Skip to content

Releases: smalot/pdfparser

v2.12.5

21 Apr 06:20
2cfa0d9

Choose a tag to compare

What's Changed

  • Updated .gitattributes so less files will be distributed when someone downloads the package (by @williamdes in #792)
  • For PHP 8.5: Fix chr() deprecation in ASCII85 last-tuple decoding by @Ciki in #793

New Contributors

Full Changelog: v2.12.4...v2.12.5

v2.12.4

11 Mar 10:55
028d7cc

Choose a tag to compare

Bugfix release

Refining a change in the latest release:

Ignore Form as well as Image XObjects when assembling the text array for a PDFObject. by @rupertj in #783

When assembling the text array for an object, skip Forms that don't contain any text, instead of all Forms. by @rupertj in #789

Full Changelog: v2.12.3...v2.12.4

v2.12.3

08 Jan 08:19
61c9bca

Choose a tag to compare

Security fix and refinements

Fix for potential Denial of Service vulnerability

Summary: The fix prevents the RawDataParser.php‎ to enter an endless loop under certain circumstances which would lead to memory exhaustion.

Details: When parsing a specifically crafted, malformed PDF file, the low-level RawDataParser enters a state that leads to uncontrolled memory allocation. This continues until the PHP script exhausts its memory_limit and crashes with a fatal error. An attacker can leverage this vulnerability by submitting a small, malicious PDF file to any service using this library, causing the server process to crash and become unavailable.

Thank you Yang LUO (https://github.com/N0zoM1z0) for reporting this and the provided details on the matter. #787 contains further information.

Refinement to improve extracted texts

Ignore Form as well as Image XObjects when assembling the text array for a PDFObject. by @rupertj in #783


Full Changelog: v2.12.2...v2.12.3

v2.12.2

24 Nov 07:42
370b7e9

Choose a tag to compare

What's Changed

  • Include PHP 8.5 in CI + Fixes deprecations by @k00ni in #779

Full Changelog: v2.12.1...v2.12.2

v2.12.1

18 Aug 08:15
98d31ba

Choose a tag to compare

What's Changed

  • Stop Image XObjects being included as empty strings in PDFObject::getTextArray(). by @rupertj in #775

Internal

  • CI: Updated OS images (Windows, Ubuntu) by @k00ni in #776

New Contributors

Full Changelog: v2.12.0...v2.12.1

v2.12.0

31 Mar 14:34
8440edb

Choose a tag to compare

What's Changed

Internal changes

  • chore: update phpunit.xml by @vitormattos in #757
  • CI: added PHP 8.4 by @k00ni in #723
  • Simplified Coding Style checks: PSR12 replaces Symfony, risky not allowed anymore by @k00ni in #737
  • Introducing CONTRIBUTING.md by @k00ni in #744
  • pull_request_template.md: Set path to CONTRIBUTING.md by @k00ni in #760

New Contributors

Full Changelog: v2.11.0...v2.12.0

v2.11.0

19 Aug 05:48
ac8e667

Choose a tag to compare

What's Changed

  • Account for inline images in formatContent() by @GreyWyvern in #693
  • Fix for two bugs related to Unicode translation support by Font objects by @unixnut in #698
  • Fix for adjacent escaped slashes and escaped parentheses in strings by @GreyWyvern in #711
  • Merge XMP Metadata if dc:format tag not found by @GreyWyvern in #722
  • Implement missing cm command by @DominikDostal in #720
  • fix typo and clarify sentence by @bernard-ng in #729
  • fix: check that the previous xref is not the just processed xref by @tkegan in #727

Internal changes

  • Continuous-integration.yml: let workflow run on each push event by @k00ni in #719

New Contributors

Full Changelog: v.2.10.0...v2.11.0

v2.10.0

30 Apr 12:06
14adf31

Choose a tag to compare

What's Changed

Internal changes

  • improved documentation: Return page width and height from document by @vitormattos in #700
  • Fixed CS issue in PDFObject.php by @k00ni in #695

Full Changelog: v2.9.0...v.2.10.0

v.2.10.0

29 Apr 14:21
14adf31

Choose a tag to compare

Replaced by v2.10.0

v2.9.0

05 Mar 09:49
6b53144

Choose a tag to compare

What's Changed

  • Fix returning empty text in some cases by @xAzoom in #666
  • Baseencoding fallback by @GreyWyvern in #669
  • Check for binary content in formatContent() before a problematic regexp by @GreyWyvern in #676
  • Fixed latest coding style issues and refined a few PHP doc entries to match types by @k00ni in #677
  • Fixes Scrutinizer integration (mostly failing tests) by @k00ni in #682
  • Fixed a few coding style issues by @k00ni in #670

New Contributors

Full Changelog: v2.8.0...v2.9.0