PDFlib TET PDF IFilter 5.3

Released: May 4, 2021

Updates in 5.3

Features

  • Optimized PDF resource handling to improve performance for documents with excessive numbers of images, patterns or other resources.
  • Security and performance updates of all third-party components.
  • Harden processing of damaged and illegal PDF documents by testing the full Issue Tracker PDF corpus with tens of thousands of stressful PDF files.
  • Expanded platform and CPU support including macOS on ARM64 and Linux on ARM64.
  • Timeout can be specified to limit processing time for large or complex files.
  • Enhancements in all language bindings and updates for the latest language versions including .NET 5, PHP 8, Perl 5.32 and Ruby 3.0.
  • Support for native UTF-8, UTF-16 and UTF-32 Unicode strings in C++17 and C++20.
  • Implement detection of certain kinds of attacks using legal PDF constructs overly large data structures.
  • Improved TETML output for edge cases.
  • Improved word boundary, list and paragraph detection.
  • Support for Unicode 13.
  • Improved performance for Classic .NET binding.
  • Many minor bug fixes and improvements.
  • Updated CJK character collections and CMaps for PDF 2.0.