Cette page a été archivée et est n'est plus actualisée.

Nous ne fournissons plus ce produit.

PDFlib TET PDF IFilter

Extrayez le texte et les métadonnées des documents PDF.

Publié par PDFlib
Distribué par ComponentSource depuis 2003

Version : 5.5 Mise à jour : Jan 12, 2024

i

Please note: PDFlib TET PDF IFilter was officially retired as of December 19th 2024. If you are interested in this product, consider PDFlib instead.

PDFlib TET PDF IFilter 5.3

Released: May 4, 2021

Mises à jour de 5.3

Fonctionnalités

  • Optimized PDF resource handling to improve performance for documents with excessive numbers of images, patterns or other resources.
  • Security and performance updates of all third-party components.
  • Harden processing of damaged and illegal PDF documents by testing the full Issue Tracker PDF corpus with tens of thousands of stressful PDF files.
  • Expanded platform and CPU support including macOS on ARM64 and Linux on ARM64.
  • Timeout can be specified to limit processing time for large or complex files.
  • Enhancements in all language bindings and updates for the latest language versions including .NET 5, PHP 8, Perl 5.32 and Ruby 3.0.
  • Support for native UTF-8, UTF-16 and UTF-32 Unicode strings in C++17 and C++20.
  • Implement detection of certain kinds of attacks using legal PDF constructs overly large data structures.
  • Improved TETML output for edge cases.
  • Improved word boundary, list and paragraph detection.
  • Support for Unicode 13.
  • Improved performance for Classic .NET binding.
  • Many minor bug fixes and improvements.
  • Updated CJK character collections and CMaps for PDF 2.0.