This page has been archived and is no longer updated.

We do not supply this product anymore.

PDFlib TET PDF IFilter

Extract text and metadata from PDF documents.

Published by PDFlib
Distributed by ComponentSource since 2003

Version: 5.5 Updated: Jan 12, 2024

i

Please note: PDFlib TET PDF IFilter was officially retired as of December 19th 2024. If you are interested in this product, consider PDFlib instead.

PDFlib TET PDF IFilter 5.3

Released: May 4, 2021

Updates in 5.3

Features

  • Optimized PDF resource handling to improve performance for documents with excessive numbers of images, patterns or other resources.
  • Security and performance updates of all third-party components.
  • Harden processing of damaged and illegal PDF documents by testing the full Issue Tracker PDF corpus with tens of thousands of stressful PDF files.
  • Expanded platform and CPU support including macOS on ARM64 and Linux on ARM64.
  • Timeout can be specified to limit processing time for large or complex files.
  • Enhancements in all language bindings and updates for the latest language versions including .NET 5, PHP 8, Perl 5.32 and Ruby 3.0.
  • Support for native UTF-8, UTF-16 and UTF-32 Unicode strings in C++17 and C++20.
  • Implement detection of certain kinds of attacks using legal PDF constructs overly large data structures.
  • Improved TETML output for edge cases.
  • Improved word boundary, list and paragraph detection.
  • Support for Unicode 13.
  • Improved performance for Classic .NET binding.
  • Many minor bug fixes and improvements.
  • Updated CJK character collections and CMaps for PDF 2.0.