PDFlib TET
从任何 PDF 文件中可靠地抽取文字、图像和原数据。
- 可以用文件库/组件和命令行工具的方式提供
- 将 PDF 文字内容抽取为 Unicode 字符串和构化的 XML
- 新版本 4.1 的抽取速度更快
说明: View Office documents easily through your Web browser. OfficeHTMLFilter is the upgraded version of DMC HTMLFilter V1, part of the Dynamic Multiplatform Converter series. Designed for conversion from Office documents to HTML in various kinds of server ... 阅读更多
说明: PDF Information Retrieval Tool. PDFlib pCOS provides a simple and elegant facility for retrieving any information from a PDF document which is not part of the page contents. For example, PDF metadata, interactive elements (links etc.), or page dimensions ... 阅读更多