Read, modify and write Word documents without utilizing Microsoft Word. Aspose.Words for Java supports a wide array of features including document creation, content and formatting manipulation, powerful mail merge abilities, exporting to DOC, HTML and PDF (requires Aspose.Pdf). Aspose.Words for Java Application Programming Interface (API) is powerful yet easy to use. To minimize your Aspose.Words for Java learning curve, classes, properties and method names borrow the best practices from two well known APIs: Microsoft Word Object Model and System.Xml. High-level reporting and document building functionality is provided with Aspose.Words for Java as well as detailed programmatic access to all document elements.
Publisher: PDFlib Primary Category: PDF Product Type: Component / Application / .NET Class / ActiveX DLL / DLL / JavaBean
Text extraction toolkit. PDFlib TET (Text Extraction Toolkit) reliably extracts text, images and metadata from any PDF file. It is available as a library/component and as a command-line tool. PDFlib TET makes available the text contents of a PDF as Unicode strings or structured XML, plus detailed glyph and font information. With PDFlib TET you can retrieve the corresponding Unicode values for text in a PDF document, as well as its position on the page.