Screenshot Preview

jPDFText - 概要

经 Qoppa Software - 产品类型: 构件 / Java类

概要

 *

jPDFText by Qoppa Software

URLs: jpdftext, qoppa-software, qoppa software, qoppasoftware

Extract text from PDF documents. jPDFText is a Java library that lets you process PDF documents to extract the textual content for archiving, storage, searching or indexing. With jPDFText you can load PDF documents from files, network drives, URLs or input streams. You can extract text or extract words as a vector of strings. jPDFText is built on top of Qoppa's proprietary PDF technology so there is no need for any third party software or drivers.

jPDFText is a Java library that integrates seamlessly into your application or applet to extract words from PDF documents. jPDFText provides the following functions:

Load PDF documents from files, network drives, URLs or input streams

Get basic information from the pdf document such as title, author, keywords, page count, etc.

Extract words from pdf documents as a vector of String

Extract words page by page

jPDFText is built on top of Qoppa's proprietary format and doesn't require any third party programs or drivers.

jPDFText - Getting Started

The starting point for using jPDFText is the com.qoppa.pdfText.PDFText. This class is used to load a pdf document and extract the text from the document. The class provides three constructors to load PDF files from the file system, a URL or an InputStream. All constructors take an additional parameter, an object that implements IPasswordHandler, that will be queried if the PDF file has requires a password to open. For PDF files that are not encrypted, this second parameter can be null:

jPDFText - Extract Text

Once a PDFText object has been created, the host application simply needs to call the getText

jPDFText - Extracting Text Page by Page

To extract the text page by page, use the getText method that takes a page number as a parameter. You can get the number of pages from the PDFText object through the getPageCount method.

jPDFText - Extracting Words as a Vector of Strings

Once a PDFText object has been created, the host application simply needs to call the getWords method to get the list of words from the loaded PDF document.

jPDFText - Extracting Words Page by Page

To extract words page by page, use the getWords method that takes a page number as a parameter. You can get the number of pages from the PDFText object through the getPageCount method.

jPDFText - Getting Basic Information about the PDF Document (Title, Author, etc.)

To get basic information about the loaded PDF document, you need to get the DocumentInfo class accessible through PDFText.getDocumentInfo. From this class, you can get information about the document such as title, author, subject, keywords, etc.

jPDFText - Distribution and JAR Files

jPDFText is packaged in a single jar file, jPDFText.jar that gets installed with the evaluation sample. When distributing an application that contains jPDFText, the jPDFText.jar file needs to be distributed along with it and needs to be included in the class path when running the application.

Extract text from PDF documents.

Operating System for Deployment: Windows 7, Windows Vista, Windows XP, Sun Solaris 10, Sun Solaris 9, HP-UX 11.x, HP-UX 10.x, IBM AIX 6.x, IBM AIX 5.x, Red Hat Enterprise Linux 5.x, Red Hat Enterprise Linux 4.x, RedHat Linux 9.x, RedHat Linux 8.x, SUSE Linux 11, SUSE Linux 10.x, Ubuntu 10.x, Ubuntu 9.x, Mac OS X

Architecture of Product: 32Bit

Product Type: Component

Component Type: Java Class

Compatible Containers: JBuilder 2008, JBuilder 2006, NetBeans IDE 6.7, Eclipse V3.5, Eclipse V3.4, IntelliJ IDEA 8

Keywords: jpdftext, java pdf text library, java pdf text, jpdftext cpu, jpdftext cpu pair, qoppa jpdftext, jpdftext qoppa, jpdf text, j pdf text, buy jpdftext, qoppa jpdftext buy, upgrade jpdftext, jpdftext upgrade, subscription jpdftext, jpdftext subscription, renew jpdftext, jpdftext renew, reseller jpdftext, jpdftext reseller, distributor jpdftext, jpdftext distributor, jpdftext cost, jpdftext price, jpdftext purchase, purchase jpdftext, order jpdftext, jpdftext order

pdf

注意

还未完成翻译的产品说明, 我们提供了Google 翻译的连结方便您使用. 但请注意, 自动翻译有时候可能会有翻译错误.
强力驱动
在Firefox 和Opera 需要Flash.

产品搜索

输入搜索词:

为什么从ComponentSource购买?

ComponentSource 提供独特的全球国际服务, 在世界各地共有超过1,000,000开发者客户.

更多讯息 | 关于我们