GroupDocs.Parser for Java

GroupDocs.Parser for Java is a text, image and metadata extractor API for applications that support parsing raw, structured and formatted text. It also allows you to retrieve metadata of supported file formats. GroupDocs.Parser for Java enables you to extract text and metadata from password protected files in all popular formats including word processing documents, Microsoft Excel spreadsheets, Microsoft PowerPoint presentations, Microsoft OneNote, PDF files and ZIP archives.

Supported File Formats

Text Extraction

  • Text: DOC, DOCX, DOT, DOTX, DOTM, DOCM, RTF, ODT, OTT, OTS, XLA, XLAM, TXT, MD, WordprocessingML (XML)‎
  • Spreadsheets: XLS, XLSX, CSV, XLSM, XLSB, XLT, XLTX, XLTM, ODS, SpreadsheetML (XML), TSV
  • Presentations: PPT, PPTX, PPTM, PPS, PPSX, PPSM, POT, POTX, POTM, OTP, ODP
  • OneNote: ONE
  • Email: MSG, EML, EMLX, PST, OST, MS EXCHANGE SERVER, POP, IMAP
  • Electronic Publishing: EPUB, FB2‎
  • Portable Document: PDF, PDF Portfolio, Encrypted PDF‎
  • DOM-based: XML, HTML, XHTML...

Latest News

GroupDocs.Parser for Java V23.9
GroupDocs.Parser for Java V23.9
Adds the ability to distinguish inline images from emails.
GroupDocs.Parser for Java V23.2
GroupDocs.Parser for Java V23.2
Adds the ability to load external resources.
GroupDocs.Parser for Java V22.11
GroupDocs.Parser for Java V22.11
Adds support for extracting attachments from presentations, spreadsheets and word processing documents.
GroupDocs.Parser for Java V22.3
GroupDocs.Parser for Java V22.3
Adds support for extracting barcodes from images.
GroupDocs.Parser for Java V21.2
GroupDocs.Parser for Java V21.2
Improves text extraction from word processing documents.
GroupDocs.Parser for Java V20.12
GroupDocs.Parser for Java V20.12
Adds the ability to identify whether a file is password-protected.

Prices from: $ 1,175.02

Developer Small Business License A Developer Small Business license permits One (1) Developer to create an unlimited number of Derived Works using the Product which can be used at only One...

GroupDocs.Parser for Java is also available in:

Got a Question?

Live Chat with our GroupDocs licensing specialists now.

GroupDocs
As official and authorized distributors, ComponentSource supplies you with legitimate licenses directly from GroupDocs.
Component Type
  • Java Class

Recent Awards

PublisherPublisherPublisher