GroupDocs.Text for .NET V16.11.0

NEW - 고급 문서 텍스트 추출 API를 사용하여 다른 파일 형식에서 원시 및 서식이 지정된 텍스트를 추출합니다.
11월 28, 2016
신제품

기능

GroupDocs.Text for .NET is a document text extraction API. It extracts text and metadata from Microsoft Word, Excel, PowerPoint, email messages, container files that contain other files like ZIP archives, plain text files and HTML without any document readers installed. The text extractor API performs operations with accuracy and speed. It also provides tools to detect encoding such as UTF32 LE, UTF32 BE, UTF16 LE , UTF16 BE and more.

  • Advanced Document Text Extraction API Features
    • Extract raw and formatted text.
    • Extract metadata.
    • Extract text from containers containing other files such as zip archives.
    • Extract formatted text from TXT, Markdown and HTML files.
    • Support for encoding detection.
    • Support for media type detectors.
  • Text and Metadata Extractors - GroupDocs.Text provides various metadata and text extractors for different files.
  • Container Text Extractor - Work with files that contain other documents like zip archives.
  • Supported Formats
    • DOCX : OOXML Document.
    • DOCM : OOXML Macro Enabled Document.
    • DOC : Word Document 97-2003.
    • RTF : Rich Text Format.
    • ODT : OpenDocument Text.
    • XLSX : OOXML 2007-2010.
    • XLSM : OOXML Macro Enabled Workbook.
    • XLSB : OOXML Binary Workbook.
    • XLS : Excel Workbook 97-2003.
    • CSV : Comma Separated Values.
    • ODS : OpenDocument Spreadsheet.
    • PPTX : OOXML Presentation.
    • PPSX : OOXML SlideShow.
    • PPSM : OOXML Macros Enabled Presentation.
    • PPT : PowerPoint Presentation 97-2003.
    • PPS : PowerPoint SlideShow 97-2003.
    • ODP : OpenDocument Presentation.
    • TXT : Plain text.
    • HTML (.xhtml, .htm) : Hypertext Markup Language document.
    • MHTML (.mht) : Web Archive Single File.
GroupDocs.Text for .NET

GroupDocs.Parser for .NET

다른 형식에서 원시 및 서식이 지정된 텍스트를 추출합니다.

궁금한 점이 있으세요?

GroupDocs 사 제품 라이선스 담당자와 라이브 채팅