Das neue Document Analyzer SDK extrahiert automatisch Daten aus textbasierten Office-Dokumenten, PDFs und Bildern.
August 28, 2020
Neue Version
Funktionen
New in the LEADTOOLS Recognition Engine
- New Document Analyzer SDK
- The Document Analyzer SDK intelligently and automatically extracts data from text-based Microsoft Office documents (DOC, DOCX, XLS, XLX), PDFs, and document images (JPG, TIFF, PNG PDF) without requiring a structured layout
- Automatically extracts data from images, documents, mixture of images and documents, and from images within documents.
- Flawlessly handles various data formats, including tables, text flows, data across multiple lines.
- Includes predefined rulesets and functions to add custom rulesets to find, collect, and act upon information.
- Performs deep analysis to ensure that information of interest is not missed.
- Customizable actions include collect, report, highlight, and redact.
- New ICR
- New ICR technology for remarkable unstructured handwritten print and cursive text recognition
- Support for hand-printed and cursive handwriting.
- Support for the English character set, including uppercase, lowercase, numerals, punctuation, and symbols.
- Powerful and automatic preprocessing to handle noisy and low-resolution images.
- Spelling dictionaries.
- Process color and bitonal images.
- Output strings or text-based formats such as PDF, DOCX, and XML.
- Comprehensive reports for text results:
- Character size, location, and baseline.
- Positional attributes, including end of word, end of line, and end of paragraph.
- Confidence values.
- Mixed Zone Recognition
- New mixed mode AutoZone capabilities that combine LEADTOOLS OCR, ICR, and Recognition technologies to automatically detect, recognize, and extract everything within an image.
- Automatically extract text from images that contain a mix of machine-printed text, handwritten text, MICR, MRZ, OMR, graphics, and table zones.
- OCR
- Performs deep analysis to ensure that targeted information is not missed in recognition.
- Superscript and subscript support.
- Automatic detection of E13-B and CMC7 MICR zones.
- Major enhancement to OCR speed, recognition, and accuracy.
- Detect text in graphics and image zones.
- Detect MRZ code orientation.
- Better OCR recognition for images taken by mobile.
- Detect slope for each word to produce better output.
- Enhanced mono- and proportional-font recognition.
- LEAD OCR engine startup optimization.
New in LEADTOOLS Document Engine
Document Framework
- Document Viewer
- New Document Viewer Component for Xamarin Forms.
- Added support to load and play any of LEAD's supported video formats in the .NET and Java Document Viewer (Microsoft Windows only).
- Client-side PDF rendering enhancements include:
- Stamp rendering.
- Memory optimization.
- Render digital signatures.
- Switch to service mode based on individual page size or image encoding.
- Speedup for calRGB/grayscale color space rendering.
- Customize text in thumbnails.
- Added support for history (previous/next document).
- Improved UI responsiveness when viewing documents with many pages.
- Added custom UI injection and binding support to the high-level ReactJS document viewer control, LEADVIEW.
- Attachments
- Added support for general purpose and PDF attachments.
- Added support for PDF Embedded files and Portfolio.
- Document Service
- Added support to load a mix of different annotations formats, including IBM P8/Deja annotations.
- Hybrid Caching
- Use rules to mix different cache back ends. For example, smaller items use in-memory cache and large items use a remote cache.
New Document and Image Compare Libraries
- Compares two documents or pages and intelligently highlights changes, additions, and deletions.
- Text compare.
- Bitmap compare.
Document Converter
- Document Converter optimizations to speed up text extraction.
PDF SDK Updates
- PDF Loading and Rendering
- Redesigned PDF loading and rendering to significantly optimize memory usage, load time, and render quality, with smooth zooming and no pixelation.
- Optimized memory usage, load time, and render quality.
- Optimized functions that control PDF metadata to be faster.
- Functionality added to load and render only a portion of a PDF document.
- Rasterization of documents at 2400 DPI.
- Added "Enhance Thin Lines" rendering feature to clarify thin lines in the viewer.
- PDF Portfolio and PDF Attachment
- Load PDF Portfolio documents.
- Load portfolio schema.
- Extract files within PDF Portfolio documents.
- Load PDF document with embedded attachments.
- Extract embedded attachments from PDF documents.
- PDF Redaction Updates
- Added support for image redaction.
- Updated text redaction to support text inside Form XObject.
- Additional PDF Updates
- Load PDF forms.
- Enhanced PDF annotations (create, modify, load and save).
- Enhanced PDF manipulation operations such as PDF merge, insert, and extract pages.
- Enhanced the PDF Optimizer.
- Enhanced PDF/A conversion with focus on font handling.
Updates for Microsoft Office Formats
- Added support to render many chart types, including:
- Radar.
- Area.
- Scattered.
- Line.
- Bar.
- Chart legends.
- Enhanced support for vector graphics rendering in all Microsoft Office formats.
- Enhanced SVG text rendering and extraction.
- Optimize speed and memory usage for all document formats: DOC, DOCX, XLS, XLSX, PPT, and PPTX.
- DOC/DOCX
- Mathematical equation model (OMML) support in DOCX only.
- Optimized load speed and memory usage.
- Added support for equation formula calculations.
- Sections and Drawing-ML shapes.
- Enhanced support for rendering nested tables.
- Enhanced text wrapping around text boxes.
- XLS/XLSX
- Optimized speed and memory usage.
- Faster loading for large sheets.
- Formula evaluator module to handle all logical operations and calculations for conditional formatting.
- Table design support.
New in the LEADTOOLS Multimedia Engine
- Multimedia SDK Updates
- Faster ISO format capture and conversion.
- New live audio processor to detect and fill dropped audio samples.
- Updates to Streaming SDK
- MPEG-2 transport error detection option added to the LEAD MPEG-2 Transport Multiplexer.
- Streaming server can use MPEG-2 Transport files for all protocols.
- Streaming server can use DICOM files as source files.
- Updates to Media Foundation Components
- New Media Foundation target formats:
- 3GP: Video (H264) - Audio (AAC and AMRNB).
- MP3.
- MPEG-2: Video (H264, HEVC) - Audio (AAC, AAC-ADTS, AC3, MP3, and MPEG-2 Audio).
- WAVE: Audio only (MP3, PCM, and FLOAT).
- FLAC.
- FMPEG4: Video (H264) - Audio (AAC, AC3, and ALAC).
- AVI: Video (Uncompressed video colors, M-JPEG) - Audio (PCM).
- MPEG-2 Transport: Video (H264 and HEVC) - Audio (AAC, AAC-ADTS, AC3, MP3, and MPEG-2 Audio).
- ADTS: Audio (AAC and AAC-ADTS).
- AC-3: Audio (Dolby AC-3 Audio).
- Update Media Foundation targets with new audio and video formats
- MP4: Added audio formats (AC3, ALAC, and FLAC).
- MKV: Added new video formats (VP8 and VP9) and new audio format (OPUS).
- Added support for .webm format to LEAD MKV media source.
- Added support to select between available hardware and software encoders.
- Added new decoders, encoders and target format:
- LEAD MCMW Decoder.
- LEAD MJ2K Decoder.
- LEAD MCMP Decoder.
- LEAD MJPEG Decoder.
- LEAD MJPEG Encoder.
- AVI Target format.
New in LEADTOOLS Imaging Engine
- Xamarin Camera
- Added control of preview size and frame rate.
- New Image Processing Functions
- Extract Object - Extracts connected groups of pixels from an image.
- Canny Edge Detector.
- Forms Field Detector.
- Updates to Formats
- Optimized memory usage for J2K format.
- Expanded J2K support to include some files that do not follow the specification.
- Added support to load 8-bit TIFF and PNG files with 8-bit alpha for transparency.
- Added support for ANSI, UTF7, UTF8, UTF16LE, UTF16BE encoded text files.
- Added support to load CGM files saved with text encoding.
- Added support to render Medium Map Overlay for AFP (MODCA) documents.
Additional Changes
- New powerful Xamarin demos that show more features of LEADTOOLS in a Xamarin development environment. These new demos are designed to look like end-user applications, but the source code for them is included so that anyone can get a head start. Below are some of the new demos included in version 21:
- Annotations Demo.
- Barcode Demo.
- Business Card Reader Demo.
- Camera Demo.
- Converter Demo.
- DICOM Demo.
- Document Viewer Demo.
- Image Processing Demo.
- Live Filter Demo.
- MICR Demo.
- OCR Demo.
- Maven repository for Android.
Changes to the LEADTOOLS Product Line in Version 21
- The following products have been renamed:
- LEADTOOLS Document Imaging to LEADTOOLS Document.
- LEADTOOLS Recognition Imaging to LEADTOOLS Recognition.
- LEADTOOLS Document Imaging Suite to LEADTOOLS Document Suite.
- LEADTOOLS Medical Imaging to LEADTOOLS Medical.
- LEADTOOLS Medical Imaging Suite to LEADTOOLS Medical Suite.
- The following products have been added.
- LEADTOOLS Forms - includes all features of the LEADTOOLS Recognition product, including ICR, MICR, MRZ, and OMR, plus forms recognition and processing.
- The following products have been removed.
- LEADTOOLS PACS Imaging - functionality moved into the LEADTOOLS Medical product.
- LEADTOOLS Medical Multimedia Module - functionality moved into the LEADTOOLS Medical product.
- LEADTOOLS Dental Display Module.
- The LEADTOOLS Document Suite and the Medical family of products include all features of LEADTOOLS Multimedia.
LEADTOOLS Document Suite SDK
Umfassende Sammlung von Dokument- und Bildbearbeitungsfunktionen, einschließlich Scanning, OCR, OMR, Formularerkennung, PDF u.a..
Sie haben eine Frage?
Live-Chat mit unseren LEADTOOLS-Lizenzierungs-Spezialisten.
Offizieller Händler seit 1996