Lanzamientos de Aspose.OCR for .NET

Released: May 21, 2025

Actualizaciones en V25.5

Características

  • Performance optimization: Decreased the recognition time for the default setting.

Correcciones

  • Fixed an issue with missed pages during PDF recognition.

Released: Apr 8, 2025

Actualizaciones en V25.4

Características

  • Added support for markdown output format with document layout.
  • Automatic language detection during recognition.
  • Improved DOCX output format.

Released: Mar 26, 2025

Actualizaciones en V25.3

Características

  • Added universal recognition of Arabic, Persian and English alphabets.
  • Added support for the automatic analysis of image content and detection of layout blocks.
  • Improved recognition speed.

Correcciones

  • Fixed an issue with saving recognition results to hOCR.

Released: Feb 27, 2025

Actualizaciones en V25.2

Características

  • Exposed control over ONNX session options for advanced users.
  • Added automatic detection of image language, supporting: English (Latin), Cyrillic, Arabic, Chinese, Japanese, Korean, Hindi, Tamil, Telugu, and Kannada.

Released: Jan 21, 2025

Actualizaciones en V25.1

Características

  • Recognition results can now be saved in hOCR format.
  • Optimized searchable PDFs to fully preserve the original image quality and maintain the file size.
  • Removed deprecated APIs to improve code readability and performance.
  • Changed the default language model to English (without diacritics) when no recognition language is explicitly specified.

Released: Dec 2, 2024

Actualizaciones en V24.12

Características

  • Added a container class for storing recognition results.
  • Added support for recognizing Mongolian text.
  • Added a method to release memory by unloading unneeded OCR modules.
  • Significantly enhanced the performance of saving recognition results to searchable PDFs.
  • Improved the calculation of line height in searchable PDFs.

Released: Nov 21, 2024

Actualizaciones en V24.11.1

Características

  • Added an experimental OCR model for extracting mixed-language Cyrillic/English text.
  • Added support for recognizing mixed-language Telugu/English text.
  • Added support for recognizing mixed-language Tamil/English text.
  • Added support for recognizing mixed-language Kannada/English text.
  • Added universal recognition of Indic text based on Devanagari script, including mixed Devanagari/English text.
  • Added universal recognition of Chinese/English text (language-agnostic).

Released: Nov 6, 2024

Actualizaciones en V24.11

Características

  • Added support for recognizing mixed-language Korean/English text.
  • Added support for recognizing mixed-language Japanese/English text.
  • Enhanced handling of custom fonts in searchable PDFs.
  • Improved searchable PDF generation.
  • Text extraction is now faster and more precise across various document types.

Released: Oct 21, 2024

Actualizaciones en V24.10

Características

  • Added support for recognizing mixed-language Chinese/English text.
  • Significantly improved Chinese text recognition accuracy.
  • Introduced simple and straightforward content structure detection modes.
  • Improved compatibility with TIFF images.

Released: Sep 28, 2024

Actualizaciones en V24.9

Características

  • Added the ability to reduce PDF file size at the expense of lower background image quality.
  • Implemented next-get text-in-wild OCR model with improved recognition accuracy and multi-language support.
  • Improved image orientation detection to prevent certain images from being incorrectly rotated upside-down.
  • Refined content area detection algorithms for TABLE and PHOTO modes.