About Aspose.OCR for C++

Extract text from images from within your C++ application.

Aspose.OCR for C++ is a robust and reliable optical character recognition API. The OCR library supports text fragment detection which enables you to recognize headers and paragraphs for pages with pictures or tables. It supports commonly used formats for reading characters, fonts with different styles, noise removal filters and the ability to recognize the whole page or just single line.

Supported File Formats

Images

  • JPEG
  • PNG
  • TIFF
  • BMP

Batch OCR

  • Multi-page PDF
  • ZIP
  • Folder

Recognition results

  • Text
  • PDF
  • Microsoft Word
  • Microsoft Excel
  • RTF
  • JSON
  • XML

Features and Capabilities

  • Photo OCR - Extract text from smartphone photos with scan-level accuracy.
  • Searchable PDF - Convert any scan into a fully searchable and indexable document.
  • URL recognition - Recognize an image from URL without downloading it locally.
  • Bulk recognition - Read all images from multi-page documents, folders and archives.
  • Any font and style - Identify and recognize text in all popular typefaces and styles.
  • Fine-tune recognition - Adjust every OCR parameter for best recognition results.
  • Spell checker - Improve results by automatically correcting misspelled words.
  • Find text in images - Search for text or regular expression within a set of images.
  • Compare image texts - Compare texts on two images, regardless of the case and layout.
  • Limit recognition scope - Limit the set of characters the OCR engine will look for.
  • Detect image defects - Automatically find potentially problematic areas of image.
  • Recognize areas - Find and read only specific areas of an image, not all text.