用 RESTful OCR API 從圖像中讀取文字

10月 11, 2023
PrizmDoc 13.25 提供了一個光學字元辨識 API 外掛程式,其可提供高精度識別,非常適合圖像或文件處理應用程式。

繼續用英語閱讀:

PrizmDoc Viewer is a Web-based HTML5 document viewer that allows developers to embed powerful document viewing and document conversion functionality into their Web applications. It supports a wide range of document formats, including PDF, Word, Excel, PowerPoint, AutoCAD, and more.

PrizmDoc Viewer 13.25 offers an optional RESTful API that lets you read text from images using Optical Character Recognition (OCR). The API is compatible with any programming language, making it easy to integrate into your own applications, no matter which platform you are targeting. It can perform high-accuracy OCR on both full pages and specific areas of interest.

The OCR API can help you quickly and accurately convert any image-based document into an editable text file or searchable PDF. It is compatible with a variety of input formats, including JPEG, TIFF, PNG, and BMP, and can automatically detect page orientation and recognize text in various page layouts. It can also generate image-over-text PDF output, providing searchable text that aligns with the text on the image. This allows you to convert scanned documents or other image-based files into searchable PDFs.

To see a full list of what’s new in version 13.25, see our release notes.

For more information, visit our PrizmDoc Viewer product page.