Estrai il testo da un PDF a livello di programmazione utilizzando C#

Maggio 20, 2024
Utilizza MESCIUS Document Solutions for PDF per automatizzare l'estrazione di testo dai PDF per l'indicizzazione, la ricerca e altro ancora.

Continua a leggere in inglese:

Document Solutions for PDF (DsPdf) is a high-speed, feature-rich, server-side PDF API Library for .NET with no dependencies on Adobe Acrobat. DsPdf allows developers to programmatically create, manipulate, import/export, and deploy PDF documents, including AcroForms, across desktop and web applications at scale. With full .NET support, you can generate, load, modify, and convert PDFs directly within your .NET, Mono, Xamarin.iOS, and Xamarin.Android apps. It also includes a fast JavaScript-based client-side viewer/editor that allows users to view/optionally edit PDF documents in desktop/web applications.

In this blog post, MESCIUS Product Marketing Specialist Mackenzie Albitz demonstrates how to use DsPdf to unlock PDF content seamlessly by parsing and extracting text from PDFs for a variety of scenarios, including:

  • Extracting all text from a PDF file
  • Extracting text from a specific PDF page
  • Extracting text from predefined bounds in a PDF
  • Extracting fonts from a PDF 

Sample code is included and there's even a link to a Quick Start Demo and a complete sample application to assist you in getting started.

Read the full blog to learn how to unlock PDF content.

Document Solutions for PDF is licensed per developer and is available in several license options for differing distribution needs. Team licenses are also available for multiple developers within the same organization. See our Document Solutions for PDF licensing page for full details.

Learn more on our Document Solutions for PDF product page.