Reading scanned PDF


I have usecase. I want to read scanned pdf and convert into text format. Is there any possibility.

This Topic is created by shahzadlatif using the Email to Topic plugin.



Thank you for your inquiry.

Yes you can perform OCR operation on PDF and read the text using Aspose.OCR. In order to accomplish this requirement, you may use combination of Aspose.OCR and Aspose.Pdf. First, process your PDF file using Aspose.Pdf to get the PDF page as images and then perform OCR on the image files using Aspose.OCR to extract the text. For further details, you may visit our online documentation for .NET. The same functionality can be found for Java version as well.

Performing OCR on PDF Documents


Ikram ul Haq
Aspose – File Format APIs

Got a Complaint? File it!.