PDF to text

Hi Team,


Last couple of days, I have been working on Aspose java OCR and I am looking for the product, which able to convert the scanned documents from pdf format to text. Is it possible with Aspose Java libraries? Please help me on this.

Thanks,
Sibi
Hi Sibi,

Thank you for your inquiry.

Aspose.OCR API can only accept images to perform OCR operation on them. If the requirement is to perform OCR on PDF documents then two Aspose APIs will be used to achieve the ultimate goal, that is; Aspose.Pdf API to convert the PDF pages to images and Aspose.OCR API to perform the OCR operation on the extracted/converted images. For details on how to perform OCR operation on PDF document please visit the URL Performing OCR on PDF Documents.

Hope the above information helps. Feel free to contact us in case of further query or comments.