Problems generating searchable PDF with HOCR input

Hi, i downloaded Aspose PDF for Java 9.7.1 and 10.0.0 to test the changes on the subject with no success.

I’m attaching a snippet of the code i’m using for testing and the PDF i want to make searchable.

To make things simple, i loaded the contents of the HOCR output into a String and used them as a return for the invoke method.

Maybe i’m doing something wrong or the HOCR i’m generating with Tesseract is not compatible.

Thanks for your attention

Hi Carlos,

Thanks for your feedback. I have tested your sample code in which you have loaded OCR contents to invoke callback method. It is not generating a searchable PDF, so shared the code with product team in related issue PDFNEWJAVA-34536 for further investigation.

However I have tried your PDF file using code shared in documentation link and able to generate searchable PDF without any issue. Please note I have used [tesseract-ocr-setup-3.02.02.exe](http://prntscr.com/6hdhju) from tesseract download page. Please give this a try and share the results.

We are sorry for the inconvenience caused.

Best Regards,

The issues you have found earlier (filed as ) have been fixed in this update. This message was posted using BugNotificationTool from Downloads module by MuzammilKhan