We are trying to convert from PDF (Converted from image) to Searchable PDF. But it is not properly recognizing text.
Output-1.pdf (432.6 KB)
Before_OCR.pdf (121.1 KB)
aspose version : 23.3.0
We are trying to convert from PDF (Converted from image) to Searchable PDF. But it is not properly recognizing text.
Output-1.pdf (432.6 KB)
Before_OCR.pdf (121.1 KB)
aspose version : 23.3.0
Can you please share the sample code snippet as well that you used to generate searchable PDF? We will test the scenario in our environment and address it accordingly.
As per your official document we are using same code,
// Scanned multipage PDF Path
String fullPath = "multi_page.pdf";
// Initialize AsposeOcr class object
AsposeOCRPdf api = new AsposeOCRPdf();
DocumentRecognitionSettings settings = new DocumentRecognitionSettings(0,1);
// Recognize images from PDF
ArrayList<RecognitionResult> res = api.RecognizePdf(fullPath, settings);
// Save result as Searchable PDF
AsposeOCR.SaveMultipageDocument("output.pdf", Format.Pdf, res);
Reference Link : Convert Scanned PDF to Searchable PDF with OCR in Java
Questions :
Please let me know if you need more details.
The input PDF document has a low quality image and the image is also divided into multiple parts. Which is why API is facing issues while recognizing the text. Nevertheless, an investigation ticket as OCRJAVA-316 has been logged in our issue tracking system for the sake of further investigation. We will look into its details and keep you posted with the status of its correction. Please be patient and spare us some time.
PS: Yes, you can use Aspose.Total license with Aspose.OCR.
The issues are resolved on first come first serve basis in free support model. As soon as we are done with the issue investigation, we will share updates with you about ticket fix or its resolution ETA. Please spare us some time.
We apologize for your inconvenience.