@Anil1995
The PDF file can have many formats and they can have entirely different structures. For a PDF file, that contains multiple images inside it along with textual content and you need to extract text from the images only - OR if a PDF has a single image on one page, you can use the code snippet that involves ImagePlacementAbsorber usage.
In another case, where you want to extract the text of a complete page but the content on a page is a mixture of images and text, you need to convert the whole page into a single image and then perform OCR operation on it.
The PDF file which you have shared with us represents the second case and we used the below code snippet (it was shared in our previous response as well) to extract text from it. We did not notice any issue. The API was able to extract complete text from the converted images:
string file = @"MP09.pdf";
Document pdfDocument = new Document(file);
foreach(Page page in pdfDocument.Pages)
{
Aspose.Pdf.Devices.PngDevice pngDevice = new Aspose.Pdf.Devices.PngDevice(new Aspose.Pdf.Devices.Resolution(300));
pngDevice.Process(page, "output" + page.Number + ".jpg");
AsposeOcr libOcr = new AsposeOcr();
string slResult = "";
slResult = libOcr.RecognizeImage("output" + page.Number + ".jpg");
Console.WriteLine(slResult);
}
Please try using this code and let us know about the issues if you face some.