Unable to extract the text from PDF file

Hi Team,

We are trying to extract the pdf content from the pdf file. But, while extraction aspose failed to extract full content of pdf.

Please find the code snippet.
Sample code we are using.
TextAbsorber textAbsorber = new TextAbsorber(new TextExtractionOptions(TextExtractionOptions.TextFormattingMode.Raw));
textAbsorber.visit(pdfDocument);
String pdfText = textAbsorber.getText();

Also, please find pdf file. Please find the screenshot in which we have highlighted the text which was not extracted by aspose utility.
Appreciate for help.

Thanks
Issue PDF extraction.zip (2.0 MB)

@forasposeissues
I’ll investigate the issue and write you as soon as possible

@forasposeissues
It seems that text in the beginning isn’t text content but rather image
When opening Adobe Acrobat tries to convert it to text but when opened in browser you can’t even copy any content from mentioned parts
Nevertheless, I’ll add a task for development team to investigate if there any ways to extract text in such cases

@forasposeissues
We have opened the following new ticket(s) in our internal issue tracking system and will deliver their fixes according to the terms mentioned in Free Support Policies.

Issue ID(s): PDFNET-57487

You can obtain Paid Support Services if you need support on a priority basis, along with the direct access to our Paid Support management team.

Hi Team,

Any update on this issue?

@forasposeissues
I checked the issue, it seems that there wasn’t any updates on it