We're sorry Aspose doesn't work properply without JavaScript enabled.

Free Support Forum - aspose.com

Search/Extract text from PDF in Java using Aspose.PDF | Strange words extracted from document

On last page of attached document, when address paragraph was extracted (using ParagraphAbsorber), the following strange text was retuned to us:

1001 West Loop South, Suite 215 / / / 3/M/0 S
HoustonHouston., Texa Texass 7702 770277 ^ / /

What is the reason for this?
5950.pdf (1.6 MB)


I have been able to reproduce the issue on our end. A ticket with ID PDFJAVA-40615 has been created in our issue tracking system to further investigate the issue on our end. This thread has been linked with the issue so that you may be notified once the issue will be fixed.


We have investigated the earlier logged ticket and found that this is not a bug as this text is actually present on the page above the image. You can verify it by copy the text in this area and paste it into some document. It looks like that some OCR was used before that has not recognized all text correctly (3 / M / 0S instead of 3/11/05).