excuse me.
I found a problem when using aspose. part pdf,
After opening, the positioning of the text is different, and the order of the text is disrupted. Details are as follows.
Assume that the x- and y-axis positioning of the text exceeds the width and height of the current page. For example, the string 2
Some content is in reverse order. Sign below. As follows
I will paste the code below
Hope you can help check as soon as possible when you have free time. thanks
Could you please share some more detail about your requirement and use case? We will then investigate the issue and provide you more information on it. Please also share your expected output. Thanks for your cooperation.
The code in the attachment uses Aspose to read the contents of the PDF file and outputs it to a BMP file according to the read text.
According to the results, we can see that the width and height returned by Aspose (getpageinfo() getWidth(),getPageInfo(). GetHeight ()) is 595 * 842, and the content of the body is far beyond this range. In addition to the large difference between the position of the text content and the width and height of the page, the order of the text content is also disordered. For example, the Chinese date string “220年11月18日” (means November 18, 2022. it is a OCR output) should be at the end of the text, but according to the text positioning output of Aspose, it runs to the top.
We can use software packages such as Python or JavaScript (PDF. JS) or system software (such as Adobe PDF reader) to read the text in the PDF normally and locate the text correctly, so the PDF file itself should be no problem. Please help us confirm whether this is the problem of Aspose itself or our usage. Thank you.
We have logged a ticket for your case in our issue tracking system as PDFJAVA-41239. We will inform you via this forum thread once there is an update available on it.