Incorrect paragraphs extracted by Aspose

I want to extract 1 specific paragraph in PDF, but Aspose extracted 2 paragraphs, each contains part lines of the paragraph.

Please see the PDF example and code for detail.

AsposeExample.zip (195.7 KB)

example.pdf (122.9 KB)

Thank you!

@davidknn
Please provide the code you used

Of course, you may find it in the AsposeExample.zip as the attachment.

@davidknn
Thank you for the data provided - I will study the issue and write to you tomorrow.

@davidknn
I looked through the data provided - thanks for the detailed and clear description of the issue. The penultimate paragraph in the document is actually divided into two (No. 1 and 2 in the screenshot).
I assume that the word CPU at the beginning of the line in a different font/character set was regarded as a sign of a new paragraph. I’ll create a task for the development team.
screen.png (168.9 KB)

@davidknn
We have opened the following new ticket(s) in our internal issue tracking system and will deliver their fixes according to the terms mentioned in Free Support Policies.

Issue ID(s): PDFNET-56299

You can obtain Paid Support Services if you need support on a priority basis, along with the direct access to our Paid Support management team.

The issues you have found earlier (filed as PDFNET-56299) have been fixed in Aspose.PDF for .NET 24.4.