Hola,
I’m currently extracting the text from this sample.pdf (13.9 KB) with ParagraphAbsorber page by page like this:
Document pdfDocument = new(file)
foreach (var page in pdfDocument.Pages)
{
paragraphAbsorber.Visit(page);
}
The problem I’m running into is that the last paragraph of page 1 is overflowing to page 2. I know that a MarkupParagraph class has ContinuationPageNumbers and SecondaryPoints which seems to be of use in this issue. However, when running with my sample pdf file, these two properties of the last MarkupParagraph are null. I’m not sure how to resolve this while using ParagraphAbsorber page by page.
Thank you for any help or advice!