We have found that the Aspose PDF TextAbsorber object does a remarkable job of extracting text from a page of a PDF and forming it into a string that is broken into lines with carriage return/line feeds (using the Pure option). This is a very difficult undertaking and the product is performing very well. The ability to have text broken into readable lines is extremely compelling and not something that a lot of other products can do (and certainly not without expensive server runtime licenses). Thank you!
The reason I ask is we would like to find the full character positions for each character that make up a line of text.
Thanks for considering this. We can piece the line text back together. But if we knew for each text block which line you considered it to be a part of that would be huge. Or else some kind of way to track the text components that were used to make up the line down so that we could find out their position information.