Hello there,
I have the next question. Some PDFs contain no text layers, only images and graphical glyphs. Other PDFs contain a text layer, which can be selected, copy-pasted, and also a search can be performed. I know, that Aspose.Pdf allows to detect and extract such text content with the "TextFragmentAbsorber" class. However, for very big PDFs this process takes a lot of time - you need to iterate over every page of a document in a cycle and apply a "TextFragmentAbsorber" instance for a single page, trying to grab something textual.
So I'm wondering, is there a way to perform a quick check in order to find out, does the particular PDF file contain a textual content at all or not? In case this would be possible, it would be a great opportunity to eliminate the page-by-page scanning with "TextFragmentAbsorber", when the PDF doesn't have text content at all.
Thanks in advance.
With best regards, Denis Gvardionov