I am using the PdfExtractor class method HasNextImage(), our process appears to be stopping when multiple images are detected. We believe that we could handle this issue more appropriately if we were able to determine which page number within the PDF this issue occurred. Some of the PDF’s that we are scanning have issues with the Xerox that is scanning the image (Blurry or dirty scans), this is to be expected as some units are up for repair or replacement. If we can overcome this issue we would be able to move the process on without having our application time out. Is there any way that we could return the page number that the PdfExtractor is currently processing?
Also, I would like to know if there is a way to get the collection of images that is created when calling ExtractImage(). Is this available publicly?