How to determine if a page is a scan

Is there a way to find out whether a page contains one image only and that image covers most (or all) of the MediaBox of a page?

@yehuda.alon,

Thanks for contacting support.

You can get images count from PDF file and determine if page contains one image or multiple. Furthermore, as per your requirements, you may follow the instructions to Get the Resolution and Dimensions of Embedded Images.

What I need is the Image boundary (location on page), not its properties.

@yehuda.alon

Thanks for writing back.

In case if you want to determine location/coordinates of the images on page, please check “Working with Image Placemnet” article in our API documentation. ImagePlacement provides the member Rectangle that return actual image placement values. In case if you face any issue, please share your sample PDF with us, so that we can test the scenario in our environment and address it accordingly.