@choiys
We are sorry if any of our previous replies caused any confusion here. Please note that Aspose.PDF is specialized to deal with only PDF documents and their conversion into other file formats. Recognizing text from an image or shape is out of the scope of the API.
Furthermore, Aspose.PDF offers a feature to convert scanned PDF documents into searchable PDF document where it uses third party OCR.
Your provided PDF has mixed content i.e. text and images. Whereas, images in your PDF are drawn graphics actually. So we logged an investigation ticket in order to determine whether there is any workaround to extract text from it or not. Apparently, it could be achieved by converting the whole page to image and then perform OCR on that image because at the moment drawn objects/shapes from the PDF cannot be extracted. Another dedicated ticket to extract drawn shapes is already logged in our issue tracking system as PDFNET-51913.
Therefore, we are afraid that your requirements cannot be achieved using Aspose.PDF only. You can however, convert PDF Pages to Images and then perform OCR on them using Aspose.OCR. In case you have further concerns, please feel free to share.