Thanks for your patience.
We have further investigated the issue reported earlier and as per our observations, the source PDF file looks like an image, because PDF contains image only indeed with invisible text.
The document is an OCR recognition tool result - the image is placed to Pdf page as it is, but invisible text was added the over the image to make recognized text accessible.
The fonts are invisible and provide no graphics view. There is also no font face information.
We can implement an enhancement that will convert invisible fonts into visible fonts, then Pdf to Doc conversion can be performed, but there will be following limitations (followed by OCR tool):
- There will be no font face information - just CourierNew font fill be used
- There will be no font style information - the italic font will look like regular
- The text will have different size even if it looks like the same size on the image
please look at the attached 36333_analisys.png image to see the limitation concepts. The enhancement is possible but with above stated limitations and the current ETA is 9.4.0 (early July release)
Furthermore, if you have any other OCR documents examples, it is highly recommended to share those files as they will help us in implementing this feature in more appropriate manner.