See attached pdf and resulting html file from a conversion from pdf to html on Aspose.pdf version 10.2.0. There appears to be a lot of misplaced/missing text?
Sorry about that, but it appears that the issue is pdf to docx, not html. Can you verify?
have tested the scenario and I am able to reproduce the same problem that contents of page-3 onward are missing or replaced with strange characters. For the
sake of correction, I have logged it in our issue tracking system as PDFNEWJAVA-334809. We
will investigate this issue in details and will keep you updated on the status
of a correction. <o:p></o:p>
We apologize for your inconvenience.
The issues you have found earlier (filed as PDFJAVA-34809) have been fixed in Aspose.PDF for Java 21.5.