PDF to HTML conversion issues (Issue 1 and Issue 2)

HI. I have attached two sets of inputs (pdf files) and output files(HTML Files) and I have the below issues.



1) for “sample1.pdf” input “sample1.html” is generated. The issue I want you to focus on is the special characters in the image heading. it is not properly generated.



2)for “sample2.pdf” input “sample2.html” is generated. The issue I want you to focus on is that the generated HTML is larger than the actual input which should not be the case.



I am attaching the sample java code as well. Please look into the issue and let me know if we can address the above two issues. Thank you

Hi Aravind,


Thanks for contacting support.

I have tested the scenarios using latest release of Aspose.Pdf for Java 11.4.0 and I am unable to notice any issue while converting Sample1.pdf file to HTML format. Furthermore, I am also unable to notice any issue during Sample2.pdf to HTML conversion. For your reference, I have attached the output files generated over my end. Please take a look and share your findings.