Pdf to Html issues

Hi,
I hope you can help me.

I have some issues converting PDF to HTML files.

You can see the files I used in the attachments.

I converted Docx.docx to PDF (DOCX_to_PDF.pdf) with Aspose.Word (17.1.0.0).

After that I converted the DOCX_to_PDF to HTML (PDF_to_HTML.html) with Aspose.Pdf (17.2.0.0).

Here there was some problems.
The conversion lost few css style-inline such as "text-decoration:underline" and "vertical-align:sub".

The code I used to convert PDF to HTML is simply:
Document document = new Document(dataDir + "DOCX_to_PDF.pdf");
document.Save(dataDir + "PDF_to_HTML.html", SaveFormat.Html);

I also tryed many options of "HtmlSaveOptions" class but the result still remains the same.

Is there any way to obtain a file like "PDF_to_HTML_GOAL.html"?


Kind regards,

Andrea



Hi Andrea,


Thanks for contacting support.

I tried to convert input file which you have shared into HTML and was able to notice that the formatting of the generated output was not correct. Therefore I have logged the issue as PDFNET-42237 in our issue tracking system. We will further investigate the issue and keep you updated on the status. Please spare us a little time. We are sorry for the inconvenience.

Best Regards,