Some CSS is going through, but a lot of it that is supposedly supported according to the documentation is not? Another issue is that it will only recognize h1 tags and not 2 through 6. Is there something I am missing here? or is it simply not supported in the conversion?
Thanks for your inquiry. I have tried to convert a sample PDF file into HTML and noticed that the Headings in the PDF document were not rendered as conventional HTML tags (i.e
). However for the purpose of initial investigation I have logged an issue as PDFNET-42476 in our issue tracking system with my sample file(s). We will further look into this and keep you updated on the status within this thread. Please be patient and spare us a little time. For your reference, I have used following code snippet.
Document doc = new Document(dataDir + “Sample_HTML_PDF.pdf”);<o:p></o:p>
HtmlSaveOptions options = new HtmlSaveOptions();
doc.Save(dataDir + "Sample_HTML.html", options);
However, we will really appreciate if you please can share a sample input document and code snippet so that we can investigate the issue with better understanding. We are sorry for the inconvenience.