Hi Support Team,
I am trying to generate HTML from a PDF File. The generated html looks good except for the umlauts in the text which are not displayed. I have attached the PDF which i am using and also the generated output. I am using MAC OS and the code mentioned below for generating the output.
Document doc = new Document(ConvertPDFtoXLSX.class.getClassLoader().getResourceAsStream(“test.pdf”));
// Instantiate HTML Save options object
HtmlSaveOptions newOptions = new HtmlSaveOptions();
// Enable option to embed all resources inside the HTML
newOptions.PartsEmbeddingMode = HtmlSaveOptions.PartsEmbeddingModes.EmbedCssOnly;
// This is just optimization for IE and can be omitted
newOptions.LettersPositioningMethod = LettersPositioningMethods.UseEmUnitsAndCompensationOfRoundingErrorsInCss;
newOptions.RasterImagesSavingMode = HtmlSaveOptions.RasterImagesSavingModes.AsExternalPngFilesReferencedViaSvg;
//newOptions.FontSavingMode = HtmlSaveOptions.FontSavingModes.SaveInAllFormats;
// Output file path
String outHtmlFile = "Single_output.html";
// Save the output file
Output can be accessed from the following location: Dropbox - File Deleted - Simplify your life
test.pdf (44.5 KB)
Thanks for sharing further details.
We have logged an investigation ticket as PDFJAVA-39249 in our issue tracking system. We will further look into details of it and keep you posted with the status of its resolution. Please be patient and spare us some time.
We are sorry for the inconvenience.
Any updated on the ticket?
This issue is very important for us as the generated html is not at all readable because of the missing characters.
Please let me know once you have had time to look at the issue.
I like to inform this issue has been added recently in our issue tracking system and as per our company policy, the first priority for investigation is given to the Paid Support i.e. Enterprise and Priority Support on first come first serve basis. After that the issues from normal support forum are scheduled for investigation on first come first serve basis. I request for your patience and we will share good news with you soon.
If we buy the paid support how long will we have to wait then before all the issues which we have reported to be investigated ?
I like to inform that paid support issues are treated on priority basis as compare to free support issues and are resolved as soon as possible.
The issues you have found earlier (filed as PDFJAVA-39249) have been fixed in Aspose.PDF for Java 20.9.