Hi there
I am using Aspose PDF 11.7.0 for converting PDF files to HTML files.
There is a problem that a segment of text appear twice.
One is in the result html, and the other is in the background image.
Here is my code to test:
Document pdf = new Document(“custom/input/pdf/研究者のみなさまへ.pdf”);
HtmlSaveOptions htmlSaveOps = new HtmlSaveOptions();
htmlSaveOps.RasterImagesSavingMode = HtmlSaveOptions.RasterImagesSavingModes.AsEmbeddedPartsOfPngPageBackground;
htmlSaveOps.FontSavingMode = HtmlSaveOptions.FontSavingModes.AlwaysSaveAsWOFF;
htmlSaveOps.PartsEmbeddingMode = HtmlSaveOptions.PartsEmbeddingModes.EmbedAllIntoHtml;
htmlSaveOps.LettersPositioningMethod = LettersPositioningMethods.UseEmUnitsAndCompensationOfRoundingErrorsInCss;
htmlSaveOps.setSplitIntoPages(false);
for(int p = 1; p<=pdf.getPages().size();p++){
Document pageDoc = new Document();
pageDoc.getPages().add(pdf.getPages().get_Item§);
pageDoc.save(“custom/output/pdf/研究者のみなさまへ.”+p+".html", htmlSaveOps);
}
Please check this and file that cause this problem, thank you
P.S. This happens in page 14.