Problems with accents from docx to HTML

Hi,

Our company is considering about to purchase Aspose for our further developments. We are currently testing docx to HTML conversion in batch.

Sometimes, randomly, we have problems with accents.

This is an extract from the Word file : Java est utilisé dans une grande variété
The corresponding generated HTML is : Java est utilis?? dans une grande vari??t??

The same docx file is sometimes well converted, sometimes not.

We are using Aspose.Words 17.10 and the following code :

Document doc = …
ByteArrayOutputStream baos = new ByteArrayOutputStream();
HtmlSaveOptions options = new HtmlSaveOptions(SaveFormat.HTML);
options.setExportImagesAsBase64(true);
doc.save(baos, options);

JDK 1.7.0_55 64 bits
CentOs 64 bits

Here are my files : files.zip (6.5 KB)

Thanks

@ludovic_certi

Thanks for your inquiry. We have tested the scenario on CentOs 64 bit with Aspose.Words for Java 17.10 in a loop, but I am afraid we are unable to notice reported issue. We will appreciate it if you please share some more details about the issue and your complete code here. We will try to replicate the issue at our end and will guide you accordingly.