Docx converted from Html is corrupted using the later version of Aspose.Words

Hi,

When converting an htm file to a docx file, the output is corrupted with Aspose.Words 18.4.0 onwards, this didn’t happen with 18.3.0 or earlier.

Code:

 string m_outputFilePath = @"C:\out.docx";
 string m_sourceDocument = @"C:\test.htm";
 string dir = @"C:\";
 Document convertedDocument = new Document(m_sourceDocument, new HtmlLoadOptions(LoadFormat.Html, "", dir));
 convertedDocument.Save(m_outputFilePath);

I’ve attached the test files and output from both 18.3.0 and 18.4.0 where the out_18.3.0.docx is not corrupted while out_18.4.0.docx is corrupted.

Could you please check if there’s any regression?

Thanks
test documents.zip (107.8 KB)

@ServerSide527,

Thanks for your inquiry. We have tested the scenario and have managed to reproduce the same issue at our side. For the sake of correction, we have logged this problem in our issue tracking system as WORDSNET-17219. You will be notified via this forum thread once this issue is resolved.

We apologize for your inconvenience.

The issues you have found earlier (filed as WORDSNET-17219) have been fixed in this Aspose.Words for .NET 18.10 update and this Aspose.Words for Java 18.10 update.

The issues you have found earlier (filed as ) have been fixed in this Aspose.Words for .NET 18.12 update and this Aspose.Words for Java 18.12 update.