Hi,
I hope you can help me.
I have some issues converting DOCX to HTML files.
You can see the files I used in the attachments.
I converted "Test_File.docx" to "Test_File.html" with Aspose.Word (17.1.0.0).
The html file contains too many tags that break the sentences.
For instance:
Docx file contains:
Normal text, Normal text, Normal text....
Html file contains:
Normal Text,
Normal Text,
Normal Text,
I think the result is too verbose. What I expect is something like that:
Normal Text, Normal Text, Normal Text,
The code I used to convert DOCX to HTML is simply:
Document document = new Document(dataDir + "Test_File.docx");
document.Save(dataDir + "Test_File.html", SaveFormat.Html);
I also tryed many options of "HtmlSaveOptions" class but the result still remains almost the same.
Is there any way to obtain a file like "Test_File_GOAL.html"?
I need to have html files cleaned for future custom edits.
Kind regards,
Andrea