是否可以将docx中的html内容转换为标准的xml格式

例如附件中的这种文档
工作量.docx (5.6 KB)

@JIYO_SANG 您可以使用以下代码将 DOCX 转换为 HTML:

Document doc = new Document(@"C:\Temp\in.docx");
doc.Save(@"C:\Temp\out.html");

To be precise, it is not about converting HTML to DOCX or converting DOCX to HTML. Instead, it is about converting a DOCX that is presented in HTML format into a DOCX that is presented in XML format. That is to say, it is a conversion from one DOCX to another DOCX.

@JIYO_SANG Unfortunately, it is still not clear what is your expected output. Could you please elaborate and provide you sample input and expected output.

problem solved. Thank you anyway!

1 Like