As per our requirement we need to compare content of two document after convert .docx to html.
we have two word document A.docx and B.docx, where B.docx is copy of A.docx. In document B.docx some content is changed by Author in some paragraphs or tables etc.
When we convert A.docx to A.html and B.docx to B.html and compare content of document based on elements p, span, table,h1 etc.
In some of the location number of tags are mismatch even I didn’t do any changes on that particular paragraphs.
Due to this “span” tags our comparison logic give us wrong result.
For more detail please find attached screen shot,java code and docx file (Refer Page 14 of document to check span tags).
Could you please suggest us why number on span tag not comes equal in both html file.
For Aspose team.zip (1.7 MB)