<?xml:namespace prefix = o ns = "urn:schemas-microsoft-com:office:office" />
Thank you for additional information. I tried converting your HTML to DOC using the latest version of Aspose.Words and size of output document is ~1.5MB. After open/save using MS Word file size is significantly decreased.
It seems the problem occurs because there are a lot of merged cells in your document and content in merged cells is duplicated. You can use the following code to work the problem around:
Document doc = new Document(@"Test001\HTMLText.htm");
/// Remove content from merged cells.
public void RemoveContentFromMergedCells(Document doc)
// Remove content from merged cells.
// Get collection of cells in the docuemnt.
NodeCollection cells = doc.GetChildNodes(NodeType.Cell, true);
foreach (Cell cell in cells)
// Check whether cell is merged with previouse.
if (cell.CellFormat.HorizontalMerge == CellMerge.Previous ||
cell.CellFormat.VerticalMerge == CellMerge.Previous)
// Remove content from the cell.
Hope this helps. Please let me know if you need more assistance, I will be glad to help you.