Some symbols and text are missing while converting docx to html

Hi Aspose support,
In our product we have functionality that converts word document to html.
After conversion word document to html, we were faced the issue while loading symbols, Equations and text.
We used Aspose.words (22.10.0) library.
We are using Windows 10 Pro (64-bit operating system, x64-based processor) and .NET Framework 6.0

case#1:
After conversion word document to html, Equations and R-squared values are missing in the last Chart.
(left side html document -right side word document)
Issue: Equations_R_Squared_Missing.zip (81.4 KB)
Source document:Chart Trendline Options.docx (162.9 KB)
Expected result: Converted html should be the same as word document

case#2:
After conversion word document to html, some Thai subset symbols are missing.
(Left side html document -right side word document)
Issue: Symbols_Missing.zip (240.6 KB)
Source document:Symbols in Angsana New font.docx (15.9 KB)
Expected result: Converted html should be the same as word document

case#3:
After conversion word document to html, some symbols are loaded in wrong order.
(left side html document -right side word document)
Issue:Order_Missing.zip (109.3 KB)
Source document:Symbols in Arial font.docx (30.1 KB)
Expected result: Converted html should be the same as word document

case#4:
After conversion word document to html, Basic Latin and Cyrillic subsets look incorrect.
The letters have different design and size.
(Left side html document -right side word document)
Issue:TablesWithSymbols.zip (347.9 KB)
Source document:TableWithSymbols.docx (18.8 KB)
Expected result: Converted html should be the same as word document

Please help on this.
Thanks.

@radha.bobbili

  1. Unfortunately, Aspose.Words does not support rendering trendline equation and R-squared values. This feature request is logged as WORDSNET-24798.

  2. I have managed to reproduce the problem and logged it as WORDSNET-24799. From your screenshot I made a conclusion that you are conversion the document to HtmlFixed format. Please confirm.

  3. I have managed to reproduce the problem and logged it as WORDSNET-24800.

  4. There are two issues here.
    a) Aspose.Words does not support rendering of colored symbols. This feature request is logged as WORDSNET-22379
    b) Symbol lost in “Basic Latin” section. The problem is logged as WORDSNET-24801.

We will keep you updated and let you know once the issues are resolved or we have more information for you.

We are using HtmlFixedSaveOptions to convert docx to html.
Please find the source code : sourcecode.zip (562 Bytes)

@radha.bobbili Thank you for additional information. My assumption was right. We will keep you informed and let you know once the reported issues are resolved.

The issues you have found earlier (filed as WORDSNET-24798) have been fixed in this Aspose.Words for .NET 23.2 update also available on NuGet.

1 Like