XLSX to HTML conversion does not retain sequence of invisible characters

Hi!

We are using Aspose 21.8.0 to process the xlsx file included in the archive. The text in the cells is surrounded by a sequence of Zero Width Space (U+200B) and Zero Width Non-Joiner (U+200C) characters which are not retained after converting the file to html.

We reproduced this issue with versions 21.7.0 - 21.9.0.

This is the code we use for processing the file:

    AsposeLicenser.LoadLicense();

    var options = new HtmlSaveOptions
    {
        ExportImagesAsBase64 = true,
        ExportSingleTab = true,
        IsExportComments = showComments,
        ExportHiddenWorksheet = false,
        HiddenRowDisplayType = HtmlHiddenRowDisplayType.Remove,
        HiddenColDisplayType = HtmlHiddenColDisplayType.Remove,
        ExportGridLines = true
    };

    using (var wb = new Workbook(inputPath))
        wb.Save(targetPath, options);

AsposeXlsxToHtmlIssue.zip (21.0 KB)

Thanks!

@arnoldbeilandevozon

I have compared the text of your both Html files and they look almost similar. Please share screenshot of the issue and how are you noticing the difference.

We’re using a hex editor to see the hidden characters.

With version 21.6.0, the “sheet001” html file looks like this: hex_editor_21.6_ok.png (245.7 KB)

With version 21.8.0: hex_editor_21.8_not_ok.png (187.8 KB)

@arnoldbeilandevozon,

Thanks for providing us screenshots.

Please notice, I have evaluate your issue using your sample file. I have logged a ticket with an id “CELLSNET-49195” for your issue. We will investigate and look into your issue soon.

Once we have an update on it, we will let you know.

@arnoldbeilandevozon,

This is to inform you that we have fixed your issue now. We will soon provide you the fixed version after performing QA and incorporating other enhancements and fixes.

@arnoldbeilandevozon

The issue is fixed in v21.9.2.
Aspose.Cells21.9.2 For .Net2_AuthenticodeSigned.Zip (5.6 MB)
Aspose.Cells21.9.2 For .Net4.0.Zip (5.6 MB)
Aspose.Cells21.9.2 For .NetStandard20.Zip (5.6 MB)

The issues you have found earlier (filed as CELLSNET-49195) have been fixed in this update. This message was posted using Bugs notification tool by simon.zhao