Free Support Forum - aspose.com

Html data output result contain junk char

I am using the below code to save the html output

   HtmlSaveOptions option=new HtmlSaveOptions(SaveFormat.HTML);
             option.setEncoding(Charset.forName("UTF-8"));
             option.setExportRelativeFontSize(true);
            ByteArrayOutputStream output = new ByteArrayOutputStream();
             doc.save(output,option);

but output result contain the junk char data
like ’ &lt;br/&gt;’ ‘&#xa0;’ ,<p>

@ramveer

Thanks for your inquiry. Please ZIP and attach your input Word document and problematic output HTML here for testing. We will investigate the issue and provide you more information on it.

please find the input and output data in attached zip.Input&Output.zip (30.2 KB)

@ramveer

Thanks for sharing the detail. We have tested the scenario using latest version of Aspose.Words for Java 18.9 and have not found any issue with output HTML when it is open in browser. You have shared the output as image. Could you please share some more detail about your issue along with steps to reproduce this issue at our end? We will then investigate the issue and provide you more information on it.

we have RichText editor , we paste the input in Richtext editor and then input take process through Aspose to convert into HTML as output to save in database or for preview purpose.
Please let me know if anything else required.

@ramveer

Please note that Aspose.Words mimics the behavior of MS Word. Its mean that if you convert your document to HTML using MS Word, you will get the same output.

Please create a standalone application ( source code without compilation errors ) that helps us to reproduce your problem on our end and attach it here for testing. We will investigate the issue and provide you more information on it. Thanks for your cooperation.