We're sorry Aspose doesn't work properply without JavaScript enabled.

Free Support Forum - aspose.com

Extracting text from an html page


I have Aspose.Words. How do I extract plain text out of an html document? I tried to so do like the following, but got many lines duplicate-times:

Aspose.Words.Document doc = new Aspose.Words.Document(FullFileName);

text = doc.ToTxt();

This message was posted using Aspose.Live 2 Forum


<?xml:namespace prefix = o ns = "urn:schemas-microsoft-com:office:office" />

Thanks for your request. Could you please attach your HTML document here for testing? I will check the issue and provide you more information.

Best regards.