I would like to know how to extract text from an html file.
This is what I am doing right now:
Aspose.Words.Document doc = new Aspose.Words.Document(c:\myfile.html);
string text = doc.ToTxt();
My problem is that I am getting multiple lines - i.e. some of the lines in the page are returned multiple times.
Niloos Software ltd.