I am using Aspose words to load and print html files and have encountered a potential parsing bug.
If I load the html file using word, it works fine, but if I tried to load it using Aspose.Words I get the following error:
Aspose.Words.FileCorruptedException: The document appears to be corrupted and cannot be loaded.
I have narrowed down the html to this:
<IFRAME WIDTH=468 HEIGHT=60 NORESIZE SCROLLING=No FRAMEBORDER=0 MARGINHEIGHT=0 MARGINWIDTH=0
SRC="http://something.com/?queryVariable|2.0|107|182639|1|1|blah">
<script language=javascript src="http://soemthing.com/script.js;">
</script>
</IFRAME>
Specifically the query string argument on this url: http://something.com/?queryVariable|2.0|107|182639|1|1|blah
It’s the pipes in the query string that cause the issue, Although it’s a strange query string, its valid, I have successfully parsed it.
The reason I think this is a bug is that I can open the html file successfully in MS Word but not using Aspose.
Thanks
Ed