we are facing an issue on Aspose.Html library version 20.9.1. The getInputEncoding method in previous 18.6 version was returning the correct value, that on our testing document is UTF-16, while in the 20.9.1 version returns UTF-8.
The code we are using is the following:
com.aspose.html.HTMLDocument hd = new com.aspose.html.HTMLDocument(" HTML_UTF-16.html");
encInput = hd.getInputEncoding();
We have verified the input file with an hex editor, showing the UTF-16 BigEndian BOM marker on top:
image.png (12.3 KB)
Attached you can find the source html for testing.
Many thanks in advance.
GianfrancoHTML_UTF-16.zip (833 Bytes)