Hello,
I have attached 3 minimum html files, all of them failed to detect by file type detect tool.
Aspose Words detects them all as html.
Perhaps worth to work on encoding? ![]()
Best.
htm.zip (1.1 KB)
Hello,
I have attached 3 minimum html files, all of them failed to detect by file type detect tool.
Aspose Words detects them all as html.
Perhaps worth to work on encoding? ![]()
Best.
htm.zip (1.1 KB)
Hello,
Thank you for providing the test files: htm.htm, htmbe.htm, and htmle.htm.
We have successfully reproduced the issue using Aspose.Cells 26.6. Our internal testing confirms that the FileFormatUtil.DetectFileFormat method incorrectly returns LoadFormat.Unknown for these specific files instead of identifying them as HTML.
It appears the detection logic is indeed struggling with the specific encoding and structure of these documents, even though they are valid HTML files. We have verified that this behavior is inconsistent with how other components handle these formats.
We have opened the following new ticket(s) in our internal issue tracking system and will deliver their fixes according to the terms mentioned in Free Support Policies.
Issue ID(s): CELLSNET-60330
You can obtain Paid Support Services if you need support on a priority basis, along with the direct access to our Paid Support management team.
Thanks, please add to the ticket, the same happens for xml files, please test this xml sample to get unknown!
xml.zip (682 Bytes)
If this also happens when reading those xml/html based filesβ¦
Best ![]()
Thank you for providing the additional sample file. I have successfully received the XML archive you attached.
I have added this new test case to the ongoing investigation regarding the file format detection issue. Our development team will analyze this sample alongside the previously provided HTML files to ensure the fix covers both formats and addresses the underlying encoding detection behavior.
We will keep you updated on the progress of the investigation through this thread.