Wrong language detection


#1

Hi,
When we use aspose to convert HTML to MS Word, the language detected by word is incorrect. Instead of Hebrew, it detects it as Arabic.
Can you please fix this issue?


#2

@omri-1

To ensure a timely and accurate response, please attach the following resources here for testing:

  • Your input HTML document.
  • Please attach the output Word file that shows the undesired behavior.
  • Please create a simple application ( source code without compilation errors ) that helps us to reproduce your problem on our end and attach it here for testing.

As soon as you get these pieces of information ready, we will start investigation into your issue and provide you more information. Thanks for your cooperation.

PS: To attach these resources, please zip and upload them.


#3

Attached 2 docs:
1 - the original doc
2 - the doc after converting doc 1 to html and convert it back to doc (with stds)
aspose 29.zip (58.3 KB)


#4

@omri-1

Thanks for sharing the detail. Please create a simple application ( source code without compilation errors ) that helps us to reproduce your problem on our end and attach it here for testing.

Thanks for your cooperation.


#5

Code:
new Aspose.Words.Document(@“E:\WordTest\31.docx”)
.Save(@“E:\WordTest\31.htm”, new HtmlSaveOptions
{
HtmlVersion = Aspose.Words.Saving.HtmlVersion.Html5,
ExportImagesAsBase64 = true,
ExportHeadersFootersMode = Aspose.Words.Saving.ExportHeadersFootersMode.None,
ExportListLabels = Aspose.Words.Saving.ExportListLabels.AsInlineText,
});
new Aspose.Words.Document(@“E:\WordTest\31.htm”)
.Save(@“E:\WordTest\31_2.docx”, SaveFormat.Docx);

31.docx language: Hebrew
31_2.docx language: Arabic


#6

@omri-1

We have tested the scenario and have managed to reproduce the same issue at our side for both documents. For the sake of correction, we have logged the problems in our issue tracking system as WORDSNET-18871 and WORDSNET-18872. You will be notified via this forum thread once these issues are resolved.

We apologize for your inconvenience.