Got the garbage symbols during convertion doc to txt

Hi,

I have doc file when I’m converting it to txt format.
In my file I got the garbage content also.

PFA my text file
I/P doc or docx file.
O/P txt file
aspose java version is 15.8.0
OS version ubantu 14.04 LTS
Java 1.8
using Aspose total license for java.
Hi Crimson,

Thanks for your inquiry. Could you please share some detail of garbage content which you noticed in output txt file format? We will investigate the issue on our side and provide you more information.

Hi,

Those are Ascii control chars when we open the txt file on windows OS.
In linux I go the symbols 00, 00
0C, 02
Please see the txt file that I have uploaded that text file contains symbols.

When we convert from doc to txt , I need to know if I want to save txt file in UTF-8 format then what should I do?

I have attached snapshot also.
Hi Crimson,

Thanks for sharing the detail. We have tested the scenario and have managed to reproduce the same issue at our side. For the sake of correction, we have logged this problem in our issue tracking system as WORDSNET-13598. You will be notified via this forum thread once this issue is resolved.

We apologize for your inconvenience.

The issues you have found earlier (filed as WORDSNET-13598) have been fixed in this Aspose.Words for .NET 16.10.0 update and this Aspose.Words for Java 16.10.0 update.


This message was posted using Notification2Forum from Downloads module by aspose.notifier.