PDF format issues while converting from word

Please consider the first ,fifth & sixth points as mentioned above.Because at most cases, there exists merging of letters in word & font type and font size changes.

Please go through the below samples.
Sample:samples.zip (2.3 MB)

@Gomathi,

We are working on your queries and will get back to you soon.

@Gomathi,

We have logged two more issues i.e. WORDSNET-18368 (related to 10G872J7MQ0.docx) and WORDSNET-18370 (related to 10TDH50Q9LZ.docx) in our issue tracking system. We will further look into the details of these problems and will keep you updated on the statuses of these issues. We apologize for your inconvenience.

Well, Thanks in advance.

@Gomathi,

Regarding WORDSNET-18370, it is to update you that the implementation of this issue has been postponed till a later date (no estimates are available at the moment). This is a rare case/scenario. We will inform you via this thread as soon as this issue is resolved. We apologize for your inconvenience.

As a workaround, please use the following code:

Document doc = new Document("E:\\temp\\10TDH50Q9LZ.docx");

PdfSaveOptions opts = new PdfSaveOptions();
opts.getMetafileRenderingOptions().setUseEmfEmbeddedToWmf(false);

doc.save("E:\\Temp\\awjava-19.3-setUseEmfEmbeddedToWmf-false.pdf", opts);

@Gomathi,

Regarding WORDSNET-18368 (related to 10G872J7MQ0.docx), please also convert this document to PDF format by using MS Word on your end and attach it here for our reference. Please also provide a comparison screenshot highlighting the problematic areas in Aspose.Words generated document with respect to MS Word generated document and attach it here for our reference. Please point out the exact problematic places for this issue. We do not see a big difference between Aspose.Words and MS Word generated PDF outputs on our end. Thanks for your cooperation.

Sorry for delayed reply.Please keenly see the x-axis fonts of that rendered pdf.Its not retained as in the original doc.,and usually the graph lines are little more darker.

Reference Output Pdf: Sample.zip (1.2 MB)

For time being, please give solution for other issues raised under the same topic “PDF format issues while converting from word”. Thanks in advance.

Regards,
Gomathi. N.

@Gomathi,

We need the resources mentioned in my previous post to be able to investigate your issue (WORDSNET-18368) further on our end. Please also tell, what OS, MS Word and JDK versions are you testing these scenarios on? This is because the output shown in MS Word on our end may be different than to what is displayed on your end. Please also see how MS Word 2019 produces the PDF output on our end.
msw-2019.pdf (53.1 KB)

@Gomathi,

We are waiting for your further input on this topic (WORDSNET-18368). Please see my previous posts and share the required resources. Thanks for your cooperation.

I’ am currently using below services

OS : Windows 10 OS(64-bit)
JDK : jdk1.8.0_181
MS word: Microsoft Word 2010

Sorry for delayed reply.

@Gomathi,

Thanks for the additional information. Please also convert (Save As) this document (10G872J7MQ0.docx) to PDF format by using MS Word 2010 on your end and attach it here for our reference.

Please find MS Word 2010 for the mentioned sample,
Sample :Sample.zip (225.8 KB)

@Gomathi,

We have logged these details in our issue tracking system and will keep you posted on any further updates.

@Gomathi,

Regarding WORDSNET-18368, we have completed the work on this issue and come to a conclusion to close this issue as “Won’t fix”.

The difference between Aspose.Words and MS Word outputs is rather minor and we consider it acceptable. We converted the original document to PDF by using MS Word 2016, and the result, although slightly, differs from what you provided. Some areas of image are better than MS Word output, some worse. Apparently, the result may vary somewhat from system to system and in different versions of MS Word. The perfect match will not be achieved, because we do not know the exact internal logic of MS Word. For the current result, the difference is insignificant.

Fine we will try for version updates.Can you resolve the other PDF conversion issues stated above?

Regards,
Gomathi. N.

@Gomathi,

Please check below the status of your outstanding issues:

  1. We have completed the analysis of the following issues and identified the root causes of the problems. However, because of complexity, the implementations of these issues have been postponed till a later date (no estimates are available at the moment).

WORDSNET-18290, WORDSNET-18331, WORDSNET-18333, WORDSNET-18334 and WORDSNET-18370

  1. The fix of WORDSNET-18332 is being developed.

  2. WORDSJAVA-2006 is currently pending for analysis and is in the queue.

We will inform you via this thread as soon as these issues are resolved. We apologize for any inconvenience.

The issues you have found earlier (filed as WORDSNET-18332) have been fixed in this Aspose.Words for .NET 19.5 update and this Aspose.Words for Java 19.5 update.

Yeah, will check the same.

@Gomathi,

Regarding WORDSJAVA-2006, please use the following workaround to fix this issue:

Document doc = new Document("E:\\temp\\10N760L2HNR_10_egi10N760L2HNR.doc");
PdfSaveOptions options = new PdfSaveOptions();
options.getMetafileRenderingOptions().setRenderingMode(MetafileRenderingMode.VECTOR);
doc.save("E:\\temp\\awjava-19.5-opts.pdf", options);

The issues you have found earlier (filed as WORDSJAVA-2006) have been fixed in this Aspose.Words for .NET 19.6 update and this Aspose.Words for Java 19.6 update.