Clarification on usePdfImageConversion Support in .NET vs Java + PDF to DOCX Conversion Quality Issues

Hello,

We are a licensed Aspose.Total customer using Aspose.PDF for Java (version 23.1), and we would like to request clarification and assistance regarding the following:


1. usePdfImageConversion — Feature Availability

We have encountered references to a setting called DocSaveOptions.UsePdfImageConversion, reportedly available in Aspose.PDF for .NET to ensure high-fidelity PDF to DOCX conversion by embedding each page as an image.

However:

  • We cannot find this setting documented in either the .NET or Java API references.
  • It does not appear to exist in the Java SDK at all.

Can you please confirm:

  • Whether usePdfImageConversion is officially supported in .NET?
  • Whether there is an equivalent (or planned) feature in Aspose.PDF for Java?
  • Whether this setting is deprecated, undocumented, or internal-only?

2. PDF to DOCX Conversion Quality (Java)

Since we could not find a built-in option to ensure visual fidelity in Java, we implemented a fallback approach:

  • Each PDF page is rendered to an image using JpegDevice (200 DPI)
  • These images are inserted into a .docx using Aspose.Words, with all page margins set to zero

While this works in general, we are seeing issues on some documents:

  • Font or formatting degradation (compared to the original PDF)
  • Excessive white space or image scaling artifacts
  • Inconsistent rendering across environments (e.g., different OS or Java runtime)

We can attach a few example PDFs & DOCXs where the output is not satisfactory.


3. Request for Guidance

Could you kindly:

  • Review the attached samples and advise if we are missing any configuration that could improve output quality?
  • Confirm if there are better rendering settings, layout modes, or recommended practices for converting PDF to DOCX with high fidelity in Java?

We would appreciate any guidance you can provide.

Thank you in advance for your support.

Best regards,

João Neves on behalf of Antonio DI MATTEO
European Council
antonio.di-matteo@consilium.europa.eu
[License/Support ID - We have a paid aspose.total license and we’ll provide the id later]

@JoaoNevesEUCouncil

Hello João,

Thank you for reaching out regarding your queries about usePdfImageConversion and PDF to DOCX conversion quality.

1. usePdfImageConversion — Feature Availability

  • .NET Support: Yes, DocSaveOptions.UsePdfImageConversion is available in Aspose.PDF for .NET. It allows for high-fidelity PDF to DOCX conversion by embedding each page as an image.

  • Java Equivalent: Currently, there is no direct equivalent of this feature in Aspose.PDF for Java, and it is not documented as a planned feature. If you require high fidelity, your current approach of rendering pages to images is a valid workaround.

2. PDF to DOCX Conversion Quality (Java)

Your fallback approach of using JpegDevice to render PDF pages as images is a common method. However, issues such as font degradation and excessive white space can arise due to the nature of image rendering. Here are some suggestions to improve output quality:

  • DPI Settings: Ensure you are using a higher DPI (e.g., 300 DPI) for better image quality.
  • Image Format: Consider using PNG instead of JPEG to avoid compression artifacts.
  • Margins and Layout: Double-check the margins and layout settings in your DOCX generation to minimize white space.

3. Request for Guidance

To enhance the quality of your PDF to DOCX conversion, you might want to explore the following:

  • Rendering Options: Investigate different rendering options available in the JpegDevice, or consider using PngDevice for improved fidelity.
  • Aspose.Words Settings: Review the settings in Aspose.Words for inserting images to ensure they are correctly aligned and sized.

If you continue to experience issues, sharing specific examples of the problematic PDFs and DOCXs would help in providing more targeted advice.

Thank you for your understanding, and feel free to reach out if you have further questions!

Best regards.