While converting pdf to word the document text is getting changed. The document is bilanguage English and Russian

thats orignal pdf file дог1673_26_11_2024 (3).pdf (168.1 KB)

and here is converted word file
дог1673_26_11_20241.docx (27.0 KB)

@apatil03

It sounds like you’re experiencing issues with text formatting when converting a bilingual PDF document to Word. This can occur due to the way text is encoded or structured in the original PDF, especially with languages that use different character sets.

Cause: The formatting issues may arise from unembedded fonts, outdated software, or specific encoding challenges in the PDF.

Solution: Here are several steps you can take to troubleshoot and resolve the issue:

  1. Check Font Embedding: Ensure that the fonts used in the PDF are embedded. If the fonts are not embedded, it can lead to text rendering issues during conversion.

  2. Use the Latest Version: Make sure you are using the latest version of Aspose.Words. Updates often include bug fixes and improvements for handling various document formats.

  3. Adjust Conversion Settings: You can try adjusting the conversion settings. For example, set the PdfLoadOptions to handle specific encoding or layout issues. Here’s a basic example:

    var pdfLoadOptions = new PdfLoadOptions();
    pdfLoadOptions.ConvertFontToUnicode = true; // This can help with multilingual documents
    
    Document doc = new Document("input.pdf", pdfLoadOptions);
    doc.Save("output.docx");
    
  4. Test with Different PDFs: If possible, test the conversion with other bilingual PDFs to see if the issue is consistent across different documents. This can help identify if the problem is specific to the original PDF.

  5. Contact Support: If the issue persists, consider reaching out to Aspose support with the specific files you are working with. They can provide more targeted assistance.

Next Steps: Please try the above steps and let me know if they help resolve the issue. If you continue to experience problems, feel free to provide more details, and I will assist you further. Thank you!

@apatil03 As I can see you are using Aspose.PDF for conversion PDF to Word. So I will move your request to the appropriate forum. My colleagues from Aspose.PDF team will help you shortly.
Also, I tested conversion your PDF to Word using Aspose.Words and there are problems too. So we have logged WORDSNET-28579 issue to address them.

@apatil03

We have opened the following new ticket(s) in our internal issue tracking system and will deliver their fixes according to the terms mentioned in Free Support Policies.

Issue ID(s): PDFNET-60552

You can obtain Paid Support Services if you need support on a priority basis, along with the direct access to our Paid Support management team.