Table format is disturb while - Merging PDF to Word - By Aspose .NET

We are merging different word and pdf document by using Aspose PDF & Word .NET Lib .

We are facing format issue , while merging PDF document to consolidated word and PDF document .

Table format present inside consolidated word & pdf file is not properly formatted as original pdf document after merging .

I have attached original and merge PDF / word document for reference, please guide /help to resolve this issue .

Kindly arrange meeting to resolve this issue.
OrgionalPDF.pdf (24.2 KB)

MergedDoc.Docx (72.2 KB)

MergedPDF.Pdf (208.0 KB)

@avinashchilke1

Can you please provide more details about the specific formatting issues you are encountering when merging the PDF documents into Word? Additionally, what version of Aspose libraries are you using?

Hello Team ,

I am using 25.2.0 Version of Aspose.PDF & Aspose.Word.

Thanks,
Avinash

@avinashchilke1

Can you please also share the code snippet that you are using to process the shared files? We will test the scenario in our environment and address it accordingly.

Hello Team,
Code.zip (2.0 KB)

Please check attach code here.

Thanks,
Avinash

@avinashchilke1

Looks like you are merging the documents and generating final merged PDF using Aspose.Words. We are moving the inquiry to the respective forum category where you will be assisted accordingly.

Thanks @asad.ali for Response, Kindly forward to respective team.

We need resolve this issue ASAP, as it is showstopper for us.

Thanks,
Avinash

@avinashchilke1 First of all, please note, Aspose.Words is designed to work with MS Word documents. MS Word documents are flow documents and they have structure very similar to Aspose.Words Document Object Model. But on the other hand PDF documents are fixed page format documents.
In your code you are converting PDF document to DOCX using Aspose.PDF with RecognitionMode.Textbox. In this mode all content in the output DOCX document is represented using flowing frames. In addition in your code you modify the PDF document before saving using RemovePageNumbersFromPdfDocument method, which is not provided. So we can only guess what it does.
If I understand correctly you are merging several documents, but provided only the problematic input document and the output. But the first document is not provided (the first 14 pages in the problematic output document). If possible please provide this document too.

If your goal is to merge PDF and MS Word documents preserving their original layout and save the result as PDF. I would suggest you to convert MS Word document to PDF first and then merge PDF documents.

Thanks @alexey.noskov for your valuable input , I have did changes as per your suggestion & issue is resolved

1 Like

@avinashchilke1 It is perfect that you managed to resolve the problem. please feel free to ask in case of any further issues. We are always glad to help you.

A post was split to a new topic: Removing page numbers from PDF