Tables with merged cells and custom inner-cell-padding are broken when loading a PDF stream into Aspose.Words.Document object


We came accross a broken output when loading a PDF into an Aspose.Words.Document object. We use this to concatenate Docx and PDF documents together.

The bug still occurs with this simplified code:

var stream = File.OpenRead("input.pdf");
var document = new Aspose.Words.Document(stream);
document.Save("broken_output.pdf", Aspose.Words.SaveFormat.Pdf);

A test project (with input/output pdf) are linked to this post.

Thank you,
Romain (316.3 KB)

broken_output.pdf (87.2 KB)

input.pdf (112.5 KB)

We have opened the following new ticket(s) in our internal issue tracking system and will deliver their fixes according to the terms mentioned in Free Support Policies.

Issue ID(s): WORDSNET-26580

You can obtain Paid Support Services if you need support on a priority basis, along with the direct access to our Paid Support management team.

Please note, Aspose.Words is designed to work with MS Word documents. MS Word documents are flow documents and they have structure very similar to Aspose.Words Document Object Model. On the other hand PDF documents are fixed page format documents. While loading PDF document into Aspose.Words DOM, Fixed Page Document structure into the Flow Document Object Model. Unfortunately, such conversion does not guaranty 100% fidelity. So it is not always possible to retain PDF document layout after processing PDF document using Aspose.Words.