PDF to Docx Conversion Changes Formatting

There are formatting issues in some documents when converting from PDF to docx.

Examples were run through the online converter for testing, after seeing the issue present itself using the Aspose Words product within code. Some examples include images not being visible, text/charts changing size and moving text, and converting a Japanese doc is very malformed.

japanese.docx (12.8 KB)

japanese.pdf (75.8 KB)

example - converted from pdf (1).docx (173.6 KB)

example.pdf (353.9 KB)

somatosensory - pdf conversion (1).docx (395.9 KB)

somatosensory.pdf (141.9 KB)

file-sample_150kB - pdf conversion (1).docx (120.3 KB)

file-sample_150kB.pdf (139.4 KB)

@devinheld Please note, Aspose.Words is designed to work with MS Word documents. MS Word documents are flow documents and they have structure very similar to Aspose.Words Document Object Model. But on the other hand PDF documents are fixed page format documents . While loading PDF document, Aspose.Words converts Fixed Page Document structure into the Flow Document Object Model. Unfortunately, such conversion does not guaranty 100% fidelity.

We have opened the following new ticket(s) in our internal issue tracking system and will deliver their fixes according to the terms mentioned in Free Support Policies.

Issue ID(s): WORDSNET-28271,WORDSNET-28272,WORDSNET-28273,WORDSNET-28274

You can obtain Paid Support Services if you need support on a priority basis, along with the direct access to our Paid Support management team.