Aspose PDF formatting issues while converting from RTF to PDF, and then PDF to MS Word document!

Hi,

We are using Aspose.Words to convert RTF document to PDF and then PDF to word document. The reason for this conversion is, the initial RTF document contains frames and we want to remove those as they are causing formatting issues when editing the document in word.

However, when the RTF document is converted to pdf and then to word document, we are seeing few issues.

if we look at below screenshot, The high lighted lines in yellow are accepting only few characters. After that it is going to next line. However, The one line highlighted in red doesn’t have any issues. Not sure why this is happening even though all are having same format in original RTF file. I have tried Aspose word online solutions as well as tried with .net c#. but having the same outpput. Could you please let help us on this issue.? Attached all the required documents. Initial RTF doc, converted PDF and then the word document which is converted from PDF.

SampleDocs.zip (11.5 KB)

@nagasivaos Please note, Aspose.Words is designed to work with MS Word documents. MS Word documents are flow documents and they have structure very similar to Aspose.Words Document Object Model. On the other hand PDF documents are fixed page format documents. While conversion PDF document to MS Word document Fixed Page Document structure into the Flow Document Object Model. Unfortunately, such conversion does not guaranty 100% fidelity. So it is not always possible to retain PDF document layout upon conversion it to MS Word document.

In your particular case the “DOCTORS” and “FACTS” words are recognized as a single narrow paragraph (see space after value of the paragraph) so the words are distributed into two lines.