Loss of linebreaks in PDF

Opening a PDF file in Aspose Words .NET then saving it again causes some linebreaks to be removed from the documents.

Please see attached 3 files:

  • Input.pdf
  • Output.pdf
  • Converted-with-msword.docx, which is Intput.pdf converted to DOCX with MS Word

You can see that in Input.pdf, there is a linebreak at the end of the first line “Fangorn Limited”. This linebreak is gone in Output.pdf.

The linebreak is still there when Input.pdf is converted to DOCX with MS Word, so it seems like there is some bug in Aspose Words causing the linebreak to be lost when the PDF is converted to a Document object.

Converted-with-msword.docx (17.4 KB)
Input.pdf (91.6 KB)
Output.pdf (61.0 KB)

@ssmolkin1,
Thank you for reporting this problem to us. We have tested the scenario and managed to reproduce the same issue at our side. For the sake of correction, we have logged this problem in our issue tracking system as WORDSNET - 23610. You will be notified via this forum thread once this issue is resolved.

We apologize for your inconvenience.

The issues you have found earlier (filed as WORDSNET-23610) have been fixed in this Aspose.Words for .NET 22.4 update also available on NuGet.