We are evaluating Aspose.PDF (v21.8.0) for converting PDF into DOCX.
I convert my PDF document (test.pdf) that is multi-column to DOCX (test_savedToDocx.docx) with the following code:
var pdfDocument = new Document("test.pdf");
pdfDocument.Save("test_savedToDocx.docx", SaveFormat.DocX);
The .docx output contains corrupted paragraph:
image.png (43.6 KB)
“blockchains, if…” is part of “The blockchain was …” paragraph, but moved into separate paragraph:
image.png (56.3 KB)
This is a bug or there is some save option that allows preventing such paragraph breaks?
test.pdf (34.4 KB)
test_savedToDocx.docx (15.9 KB)