We have been using Aspose to convert Docx to pdfa-1a.
We have found that paragraph text converted incorrectly. Each word of paragraph text are placed in the different tag. (see “converted_with_aspose” attached image). However, if we save the same file from MS Word, all paragraph text is placed in one tag (see “save_from_word” attached image).
Is this known behavior?
Is there any workaround to get the text in one teg after conversion?
Will this behavior be fixed?
The input file is attached.
Also checked on 19.10 Aspose.Word. Behavior is reproduced.
var inputFilePath = "test_doc.docx"; var outputFilePath = "result.pdf"; var inputDocument = new Aspose.Words.Document(inputFilePath); var pdfSaveOptions = new Aspose.Words.Saving.PdfSaveOptions(); pdfSaveOptions.OutlineOptions.HeadingsOutlineLevels = 9; pdfSaveOptions.DisplayDocTitle = true; pdfSaveOptions.DmlRenderingMode = Aspose.Words.Saving.DmlRenderingMode.DrawingML; pdfSaveOptions.ExportDocumentStructure = true; pdfSaveOptions.Compliance = PdfCompliance.PdfA1a; inputDocument.Save(outputFilePath, pdfSaveOptions);