Hi
I am facing formatting issues in some documents on pdf to word(Docx) conversion.
Attaching pdf document and an image with highlighted issue after conversion.
FormatIssue(PdfToDoc).pdf (1.9 MB)
image_2023_07_14T14_17_39_290Z.png (26.6 KB)
Can you please share which API are you using? Aspose.PDF for Java or Aspose.PDF for .NET? We will further proceed to assist you accordingly.
We are using Aspose.PDF for .Net
Please check the attached DOCX file generated in our environment using 23.7 version of the API and let us know in case you find some issues in it.
output_flow.docx (2.9 MB)
@asad.ali the output document seems fine, I have tested using 23.7 version but still getting the formatting issue. But your output document seems fine.
So can you please send me the code that you are using to convert the pdf document.
Please check the below code snippet:
Document pdfDocument = new Document(dataDir + @"PreviewTest_21MB 1.pdf");
DocSaveOptions saveOptions = new DocSaveOptions();
saveOptions.Format = DocSaveOptions.DocFormat.DocX;
saveOptions.Mode = DocSaveOptions.RecognitionMode.Flow;
saveOptions.RelativeHorizontalProximity = 2.5f;
saveOptions.RecognizeBullets = true;
pdfDocument.Save(dataDir + @"output_flow.docx", saveOptions);