Hi,
I used Aspose PDF Java API 19.10 to convert pdf file to docx file and found the below issues.
- The Tables are not converted correctly as editable table format instead it is converting the table to an image and adding text to the table cells
Demo.pdf (102.1 KB)
Because the tables are converted to image , the ASPOSE WORDS API is unable to read the data .
Below is the pdf to word conversion code
// Load source PDF file
com.aspose.pdf.Document doc = new com.aspose.pdf.Document(DATADIR + pdfFileName);
// Instantiate Doc SaveOptions instance
DocSaveOptions saveOptions = new DocSaveOptions();
// Set output file format as DOCX
saveOptions.setFormat(DocSaveOptions.DocFormat.DocX);
saveOptions.setMode(DocSaveOptions.RecognitionMode.Flow);
// Set the Horizontal proximity as 2.5
saveOptions.setRelativeHorizontalProximity(2.5f);
// Enable the value to recognize bullets during conversion process
saveOptions.setRecognizeBullets(true);
// Save resultant DOCX file
doc.save(DATADIR + pdfFileName.replace(".pdf", ".docx"), saveOptions);
Regards,
Krsna