We're sorry Aspose doesn't work properply without JavaScript enabled.

Free Support Forum - aspose.com

PDF not properly converted to DOC using Aspose.PDF(JAVA)

Hi,
When I converted Thai language PDF into DOC, I found some top/bottom characters missing like example:
Original : เพื่อเป็นรากฐานในการดำเนินชีวีตได้อย่างรู้เท่าทัน
Result : เพอเปนรากฐานในการดาเนนชวตไดอยางรเทาทน

Here is my code (JAVA) :

com.aspose.pdf.Document pdfDoc = new com.aspose.pdf.Document(sPath);
com.aspose.pdf.DocSaveOptions saveOption = new com.aspose.pdf.DocSaveOptions();
saveOption.setMode(com.aspose.pdf.DocSaveOptions.RecognitionMode.Flow);
saveOption.setRelativeHorizontalProximity(2.5f);
saveOption.setRecognizeBullets(true);
saveOption.setMaxDistanceBetweenTextLines(2.5f);
if ( Pattern.matches(“doc|DOC”, sExt) ) {
outputFileName += “.doc”;
saveOption.setFormat(com.aspose.pdf.DocSaveOptions.DocFormat.Doc);
pdfDoc.save(outputFileName, saveOption);
} else if ( Pattern.matches(“docx|DOCX”, sExt) ) {
outputFileName += “.docx”;
saveOption.setFormat(com.aspose.pdf.DocSaveOptions.DocFormat.DocX);
pdfDoc.save(outputFileName, saveOption);
}

Is there a way to fix this issue?

Regards,
Rapeepan

And this is the example PDF 865-file.pdf (795.6 KB)
file

@rcomniscien,

We have tested the scenario in our environment and were able to notice the issue. We have logged it as PDFJAVA-39077 in our issue tracking system. We will further look into details of the issue and keep you posted with the status of its correction. Please be patient and spare us little time.

We are sorry for the inconvenience.