I am using the evaluation version of Aspose.Pdf.Kit for Java, in order to evaluate it, and i am trying to extract the Arabic text from a PDF. however, the result in text file is only question marks.
I checked the regional settings, and the unicode language is selected as Arabic.
How can i solve this? or is this a limitation of the evaluation version?
Thank you for considering Aspose.
The Java edition of Aspose.Pdf.Kit supports extracting Arabic text from PDF file. Could you please attach the PDF here to let us check it at our end?
I have attached the PDF file i am trying to convert
Also, I have one more inquiry
will the Aspose be able to identify more the language in the PDF document (English and Arabic) and extract them as it?
We are working over this issue and will reply to you soon. Regarding your query, Aspose.Pdf.Kit is able to uniquely identify different languages texts (English & Arabic) while extracting them from Pdf file.
After thorough investigation and lots of tests based over
Pdf file that you have shared, it’s been observed that Aspose.Pdf.Kit for java
lacks the capability to extract the Arabic text contents. Our development team
is working hard, and we hope it will be resolved in a week time frame. Please
spare us little time, and soon we would be able to share the Beta version,
which includes this feature.
Your patience and understanding is highly appreciated.