PDF Kit Text Extraction Cyrillic

meieat · February 6, 2008, 8:25am

Hi,

we’re trying to extract data from pdf witch is written in cyrillic. Problem is that after extraction all text is just ‘??? ???’. If i just open the pdf in acrobat reader, than i can select copy and paste this text into notepad and everything is readabale (all letters are there). What is causing this problem?

Thank you in advance

AdeelTaseer · February 6, 2008, 10:53am

Hi,

Can you please provide us with the pdf file so that we can more accurately determine the cause of the problem.

Thanks.

meieat · February 7, 2008, 5:16am

Hi,

where should i send it?

Tnx

AdeelTaseer · February 7, 2008, 5:45am

Hi,

To share your PDF file with us, please follow the steps mentioned in the following article on how to share a document in a forum post:

https://forum.aspose.com/t/how-to-share-a-document-in-a-forum-post/225947/

Thanks.

AdeelTaseer · February 7, 2008, 6:36am

Hi,

This is a known issue and has been logged as PDFKITNET-4259. We are working to fix this. Only English characters are supported now. When PDF contains Asian Characters(Fonts) then the method will throw "Cann't find fonts..." exceptions. In case European Languages used then it won't throws exception, but it will show broken characters or marks of exclamation like "????" etc.

Thanks.

meieat · February 14, 2008, 12:58am

Hi again,

where can we see that this bug is fixed? And is there also known when this bug fix will be released?

Tnx in advance

forever · February 14, 2008, 3:24am

Hi,

We are working on this issue but I am afraid we can’t support it in short time. We will notify you on this thread when we make progress.

aspose.notifier · November 3, 2009, 2:19pm

The issues you have found earlier (filed as 4259) have been fixed in this update.

This message was posted using Notification2Forum from Downloads module by aspose.notifier.
(1)