We're sorry Aspose doesn't work properply without JavaScript enabled.

Free Support Forum - aspose.com

Extracting text from PDF image


i am working on a project where i need to read the text from an image
stored in a PDF file.Basically my PDF file contains the forms which
are stored as images and from these images i need to extract the
information like user name, user address.I have downloaded
Aspose.Pdf.Kit.msi from your site.

I just want to know whether i can extract the text from an
image using Aspose.Pdf.Kit. In your documentation you have provided
the examples for extracting the text and image from PDF but there is
no such example or topic which describes how to extract the text from
an image.


Hello Pandurang,<?xml:namespace prefix = o ns = "urn:schemas-microsoft-com:office:office" />

Thanks for considering Aspose.

I am sorry to inform you that extraction of text from Image file is currently not supported by Aspose.Pdf.Kit, and I am afraid we cannot support it in short time. We apologize for your inconvenience.

Hi Pandurang,

The functionality you refer to is named OCR after Optical Character Recognition. Presently it is unavailable but is planned for future releases of Aspose.Recognition component that is aimed at intelligent processing of PDF files.

You can learn more about it on the product page:

[http://www.aspose.com/categories/file-format-components/aspose.recognition-for-.net/default.aspx ](http://www.aspose.com/categories/file-format-components/aspose.recognition-for-.net/default.aspx)

Right now it supports text format extraction from PDF files into formatted layout documents like DOC, HTML, RTF etc.

If you like I can notify you once we have OCR module in place.