Convert scanned PDF (Image) to searchable PDF (Text)

Dear Support,

This is urgent need for our client.

Our Requirement:

1. Our customer will scan the new paper and give to us in the form of image PDF.
2. We should convert this image PDF into readable/se news paper in the form of PDF.

  1. When user search using some keyword, we should high light it on PDF document.
    4. We should save the image and text on PDF document into SQL / MySql database.


    We would be happy if you could provide your trial version or give me the correct solution for this.

    Please refer the input file (Image PDF), that could be convert into Searchable PDF.


    Thank you for your support.

    Regards,
    Selvam.R

Hi Selvam,


Thanks for contacting support.

Aspose.Pdf for .NET provides the capabilities to extract text from PDF document but if the source PDF file is generated from a scanned image, then I am afraid Aspose.Pdf for .NET might not be able to extract or Highlight text inside PDF file. However for this particular requirement, first you need to extract the images from PDF document and then you may try using Aspose.OCR to extract text from image files. For more information, please visit the following links


Once you have performed an OCR on images, you can again convert the text contents to PDF format using Aspose.Pdf for .NET. For further details, please visit How to Convert a text file to PDF