Extracting Text from images demo does not work for my .tif files

Hi,

I am using IntraPDF pdf2image utility to generate TIF images from a PDF document. The attached TIF images are with DPI=100.
I need to then use Aspose OCR for .Net to extract text from these .tif files. I have already purchased licensed version of this component. So far, I never tested, if this is working for our files. Now I have started this task.
Before implementing in my code, I wanted to check it with your demo at http://www.aspose.com/demos/.net-components/aspose.ocr/csharp/LoadImage.aspx
I selected the files attached here & tried to extract text. But I always receive the error message “Invalid File”.

Can you pls tell me why I receive this error?

Regards
Uma Anand Ilango

Hi Uma,

Small text size is used in the attached images and Aspose.OCR for .NET unfortunately does not support small sizes at the moment but this issue has been logged into our issue tracking system as OCR-29048. We will keep you updated on this issue in this thread.

Regarding online samples, previous version was used in the online samples which does not support .tif extension. We will update the DLL soon. In the meantime, you can use the following solutions.

Sorry for the inconvenience.

Best Regards,

Hi,

Thank you for your reply. I downloaded the code from your community\files and ran the solution.

I changed the attached file extension to .tiff. The output I received from "Extract Text" is below

mPh
~
---(\)/ttt!!-()ti-s!-/-(!t- (Ife!--\
^<+c^^u

This is not a valid output. Can you pls tell me why I am receiving like this?

Regards

Uma Anand Ilango

Hi Uma,

As shared in my previous post, Aspose.OCR has some issues with small text sizes (smaller than 28pt) and this issue has been logged into our issue tracking system as OCR-29048. You are receiving incorrect output because text in the attached images is smaller than the supported sizes. We will let you know once this issue is resolved.

Sorry for the inconvenience.

Best Regards,

The issues you have found earlier (filed as ) have been fixed in this Aspose.Words for JasperReports 18.3 update.