OcrEngine does not read text from jpg file


#1

Just started to test extract text from jpg with below code; and not succeeded;
Aspose.OCR.License license = new Aspose.OCR.License();
license.SetLicense(“Aspose.OCR.lic”);

        // create an instance of OcrEngine
        var engine = new Aspose.OCR.OcrEngine();
        // set image file
        engine.Image = ImageStream.FromFile("C:\\temp\\test\\" + "test1.jpg");
        // process the image
        engine.Process();
        // get results
        //  Console.WriteLine(engine.Text);
        txtyaz.Text = engine.Text.ToString();

image below;

image.jpg (155.5 KB)


#2

@refikcoskun,

We have looked into the sample image shared by you. It was found that the data in the image is in tabular format. Please note that current implementation does not support extracting data from table format. Furthermore reading data from tabular format issue has been logged into our system with ID OCRNET-2941. The issue ID has been link with this thread. You will be notified automatically in this forum thread once any update is available.


#3

Thank you very much for your interest. This time I used my business card as an image (photo 1), with same code and I got below error (photo 2) ?
photo 1;

IMG_1150.JPG (336.2 KB)

photo 2;

error.JPG (37.6 KB)


#4

@refikcoskun,

We have looked into the issue. While investigation it was found that the image contains Turkish language. Turkish language is not supported. A much more sophisticated engine is required to support Turkish language. Aspose.OCR for .NET API currently supports the following languages.

  1. English
  2. Spanish
  3. French
  4. Portuguese

#5

Hi Rumi,

Thank you very much for your greate help. I have a last question;

with same code i have good result for example 1 below but error in example 2 ?

example 1;

test5.jpg (12.9 KB)

example 2;

test6.jpg (12.8 KB)

exception ;

errorDetail.JPG (44.3 KB)

thanks again,

Refik Coşkun


#6

Sorry exaple 2 and exception error are below;

exaple 2;

test6.jpg (11.5 KB)

exception ;

errorDetail2.JPG (58.9 KB)


#7

Hi Ikram,

is it possible that OcrEngine runs after an exception with the next character
instead of stoping the executing, giving a space, forexample, for unrecognized character.

Thank you and good day,

Refik Coşkun


#8

@refikcoskun,

We have tried to reproduce the issue at our end using the sample image and code snippet shared by you. Following are the OCR results.

abcdefghijklmnopqrstuvwxR

abcdefghijklmnopqrstuvwxyz A 

Further, there is no such functionality available that enable the OCR engine to carry on when any exception is encountered.