OCR Text Extract Accuracy


#1

We are using the below code for extracting text from image. But unable to extract text from the attached image .

Sample Code :

// Initialize an instance of OcrEngine

OcrEngine ocrEngine = new OcrEngine();

ocrEngine.ClearNotifies();

CorrectionFilters filters = new CorrectionFilters();

Filter filter = null;

filter = new RemoveNoiseFilter();

filters.Add(filter);

ocrEngine.Config.DoSpellingCorrection = true;

ocrEngine.Config.CorrectionFilters = filters;

ocrEngine.Config.RemoveNonText = true;

IRecognitionBlock blockB = RecognitionBlock.CreateTextBlock(657, 11, 289, 75);

blockB.Whitelist = new char[] { '0', '1', '2', '3', '4', '5', '6', '7', '8', '9' };

ocrEngine.Config.AddRecognitionBlock(blockB);

ocrEngine.Config.DetectTextRegions = false;

ocrEngine.Image = ImageStream.FromFile(dataDir);

if (ocrEngine.Process())

{

Console.WriteLine(ocrEngine.Text);

}

ocrEngine.Dispose();


#2

We are using the below code for extracting text from image. But unable to extract text from the attached image .

Sample Code :

// Initialize an instance of OcrEngine

OcrEngine ocrEngine = new OcrEngine();

ocrEngine.ClearNotifies();

CorrectionFilters filters = new CorrectionFilters();

Filter filter = null;

filter = new RemoveNoiseFilter();

filters.Add(filter);

ocrEngine.Config.DoSpellingCorrection = true;

ocrEngine.Config.CorrectionFilters = filters;

ocrEngine.Config.RemoveNonText = true;

IRecognitionBlock blockB = RecognitionBlock.CreateTextBlock(657, 11, 289, 75);

blockB.Whitelist = new char[] { '0', '1', '2', '3', '4', '5', '6', '7', '8', '9' };

ocrEngine.Config.AddRecognitionBlock(blockB);

ocrEngine.Config.DetectTextRegions = false;

ocrEngine.Image = ImageStream.FromFile(dataDir);

if (ocrEngine.Process())

{

Console.WriteLine(ocrEngine.Text);

}

ocrEngine.Dispose();


#3
Hi Prakash,

Thank you for your inquiry and sharing sample.

We have investigated the issue at our end. Initial investigation shows that the issue persists. The issue had been logged into our system with ID OCRNET-2938. Our product team will further look into it. We will update with the progress via this forum thread.


#4
Hi Prakash,

Thank you for your inquiry and sharing sample.

We have investigated the issue at our end. Initial investigation shows that the issue persists. The issue had been logged into our system with ID OCRNET-2938. Our product team will further look into it. We will update with the progress via this forum thread.

#5

Can we get updated status on this as it’s urgent for us to proceed further.


#6
Hi Prakash,

Thank you for writing us back.

This is to update you that we have asked for an update from our product team. We will share it with you as soon as we have any update on this issue.


#7

The issues you have found earlier (filed as ) have been fixed in this Aspose.Words for JasperReports 18.3 update.