We are evaluating using aspose ocr in a project so I am doing some tests to evaluate it. My test consists of doing OCR on a newspaper jpg (spanish). The jpg isn’t a scan but the digital copy of the newspaper page so I expect it to be pretty accurate. Unfortunately the result of the OCR process is:
4DESTAQUE Naercoreazemeservembremerzoadamsrrem
-ili/][][][]I][][][][][][][]I]I][l][][][I[][][///[][][][][][][][][][///l/][][<’/][][][][I][][][][][][][I/h/][][][][]]]]]]]]]]]]]]]][]I1I]]]]]][][]I][][][]]]]]]][][][][][][][][][][][][][][/////////][][][][][][]I][][][][]]]]][LI][][]I][].illili]]][I][][][][][][][][][][][][][][][][][][][.ili]]]]]]]]]]]]]I][][.[][][][[
The code is pretty simple however I’m not seeing any results. We have licenses for aspose pdf and cells which work great but am having trouble getting this test to work. The byte array is fine, I’ve written it to disk and the spa.zip (downloaded from the aspose as the spanish dictionary) is in the appropriate path. Any help?
oe.LanguageContainer.Clear();
oe.LanguageContainer.AddLanguage(LanguageFactory.Load(Path.Combine(path, “spa.zip”)));
oe.Config.DetectTextRegions = true;
foreach (var f in files)
{
<span style=“font-family: “Courier New”;”> oe.Image = ImageStream.FromStream(new MemoryStream(f.Contents), ImageStreamFormat.Jpg);
if (oe.Process())
{
f.Ocr = oe.Text.ToString();
db.Update(f, new List() { “Ocr” });
}
}