Hello,
We need to convert images to the searchable (text) pdf. As I see in this topic (2013)
its not possible without destroying format but now we are in 2017 and i though something was increased but I am disappointed for the result.
Well I am uploading all files here…
1- attachment is my temp image (an invoice)
2- this one is my result by using aspose.ocr and pdf.
Here is my code
Aspose.OCR.License license = new Aspose.OCR.License();
license.SetLicense(“Aspose.Total.lic”);
Aspose.Pdf.License licenseforPdf = new Aspose.Pdf.License();
licenseforPdf.SetLicense(“Aspose.Total.lic”);<span style="color:#2b91af;">OcrEngine</span> ocrEngine = <span style="color:blue;">new</span> <span style="color:#2b91af;">OcrEngine</span>(); Aspose.Pdf.Generator.<span style="color:#2b91af;">Pdf</span> pdf1 = <span style="color:blue;">new</span> Aspose.Pdf.Generator.<span style="color:#2b91af;">Pdf</span>(); <span style="color:#2b91af;">String</span> searchFolder = txtDirectoryPath.Text; <span style="color:blue;">var</span> filters = <span style="color:blue;">new</span> <span style="color:#2b91af;">String</span>[] { <span style="color:#a31515;">"jpg"</span>, <span style="color:#a31515;">"jpeg"</span>, <span style="color:#a31515;">"tif"</span>, <span style="color:#a31515;">"png"</span>, <span style="color:#a31515;">"gif"</span>, <span style="color:#a31515;">"tiff"</span>, <span style="color:#a31515;">"bmp"</span> }; <span style="color:blue;">var</span> images = GetFilesFrom(searchFolder, filters, <span style="color:blue;">false</span>); <span style="color:blue;">foreach</span> (<span style="color:blue;">var</span> image <span style="color:blue;">in</span> images) { ocrEngine.Image = <span style="color:#2b91af;">ImageStream</span>.FromFile(image); <span style="color:blue;">if</span> (ocrEngine.Process()) { Aspose.Pdf.Generator.<span style="color:#2b91af;">Section</span> sec1 = pdf1.Sections.Add(); <span style="color:green;">// Create a new text paragraph and pass the text to its constructor as argument</span> Aspose.Pdf.Generator.<span style="color:#2b91af;">Text</span> t2 = <span style="color:blue;">new</span> Aspose.Pdf.Generator.<span style="color:#2b91af;">Text</span>(ocrEngine.Text.ToString()); sec1.Paragraphs.Add(t2); pdf1.Save(<span style="color:#2b91af;">Path</span>.Combine(searchFolder, <span style="color:#a31515;">"Result"</span>, <span style="color:#2b91af;">Path</span>.GetFileName(image) + <span style="color:#a31515;">".Pdf"</span>)); <span style="color:green;">// Display the recognized text</span> <span style="color:#2b91af;">Console</span>.WriteLine(ocrEngine.Text); <span style="color:#2b91af;">Console</span>.WriteLine(ocrEngine.Text.PartsInfo[0].Box); } <span style="color:blue;">else</span> { <span style="color:#2b91af;">Console</span>.WriteLine(<span style="color:#a31515;">"Error in file "</span> + <span style="color:#2b91af;">Path</span>.GetFileName(image)); } } <span style="color:#2b91af;">MessageBox</span>.Show(<span style="color:#a31515;">"Completed"</span>);</pre><pre style="font-family: Consolas; font-size: 13px; background: white;"><br></pre><pre style="font-family: Consolas; font-size: 13px; background: white;">(I see that i can only get it as text format)</pre><pre style="font-family: Consolas; font-size: 13px; background: white;"><br></pre><pre style="font-family: Consolas; font-size: 13px; background: white;">3- I tried abby finereader online service and the result is on the third attachment</pre><pre style="font-family: Consolas; font-size: 13px; background: white;"><br></pre><pre style="font-family: Consolas; font-size: 13px; background: white;">As you see abby finereader is the perfect. </pre><pre style="font-family: Consolas; font-size: 13px; background: white;"><br></pre><pre style="font-family: Consolas; font-size: 13px; background: white;">Well i need to know that whether you have that kind of service or you will have in close feature? </pre><pre style="font-family: Consolas; font-size: 13px; background: white;">If not then we will look for other solution. </pre><pre style="font-family: Consolas; font-size: 13px; background: white;">Because the aspose.ocr result is completly unuseful (horrible) data for us.</pre><pre style="font-family: Consolas; font-size: 13px; background: white;"><br></pre><pre style="font-family: Consolas; font-size: 13px; background: white;"><br></pre><pre style="font-family: Consolas; font-size: 13px; background: white;"><br></pre><pre style="font-family: Consolas; font-size: 13px; background: white;"><br></pre></div>