Hi,
Is there a way to extract or absorb the table from a non-editable pdf file? It seems like a scanned image file converted to pdf. We need to extract the table as it is alone from the document.
2020-01-31_12-17_1 (1).pdf (18.9 KB)
Please find the attached document and let me know if anything is needed.
Thank you, M.S. Sathish.
@sathish.sundaresan
We are afraid that Aspose.PDF cannot offer this feature i.e. recognize table from an image. However, you can try extracting text by performing OCR on the PDF using Aspose.PDF or Aspose.OCR.
OcrInput input = new OcrInput(InputType.PDF);
input.Add(imgPath);
var result = api.Recognize(input, new RecognitionSettings
{
RecognitionAreas = new List<Aspose.Drawing.Rectangle>
{
new Aspose.Drawing.Rectangle(10, 10, 200, 500)
}
});