Hello,
We are trying to parse table from pdf file with TableAbsorber class, but contents are missing in the absorber (columns). Here attached the pdf file and the results of the parsing.
We used the code snippet below :
Document pdfDocument = new Document(dataDir + “Max.pdf”);<o:p></o:p>
TableAbsorber absorber = new TableAbsorber();
absorber.Visit(pdfDocument.Pages[1]);
foreach(AbsorbedTable table in absorber.TableList)
{
foreach(AbsorbedRow row in table.RowList)
{
foreach(AbsorbedCell cell in row.CellList)
{
foreach(TextFragment text in cell.TextFragments)
{
Console.Write(text.Text + " ");
}
Console.Write("|");
}
Console.WriteLine("-------------------------------------------");
}
Console.WriteLine("===========================================");
}
Test.pdf (38.5 KB)
Result.pdf (41.0 KB)