Hello,
I tried to extract tables from a pdf file which contain merged cells but I couldn’t have the correct results, please find here my source code.
package aspose;
import com.aspose.pdf.*;
public class App
{
public static void main( String[] args )
{
Document doc = new Document("RIVP000C8E3B.pdf");
try {
TableAbsorber absorber = new TableAbsorber();
PageCollection pc = doc.getPages();
for(Page pg:pc){
absorber.visit(pg);
com.aspose.pdf.internal.ms.System.Collections.Generic.IGenericList<AbsorbedTable> l = absorber.getTableList();
for(AbsorbedTable table:l){
com.aspose.pdf.internal.ms.System.Collections.Generic.IGenericList<AbsorbedRow> r = table.getRowList();
for(AbsorbedRow row:r)
{
com.aspose.pdf.internal.ms.System.Collections.Generic.IGenericList<AbsorbedCell> c = row.getCellList();
for(AbsorbedCell cell:c)
{
for(TextFragment tf:cell.getTextFragments())
{
for(TextSegment ts:tf.getSegments())
{
System.out.println(ts.getText());
}
}
}
}
}
}
} catch (Exception e) {
// TODO Auto-generated catch block
e.printStackTrace();
}
}
}
Thank you for your help.
Appreciate
Thamer