How to get Merged columns from PDF Files

We are using Aspose.PDF in our project and when the PDF file have merged columns, we are receiving only single column value, where we need to receive the multiple column values from a row…How to get multiple values if a row consist of multiple column and values
As of now we are using AbsorbedTable, where it is proving single column , when we have multiple column values in a row.
Pdfprasing.JPG (30.7 KB)

@Srikanth_Pulla,

Can you share a pdf document with that layout and your code snippet, please?

Import_Export.pdf (430.6 KB)
please find the attachments, where table has merged columns, and we are able to get Sigle column only. please find the link below where our code snippet looks like.

@Srikanth_Pulla,

Here is some code that will show all the content.

private void Logic(Document doc)
{            
    var pageList = doc.Pages.Cast<Page>().ToList();

    foreach (var page in pageList)
    {
        // Create TableAbsorber object to find tables
        TableAbsorber absorber = new TableAbsorber();
        absorber.Visit(page);
        foreach (var table in absorber.TableList)
        {
            foreach (var row in table.RowList)
            {
                bool foundSomething = false;
                string rowText = string.Empty;
                foreach (var cell in row.CellList)
                {
                    foreach (TextFragment fragment in cell.TextFragments)
                    {
                        if (!string.IsNullOrWhiteSpace(fragment.Text))
                        {
                            foundSomething = true;
                            rowText += $"{fragment.Text}|";
                        }
                    }                            
                }

                if (foundSomething)
                    Console.WriteLine(rowText);
            }
            Console.WriteLine();
        }
        Console.WriteLine("-----------------------------");
    }

    Console.ReadKey();
}

The problem is that when there is a cell with multiple rowspan, the rest of the content appears in another table, as you can see in the result of my code.

So I will be creating a ticket for the dev team.

@Srikanth_Pulla
We have opened the following new ticket(s) in our internal issue tracking system and will deliver their fixes according to the terms mentioned in Free Support Policies.

Issue ID(s): PDFNET-54503

You can obtain Paid Support Services if you need support on a priority basis, along with the direct access to our Paid Support management team.

Hi @carlos.molina,

We already purchased the licenses, could you please take this high priority.

Regards,
Srikanth Pulla

@Srikanth_Pulla,

I am sorry, but I am not part of the dev team. I am only a member of the free support team.

Remember that having a paid license does not qualify for Pad Support Service.

I did send a note to the dev team with your message, that you need this soon. But the prioritization will be handled based on Free Support Policies.