PDF text and table extraction

How to extract text and tables from PDF using Aspose PDF for .Net to create a word document out of it by preserving their order. E.g., PDF file starts with text followed by table which is again followed by text.

@bhargavgaglani07

You can simply convert the entire PDF document into Word format and output file will have content in the same order. In case some issue is there, please share your sample file along with expected output with us so that we can test the scenario in our environment and address it accordingly.

Hi @asad.ali,

Thank you for providing information. Converting to PDF has performance issues and takes too much time in case of large PDFs. We are primary looking for text and table only, extracting this data is way faster. So if you can help out to retain order while extracting these then that would be really helpful.

@bhargavgaglani07

Would you please share your sample PDF document for our reference so that we can test the scenario in our environment and address it accordingly.

Find attached sample file.

Sample Text Document With Tables.pdf (1.5 MB)

@bhargavgaglani07

We have opened the following new ticket(s) in our internal issue tracking system and will deliver their fixes according to the terms mentioned in Free Support Policies.

Issue ID(s): PDFNET-54769

You can obtain Paid Support Services if you need support on a priority basis, along with the direct access to our Paid Support management team.