If a PDF contains a table (text within can be copy and pasted), will the table retain its HTML structure after converting the PDF to HTML?
Thank you for your response,
I have a requirement to convert PDF tables for data extraction.
Most of the tables follow a simple row-column layout.
HTML is chosen because we have a XSLT application to perform data transformation quickly.
Depending on how much effort is required, extracting the information directly from the PDF is still within consideration.
Thanks for your feedback. In addition to your existing approach of converting PDF to HTML for table data extraction, you may covert PDF to excel using Aspose.Pdf for .NET and later can extract row/column data from excel workbook using Aspose.Cells for .NET. Please check following Aspose.Cells documentation link for the purpose.
Please feel free to contact us for any further assistance.