Hi, Support:
Is there any method to draw text data from a table in a give page in a pdf?
please refer to the demo pdf file, where the data to be drawn in a given table cell in given table-no in given pageno has been marked.
thanks.
Data in Table Cell to be drawn.pdf (65.3 KB)
If you have some marker defined in the table cell which needs to be replaced with a desired text, you can achieve it by replacing the text functionality.
Thank for your suggestion,however, this looks not an ideal method to achive this goal. If the api can locate the table at the page in the pdf, and then locate the data to be drawn at the cell, which should be then right way for this.
We will surely investigate the feasibility of this requirements. One question, by drawing the data you mean adding text. Right?
extract the data text from the given cell in the table and save the extracted text as txt file.
for this purpose, it can be achieved by using
Dim TableAbsorber As Aspose.Pdf.Text.TableAbsorber = New Aspose.Pdf.Text.TableAbsorber()
TableAbsorber .Visit(Pdf.Pages(1))
For Each Table As AbsorbedTable In TableAbsorber .TableList
For Each Row As AbsorbedRow In Table.RowList
For Each cell As AbsorbedCell In Row.CellList
Dim TextFragment As TextFragment = New TextFragment()
Dim TextFragmentCollection As TextFragmentCollection = cell.TextFragments
For Each fragment As TextFragment In TextFragmentCollection
For Each seg As TextSegment In fragment.Segments
CellText += seg.Text
Next
CellText = CellText & vbCrLf
'Console.WriteLine(CellText )
Next
next
next
next
However, if the table is present as a scan image style, the TableAbsorber can not be able to probe it, for this feature, it is expected to support in later version.
As per our understandings, you want to have a method in the API in which you can specify table, column and cell number and expects it to add text inside that cell. Secondly, you need to be able to extract tables that are added as scanned image? Can you please confirm? Also, please share such PDF which contain table as scanned image.
We have opened the following new ticket(s) in our internal issue tracking system and will deliver their fixes according to the terms mentioned in Free Support Policies.
Issue ID(s): PDFNET-55685
About your second requirement of extracting tables which are present as scanned image, we are afraid this would not be possible. Aspose.PDF does not deal with scanned images or manipulate them. The least you can do is to perform OCR operations on such images to extract table text. Changing the text would still not be possible on the image itself.
We apologize for the inconvenience.