Could you please share your sample PDF document. We will test the case in our environment and address it accordingly.
AccumTermsheet_LGT(UBS).pdf (193.5 KB)
food.pdf (151.1 KB)
Please find attached pdfs, help on solution.Error.png (112.1 KB)
Is it possible to have a call for discussion in detail. It will solve my problem on a priority basis and it will also fix the problem if it is present in the library.
Able to read table not able to read data from each cell of table. it always shows “Enumeration yielded no results” while reading data from cell. And this recognize part of datable not full table which is spitted across two pages.
If possible to have call then please tell me, What time we can have a call.
I’m sorry, we do not provide phone support under free support. The phone support is only provided under paid Enterpise Support or Business Support.
However, we’re going to investigate the issue you shared in detail and will update you soon. Please note that under free support the issues are handled on first come first serve basis, but we’ll try to prioritize the investigation on this one.
Thanks for your reply. Please try to investigate as early as possible.
We have tested the scenario in our environment while using Aspose.PDF for .NET 20.10 and following code snippet:
Document pdfDocument = new Document(dataDir + "food.pdf");
foreach (var page in pdfDocument.Pages)
{
Aspose.Pdf.Text.TableAbsorber absorber = new Aspose.Pdf.Text.TableAbsorber();
absorber.Visit(page);
foreach (AbsorbedTable table in absorber.TableList)
{
foreach (AbsorbedRow row in table.RowList)
{
foreach (AbsorbedCell cell in row.CellList)
{
TextFragment textfragment = new TextFragment();
TextFragmentCollection textFragmentCollection = cell.TextFragments;
foreach (TextFragment fragment in textFragmentCollection)
{
string txt = "";
foreach (TextSegment seg in fragment.Segments)
{
txt += seg.Text;
}
Console.WriteLine(txt);
}
}
}
}
}
The API was able to extract table data from one of the PDFs i.e. AccumTermsheet_LGT(UBS).pdf. Furthermore, it detected a table occurrence inside food.pdf but was unable to extract any text. Hence, an issue has been logged in our issue tracking system for this particular file as PDFNET-48939. We will further look into its details and keep you informed about its rectification status. Please give us some time.
We apologize for the inconvenience.
PS: We tested the scenario by applying a valid license as well. Please make sure that you are using the API with a license. In case you do not have one, please try applying for 30-days free temporary license to evaluate the API without any limitation.
Thanks for your prompt reply, Waiting for PDFNET-48939 fixes.
Also need to do one more thing i.e. if table is divided into 2 pages must be read as single table not two different tables. It causes problems while parsing the pdf. It works in word parsing but not in pdf. Also look into this on priority,
Please try to setup call need to discuss more scenario like this.
We have logged another issue as PDFNET-48942 in our issue tracking system for your above case. We will surely check the feasibility of its implementation and let you know as soon as it is resolved. Furthermore, as shared earlier by @shahzadlatif - the issues are resolved on first come first serve basis under free support model. We have however recorded your concerns and will surely consider them during issue investigation.
Please also note that we encourage providing support via our free support forum and phone call support is only provided in case of paid support. In case you have more scenarios which need to be reported, you can please share them within this forum thread and we will address them for sure after testing the scenario in our environment.
Ok, Understood. Will post what ever challenges we are facing here,
Sure, please take your time to evaluate the API and let us know in case you face any issue.
Please share temporary license for latest version…
Please fix reported issues on priority. It will be helpful to us
You can apply a 30-days temporary license by yourself over our website.
We will surely resolve the logged issue, however it will be resolved on first come first serve basis as per the policy of normal support. We will surely inform you as soon as the tickets associated with this forum thread are resolved. We highly appreciate your patience in this regard. Please give us some time.
We apologize for the inconvenience.
Please help on this, is this issue fixed or not If fixed then please share the latest patch so we can upgrade at our end.
Not able to read text from pdf cell is that text is breaked in two lines
Above logged ticket is more like a new feature which needs to be added to the API. It would take certain amount of time to get investigated for feasibility and implementation. We will surely inform you within this forum thread as soon as we have some definite updates in this regard. We request for your patience in this matter.
Are you referring to another issue you are facing with some PDF? Please share a sample PDF document and code snippet so that we can replicate this issue at our side and address it accordingly.
Please tell me the status of issues reported by me. is it all solved or not yet. We have some client’s requirements related to this only.
We are afraid that earlier logged tickets have not been yet resolved. As soon as they are completely investigated, we will be able to share some ETA with you regarding their fixes. We highly appreciate your patience and comprehension in this matter. Please spare us some time.
We are sorry for the inconvenience.