Hi
I am evaluating the Aspose.PDF. I installed the latest version from nuget (20.8.0) and activated the temporary license.
I have to migrate a project that currently use an old version of 3Heights extract (year 2011-2012) and basically I have to read all pdf text objects even with many pages (10000+). I have to extract whole text objects with all their properties (position, font, text and so on).
string pathExe = Path.GetDirectoryName(System.Reflection.Assembly.GetEntryAssembly().Location);
FileStream streamLicense = new FileStream($"{pathExe}\Aspose.Pdf.lic", FileMode.Open);
License license = new Aspose.Pdf.License();
license.SetLicense(streamLicense);
Aspose.Pdf.Document document = new Aspose.Pdf.Document(filePdf);
TextFragmentAbsorber textFragmentAbsorber = new TextFragmentAbsorber();
document.Pages.Accept(textFragmentAbsorber);
I got a “Out of memory error” on accept function
I’ve tried to elaborate a single page too with same result
Aspose.Pdf.Document document = new Aspose.Pdf.Document(filePdf);
for (int pagina = 1; pagina <= document.Pages.Count; pagina++)
{
TextFragmentAbsorber textFragmentAbsorber = new TextFragmentAbsorber();
document.Pages[pagina].Accept(textFragmentAbsorber);
}
any tips?