I’ve been evaluating the text extraction capabilities of Aspose.Pdf.Kit and came across a bunch of Pdf’s that cause exceptions. I know a couple are because they are corrupt, or password protected, but most work fine in other programs or viewers. Can you take a look and get back to me ASAP, as I am evaluating this and several other text extraction components?
http://datarg.com/docs/badpdf.rar
Also, the performance doesn’t seem to be what it should be. Is that because of the text garbling, or is it standard?