I've attached several documents that cause the following code to throw an exception and then cause the program to exit without any chance of recovery. The exceptions we get are similar to:
Wrong text extracting, please check your pdf.: v3.8.0.1
at x30a678191d9b1780.x59ea5a64f22abd3a.xc61a6090bb4b2d9b()
at Aspose.Pdf.Kit.PdfExtractor.ExtractText()
Environment is 32-bit executable running on Windows Server 2008 x64.
Test code is: (Note: The stream is because the files do not come from the filesystem, but from a remote server.)
PdfExtractor extractor = new PdfExtractor();
FileStream pdfstream = File.Open( filename, FileMode.Open );
extractor.BindPdf( pdfstream );
try {
extractor.ExtractText();
} catch( Exception ex ) {
Console.WriteLine( ex.Message );
}
Also, any plans to increase the performance of just getting out the text from pdf files. We've got other toolkits that we use to supplement the Aspose toolkit that are a lot faster, but aren't as stable overall.
Thanks for your help in this matter.