Hi, We have a (writable) PDF file that we’re trying to do a text extract on using the following code:
Using contentStream As New IO.MemoryStream(Me.ContentFileData)
Dim extractor As New Aspose.Pdf.Facades.PdfExtractor()
'extractor.Password = “”
extractor.BindPdf(contentStream)
extractor.ExtractText(System.Text.Encoding.Unicode)
Using PDFTextStream As New MemoryStream()
extractor.GetText(PDFTextStream)
PDFTextStream.Seek(0, SeekOrigin.Begin)
Dim reader As New StreamReader(PDFTextStream, System.Text.Encoding.Unicode)
Dim allText As String = reader.ReadToEnd()
reader.Close()
End Using
End Using
It works fine for all our other files, just not this one. I’ve tried this in the latest release of Aspose.Pdf. Is there a know issue with this? I can provide the file that fails through regular e-mail.
Thanks,
Andy.
Hi Andy,
Thank you for the details.
As your template file has some confidential data, you may mark this thread as private by clicking the “Make Private” button available at the top of every forum post or by checking the “Keep this post as private” checkbox at the bottom when replying to the post. This way, only Aspose staff and you would be able to see your attachments. (Recommended)
If you want to send the file as an email, please follow the steps mentioned in the link below. We will check it and get back to you.
https://about.aspose.cloud/legal/gdpr/
Sorry for the inconvenience.