ParagraphAbsorber throwing Object reference not set to an instance of an object error in Linux environment

jaladev · August 31, 2018, 4:46am

We’re using Aspose.Pdf to extract all paragraph text from PDF files. Everything worked fine when developing on a Windows 10 workstation but when I deploy the application to our server running Ubuntu an Object reference not set to an instance of an object error when I use the ParagraphAbsorber on the document.

			using (var stream = new MemoryStream(FileBytes))
			{
				using (var pdfDocument = new Document(stream))
				{
					var absorber = new ParagraphAbsorber();

					absorber.Visit(pdfDocument);

					foreach (var pageMarkup in absorber.PageMarkups)
					{
						foreach (var markupSection in pageMarkup.Sections)
						{
							foreach (var paragraph in markupSection.Paragraphs)
							{
								// Extract paragraph text
							}
						}
					}
				}
			}

Stack trace of the error:

Object reference not set to an instance of an object.
at .(Operator )
at ()
at .(BaseOperatorCollection , Resources , Page )
at .(BaseOperatorCollection , Resources )
at .()
at Aspose.Pdf.Text.TextFragmentAbsorber.Visit(Page page)
at Aspose.Pdf.Text.PageMarkup.(Page )
at Aspose.Pdf.Text.ParagraphAbsorber.Visit(Page page)
at Aspose.Pdf.Text.ParagraphAbsorber.Visit(Document doc)
at Jala.ProjectProcessor.Extractors.PdfExtractor.Extract()

Server specifications:
OS: Ubuntu 16.04.4 x64
.Net Core Runtime SDK installed: aspnetcore-runtime-2.1
Aspose.Pdf version: 18.8.0

I also attached a sample Pdf file that throws the same error.

Winged.pdf (49.7 KB)

asad.ali · August 31, 2018, 1:57pm

@jaladev

We are setting up an environment to test the scenario and will get back to you shortly.

asad.ali · September 18, 2018, 7:23pm

@jaladev

Thanks for being patient.

We have set up the environment i.e. Linux CentOS 7 x64 with .NET Core 2.1 Runtime installed and tested the scenario. We were able to replicate the mentioned error and logged it under the ticket ID PDFNET-45412 in our issue tracking system. We will further look into details of the issue and keep you informed with the status of its rectification. Please be patient and spare us little time.

We are sorry for the inconvenience.