Extract text from PDF using Java with Aspose.PDF for Java - java.lang.IllegalStateException occurs

报关单.PDF (141.0 KB)
aspose.pdf 19.4
Mac OS 10.14.4
java version “1.8.0_181”
Java™ SE Runtime Environment (build 1.8.0_181-b13)
Java HotSpot™ 64-Bit Server VM (build 25.181-b13, mixed mode)

Code:
TextFragmentAbsorber textFragmentAbsorber = new TextFragmentAbsorber();
pdfDocument.getPages().accept(textFragmentAbsorber);

Stack:
java.lang.IllegalStateException: Resource file GBK2K-H not found in assembly
at com.aspose.pdf.internal.l0k.ld.lI(Unknown Source)
at com.aspose.pdf.internal.l4v.lv.lf(Unknown Source)
at com.aspose.pdf.internal.l4v.lv.lI(Unknown Source)
at com.aspose.pdf.internal.l4j.lt.lI(Unknown Source)
at com.aspose.pdf.internal.l4j.lt.lI(Unknown Source)
at com.aspose.pdf.internal.l4j.lt.lI(Unknown Source)
at com.aspose.pdf.internal.l4y.lI.ld(Unknown Source)
at com.aspose.pdf.internal.l4y.lk.(Unknown Source)
at com.aspose.pdf.internal.l4y.lI.(Unknown Source)

@JamesGuo

Thank you for contacting support.

We have worked with the data shared by you and have been able to reproduce the issue in our environment. A ticket with ID PDFJAVA-38594 has been logged in our issue management system for further investigation and resolution. The ticket ID has been linked with this thread so that you will receive notification as soon as the ticket is resolved.

We are sorry for the inconvenience.

The issues you have found earlier (filed as PDFJAVA-38594) have been fixed in Aspose.PDF for Java 20.4.