Mac OS 10.14.4
java version “1.8.0_181”
Java™ SE Runtime Environment (build 1.8.0_181-b13)
Java HotSpot™ 64-Bit Server VM (build 25.181-b13, mixed mode)
Aspose: aspose-pdf-19.8
read text from PDF file, return more text than display in viewer.
just like export to new pdf file with font Embedded false below
Maybe CJK characters
orignal.pdf (72.7 KB)
export.pdf (125.1 KB)
Document pdfDocument = new Document(**orignal.pdf**);
Font font = FontRepository.findFont("SimSun");
font.setEmbedded(false);
TextFragmentAbsorber textFragmentAbsorber = new TextFragmentAbsorber();
pdfDocument.getPages().get_Item(1).accept(textFragmentAbsorber);
for (TextFragment item : textFragmentAbsorber.getTextFragments()) {
item.getTextState().setFont(font);
}
pdfDocument.save("export.pdf");