We're sorry Aspose doesn't work properply without JavaScript enabled.

Free Support Forum - aspose.com

Extract text and image from pdf but emoji can't be extracted

Hi,

I’m extract text and image from pdf using Aspose.PDF but emoji can’t be extracted.
my code:

BufferedInputStream bis = new BufferedInputStream(new FileInputStream("C:\\Users\\xxx\\Desktop\\lucy-test-1.pdf"));
Document document = new Document(bis);
PdfExtractor ext = new PdfExtractor();
ext.setExtractTextMode(1);
ext.bindPdf(document);
ext.extractText(StandardCharsets.UTF_8);
ext.getText(new FileOutputStream("C:\\Users\\xxx\\Desktop\\lucy-test-1.txt"));

here is my pdf :
file-test.pdf (1.2 MB)
result:
1.JPG.png (136.4 KB)

@lucy.hq

A ticket with ID PDFJAVA-40948 has been created in our issue tracking system to further investigate the issue on our end. This thread has been linked with the issue so that you may be notified once the issue will be fixed.