PDF With RTL Text

Hi,

The PDF file contains the following RTL text: מע-4.
However, when running the below code, I get the following simple text: מע4-
Any idea?

Thanks.

com.aspose.pdf.Document pdfDoc = new com.aspose.pdf.Document(file);
pdfDoc.setDirection(com.aspose.pdf.Direction.R2L);
com.aspose.pdf.TextAbsorber textAbsorber = new com.aspose.pdf.TextAbsorber();
textAbsorber.setExtractionOptions(new TextExtractionOptions(TextExtractionOptions.TextFormattingMode.Pure));
pdfDoc.getPages().accept(textAbsorber);
String content = textAbsorber.getText();
System.err.println(content);
pdfDoc.close();

test.pdf (26.1 KB)

@asingernarrative

We have opened the following new ticket(s) in our internal issue tracking system and will deliver their fixes according to the terms mentioned in Free Support Policies.

Issue ID(s): PDFJAVA-43055

You can obtain Paid Support Services if you need support on a priority basis, along with the direct access to our Paid Support management team.