Hi,
The PDF file contains the following RTL text: מע-4.
However, when running the below code, I get the following simple text: מע4-
Any idea?
Thanks.
com.aspose.pdf.Document pdfDoc = new com.aspose.pdf.Document(file);
pdfDoc.setDirection(com.aspose.pdf.Direction.R2L);
com.aspose.pdf.TextAbsorber textAbsorber = new com.aspose.pdf.TextAbsorber();
textAbsorber.setExtractionOptions(new TextExtractionOptions(TextExtractionOptions.TextFormattingMode.Pure));
pdfDoc.getPages().accept(textAbsorber);
String content = textAbsorber.getText();
System.err.println(content);
pdfDoc.close();
test.pdf (26.1 KB)