There is an issue when searching for Hebrew text in PDF with regex. I have a C# Regex object with a Hebrew pattern and when I use the TextFragmentAbsorber it doesn’t find anything but the attached pdf file contains the text that should be matched.
When I extract the Page text with the TextAbsorber the searched word contains some spaces in the output text and I don’t know why.
@erdeiga
We have opened the following new ticket(s) in our internal issue tracking system and will deliver their fixes according to the terms mentioned in Free Support Policies.
Issue ID(s): PDFNET-56566
You can obtain Paid Support Services if you need support on a priority basis, along with the direct access to our Paid Support management team.
@zpopswat
We’ve investigated the issue and found that it requires significant changes to several components related to right-to-left text handling. We’ll continue to investigate and track the issue internally, but a fix won’t be available anytime soon as we’re prioritizing paid support and can’t provide an ETA.