Hi Team,
Hebrew regex search doesn’t work as I expected since Aspose.Pdf 21.10.0.
We are searching with regex in pdf file using TextFragmentAbsorber. If the regex patter contains Hebrew it is seems like the order is reversed and some cases the regex doesn’t match but it should.
Maybe this is related to the following issue: Aspose.Pdf.Text.TextFragmentAbsorber throws exception with hebrew regex range
When we are using Aspose.Pdf 21.9.0 it is working fine except the above use case.
I created a sample project where I compare the TextFragmentAbsorber (Aspose.PDF 21.11.0) with the .Net Regex Class.
AsposePdfHebrew.zip (126.3 KB)
My expectation is that the TextFragmentAbsorber search results should be the same as the System.Text.RegularExpressions.Regex match results in the example project.
Please check the attached .Net Core project and let me know what could be the issue.
It is very important to clarify this because this is a blocker issue for us and we cannot upgrade to the latest version of Aspose.PDF (21.9.0 → 21.11.0)
Thank you for your help.