We're sorry Aspose doesn't work properply without JavaScript enabled.

Free Support Forum - aspose.com

TextFragmentAbsorber not able to search text fragment by regexp

Hi. When we trying to find text fragment using regexp - TextFragmentAbsorber not able to find any text fragment.

Java version: 11
Aspose PDF version: 22.2

Code snippet

	void testTextFragmentSearch() throws IOException {
		var inputStream = new ClassPathResource("pdf/testExtraction.pdf").getInputStream();
		var document = new Document(inputStream);
		var page = document.getPages().get_Item(1);
		var rectangle = new Rectangle(72.024, 381.17000000953675, 181.5959996213913, 393.31399996757506);
		var absorber = new TextFragmentAbsorber();
		var searchValue = "Canada";

		searchValue = Pattern.quote(searchValue);
		absorber.setTextSearchOptions(new TextSearchOptions(rectangle, true));

		var textFragments = absorber.getTextFragments();
		for (var textFragment : textFragments) {
			drawRectangleOnPage(page, textFragment.getRectangle(), new SetRGBColorStroke(1, 0, 0), new SetLineWidth(1));


	private static void drawRectangleOnPage(Page page, Rectangle rectangle, SetRGBColorStroke colorStroke, SetLineWidth width) {
		page.getContents().add(new GSave());
		page.getContents().add(new ConcatenateMatrix(1, 0, 0, 1, 0, 0));
		page.getContents().add(new Re(rectangle.getLLX(), rectangle.getLLY(), rectangle.getWidth(), rectangle.getHeight()));
		page.getContents().add(new ClosePathStroke());
		page.getContents().add(new GRestore());

For Aspose PDF version: 21.8 this code works properly.

Source file:testExtraction.pdf (402.5 KB)
Result: result.pdf (410.0 KB)
Expected result: Absorber.png (42.2 KB)


Please comment above line of code and execute your code to get the desired output.

quote(String) method of a Pattern class used to returns a literal pattern String for the specified String passed as parameter to method . This method produces a String equivalent to s that can be used to create a Pattern. Metacharacters or escape sequences in the input sequence will be given no special meaning


We have logged this problem in our issue tracking system as PDFJAVA-41447. You will be notified via this forum thread once this issue is resolved.

We apologize for your inconvenience.

The issues you have found earlier (filed as PDFJAVA-41447) have been fixed in Aspose.PDF for Java 22.4.