I have OCR, search, and pages extraction requirements on non-searchable PDFs. I need to do in a Java servlet in Tomcat. My requirements are:
1. OCR a non-searchable PDF document and look for some strings.
2. Pull out all pages from the original PDF which contain the string and store these pages into a new PDF file.
I see that Aspose has these products 'Java-OCR' and 'Java-PDF'. Based on the description, 'Java-OCR' can only work on BMP files.
Would anyone know if any or a combination of the Aspose products can satisfy my requirements above?