Hello Java-OCR Tech Support,
I was exchanging emails with your Sales group but before I buy your software, I need to make sure that it can do the job. Please provide a technical solution to my requirements using your products.
I have to OCR, search, and extract pages on non-searchable PDFs. I need to do in a Java servlet in Tomcat. My requirements are:
1. I have to OCR a non-searchable PDF document (multiple pages) and look for some strings.
2. On the pages where I see these strings, I need to pull out the pages and store into a new PDF file.
I see that you have these products 'Java-OCR' and 'Java-PDF'. Based on the description, 'Java-OCR' can only work on BMP files. Does your Java-OCR now support PDF file format? If not, can I pull each page using Java-PDF and convert into BMP and do the OCR to locate the strings?
Thank you.