Aspose OCR using any built-in model or AI/ML

kathirmsc85 · August 31, 2023, 12:57pm

Hi All,

I would like to know more about the Aspose OCR functionality.
While extracting the data from PDF, whether Aspose OCR is using any AI/ML model to read the data? or just an algorithm used to read the data?. what kind of algorithm?
Will it train the data itself while extracting the data?

Thanks

asad.ali · August 31, 2023, 8:35pm

@kathirmsc85

We use parsing and decryption algorithms to extract images from PDF file and then we use ML model to get text from images.

kathirmsc85 · September 1, 2023, 3:12am

Hi Asad Ali,

Lets assume a pdf file has text -

For reading the text Aspose OCR using any ML Model? what is the name of the ML Model
Just parsing Algorithm? what is the name
If part of image then using ML model, will use the same data for training the model?

Could you please help me to get the answers for this. Part of project i have to share this information to management to get approved.

Thanks,

asad.ali · September 1, 2023, 7:37pm

@kathirmsc85

If PDF file contains text - it is the task to extract this text without using ML model. It’s not our main orientation to extract text. We recognize text on images. Its OCR. But we plan to add extraction function in future.
Now we can offer to use Aspose.PDF in combination with Aspose.OCR.

Also about ML - we use new 10 models our own developing and we use advanced world technologies in the subject of image recognition.

kathirmsc85 · September 6, 2023, 7:32am

@asad.ali thank you so much for the response.

Thanks.