Optical Character Recognition (OCR) is a process by which text characters can be input to a computer by providing the computer with an image. The computer uses an OCR Engine--a computer program with the specific function of making a guess which letter (recognizable to a computer) an image (recognizable to a human) represents.
The exact mechanisms that allow humans to recognize objects are yet to be understood, but the three basic principles are already well known by scientists – integrity, purposefulness and adaptability (IPA). These principles constitute the core of ABBYY Fine Reader OCR allowing it to replicate natural or human-like recognition.
Let’s take a look on how Fine Reader OCR recognizes text. First, the program analyzes the structure of document image. It divides the page into elements such as blocks of texts, tables, images, etc. The lines are divided into words and then - into characters. Once the characters have been singled out, the program compares them with a set of pattern images. It advances numerous hypotheses about what this character is. Basing on these hypotheses the program analyzes different variants of breaking of lines into words and words into characters. After processing huge number of such probabilistic hypotheses, the program finally takes the decision, presenting you the recognized text.
In addition, ABBYY Fine Reader provides dictionary support for 48 languages. This enables secondary analysis of the text elements on word level. With dictionary support, the program ensures even more accurate analysis and recognition of documents and simplifies further verification of recognition results.
The most advanced optical character recognition systems, such as ABBYY Fine Reader OCR, are focused on replicating natural or “animal like” recognition. In the heart of these systems lie three fundamental principles: Integrity, Purposefulness and Adaptability. The principle of integrity says that the observed object must always be considered as a “whole” consisting of many interrelated parts. The principle of purposefulness supposes that any interpretation of data must always serve some purpose. And the principle of adaptability means that the program must be capable of self-learning.
One does not have to be an OCR specialist to see the advantages of an OCR application built on the IPA principles. These principles endow the program withmaximum flexibility and intelligence, bringing it as close as possible to human recognition.
After years of research, ABBYY was able to implement the IPA principles described above in its OCR technologies.One does not have to be an OCR specialist to see the advantages of an OCR application built on the IPA principles. These principles endow the program withmaximum flexibility and intelligence, bringing it as close as possible to human recognition.
With Fine Reader OCR, recognized document looks just like the original. Advanced, powerful OCR software allows you to save a lot of time and effort when creating, processing and repurposing various documents. With ABBYY Fine Reader OCR, you can scan paper documents for further editing and sharing with your colleagues and partners. You can extract quotes from books and magazines and use them for creating your course studies and papers without the need of retyping. With a digital camera and Fine Reader OCR, you can capture text outdoors from banners, posters and timetables and then use the captured information for your purposes. In the same way, you can capture information from paper documents and books – for example if there is no a scanner close at hand or you cannot use it. In addition, you can use OCR software for creating searchable PDF archives.
The entire process of data conversion from original paper document, image or PDF takes less than a minute, and the final recognized document looks just like the original!
Learn more information on how our OCR software can help you in everyday life in features and benefits descriptions of Fine Reader software for Windows and Mac.