Introduction to OCR
The field of OCR even predates the advent of computers, with original OCR-related patents dating back as far as 1929. Of course, much of the significant advances in OCR are directly related to the computer age, and usually, the more advanced the OCR system is, the more computer resources (e.g., faster CPU) are required to run the OCR system. OCR, and its related discipline ICR (Intelligent Character Recognition), are changing the way industry handles its documents. ICR is defined as the computer translation of manually entered text characters into machine-readable characters.
In many applications, including legal, accounting, banking, digital libraries, insurance, remote backups, and records management, OCR is automating the way that businesses process files. Accurate OCR directly lends itself to data extraction which reduces the costs associated with form processing.
A Summary of the OCR Primer
In this OCR primer (OCR tutorial), we review the basic concepts of Optical Character Recognition. We also look at the technical aspects that need to be solved to obtain accurate OCR results. Some of the complex recognition problems that can arise are considered, along with some suggestions on obtaining improved, more accurate OCR results.