Optical Character Recognition is much more than character classification. An industrial OCR application combines
algorithms studied in detail by different researchers in the area of image processing, pattern recognition, machine
learning, language analysis, document understanding, data mining, and other, artificial intelligence domains. There is no
single perfect algorithm for any of the OCR problems, so modern systems try to adapt themselves to the actual features
of the image or document to be recognized. This paper describes the architecture of a modern OCR system with an
emphasis on this adaptation process.
In this paper we focus on some features frequently employed in Optical Character Recognition (OCR) especially in algorithms based on contour analysis. These features are of a topological and/or geometric kind somehow describing the characters. The results of an experiment to examine the identification and separation power of some features are summarized.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.