Robust Character Recognition in Low-Resolution Images and Videos
Kopf, Stephan
;
Haenselmann, Thomas
;
Effelsberg, Wolfgang
URL:
|
http://ub-madoc.bib.uni-mannheim.de/980
|
URN:
|
urn:nbn:de:bsz:180-madoc-9807
|
Dokumenttyp:
|
Arbeitspapier
|
Erscheinungsjahr:
|
2005
|
Titel einer Zeitschrift oder einer Reihe:
|
None
|
Sprache der Veröffentlichung:
|
Englisch
|
Einrichtung:
|
Fakultät für Wirtschaftsinformatik und Wirtschaftsmathematik > Sonstige - Fakultät für Wirtschaftsinformatik und Wirtschaftsmathematik
|
MADOC-Schriftenreihe:
|
Veröffentlichungen der Fakultät für Mathematik und Informatik > Institut für Informatik > Technical Reports
|
Fachgebiet:
|
004 Informatik
|
Normierte Schlagwörter (SWD):
|
Bildsegmentierung , Bildanalyse , Filmanalyse , Optische Zeichenerkennung
|
Freie Schlagwörter (Deutsch):
|
OCR , Krümmungsbasierter Skalenraum , CSS
|
Freie Schlagwörter (Englisch):
|
OCR , Curvature scale space , CSS , Character segmentation
|
Abstract:
|
Although OCR techniques work very reliably for high-resolution documents, the recognition of superimposed text in low-resolution images or videos with a complex background is still a challenge. Three major parts characterize our system for recognition of superimposed text in images and videos: localization of text regions, segmentation (binarization) of characters, and recognition. We use standard approaches to locate text regions and focus in this paper on the last two steps. Many approaches (e.g., projection profiles, k-mean clustering) do not work very well for separating characters with very small font sizes. We apply in a vertical direction a shortest-path algorithm to separate the characters in a text line. The recognition of characters is based on the curvature scale space (CSS) approach which smoothes the contour of a character with a Gaussian kernel and tracks its inflection points. A major drawback of the CSS method is its poor representation of convex segments: Convex objects cannot be represented at all due to missing inflection points. We have extended the CSS approach to generate feature points for concave and convex segments of a contour. This generic approach is not only applicable to text characters but to arbitrary objects as well. In the experimental results, we compare our approach against a pattern matching algorithm, two classification algorithms based on contour analysis, and a commercial OCR system. The overall recognition results are good enough even for the indexing of low resolution images and videos.
|
Zusätzliche Informationen:
|
|
| Das Dokument wird vom Publikationsserver der Universitätsbibliothek Mannheim bereitgestellt. |
Suche Autoren in
Sie haben einen Fehler gefunden? Teilen Sie uns Ihren Korrekturwunsch bitte hier mit: E-Mail
Actions (login required)
|
Eintrag anzeigen |
|
|