Abstract
It is very common that the filled-in character images in the form documents touch, cross, or overlap the formatted-line images. In that case, it is not easy to extract characters correctly because the shapes of characters are transformed by line images. In this paper, we propose a new method to reconstruct the character images damaged by the preprinted lines of documents. The method consists of two stages - the character decomposition stage and the character reconstruction stage. In the character decomposition stage, an input character is decomposed into some line-elements which are units of reconstruction, through the hierarchical steps. In the character reconstruction stage, the various reconstruction methods are used to restore the characters according to the 4 types of line-elements. To evaluate the performance of the proposed method objectively, we used simple recognition modules on CENPARMI handwritten digits and NIST handwritten alphabets. Experimental results showed that the difference of the recognition rates between the original characters without any damages by lines and the characters reconstructed by the proposed method is within about 1%, and the shapes of reconstructed character images are almost the same as those of the original ones.
Original language | English |
---|---|
Title of host publication | Graphics Recognition |
Subtitle of host publication | Algorithms and Systems - 2nd International Workshop, GREC 1997, Selected Papers |
Editors | Karl Tombre, Atul K. Chhabra |
Publisher | Springer Verlag |
Pages | 149-162 |
Number of pages | 14 |
ISBN (Print) | 3540643818, 9783540643814 |
DOIs | |
Publication status | Published - 1998 |
Event | 2nd International Workshop on Graphics Recognition, GREC 1997 - Nancy, France Duration: 1997 Aug 22 → 1997 Aug 23 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 1389 |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Other
Other | 2nd International Workshop on Graphics Recognition, GREC 1997 |
---|---|
Country/Territory | France |
City | Nancy |
Period | 97/8/22 → 97/8/23 |
Bibliographical note
Publisher Copyright:© Springer-Verlag Berlin Heidelberg 1998.
All Science Journal Classification (ASJC) codes
- Theoretical Computer Science
- Computer Science(all)