23.4 Character Recognition Systems

Chapter Contents (Back)
OCR. Character Recognition. Application, Character Recognition. A large number of character recognition papers appear in the ICPR proceedings every year. Most of these are not included here. Much more work is done on Chinese characters since printed (fixed) font recognition is generally a commercial operation.

23.4.1 Character Recognition Survey, Overview, Evaluations

Chapter Contents (Back)
Survey, OCR. OCR. Character Recognition. Evaluation, OCR.

Bledsoe, W.W., and Browning, I.,
Pattern Recognition and Reading by Machine,
EJCC59(225-232). BibRef 5900

Stevens, M.E.,
Automatic Character Recognition: A State of the Art Report,
NBSTechnical Tote, No. 112, 1961. BibRef 6100

Harmon, L.D.,
Automatic Recognition of Print and Script,
PIEEE(60), No. 10, October 1972, pp. 1165-1177. BibRef 7210

Kamentsky, L.A., and Liu, C.N.,
A Theoretical and Experimental Study of a Model for Pattern Recognition,
In: Computer and Info. SciencesSpartan, 1963, pp. 194-218. N-Tuple matching for OCR. BibRef 6300

Ledley, R.S.[Robert S.],
Special issue on Optical Character Recognition,
PR(2), No. 3, September 1970, pp. 145.
WWW Version. 0309 BibRef

Niemann, H.[Heinrich],
Classification of characters by man and by machine,
PR(9), No. 4, 1977, pp. 173-179.
WWW Version. 0309 BibRef

Pavlidis, T., Mori, S., (Eds.)
Special Issue on Optical Character Recognition,
PIEEE(80), No. 7, July 1992, pp. 1027-1029.
IEEE Top Reference. BibRef 9207

Suen, C.Y.,
Character Recognition by Computer and Applications,
HPRIP86(569-586). BibRef 8600

Mantas, J.,
An Overview of Character Recognition Methodologies,
PR(19), No. 6, 1986, pp. 425-430.
WWW Version. Application, Character Recognition. BibRef 8600

Mantas, J.,
Methodologies in Pattern Recognition and Image Analysis: A Brief Survey,
PR(20), No. 1, 1987, pp. 1-6.
WWW Version. 0309 Survey, OCR. BibRef

Eckhouse, R., (Editor)
Intelligent Character Recognition,
Computer(23), No. 6, June 1990, pp. 99-103. Survey, OCR Products. Product Survey. BibRef 9006

Trier, O.D., Jain, A.K., Taxt, T.,
Feature-Extraction Methods for Character-Recognition: A Survey,
PR(29), No. 4, April 1996, pp. 641-662.
WWW Version. Survey, OCR. BibRef 9604

Lopresti, D.P., Zhou, J.Y.,
Using Consensus Sequence Voting to Correct OCR Errors,
CVIU(67), No. 1, July 1997, pp. 39-47. 9707
WWW Version. BibRef

Li, Y.H., Lopresti, D.P., Nagy, G., Tomkins, A.,
Validation of Image Defect Models for Optical Character-Recognition,
PAMI(18), No. 2, February 1996, pp. 99-107.
IEEE Abstract. IEEE Top Reference.
WWW Version. BibRef 9602

Wang, P.S.P., and Bunke, H., (Eds.)
Handbook on Optical Character Recognition and Document Image Analysis,
World ScientificPublishing, 1997. Referenced as BibRef 9700 OCRDIA97
WWW Version. Survey, OCR. BibRef

Lee, S.W.[Seong-Whan],
Frontiers in Handwriting Recognition,
IJDAR(2), No. 1, 1999, pp. 1-1. Issue Introduction BibRef 9900

Lorette, G.,
Handwriting Recognition or Reading? What is the Situation at the Dawn of the 3rd Millennium?,
IJDAR(2), No. 1, 1999, pp. 2-12. BibRef 9900

Rice, S.V.[Stephen V.], Nagy, G.[George], Nartker, T.A.[Thomas A.],
Optical Character Recognition: An Illustrated Guide to the Frontier,
KluwerMay 1999. ISBN 0-7923-8492-X.
WWW Version. BibRef 9905

Francesconi, E., Gori, M., Marinai, S., Soda, G.,
A serial combination of connectionist-based classifiers for OCR,
IJDAR(3), No. 3, 2001, pp. 160-168.
HTML Version. 0105 BibRef

Brundick, F.S., Brodeen, A.E.M.[Ann E.M.], Taylor, M.S.[Malcolm S.],
A statistical approach to the generation of a database for evaluating OCR software,
IJDAR(4), No. 3, 2002, pp. 170-176.
HTML Version. 0205 BibRef

Baird, H.S.[Henry S.], Coates, A.L.[Allison L.], Fateman, R.J.[Richard J.],
PessimalPrint: a reverse Turing test,
IJDAR(5), No. 2-3, April 2003, pp. 158-163.
HTML Version. 0308 BibRef
Earlier: A2, A1, A3: ICDAR01(1154-1158).
IEEE DOI may work or IEEE-CS DOI may work. 0109 BibRef

Fairhurst, M.C.[Michael C.], Rahman, A.F.R.[A. Fuad R.], Guest Editors,
Special issue on multiple classifiers for document analysis applications,
IJDAR(5), No. 4, July 2003, pp. 165.
HTML Version. 0308 BibRef

Jaeger, S., Liu, C.L., Nakagawa, M.,
The state of the art in Japanese online handwriting recognition compared to techniques in western handwriting recognition,
IJDAR(6), No. 2, 2003, pp. 75-88.
WWW Version. 0310 BibRef
Earlier: A1, A3, A2:
Comparing On-Line Recognition of Japanese and Western Script in Preparation for Recognizing Multi-Language Documents,
FHR02(84-89).
IEEE Top Reference. 0209 BibRef

Smith, E.H.B.[Elisa H. Barney], Qiu, X.H.[Xiao-Hui],
Statistical image differences, degradation features, and character distance metrics,
IJDAR(6), No. 3, March 2004, pp. 146-153.
WWW Version. 0406 BibRef
Earlier:
Relating Statistical Image Differences and Degradation Features,
DAS02(1 ff.).
HTML Version. 0303 BibRef

Lucas, S.M.[Simon M.], Panaretos, A.[Alex], Sosa, L.[Luis], Tang, A.[Anthony], Wong, S.[Shirley], Young, R.[Robert], Ashida, K.[Kazuki], Nagai, H.[Hiroki], Okamoto, M.[Masayuki], Yamamoto, H.[Hiroaki], Miyao, H.[Hidetoshi], Zhu, J.M.[Jun-Min], Ou, W.[WuWen], Wolf, C.[Christian], Jolion, J.M.[Jean-Michel], Todoran, L.[Leon], Worring, M.[Marcel], Lin, X.[Xiaofan],
ICDAR 2003 robust reading competitions: Entries, results, and future directions,
IJDAR(7), No. 2-3, July 2005, pp. 105-122.
WWW Version. 0508 BibRef

Lucas, S.M., Panaretos, A., Sosa, L., Tang, A., Wong, S., Young, R.,
ICDAR 2003 robust reading competitions,
ICDAR03(682-687).
IEEE Abstract. IEEE Top Reference. 0311 BibRef

Ink Markup Language: InkML,
Online2002. InkML.
WWW Version. InkML is an XML data format for representing digital ink data that is input with an electronic pen or stylus as part of a multimodal system. BibRef 0200

Unipen Project,
Online1994. Dataset, Handwriting.
WWW Version. This is a working group organized through IAPR to maintain and protect (ensure available to researchers) various databases of handwriting data. BibRef 9400

Simple OCR.,
2006
WWW Version. Vendor, OCR. A free (shareware) executable. Code may be purchased for inclusion in your product.

Creative ICR Inc.,
2006
WWW Version. Vendor, OCR. Forms processing.

Datacap,
2006
WWW Version. Vendor, OCR. Forms processing and OCR products.

Novo Dynamics,
2006
WWW Version. Vendor, OCR. Omnifont/multi-font OCR.

Adlib Software,
2006
WWW Version. Vendor, OCR. Document conversion.

OmniPage,
2006
WWW Version. Vendor, OCR. from Nuance. Standard OCR package.

Prime Recognition,
2006
HTML Version. Vendor, OCR.

ABBYY FineReader,
2007
WWW Version. Vendor, OCR. OCR products.

GOCR,
2002. Open Source OCR.
WWW Version. Code, OCR.

Google Tesseract-OCR,
1995 OCR originally developed at HP.
WWW Version. Code, OCR.

Smith, R.,
An Overview of the Tesseract OCR Engine,
ICDAR07(629-633).
IEEE DOI may work or IEEE-CS DOI may work. 0709 BibRef

Stubbe, A.[Andrea], Ringlstetter, C.[Christoph], Schulz, K.U.[Klaus U.],
Genre as noise: noise in genre,
IJDAR(10), No. 3-4, December 2007, pp. 199-209.
WWW Version. 0712 BibRef

Ringlstetter, C., Reffle, U., Gotscharek, A., Schulz, K.U.[Klaus U.],
Deriving Symbol Dependent Edit Weights for Text Correction: The Use of Error Dictionaries,
ICDAR07(639-643).
IEEE DOI may work or IEEE-CS DOI may work. 0709 BibRef


Watt, S.,
New Aspects of InkML for Pen-Based Computing,
ICDAR07(457-460).
IEEE DOI may work or IEEE-CS DOI may work. 0709InkML -- ink mark up languague. An XML specification. See also Ink Markup Language: InkML. BibRef

Keshari, B., Watt, S.,
Streaming-Archival InkML Conversion,
ICDAR07(1253-1257).
IEEE DOI may work or IEEE-CS DOI may work. 0709 BibRef

Schulz, K., Mihov, S., Mitankin, P.,
Fast Selection of Small and Precise Candidate Sets from Dictionaries for Text Correction Tasks,
ICDAR07(471-475).
IEEE DOI may work or IEEE-CS DOI may work. 0709 BibRef

Fujisawa, H.,
A View on the Past and Future of Character and Document Recognition,
ICDAR07(3-7).
IEEE DOI may work or IEEE-CS DOI may work. 0709 BibRef

Wong, A.K.S.[Alex K. S.], Lee, J.W.T.[John W. T.], Yeung, D.S.[Daniel S.],
Improving Text Classifier Performance based on AUC,
ICPR06(III: 268-271).
WWW Version. 0609AUC: Area under the ROC. BibRef

Li, L.L.[Lin-Lin], Tan, C.L.[Chew Lim],
Improving OCR Text Categorization Accuracy with Electronic Abstracts,
DIAL06(82-87).
IEEE DOI may work or IEEE-CS DOI may work. 0604OCR with some prior knowledge, the abstract. BibRef

Kompalli, S.[Suryaprakash], Setlur, S.[Srirangaraj], Govindaraju, V.[Venu],
Design and Comparison of Segmentation Driven and Recognition Driven Devanagari OCR,
DIAL06(96-102).
IEEE DOI may work or IEEE-CS DOI may work. 0604 BibRef

Kompalli, S.[Suryaprakash], Nayak, S.[Sankalp], Setlur, S.[Srirangaraj], Govindaraju, V.[Venu],
Challenges in OCR of Devanagari documents,
ICDAR05(I: 327-331).
IEEE DOI may work or IEEE-CS DOI may work. 0508 BibRef

Murata, M.[Mayo], Busagala, L.S.P.[Lazaro S.P.], Ohyama, W.[Wataru], Wakabayashi, T.[Tetsushi], Kimura, F.[Fumitaka],
The Impact of OCR Accuracy and Feature Transformation on Automatic Text Classification,
DAS06(506-517).
WWW Version. 0602 BibRef

Sankar, K.P.[K. Pramod], Ambati, V.[Vamshi], Pratha, L.[Lakshmi], Jawahar, C.V.,
Digitizing a Million Books: Challenges for Document Analysis,
DAS06(425-436).
WWW Version. 0602 BibRef

Agrawal, M., Bali, K., Madhvanath, S., Vuurpijl, L.,
UPX: a new XML representation for annotated datasets of online handwriting data,
ICDAR05(II: 1161-1165).
IEEE DOI may work or IEEE-CS DOI may work. 0508 BibRef

Lenaghan, A.P., Malyan, R.R.,
XPEN: an XML based format for distributed online handwriting recognition,
ICDAR03(1270-1274).
IEEE Abstract. IEEE Top Reference. 0311 BibRef

Mihov, S., Schulz, K.U., Ringlstetter, C., Dojchinova, V., Nakova, V., Kalpakchieva, K., Gerasimov, O., Gotscharek, A., Gercke, C.,
A corpus for comparative evaluation of OCR software and postcorrection techniques,
ICDAR05(I: 162-166).
IEEE DOI may work or IEEE-CS DOI may work. 0508 BibRef

Ringlstetter, C., Schulz, K.U., Mihov, S., Louka, K.,
The same is not the same: Postcorrection of alphabet confusion errors in mixed-alphabet OCR recognition,
ICDAR05(I: 406-410).
IEEE DOI may work or IEEE-CS DOI may work. 0508 BibRef

Strohmaier, C.M., Ringlstetter, C., Schulz, K.U., Mihov, S.,
Lexical postcorrection of OCR-results: The Web as a Dynamic Secondary Dictionary?,
ICDAR03(1133-1137).
IEEE Abstract. IEEE Top Reference. 0311 BibRef

Luo, X.P.[Xi-Ping], Zhen, L.X.[Li-Xin], Peng, G.[Gang], Li, J.[Jun], Xiao, B.H.[Bai-Hua],
Camera based mixed-lingual card reader for mobile device,
ICDAR05(II: 665-669).
IEEE DOI may work or IEEE-CS DOI may work. 0508 BibRef

Luo, X.P.[Xi-Ping], Li, J.[Jun], Zhen, L.X.[Li-Xin],
Design and implementation of a card reader based on build-in camera,
ICPR04(I: 417-420).
IEEE DOI may work or IEEE-CS DOI may work. 0409 BibRef

Simard, P.Y., Szeliski, R., Benaloh, J., Couvreur, J., Calinov, I.,
Using character recognition and segmentation to tell computer from humans,
ICDAR03(418-423).
IEEE Abstract. IEEE Top Reference. 0311 BibRef

Nagy, G.,
Teaching a computer to read,
ICPR92(II:225-229).
IEEE DOI may work or IEEE-CS DOI may work. 9208 BibRef

Nagy, G.,
Advanced character recognition 6610,
ICDAR01(2-6). 0109 BibRef

Ratzlaff, E.H.,
Methods, report and survey for the comparison of diverse isolated character recognition results on the UNIPEN database,
ICDAR03(623-628).
IEEE Abstract. IEEE Top Reference. 0311 BibRef

Aksoy, S., Ye, M., Schauf, M., Song, M., Wang, Y., Haralick, R.M., Parker, J.R., Pivovarov, J., Royko, D., Sun, C., Farnebäck, G.,
Algorithm Performance Contest,
ICPR00(Vol IV: 870-876).
IEEE DOI may work or IEEE-CS DOI may work.
HTML Version. 00093 problems: Binary Shape, Symbol Recogniton, and Image Flow. BibRef

di Lecce, V., Dimauro, G., Guerriero, A., Impedovo, S., Pirlo, G., Salzo, A.,
A new database of confusing characters for testing character recognition algorithms,
CIAP99(939-944).
IEEE DOI may work or IEEE-CS DOI may work. 9909 BibRef

Mao, J.[Jianchang], Sinha, P.[Prasun],
Combining Multiple OCRs for Optimizing Word Recognition,
ICPR98(Vol I: 436-438).
IEEE DOI may work or IEEE-CS DOI may work. 9808 BibRef

Miletzki, U.,
Character Recognition in Practice Today and Tomorrow,
ICDAR97(We-1C) 9708 BibRef

Guyon, I., Schomaker, L., Plamondon, R., Liberman, M., Janet, S.,
UNIPEN project of on-line data exchange and recognizer benchmarks,
ICPR94(B:29-33).
IEEE DOI may work or IEEE-CS DOI may work. 9410 BibRef

Schuermann, J.,
Reading Machines,
ICPR82(xx). BibRef 8200

Chapter on OCR, Document Analysis and Character Recognition Systems continues in
General Character Recognition Issues .


Last update:Oct 10, 2008 at 17:20:17