25.2.2 Form and Layout Analysis

Chapter Contents (Back)
Document Analysis. Application, Document Layout. Page Segmentation.

25.2.2.1 Extract Data from Specific Forms

Chapter Contents (Back)
Document Analysis. Form Analysis.

Owens, C.J.[Clifford J.], Rutledge, T.L.[Thomas L.],
Method and apparatus for electronic image processing of documents for accounting purposes,
US_Patent4,264,808, Apr 28, 1981
WWW Link. BibRef 8104

Maderlechner, G.,
Symbolic Subtraction from Fixed Formatted Graphics and Text from Filled in Forms,
MVA(3), 1990, pp. 457-459. BibRef 9000

Reid-Green, K.S.[Keith S.], Bostain, D.R.[David R.], Charlesworth, J.M.[Jeffrey M.], Quardt, D.[Dennis], Rojo, J.Z.[Joan Z.], Wynings, C.[Christopher],
Image processing system,
US_Patent5,001,769, Mar 19, 1991
WWW Link. BibRef 9103

Vincent, K.D.[Kent D.], Jamp, R.M.[Ruei-Ming],
Method and apparatus for extracting information from forms,
US_Patent5,010,580, Apr 23, 1991
WWW Link. BibRef 9104

Nanba, H.[Hiromi],
Method of effectively reading data written on data sheet, and data reading apparatus therefor,
US_Patent5,038,393, Aug 6, 1991
WWW Link. BibRef 9108

Lee, Y.C.[Yong-Chun],
Polygon-based technique for the automatic classification of text and graphics components from digitized paper-based forms,
US_Patent5,050,222, Sep 17, 1991
WWW Link. BibRef 9109

Taylor, S.L.[S. Liebowitz], Fritzson, R., and Pastor, J.A.,
Extraction of Data from Preprinted Forms,
MVA(5), 1992, pp. 211-222. BibRef 9200

Taylor, S.L.[S. Liebowitz], Fritzson, R.,
Registration and Region Extraction of Data from Forms,
ICPR92(I:173-176).
IEEE DOI BibRef 9200

Elmenhurst, B.J.[Brian J.], Tyler, R.H.[Richard H.],
Extracting user data from a scanned image of a pre-printed form,
US_Patent6,330,357, Dec 11, 2001
WWW Link. BibRef 0112

Casey, R., Ferguson, D., Mohiuddin, K., and Walach, E.,
Intelligent Forms Processing System,
MVA(5), 1992, pp. 143-155. BibRef 9200

Garris, M.D., Dimmick, D.L.,
Form Design for High-Accuracy Optical Character-Recognition,
PAMI(18), No. 6, June 1996, pp. 653-656.
IEEE DOI 9607
BibRef
Earlier:
Evaluating Form Designs for Optical Character Recognition,
NISTIR5364, February 1994. How to design a form to make it easier for OCR. BibRef

Garris, M.D.,
Correlated Run Length Algorithm (CURL) for Detecting Form Structure within Digitized Documents,
" SDAIR94(413-424). BibRef 9400

Lin, J.Y., Lee, C.W., Chen, Z.,
Identification of Business Forms Using Relationships Between Adjacent Frames,
MVA(9), No. 2, 1996, pp. 56-64.
Springer DOI 9609
Use relations between frames (blocks of the form). Convert to a graph, then to a 1-D string for matching. BibRef

Yu, B.[Bin], Jain, A.K.,
A Generic System for Form Dropout,
PAMI(18), No. 11, November 1996, pp. 1127-1134.
IEEE DOI 9612
BibRef
Earlier:
A Form Dropout System,
ICPR96(III: 701-705).
IEEE DOI 9608
(Michigan State Univ., USA) Getting the entered text out of the form. BibRef

Glasgow, B., Mandell, A., Binney, D., Ghemri, L., Fisher, D.,
Mita: An Information Extraction Approach to the Analysis of Free-Form Text in Life-Insurance Applications,
AIMag(19), No. 1, Spring 1998, pp. 59-71. 9804
BibRef

Fan, K.C.[Kuo-Chin], Lu, J.M.[Jeng-Ming], Chen, G.D.[Gwo-Dong],
A Feature Point Clustering Approach to the Recognition of Form Documents,
PR(31), No. 9, September 1998, pp. 1205-1220.
Elsevier DOI 9808
BibRef

Tseng, L.Y.[Lin Yu], Chen, R.C.[Rung Ching],
Recognition and Data Extraction of Form Documents Based on 3 Types of Line Segments,
PR(31), No. 10, October 1998, pp. 1525-1540.
Elsevier DOI 9808
BibRef
Earlier:
The Recognition of Form Documents Based on Three Types of Line Segments,
ICDAR97(71-75).
IEEE DOI 9708
BibRef

Chen, J.L.[Jiun-Lin], Lee, H.J.[Hsi-Jian],
An Efficient Algorithm for Form Structure Extraction Using Strip Projection,
PR(31), No. 9, September 1998, pp. 1353-1368.
Elsevier DOI 9808
BibRef
Earlier:
A Novel Form Structure Extraction Method Using Strip Projection,
ICPR96(III: 823-827).
IEEE DOI 9608
(National Chiao Tung Univ., ROC) BibRef

Chen, J.L.[Jiun-Lin], Lee, H.J.[Hsi-Jian],
Field data extraction for form document processing using a gravitation-based algorithm,
PR(34), No. 9, September 2001, pp. 1741-1750.
Elsevier DOI 0108
BibRef
Earlier: A2, A1:
Field-Data Grouping for Form Document Processing Using a Gravitation-Based Algorithm,
ICPR98(Vol II: 1095-1097).
IEEE DOI 9808
BibRef

Cracknell, C., Downton, A.C., Du, L.,
An object-oriented descriptive language to facilitate advanced handwritten form processing,
IVC(16), No. 12-13, 24 August 1998, pp. 843-853.
Elsevier DOI BibRef 9808
Earlier:
TABS: A New Software Framework for Document Image Processing, Analysis and Understanding,
ICDAR97(1001-1005).
IEEE DOI 9708
BibRef
And:
Hierarchical recognition of structured hand-printed documents using rule-trees,
BMVC97(xx-yy).
HTML Version. 0209
BibRef

Cracknell, C., Downton, A.C.,
TABS: Script-Based Software Framework for Research in Image Processing, Analysis and Understanding,
VISP(145), No. 3, June 1998, pp. 194-202. 9808
BibRef

Du, L., Downton, A.C., Lucas, S.M., Al-Badr, B.,
Generalized Contextual Recognition of Hand-Printed Documents Using Semantic Trees With Lazy Evaluation,
ICDAR97(238-242).
IEEE DOI 9708
BibRef

Downton, A.C.[Andy C.], Cracknell, C.,
Document Image Understanding of Handwritten Forms Using Rule-Trees,
ICPR98(Vol I: 936-938).
IEEE DOI 9808
BibRef

Cracknell, C., Downton, A.C., Du, L.,
An object-oriented form description language and approach to handwritten form processing,
ICDAR97(180-184).
IEEE DOI 9708
BibRef

Cesarini, F.[Francesca], Gori, M.[Marco], Marinai, S.[Simone], Soda, G.[Giovanni],
INFORMYS: A Flexible Invoice Like Form Reader System,
PAMI(20), No. 7, July 1998, pp. 730-745.
IEEE DOI 9808
Extract text from accounting documents. BibRef

Cesarini, F., Francesconi, E., Gori, M., Soda, G.,
Using Physical and Logical Constraints for Invoice Understanding,
PAA(3), No. 2, 2000, pp. 182-195. 0010
BibRef

Cesarini, F., Francesconi, E., Gori, M., Soda, G.,
Analysis and understanding of multi-class invoices,
IJDAR(6), No. 2, 2003, pp. 102-114.
Springer DOI 0310
BibRef

Cesarini, F., Francesconi, E., Gori, M., Marinai, S., Sheng, J.Q., Soda, G.,
Rectangle Labelling for an Invoice Understanding System,
ICDAR97(324-330).
IEEE DOI 9708
BibRef

Ishitani, Y.,
Flexible and Robust Model Matching based on Association Graph for Form Image Understanding,
PAA(3), No. 2, 2000, pp. 104-119. 0010
BibRef

Ishitani, Y.,
Document transformation system from papers to XML data based on pivot XML document method,
ICDAR03(250-255).
IEEE DOI 0311
BibRef

Ishitani, Y.[Yasuto],
Logical Structure Analysis of Document Images Based on Emergent Computation,
IEICE(E88-D), No. 8, August 2005, pp. 1831-1842.
DOI Link 0508
BibRef
Earlier:
Model-based information extraction method tolerant of OCR errors for document images,
ICDAR01(908-915).
IEEE DOI 0109
BibRef
Earlier:
Document Layout Analysis Based on Emergent Computation,
ICDAR97(45-50).
IEEE DOI 9708
BibRef

Ming, D.[Delie], Liu, J.[Jian], Tian, J.W.[Jin-Wen],
Research on Chinese financial invoice recognition technology,
PRL(24), No. 1-3, January 2003, pp. 489-497.
Elsevier DOI 0211
BibRef

Ramdane, S.[Saďd], Taconet, B.[Bruno], Zahour, A.[Abderrazak],
Classification of forms with handwritten fields by planar hidden Markov models,
PR(36), No. 4, April 2003, pp. 1045-1060.
Elsevier DOI 0304
BibRef

Taylor, G.S.[Garland S.],
Method of optical mark recognition,
US_Patent6,741,738, May 25, 2004
WWW Link. Marks on forms. BibRef 0405

Xi, D.H.[Di-Hua], Lee, S.W.[Seong-Whan],
Extraction of reference lines and items from form document images with complicated background,
PR(38), No. 2, February 2005, pp. 289-305.
Elsevier DOI 0411
BibRef
Earlier:
Reference line extraction from form documents with complicated backgrounds,
ICDAR03(1080-1084).
IEEE DOI 0311
BibRef

Romanowski, C.J., Nagi, R.,
On Comparing Bills of Materials: A Similarity/ Distance Measure for Unordered Trees,
SMC-A(35), No. 2, March 2005, pp. 249-260.
IEEE Abstract. 0501
BibRef

Vinciarelli, A.[Alessandro],
Noisy Text Categorization,
PAMI(27), No. 12, December 2005, pp. 1882-1895.
IEEE DOI 0512
BibRef
Earlier: ICPR04(II: 554-557).
IEEE DOI 0409
BibRef

Milewski, R.J.[Robert J.], Govindaraju, V.[Venu],
Binarization and cleanup of handwritten text from carbon copy medical form images,
PR(41), No. 4, April 2008, pp. 1308-1315.
Elsevier DOI 0801
BibRef
Earlier:
Extraction of Handwritten Text from Carbon Copy Medical Form Images,
DAS06(106-116).
Springer DOI 0602
BibRef
Earlier:
Medical word recognition using a computational semantic lexicon,
FHR02(401-406).
IEEE Top Reference. 0209
BibRef

Milewski, R.J., Setlur, S., Govindaraju, V.,
A lexicon reduction strategy in the context of handwritten medical forms,
ICDAR05(II: 1146-1150).
IEEE DOI 0508
BibRef

Milewski, R.J.[Robert Jay], Govindaraju, V.[Venu], Bhardwaj, A.[Anurag],
Automatic recognition of handwritten medical forms for search engines,
IJDAR(11), No. 4, March 2009, pp. xx-yy.
Springer DOI 0903
BibRef

Cao, H.[Huaigu], Govindaraju, V.[Venu],
Preprocessing of Low-Quality Handwritten Documents Using Markov Random Fields,
PAMI(31), No. 7, July 2009, pp. 1184-1194.
IEEE DOI 0905
BibRef
Earlier:
Handwritten Carbon Form Preprocessing Based on Markov Random Field,
CVPR07(1-7).
IEEE DOI 0706
BibRef
And:
Vector Model Based Indexing and Retrieval of Handwritten Medical Forms,
ICDAR07(88-92).
IEEE DOI 0709
Statistical approach. Binarization, form line removal.
See also Handwritten Text Separation from Annotated Machine Printed Documents Using Markov Random Fields. BibRef

Wang, J., Wang, Y., Wang, Y.,
CAPFF: A Context-Aware Assistant for Paper Form Filling,
HMS(47), No. 6, December 2017, pp. 903-908.
IEEE DOI 1712
Cameras, Computational modeling, Context awareness, Image recognition, Man-machine systems, Semantics, Context-aware, paper form filling BibRef

Afifi, M.[Mahmoud], Hussain, K.F.[Khaled F.],
The achievement of higher flexibility in multiple-choice-based tests using image classification techniques,
IJDAR(22), No. 2, June 2019, pp. 127-142.
WWW Link. 1906
BibRef

Ha, H.T., Horák, A.,
Information extraction from scanned invoice images using text analysis and layout features,
SP:IC(102), 2022, pp. 116601.
Elsevier DOI 2202
OCR, Information extraction, Scanned documents, Document metadata, Invoice metadata extraction, Metadata indexing BibRef

Boillet, M.[Mélodie], Kermorvant, C.[Christopher], Paquet, T.[Thierry],
Confidence Estimation for Object Detection in Document Images,
PRL(166), 2023, pp. 31-37.
Elsevier DOI 2302
Confidence estimation, Document object detection, Active learning BibRef

Zhang, L.M.[Lu-Ming], Peng, J.J.[Jun-Jie], Liu, W.[Wenfu], Yuan, H.C.[Hao-Chen], Tan, S.H.[Shu-Hua], Wang, L.[Lu], Yi, F.[Fen],
A semantic fusion based approach for express bill detection in complex scenes,
IVC(135), 2023, pp. 104708.
Elsevier DOI 2306
Oriented objects, Object detection, Semantic fusion, Complex scenes BibRef

Prieto, J.R.[Jose Ramón], Flores, J.J.[Juan José], Vidal, E.[Enrique], Toselli, A.H.[Alejandro Hector],
Open set classification of untranscribed handwritten text image documents,
PRL(172), 2023, pp. 113-120.
Elsevier DOI 2309
Open set document classification, Handwritten text images, Probabilistic indexing, Neural networks BibRef

Flores, J.J.[Juan José], Prieto, J.R.[Jose Ramón], Garrido, D.[David], Alonso, C.[Carlos], Vidal, E.[Enrique],
Classification of Untranscribed Handwritten Notarial Documents by Textual Contents,
IbPRIA22(14-26).
Springer DOI 2205
BibRef

Li, X.L.[Xiao-Long], Zhang, W.[Wu], Wang, Y.J.[Yan-Jie], Tan, Y.B.[Yong-Bin], Xia, J.[Jing],
Spatio-Temporal Information Extraction and Geoparsing for Public Chinese Resumes,
IJGI(12), No. 9, 2023, pp. 377.
DOI Link 2310
BibRef

Mao, W.D.[Wen-Dong], Yang, S.[Shuai], Shi, H.[Huihong], Liu, J.Y.[Jia-Ying], Wang, Z.F.[Zhong-Feng],
Intelligent Typography: Artistic Text Style Transfer for Complex Texture and Structure,
MultMed(25), 2023, pp. 6485-6498.
IEEE DOI 2311
BibRef

Wang, W.J.[Wen-Jing], Liu, J.Y.[Jia-Ying], Yang, S.[Shuai], Guo, Z.M.[Zong-Ming],
Typography With Decor: Intelligent Text Style Transfer,
CVPR19(5882-5890).
IEEE DOI 2002
BibRef


Liao, H.[Haofu], RoyChowdhury, A.[Aruni], Li, W.J.[Wei-Jian], Bansal, A.[Ankan], Zhang, Y.T.[Yu-Ting], Tu, Z.W.[Zhuo-Wen], Satzoda, R.K.[Ravi Kumar], Manmatha, R., Mahadevan, V.[Vijay],
DocTr: Document Transformer for Structured Information Extraction in Documents,
ICCV23(19527-19537)
IEEE DOI 2401
BibRef

Zhang, H.[Hongkuan], Whittaker, E.[Edward], Kitagishi, I.[Ikuo],
Extending TrOCR for Text Localization-Free OCR of Full-Page Scanned Receipt Images,
REDLCV23(1471-1477)
IEEE DOI 2401
BibRef

Da, C.[Cheng], Luo, C.[Chuwei], Zheng, Q.[Qi], Yao, C.[Cong],
Vision Grid Transformer for Document Layout Analysis,
ICCV23(19405-19415)
IEEE DOI 2401
BibRef

Zhang, J.[Junyi], Guo, J.Q.[Jia-Qi], Sun, S.Z.[Shi-Zhao], Lou, J.G.[Jian-Guang], Zhang, D.M.[Dong-Mei],
LayoutDiffusion: Improving Graphic Layout Generation by Discrete Diffusion Probabilistic Models,
ICCV23(7192-7202)
IEEE DOI Code:
WWW Link. 2401
BibRef

Belhadj, D.[Djedjiga], Belaďd, A.[Abdel], Belaďd, Y.[Yolande],
Low-dimensionality Information Extraction Model for Semi-Structured Documents,
CAIP23(I:76-85).
Springer DOI 2312
BibRef

Zaryab, M.A.[Muhammad Ateeque], Ng, C.R.[Chuen Rue],
Optical Character Recognition for Medical Records Digitization with Deep Learning,
ICIP23(3260-3263)
IEEE DOI 2312
BibRef

Hayashi, S.Y.[Sergio Y.], Hirata, N.S.T.[Nina S. T.],
Understanding attention-based encoder-decoder networks: A case study with chess scoresheet recognition,
ICPR22(1586-1592)
IEEE DOI 2212
Training, Knowledge engineering, Handwriting recognition, Visualization, Image recognition, Recurrent neural networks, Collaboration BibRef

Kessi, L.[Louisa], Lebourgeois, F.[Frank], Garcia, C.[Christophe],
Unsupervised Recognition of the Logical Structure of Business Documents Based on Spatial Relationships,
CAIP21(II:57-72).
Springer DOI 2112
BibRef

Burch, M.[Michael], Wallner, G.[Günter], van de Wetering, H.[Huub], Tufail, S.[Shahrukh], Zandt-Sloot, L.[Linda], Gladkis, S.[Stasius], Hong, M.J.[Min-Ji], Lepelaars, C.[Carlo],
FamSearch: Visual Analysis of Genealogical Data,
ISVC21(II:374-385).
Springer DOI 2112
BibRef

Porter, W.P.[William P.], Murphy, C.P.[Conor P.], Williams, D.R.[Dane R.], O'Handley, B.J.[Brendan J.], Wang, C.[Chaoli],
Hierarchical Sankey Diagram: Design and Evaluation,
ISVC21(II:386-397).
Springer DOI 2112
Sankey diagrams are a type of flow diagram in which the width of the arrows is proportional to the flow rate. BibRef

Zhi, X.F.[Xiao-Fan], Shen, Z.[Zhen], Zhao, B.[Bo],
A Method for Identifying the Key Information of Electronic Invoicing in Complex Scenes,
ICIVC21(90-94)
IEEE DOI 2112
Training, Image quality, Regulators, Image recognition, Text recognition, Knowledge based systems, Lighting, complex scenes BibRef

Dang, T.A.N.[Tuan Anh Nguyen], Hoang, D.T.[Duc Thanh], Tran, Q.B.[Quang Bach], Pan, C.W.[Chih-Wei], Nguyen, T.D.[Thanh Dat],
End-to-End Hierarchical Relation Extraction for Generic Form Understanding,
ICPR21(5238-5245)
IEEE DOI 2105
Correlation, Face recognition, Semantics, Neural networks, Deep architecture, Predictive models, Noise measurement BibRef

Suh, S.[Sungho], Lukowicz, P.[Paul], Lee, Y.O.[Yong Oh],
Fusion of Global-Local Features for Image Quality Inspection of Shipping Label,
ICPR21(2643-2649)
IEEE DOI 2105
Image quality, Image recognition, Text recognition, System performance, Transforms, Object detection, Inspection BibRef

Aggarwal, M.[Milan], Sarkar, M.[Mausoom], Gupta, H.[Hiresh], Krishnamurthy, B.[Balaji],
Multi-Modal Association based Grouping for Form Structure Extraction,
WACV20(2064-2073)
IEEE DOI 2006
Image segmentation, Semantics, Task analysis, Data mining, Pipelines, Layout, Convolution BibRef

Men, Y.F.[Yi-Fang], Lian, Z.H.[Zhou-Hui], Tang, Y.M.[Ying-Min], Xiao, J.G.[Jian-Guo],
DynTypo: Example-Based Dynamic Text Effects Transfer,
CVPR19(5863-5872).
IEEE DOI 2002
BibRef

Gal, R.[Rinon], Morag, N.[Nimrod], Shilkrot, R.[Roy],
Visual-Linguistic Methods for Receipt Field Recognition,
ACCV18(II:542-557).
Springer DOI 1906
BibRef

Raoui-Outach, R., Million-Rousseau, C., Benoit, A., Lambert, P.,
Deep learning for automatic sale receipt understanding,
IPTA17(1-6)
IEEE DOI 1804
data analysis, document image processing, feedforward neural nets, learning (artificial intelligence), Semantic Analysis BibRef

Rahal, N.[Najoua], Benjlaiel, M.[Mohamed], Alimi, A.M.[Adel M.],
Entity Extraction and Correction Based on Token Structure Model Generation,
SSSPR16(401-411).
Springer DOI 1611
Scanned invoices. BibRef

Lee, H.[Hyeogjin], Kwak, N.[Nojun],
Character recognition for the machine reader zone of electronic identity cards,
ICIP15(387-391)
IEEE DOI 1511
E-passport; Machine Reader Zone; Optical Character Recognition BibRef

de las Heras, L.P.[Lluis-Pere], Terrades, O.R.[Oriol Ramos], Llados, J.[Josep], Fernandez-Mota, D.[David], Canero, C.[Cristina],
Use case visual Bag-of-Words techniques for camera based identity document classification,
ICDAR15(721-725)
IEEE DOI 1511
Structured documents, not forms as such. BibRef

Sadeh, G.[Gil], Wolf, L.B.[Lior B.], Hassner, T.[Tal], Dershowitz, N.[Nachum], Ben-Ezra, D.S.[Daniel Stokl],
Viral transcript alignment,
ICDAR15(711-715)
IEEE DOI 1511
BibRef

Antonacopoulos, A., Clausner, C., Papadopoulos, C., Pletschacher, S.,
ICDAR2015 competition on recognition of documents with complex layouts - RDCL2015,
ICDAR15(1151-1155)
IEEE DOI 1511
datasets BibRef

Kim, J.[Jieun], Boutin, M.[Mireille],
Estimating the Nutrient Content of Commercial Foods from their Label Using Numerical Optimization,
MADiMa15(309-316).
Springer DOI 1511
BibRef

Galibert, O.[Olivier], Kahn, J.[Juliette], Oparin, I.[Ilya],
The zonemap metric for page segmentation and area classification in scanned documents,
ICIP14(2594-2598)
IEEE DOI 1502
Algorithm design and analysis BibRef

Green, R., Oliver, C.,
Layout analysis of book pages,
IVCNZ13(118-123)
IEEE DOI 1402
Gaussian processes BibRef

Zhu, S.[Siyu], Zanibbi, R.,
A Text Detection System for Natural Scenes with Convolutional Feature Learning and Cascaded Classification,
CVPR16(625-632)
IEEE DOI 1612
BibRef
And:
Label Detection and Recognition for USPTO Images Using Convolutional K-Means Feature Quantization and Ada-Boost,
ICDAR13(633-637)
IEEE DOI 1312
image classification Patent office forms. BibRef

Rusinol, M., Benkhelfallah, T., d'Andecy, V.P.,
Field Extraction from Administrative Documents by Incremental Structural Templates,
ICDAR13(1100-1104)
IEEE DOI 1312
document image processing BibRef

Romero, V.[Verónica], Fornés, A.[Alicia], Vidal, E.[Enrique], Sánchez, J.A.[Joan Andreu],
Information Extraction in Handwritten Marriage Licenses Books Using the MGGI Methodology,
IbPRIA17(287-294).
Springer DOI 1706
BibRef
Earlier: A1, A4, Only:
Category-Based Language Models for Handwriting Recognition of Marriage License Books,
ICDAR13(788-792)
IEEE DOI 1312
data mining BibRef

Nion, T., Menasri, F., Louradour, J., Sibade, C., Retornaz, T., Metaireau, P.Y., Kermorvant, C.,
Handwritten Information Extraction from Historical Census Documents,
ICDAR13(822-826)
IEEE DOI 1312
document image processing BibRef

Romero, V., Sanchez, J.A.[J. Andreu],
Human Evaluation of the Transcription Process of a Marriage License Book,
ICDAR13(1255-1259)
IEEE DOI 1312
document image processing BibRef

Santosh, K.C., Belaid, A.,
Document Information Extraction and Its Evaluation Based on Client's Relevance,
ICDAR13(35-39)
IEEE DOI 1312
document image processing BibRef

Santosh, K.C., Belaďd, A.[Abdel],
Pattern-Based Approach to Table Extraction,
IbPRIA13(766-773).
Springer DOI 1307
BibRef

Nagy, G.[George],
Learning the characteristics of critical cells from web tables,
ICPR12(1554-1557).
WWW Link. 1302
BibRef

Mandal, R.[Ranju], Roy, P.P.[Partha Pratim], Pal, U.[Umapada], Blumenstein, M.[Michael],
Date field extraction from handwritten documents using HMMs,
ICDAR15(866-870)
IEEE DOI 1511
BibRef

Mandal, R.[Ranju], Roy, P.P.[Partha Pratim], Pal, U.[Umapada],
Date field extraction in handwritten documents,
ICPR12(533-536).
WWW Link. 1302
BibRef

Cao, H.G.[Huai-Gu], Subramanian, K.[Krishna], Peng, X.J.[Xu-Jun], Chen, J.Y.[Jin-Ying], Prasad, R.[Rohit], Natarajan, P.[Prem],
Extracting information from handwritten content in census forms,
ICPR12(306-309).
WWW Link. 1302
BibRef

Kacem, A.[Afef], Saďdani, A.[Asma], Belaďd, A.[Abdel],
A System for an Automatic Reading of Student Information Sheets,
ICDAR11(1265-1269).
IEEE DOI 1111
BibRef

Emilie, P.[Philippot], Yolande, B.[Belaďd], Abdel, B.[Belaďd],
Use of Semantic and Physical Constraints in Bayesian Networks for Form Recognition,
ICDAR11(946-950).
IEEE DOI 1111
BibRef

Tanaka, H.[Hiroshi], Takebe, H.[Hiroaki], Hotta, Y.[Yoshinobu],
Robust Cell Extraction Method for Form Documents Based on Intersection Searching and Global Optimization,
ICDAR11(354-358).
IEEE DOI 1111
BibRef

Hirayama, J.[Junichi], Shinjo, H.[Hiroshi], Takahashi, T.[Toshikazu], Nagasaki, T.[Takeshi],
Development of Template-Free Form Recognition System,
ICDAR11(237-241).
IEEE DOI 1111
BibRef

Kuo, T.Y.[Tien-Ying], Lo, Y.C.[Yi-Chung],
A novel form detection and removal scheme for document images,
ICIP10(2141-2144).
IEEE DOI 1009
BibRef

Sarkar, P.[Prateek],
Learning Image Anchor Templates for Document Classification and Data Extraction,
ICPR10(3428-3431).
IEEE DOI 1008
BibRef

Cao, H.G.[Huai-Gu], Prasad, R.[Rohit], Natarajan, P.[Premkumar], Govindaraju, V.[Venu],
Nested state indexing in pairwise Markov networks for fast handwritten document image rule-line removal,
ICIP09(2009-2012).
IEEE DOI 0911
BibRef

Kermorvant, C.[Christopher], Bianne-Bernard, A.L.[Anne-Laure], Marty, P.[Patrick], Menasri, F.[Farčs],
From Isolated Handwritten Characters to Fields Recognition: There's Many a Slip Twixt Cup and Lip,
ICDAR09(1031-1035).
IEEE DOI 0907
BibRef

Ebert, S.[Sebastian], Liwicki, M.[Marcus], Dengel, A.R.[Andreas R.],
Ontology-Based Information Extraction from Handwritten Documents,
FHR10(483-488).
IEEE DOI 1011
BibRef

Schulz, F.[Frederick], Ebbecke, M.[Markus], Gillmann, M.[Michael], Adrian, B.[Benjamin], Agne, S.[Stefan], Dengel, A.R.[Andreas R.],
Seizing the Treasure: Transferring Knowledge in Invoice Analysis,
ICDAR09(848-852).
IEEE DOI 0907
BibRef

Navon, Y.[Yaakov], Barkan, E.[Ella], Ophir, B.[Boaz],
A Generic Form Processing Approach for Large Variant Templates,
ICDAR09(311-315).
IEEE DOI 0907
BibRef

Arlandis, J.[Joaquim], Castello-Fos, V.[Vicent], Perez-Cortes, J.C.[Juan-Carlos],
Filled-in Document Identification Using Local Features and a Direct Voting Scheme,
IbPRIA11(548-555).
Springer DOI 1106
BibRef

Arlandis, J.[Joaquim], Perez-Cortes, J.C.[Juan-Carlos], Ungria, E.[Emilio],
Identification of Very Similar Filled-in Forms with a Reject Option,
ICDAR09(246-250).
IEEE DOI 0907
BibRef

Rosman, G., Tzadok, A., Tal, D.,
A New Physically Motivated Warping Model for Form Drop-Out,
ICDAR07(774-778).
IEEE DOI 0709
BibRef

Mace, S.[Sebastian],
Context-Driven Constraint Multiset Grammars with Incremental Parsing for On-line Structured Document Interpretation,
ICDAR07(442-446).
IEEE DOI 0709
BibRef

Hamza, H.[Hatem], Belaid, Y.[Yolande], Belaid, A.[Abdel],
A Case-Based Reasoning Approach for Invoice Structure Extraction,
ICDAR07(327-331).
IEEE DOI 0709
BibRef
And:
A Case-Based Reasoning Approach for Unknown Class Invoice Processing,
ICIP07(V: 353-356).
IEEE DOI 0709
BibRef

Chen, S., Mao, S., Thoma, G.R.,
Simultaneous Layout Style and Logical Entity Recognition in a Heterogeneous Collection of Documents,
ICDAR07(118-122).
IEEE DOI 0709
BibRef

Nagasaki, T.[Takeshi], Marukawa, K.[Katsumi], Kagehiro, T.[Tatsuhiko], Sako, H.[Hiroshi],
A Coupon Classification Method Based on Adaptive Image Vector Matching,
ICPR06(III: 280-283).
IEEE DOI 0609
BibRef

Taghva, K.[Kazem], Beckley, R.[Russell], Coombs, J.[Jeffrey],
The Effects of OCR Error on the Extraction of Private Information,
DAS06(348-357).
Springer DOI 0602
BibRef

Flaster, M.[Michael], Hillyer, B.[Bruce], Ho, T.K.[Tin Kam],
Exploratory Analysis System for Semi-structured Engineering Logs,
DAS06(291-301).
Springer DOI 0602
BibRef

Klein, B.[Bertin], Agne, S.[Stefan], Dengel, A.R.[Andreas R.],
On Benchmarking of Invoice Analysis Systems,
DAS06(312-323).
Springer DOI 0602
BibRef
Earlier:
Results of a Study on Invoice-Reading Systems in Germany,
DAS04(451-462).
Springer DOI 0505
BibRef

Agne, S., Dengel, A.R., Klein, B.,
Evaluating SEE-a benchmarking system for document page segmentation,
ICDAR03(634-638).
IEEE DOI 0311
BibRef

Tuganbaev, D., Pakhchanian, A., Deryagin, D.,
Universal data capture technology from semi-structured forms,
ICDAR05(I: 458-462).
IEEE DOI 0508
BibRef

Shima, Y., Ohya, H., Yasuda, M.,
A form dropout method based on line-elimination and image-subtraction,
ICDAR05(I: 126-130).
IEEE DOI 0508
BibRef

Viola, P.[Paul], Rinker, J.[James], Law, M.[Martin],
Automatic Fax Routing,
DAS04(484-495).
Springer DOI 0505
BibRef

Biagioli, C.[Carlo], Francesconi, E.[Enrico], Spinosa, P.[Pierluigi], Taddei, M.[Mirco],
XML Documents Within a Legal Domain: Standards and Tools for the Italian Legislative Environment,
DAS04(413-424).
Springer DOI 0505
BibRef

Cascini, G.[Gaetano], Fantechi, A.[Alessandro], Spinicci, E.[Emilio],
Natural Language Processing of Patents and Technical Documentation,
DAS04(508-520).
Springer DOI 0505
BibRef

Hadjar, K.[Karim], Ingold, R.[Rolf],
Physical Layout Analysis of Complex Structured Arabic Documents Using Artificial Neural Nets,
DAS04(170-178).
Springer DOI 0505
BibRef

Atkins, C.B.,
Adaptive photo collection page layout,
ICIP04(V: 2897-2900).
IEEE DOI 0505
BibRef

Tam, V., Setiono, R., Santoso, A.,
Applying the conjugate gradient method for text document categorization,
ICPR04(II: 558-561).
IEEE DOI 0409
BibRef

Belaid, Y., Belaid, A.,
Morphological tagging approach in document analysis of invoices,
ICPR04(I: 469-472).
IEEE DOI 0409
BibRef

Belaid, A., Belaid, Y., Valverde, L.N., Kebairi, S.,
Adaptive technology for mail-order form segmentation,
ICDAR01(689-693).
IEEE DOI 0109
BibRef

Downton, A.C., Lucas, S.M., Patoulas, G., Beccaloni, G.W., Scoble, M.J., Robinson, G.S.,
Computerising natural history card archives,
ICDAR03(354-358).
IEEE DOI 0311
BibRef

Downton, A.C., Tams, A.C., Wells, G.J., Holmes, A.C., Lucas, S.M., Beccaloni, G.W., Scoble, M.J., Robinson, G.S.,
Constructing Web-based legacy index card archives-architectural design issues and initial data acquisition,
ICDAR01(854-858).
IEEE DOI 0109
BibRef

Sako, H., Seki, M., Furukawa, N., Ikeda, H., Imaizumi, A.,
Form reading based on form-type identification and form-data recognition,
ICDAR03(926-930).
IEEE DOI 0311
BibRef

Shimamura, T., Zhu, B.L.[Bi-Lan], Masuda, A., Onuma, M., Sakurada, T., Nakagawa, M.,
A prototype of an active form system,
ICDAR03(921-925).
IEEE DOI 0311
BibRef

Yan, H.P.[He-Ping], Wang, Z.Y.[Zhi-Yan], Guo, S.[Sen],
An evaluation system for string extraction in the airline coupon project,
ICDAR05(II: 930-934).
IEEE DOI 0508
BibRef

Zhao, S.H.[Shan-Heng], Wang, Z.Y.[Zhi-Yan],
A high accuracy rate commercial flight coupon recognition system,
ICDAR03(82-86).
IEEE DOI 0311
BibRef

Thoma, G.R.[George R.], Ford, G.[Glenn], Le, D.[Daniel], Li, Z.R.[Zhi-Rong],
Text Verification in an Automated System for the Extraction of Bibliographic Data,
DAS02(423 ff.).
Springer DOI 0303
BibRef

Wnek, J.[Janusz],
Machine Learning of Generalized Document Templates for Data Extraction,
DAS02(457 ff.).
Springer DOI 0303
BibRef

Murshed, N.[Nabeel],
Automatic Reading of Traffic Tickets,
DAS02(66 ff.).
Springer DOI 0303
BibRef

Wong, W.S.[Wing Seong], Sherkat, N., Allen, T.,
Contextual focus for improved recognition of hand-filled forms,
ICDAR01(748-752).
IEEE DOI 0109
BibRef
And:
Use of colour in form layout analysis,
ICDAR01(942-946).
IEEE DOI 0109
BibRef

Hirano, T., Okada, Y., Yoda, F.,
Field extraction method from existing forms transmitted by facsimile,
ICDAR01(738-742).
IEEE DOI 0109
BibRef

Shinjo, H., Hadano, E., Marukawa, K., Shima, Y., Sako, H.,
A recursive analysis for form cell recognition,
ICDAR01(694-698).
IEEE DOI 0109
BibRef

Zheng, Y.F.[Ye-Feng], Liu, C.S.[Chang-Song], Ding, X.Q.[Xiao-Qing], Pan, S.Y.[Shi-Yan],
Form frame line detection with directional single-connected chain,
ICDAR01(699-703).
IEEE DOI 0109
BibRef

Llados, J., Lumbreras, F., Chapaprieta, V., Queralt, J.,
ICAR: Identity Card Automatic Reader,
ICDAR01(470-474).
IEEE DOI 0109
BibRef

Chhabra, A.K.,
Anatomy of a hand-filled form reader,
WACV94(195-204).
IEEE Abstract. 0403
BibRef

Fan, K.C.[Kuo-Chin], Wang, Y.K.[Yuan-Kai], Chang, M.L.[Mei-Lin],
Form Document Identification Using Line Structure Based Features,
ICDAR01(704-708).
IEEE DOI 0109
BibRef
Earlier: A1, A3 Only: ICPR98(Vol II: 1098-1100).
IEEE DOI 9808
BibRef

Trupin, E.[Eric], Ribert, A.[Arnaud], Diana, S., Heroux, P.,
Classification Method Study for Automatic Form Class Identification,
ICPR98(Vol I: 926-928).
IEEE DOI 9808
BibRef

Duygulu, P.[Pinar], Dincel, E.[Ebru], Atalay, V.[Volkan],
A Heuristic Algorithm for Hierarchical Representation of Form Documents,
ICPR98(Vol I: 929-931).
IEEE DOI 9808
BibRef

Shinjo, H., Nakashima, K., Koga, M., Marukawa, K., Shima, Y., Hadano, E.,
A Method for Connecting Disappeared Junction Patterns on Frame Lines in Form Documents,
ICDAR97(667-670).
IEEE DOI 9708
BibRef

Bayer, T.A., Mogg-Schneider, H.U.,
A Generic System for Processing Invoices,
ICDAR97(740-744).
IEEE DOI 9708
BibRef

Shamilian, J.H., Wood, T.L., Baird, H.S.,
A Retargetable Table Reader,
ICDAR97(158-163).
IEEE DOI 9708
BibRef

Bohnacker, U., Schacht, J., Yuecel, T.,
Matching Form Lines Based on a Heuristic Search,
ICDAR97(86-90).
IEEE DOI 9708
BibRef

Yoo, J.Y., Kim, M.K., Kwon, Y.B.,
Line Removal and Restoration of Handwritten Characters on the Form Documents,
ICDAR97(128-131).
IEEE DOI 9708
BibRef

Mao, J.C.[Jian-Chang], Lorie, R., Mohiuddin, M.[Moidin],
A System for Automatically Reading IATA Flight Coupons,
ICDAR97(153-157).
IEEE DOI 9708
BibRef

Mao, J.C.[Jian-Chang], Abayan, M., Mohiuddin, M.[Moidin],
A Model-Based Form Processing Sub-System,
ICPR96(III: 691-695).
IEEE DOI 9608
(IBM Almaden Res. Center, USA) BibRef

Mao, J.C.[Jian-Chang], Mohiuddin, K.,
Form dropout using distance transformation,
ICIP95(III: 328-331).
IEEE DOI 9510
BibRef

Arai, H., Odaka, K.,
Form Processing Based on Background Region Analysis,
ICDAR97(164-169).
IEEE DOI 9708
BibRef

Tang, Y.Y., Liu, J.,
Information Acquisition and Storage of Forms in Document Processing,
ICDAR97(170-174).
IEEE DOI 9708
BibRef

Safari, R., Narasimhamurthi, N., Shridhar, M., Ahmadi, M.,
Form Registration: A Computer Vision Approach,
ICDAR97(758-761).
IEEE DOI 9708
BibRef

Aksak, I., Feist, C., Kiiko, V., Knoefel, R., Matsello, V., Oganovskij, V., Schlesinger, M., Schlesinger, D., Stanke, G.,
Extraction of filled-in data from colour forms,
CAIP97(98-105).
Springer DOI 9709
BibRef

Aksak, I., Feist, C., Kijko, V., Knoefel, R., Matsello, V., Oganovskij, V., Schlesinger, M., Schlesinger, D., Stanke, G.,
Detection of the objects with given shape on the grey-valued pictures,
CAIP97(551-558).
Springer DOI 9709
BibRef

Walischewski, H.,
Automatic knowledge acquisition for spatial document interpretation,
ICDAR97(243-247).
IEEE DOI 9708
BibRef

Ting, A., Leung, M.,
Business Form Classification Using Strings,
ICPR96(II: 690-694).
IEEE DOI 9608
(School of Applied Science, SGP) BibRef

Kosiba, D., Kasturi, R.,
Automatic Invoice Interpretation: Invoice Structure Analysis,
ICPR96(III: 721-725).
IEEE DOI 9608
(The Pennsylvania State Univ., USA) BibRef

Hirayama, H.,
Analyzing Form Images By Using Line-Shared-Adjacent Cell Relations,
ICPR96(III: 768-772).
IEEE DOI 9608
(IBM Research, J) BibRef

Shimotsuji, S., Asano, M.,
Form Identification Based On Cell Structure,
ICPR96(III: 793-797).
IEEE DOI 9608
(Toshiba Corp., J) BibRef

Lorie, R., Riyaz, V., Truong, T.,
A System for Automated Data Entry from Forms,
ICPR96(III: 686-690).
IEEE DOI 9608
(IBM Almaden Res. Ctr., USA) BibRef

Garris, M.D.[Michael D.], Grother, P.J.[Patrick J.],
Generalized Form Registration Using Structure-Based Techniques,
SDAIR96(XX) National Institute of Standards and Technology. BibRef 9600

Ihle, T.[Torsten], Schirmer, H.[Helmut], Fuchs, S.[Siegfried],
Interpretation of printed forms for blind people,
CAIP95(550-555).
Springer DOI 9509
BibRef

Leedham, C.G., Monger, D.,
Evaluation of an Interactive Tool for Handwritten Form Description,
ICDAR95(1185-1188). BibRef 9500

Latanzio, B., Garzotto, A.,
Reliable Recognition of Handwritten Marks in Checkboxes,
SDAIR96(XX) Swiss Life Information Systems Research. BibRef 9600

Turolla, E., Belaďd, Y., Belaďd, A.,
Line and cell searching in tables or forms,
CIAP95(509-514).
Springer DOI 9509
BibRef

Chapter on OCR, Document Analysis and Character Recognition Systems continues in
Specific Examples: Extract Titles, Table of Contents, Citation, Information from Papers and Books .


Last update:Mar 16, 2024 at 20:36:19