25.2 Analysis Systems Applied to Documents, Document Analysis

Chapter Contents (Back)
Document Analysis. Application, Document Analysis. For a comparison of some of these techniques
See also Evaluation of Binarization Methods for Document Images.
See also Historical Document Analysis, Ancient Documents.

Deutsch, S.,
A Notes on Some Statistics Concerning Typewritten of Printed Material,
IT(3), No. 2, June 1957, pp. 147-149. BibRef 5706

Schurmann, J., Bartneck, N., Bayer, T.A., Franke, J., Mandler, E., and Oberlander, M.,
Document Analysis: From Pixels to Contents,
PIEEE(80), No. 7, July 1992, pp. 1101-1119.
IEEE Top Reference. In Special issue on OCR. BibRef 9207

Bayer, T.A., Franke, J., Kressel, U., Mandler, E., Oberlaender, M., Schuermann, J.,
Towards the Understanding of Printed Documents,
SDIA92(xx-yy). BibRef 9200

Hershey, A.V.[Allen V.],
A Computer System for Scientific Typography,
CGIP(1), No. 4, December 1972, pp. 373-385.
Elsevier DOI BibRef 7212

Johnston, E.G.[Emily G.],
Printed Text Discrimination,
CGIP(3), No. 1, March 1974, pp. 83-89.
Elsevier DOI 0501
BibRef

Gard, R.L.[Robert L.],
Digital Picture Processing Techniques for the Publishing Industry,
CGIP(5), No. 2, June 1976, pp. 151-171.
Elsevier DOI BibRef 7606

Wong, K.Y., Casey, R.G., and Wahl, F.M.,
Document Analysis System,
IBMRD(26), No. 6, November 1982, pp. 647-656. BibRef 8211

Inagaki, K.[Kosaku], Kato, T.[Toshikazu], Hiroshima, T.[Tadashi], Sakai, T.[Toshiyuki],
MACSYM: A Hierarchical Parallel Image Processing System for Event Driven Pattern Understanding of Documents,
PR(17), No. 1, 1984, pp. 85-108.
Elsevier DOI BibRef 8400

Baird, H.S., and Thompson, K.,
Reading Chess,
PAMI(12), No. 6, June 1990, pp. 552-559.
IEEE DOI BibRef 9006
Earlier: CVWS87(277-282). Skew Correction. Text Analysis. Using several basic ideas and techniques, this is a system to read the text of chess matches and get the meaning. 98% of the games are read correctly implying a much higher accuracy at the character/word level. Corrects for the skew of the printing. BibRef

Baird, H.S.[Henry S.], Fortune, S.J.[Steven J.], Jones, S.E.[Susan E.],
Image segmenting apparatus and methods,
US_Patent5,430,808, July 4, 1995.
WWW Link. BibRef 9507
Earlier: A1, A3, A2:
Image Segmentation by Shape-Directed Covers,
ICPR90(I: 820-825).
IEEE DOI Application, Document Analysis. BibRef

Srikantan, G., Srihari, S.N.,
A Study Relating Image Sampling Rate and Image Pattern Recognition,
CVPR94(709-712).
IEEE DOI BibRef 9400

Akiyama, T.[Teruo], Hagita, N.[Norihiro],
Automated entry system for printed documents,
PR(23), No. 11, 1990, pp. 1141-1154.
Elsevier DOI Japanese and English, Headlines, text lines, graphics. BibRef 9000

Masuda, I., Hagita, N., Akiyama, T., Takahashi, T., Naito, S.,
Approach to Smart Document Reader System,
CVPR85(550-557). BibRef 8500

Brandt, J.W., Jain, A.K., Algazi, V.R.,
Medial Axis Representation and Encoding of Scanned Documents,
JVCIR(2), 1991, pp. 151-165. BibRef 9100

Story, G.A., O'Gorman, L., Fox, D., Schaper, L.L., and Jagadish, H.V.,
The RightPages Image-Based Electronic Library for Alerting and Browsing,
Computer(25), No. 9, September 1992, pp. 17-26. BibRef 9209

O'Gorman, L.,
Image and document processing techniques for the RightPages electronic library system,
ICPR92(II:260-263).
IEEE DOI 9208
BibRef

Dengel, A.R.[Andreas R.], Bleisinger, R., Hoch, R., Fein, F.[Frank], Hönes, F.[Frank],
From Paper to Office Document Standard Representation,
Computer(25), No. 7, July 1992, pp. 63-67. BibRef 9207

Dengel, A.R.[Andreas R.],
ANASTASIL: A System for Low-Level and High-Level Geometric Analysis of Printed Documents,
SDIA92(xx-yy). BibRef 9200

Maio, D., and Rizzi, S.,
MAP Learning and Clustering in Autonomous Systems,
PAMI(15), No. 12, December 1993, pp. 1286-1297.
IEEE DOI BibRef 9312

Dengel, A.R., and Barth, G.,
High Level Document Analysis Guided by Geometric Aspects,
PRAI(2), No. 4, December 1988, pp. 641-656. Hierarchical document model, document tree. BibRef 8812

de Silva, G.L., Hull, J.J.,
Proper Noun Detection in Document Images,
PR(27), No. 2, February 1994, pp. 311-320.
Elsevier DOI BibRef 9402

Chen, F.R.[Francine R.], Bloomberg, D.S.[Dan S.],
Summarization of Imaged Documents without OCR,
CVIU(70), No. 3, June 1998, pp. 307-320.
DOI Link BibRef 9806
Earlier:
Extraction of Indicative Summary Sentences from Imaged Documents,
ICDAR97(227-232).
IEEE DOI 9708
BibRef
Earlier: A2, A1:
Document Image Summarization without OCR,
ICIP96(II: 229-232).
IEEE DOI BibRef

Chen, F.R.[Francine R.], Bloomberg, D.S.[Dan S.],
Extraction Of Thematically Relevant Text From Images,
SDAIR96(XX) Xerox Palo Alto Research Center. BibRef 9600

Spitz, A.L.[A. Lawrence], Wilcox, L.D.[Lynn D.],
Method and apparatus for classifying documents,
US_Patent5,414,781, May 9, 1995
WWW Link. BibRef 9505

Ozaki, M.[Masaharu],
Method and apparatus for document element classification by analysis of major white region geometry,
US_Patent5,574,802, Nov 12, 1996
WWW Link. BibRef 9611

McLean, G.F.,
Geometric Correction of Digitized Art,
GMIP(58), No. 2, March 1996, pp. 142-154. BibRef 9603

Yamashita, A., Amano, T., Hirayama, Y., Itoh, N., Katoh, S., Mano, T., and Toyokawa, K.,
A document recognition system and its applications,
IBMRD(40), No. 3, May 1996, pp. 341-352.
WWW Link. BibRef 9605

Maderlechner, G.[Gerd], Suda, P.[Peter], Bruckner, T.,
Classification of Documents by Form and Content,
PRL(18), No. 11-13, November 1997, pp. 1225-1231. 9806
BibRef

Nishida, H.[Hirobumi],
A Note on Practical Uses of Gray-Scale Image Analysis in Document Recognition,
PRL(19), No. 9, 31 July 1998, pp. 889-897. BibRef 9807

Nishida, H.[Hirobumi],
Boundary Extraction from Gray-Scale Document Images Based on Surface Data Structures,
GMIP(60), No. 1, January 1998, pp. 35-45. BibRef 9801
Earlier:
Boundary Feature Extraction From Gray-Scale Document Images,
ICDAR97(132-136).
IEEE DOI 9708
BibRef

Chauvet, P., Lopez Krahe, J., Taflin, E., Maltre, H.,
System for an intelligent office document analysis, recognition and description,
SP(32), No. 1-2 1993, pp. 161-190. BibRef 9300

Kundu, S.[Sukhamay],
A better fitness measure of a text-document for a given set of keywords,
PR(33), No. 5, May 2000, pp. 841-848.
Elsevier DOI 0003
BibRef

Kenney, A.R.[Anne R.], and Rieger, O.Y.[Oya Y.], (editors)
Moving Theory into Practice: Digital Imaging for Libraries and Archives,
Mountain View, CA: Research Libraries Group2000. ISBN 0-9700225-0-6. A how-to book for moving to the digital world for documents. (Not for analysis of them.) BibRef 0001

Lee, W.L.[Win-Long], Fan, K.C.[Kuo-Chin],
Document image preprocessing based on optimal Boolean filters,
SP(80), No. 1, January 2000, pp. 45-55. 0005
BibRef

Caere,
Company Information.
WWW Link. Vendor, OCR. OCR, document analysis, etc.

ScanSoft,
Company Information.
WWW Link. OCR, Document analysis, etc. Vendor, OCR.

Wenzel, C.[Claudia], Maus, H.[Heiko],
Leveraging corporate context within knowledge-based document analysis and understanding,
IJDAR(3), No. 4, 2001, pp. 248-260.
Springer DOI 0106
BibRef

Chan, W.[Woei], Coghill, G.[George],
Text analysis using local energy,
PR(34), No. 12, December 2001, pp. 2523-2532.
Elsevier DOI 0110
Text in clutter. BibRef

Chang, F.[Fu],
Retrieving Information from Document Images: Problems and Solutions,
IJDAR(4), No. 1, 2001, pp. 46-55.
Springer DOI 0111
BibRef

Le Cun, Y.L.[Yann L.], Bottou, L.[Leon], Bengio, Y.[Yoshua], Haffner, P.,
Gradient-Based Learning applied to Document Recognition,
PIEEE(86), No. 11, November 1998, pp. 2278-2324.
IEEE Top Reference. BibRef 9811

Aiello, M.[Marco], Monz, C.[Christof], Todoran, L.[Leon], Worring, M.[Marcel],
Document understanding for a broad class of documents,
IJDAR(5), No. 1, 2002, pp. 1-16.
PDF File. 0211
BibRef

Juola, P.[Patrick],
Document categorization and evaluation via cross-entrophy,
US_Patent6,397,205, May 28, 2002
WWW Link. BibRef 0205

Klein, B.[Bertin], Dengel, A.R.[Andreas R.],
Problem-adaptable document analysis and understanding for high-volume applications,
IJDAR(6), No. 3, March 2004, pp. 167-180.
Springer DOI 0406
BibRef
Earlier: A2, A1:
smartFIX: A Requirements-Driven System for Document Analysis and Understanding,
DAS02(433 ff.).
Springer DOI 0303
BibRef

Dengel, A.R.,
Learning of Pattern-Based Rules for Document Classification,
ICDAR07(123-127).
IEEE DOI 0709
BibRef

ReadSoft International,
2007. Document processing, OCR.
WWW Link. Vendor, Document Analysis. Vendor, OCR.

Aseervatham, S.[Sujeevan], Bennani, Y.[Younes],
Semi-structured document categorization with a semantic kernel,
PR(42), No. 9, September 2009, pp. 2067-2076.
Elsevier DOI 0905
Mercer kernel; Support vector machine; Text categorization; Semantic similarity; Semi-structured data BibRef

Aseervatham, S.[Sujeevan], Antoniadis, A.[Anestis], Gaussier, E.[Eric], Burlet, M.[Michel], Denneulin, Y.[Yves],
A sparse version of the ridge logistic regression for large-scale text categorization,
PRL(32), No. 2, 15 January 2011, pp. 101-106.
Elsevier DOI 1101
Logistic regression; Model selection; Text categorization; Large scale categorization BibRef

Sharan, A.[Aditi], Joshi, M.L.[Manju L.],
An algorithm for finding document concepts using semantic similarities from WordNet ontology,
IJCVR(1), No. 2, 2010, pp. 147-157.
DOI Link 1011
BibRef

Ferilli, S.[Stefano],
Automatic Digital Document Processing and Management: Problems, Algorithms and Techniques,
Springer2011, ISBN: 978-0-85729-197-4
WWW Link. 1101
Buy this book: Automatic Digital Document Processing and Management: Problems, Algorithms and Techniques (Advances in Pattern Recognition)
See also Automatic Content-based Indexing of Digital Documents through Intelligent Processing Techniques. BibRef

Bunke, H.[Horst], Riesen, K.[Kaspar],
Recent Advances in Graph-Based Pattern Recognition with Applications in Document Analysis,
PR(44), No. 5, May 2011, pp. 1057-1067.
Elsevier DOI 1101
Graph-based representation; Graph kernel; Graph embedding; Graph classification
See also Recent Advances in Structural Pattern Recognition with Applications to Visual Form Analysis. BibRef

Fischer, A.[Andreas], Riesen, K.[Kaspar], Bunke, H.[Horst],
Graph Similarity Features for HMM-Based Handwriting Recognition in Historical Documents,
FHR10(253-258).
IEEE DOI 1011
BibRef

Tsimboukakis, N., Tambouratzis, G.,
Word-Map Systems for Content-Based Document Classification,
SMC-C(41), No. 5, September 2011, pp. 662-673.
IEEE DOI 1109
BibRef

Medvet, E.[Eric], Bartoli, A.[Alberto], Davanzo, G.[Giorgio],
A probabilistic approach to printed document understanding,
IJDAR(14), No. 4, December 2011, pp. 335-347.
WWW Link. 1112
BibRef

Liu, Q.[Qiong], Liao, C.Y.[Chun-Yuan],
PaperUI,
CBDAR11(83-100).
Springer DOI 1204
Interface concept to use paper as the display and movile devices as the mouse. A long conceptual discussion. BibRef

Chen, F.[Francine], Girgensohn, A.[Andreas], Cooper, M.[Matthew], Lu, Y.J.[Yi-Juan], Filby, G.[Gerry],
Genre identification for office document search and browsing,
IJDAR(15), No. 3, September 2012, pp. 167-182.
WWW Link. 1209
BibRef

de Oliveira Mendes, A.[António], Torrão Fiadeiro, P.[Paulo], Matos Ramos, A.M.[Ana Maria], Lopes de Sousa, S.C.[Sónia Cristina],
Development of an optical system for analysis of the ink-paper interaction,
MVA(24), No. 8, November 2013, pp. 1733-1750.
Springer DOI 1310
BibRef

Gaceb, D.[Djamel], Eglin, V.[Véronique], Lebourgeois, F.[Frank],
Classification of business documents for real-time application,
RealTimeIP(9), No. 2, June 2014, pp. 329-345.
WWW Link. 1407
BibRef

Gaceb, D.[Djamel], Lebourgeois, F.[Frank], Duong, J.,
Adaptative Smart-Binarization Method: For Images of Business Documents,
ICDAR13(118-122)
IEEE DOI 1312
business data processing BibRef

Liu, D.[Ding], Jiang, M.[Minghu], Yang, X.F.[Xiao-Fang], Li, H.[Hui],
Analyzing documents with Quantum Clustering: A novel pattern recognition algorithm based on quantum mechanics,
PRL(77), No. 1, 2016, pp. 8-13.
Elsevier DOI 1606
Quantum clustering BibRef

Chen, J.[Jin], Lopresti, D.P.[Daniel P.], Nagy, G.[George],
Conservative preprocessing of document images,
IJDAR(19), No. 4, December 2016, pp. 321-333.
Springer DOI 1611
BibRef

Bhushan, S.N.B.[S.N. Bharath], Danti, A.[Ajit],
Classification of text documents based on score level fusion approach,
PRL(94), No. 1, 2017, pp. 118-126.
Elsevier DOI 1708
Text, classification BibRef

Song, L.Y.[Ling-Yun], Liu, J.[Jun], Luo, M.[Minnan], Qian, B.[Buyue], Yang, K.[Kuan],
Sparse Relational Topical Coding on multi-modal data,
PR(72), No. 1, 2017, pp. 368-380.
Elsevier DOI 1708
Multi-modal documents and the links between them. BibRef

Pushpalatha, K., Ananthanarayana, V.S.,
A tree based representation for effective pattern discovery from multimedia documents,
PRL(93), No. 1, 2017, pp. 143-153.
Elsevier DOI 1706
Multimedia document BibRef

Nguyen, K.C.[Kha Cong], Nguyen, C.T.[Cuong Tuan], Nakagawa, M.[Masaki],
Nom document digitalization by deep convolution neural networks,
PRL(133), 2020, pp. 8-16.
Elsevier DOI 2005
BibRef

das Neves Junior, R.B.[Ricardo Batista], Lima, E.[Estanislau], Bezerra, B.L.D.[Byron L.D.], Zanchettin, C.[Cleber], Toselli, A.H.[Alejandro H.],
HU-PageScan: A fully convolutional neural network for document page crop,
IET-IPR(14), No. 15, 15 December 2020, pp. 3890-3898.
DOI Link 2103
BibRef

Zhang, H.[Hao], Chen, B.[Bo], Cong, Y.L.[Yu-Lai], Guo, D.D.[Dan-Dan], Liu, H.W.[Hong-Wei], Zhou, M.Y.[Ming-Yuan],
Deep Autoencoding Topic Model With Scalable Hybrid Bayesian Inference,
PAMI(43), No. 12, December 2021, pp. 4306-4322.
IEEE DOI 2112
Analytical models, Probabilistic logic, Artificial neural networks, Decoding, Bayes methods, feature extraction BibRef

Appalaraju, S.[Srikar], Jasani, B.[Bhavan], Kota, B.U.[Bhargava Urala], Xie, Y.S.[Yu-Sheng], Manmatha, R.,
DocFormer: End-to-End Transformer for Document Understanding,
ICCV21(973-983)
IEEE DOI 2203
Visualization, Computational modeling, Layout, Computer architecture, Transformers, Task analysis, Vision + language BibRef

Mondal, T.[Tanmoy], Das, A.[Abhijit], Ming, Z.[Zuheng],
Exploring multi-tasking learning in document attribute classification,
PRL(157), 2022, pp. 49-59.
Elsevier DOI 2205
Multi-tasks Learning, Multi-instance Learning, Weighted Multi-task Learning, Convolutional Neural Networks, Scanning Resolution Recognition BibRef

Bakkali, S.[Souhail], Ming, Z.[Zuheng], Coustaty, M.[Mickael], Rusiñol, M.[Marçal], Terrades, O.R.[Oriol Ramos],
VLCDoC: Vision-Language contrastive pre-training model for cross-Modal document classification,
PR(139), 2023, pp. 109419.
Elsevier DOI 2304
Multimodal document representation learning, Document classification, Contrastive learning, Self-Attention, Transformers BibRef

Cao, P.F.[Pan-Feng], Wu, J.[Jian],
GraphRevisedIE: Multimodal information extraction with graph-revised network,
PR(140), 2023, pp. 109542.
Elsevier DOI 2305
Document information extraction, Graph convolutional network, Transformer BibRef

Voerman, J.[Joris], Souleiman-Mahamoud, I.[Ibrahim], Coustaty, M.[Mickael], Joseph, A.[Aurélie], Poulain-d'Andecy, V.[Vincent], Ogier, J.M.[Jean-Marc],
Automatic classification of company's document stream: Comparison of two solutions,
PRL(172), 2023, pp. 181-187.
Elsevier DOI 2309
Document processing, Imbalanced classification, Neural network BibRef

Bi, H.Y.[Heng-Yue], Xu, C.H.[Can-Hui], Shi, C.[Cao], Liu, G.Z.[Guo-Zhu], Li, Y.T.[Yu-Teng], Zhang, H.H.[Hong-Hong], Qu, J.[Jing],
SRRV: A Novel Document Object Detector Based on Spatial-Related Relation and Vision,
MultMed(25), 2023, pp. 3788-3798.
IEEE DOI 2310
BibRef

Zhang, Z.R.[Zhen-Rong], Ma, J.F.[Jie-Feng], Du, J.[Jun], Wang, L.C.[Li-Cheng], Zhang, J.S.[Jian-Shu],
Multimodal Pre-Training Based on Graph Attention Network for Document Understanding,
MultMed(25), 2023, pp. 6743-6755.
IEEE DOI 2311
BibRef

Fu, W.L.[Wen-Long], Xue, B.[Bing], Gao, X.Y.[Xiao-Ying], Zhang, M.J.[Meng-Jie],
Genetic Programming for Document Classification: A Transductive Transfer Learning System,
Cyber(54), No. 2, February 2024, pp. 1119-1132.
IEEE DOI 2402
Transfer learning, Training, Training data, Task analysis, Feature extraction, Support vector machines, Data models, transductive transfer learning BibRef

Liu, T.F.[Teng-Fei], Hu, Y.L.[Yong-Li], Gao, J.B.[Jun-Bin], Sun, Y.F.[Yan-Feng], Yin, B.C.[Bao-Cai],
Hierarchical Multi-Modal Prompting Transformer for Multi-Modal Long Document Classification,
CirSysVideo(34), No. 7, July 2024, pp. 6376-6390.
IEEE DOI 2407
Transformers, Task analysis, Feature extraction, Visualization, Adaptation models, Computational modeling, prompt learning BibRef

Xu, Z.W.[Zhe-Wei], Iwaihara, M.[Mizuho],
Confidence-Driven Contrastive Learning for Document Classification without Annotated Data,
IEICE(E108-D), No. 8, August 2024, pp. 1029-1039.
WWW Link. 2408
BibRef


Shrikhande, A.[Aniruddha], Shrikhande, S.[Saachi], Borse, S.[Siddhesh], Madake, J.[Jyoti], Bhatlawande, S.[Shripad],
Enhancing Large Document Organization Through Effective Preprocessing and Data Embedding,
ICCVMI23(1-5)
IEEE DOI 2403
Support vector machines, Dimensionality reduction, Text categorization, Data preprocessing, Organizations, TF-IDF BibRef

He, J.[Jiabang], Wang, L.[Lei], Hu, Y.[Yi], Liu, N.[Ning], Liu, H.[Hui], Xu, X.[Xing], Shen, H.T.[Heng Tao],
ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction,
ICCV23(19428-19437)
IEEE DOI Code:
WWW Link. 2401
BibRef

Cao, H.Y.[Hao-Yu], Bao, C.[Changcun], Liu, C.[Chaohu], Chen, H.[Huang], Yin, K.[Kun], Liu, H.[Hao], Liu, Y.[Yinsong], Jiang, D.Q.[De-Qiang], Sun, X.[Xing],
Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration,
ICCV23(19460-19470)
IEEE DOI 2401
BibRef

Inoue, N.[Naoto], Kikuchi, K.[Kotaro], Simo-Serra, E.[Edgar], Otani, M.[Mayu], Yamaguchi, K.[Kota],
Towards Flexible Multi-modal Document Models,
CVPR23(14287-14296)
IEEE DOI 2309
BibRef

Gemelli, A.[Andrea], Biswas, S.[Sanket], Civitelli, E.[Enrico], Lladós, J.[Josep], Marinai, S.[Simone],
Doc2graph: A Task Agnostic Document Understanding Framework Based on Graph Neural Networks,
TextEvery22(329-344).
Springer DOI 2304
BibRef

Davis, B.[Brian], Morse, B.[Bryan], Price, B.[Brian], Tensmeyer, C.[Chris], Wigington, C.[Curtis], Morariu, V.[Vlad],
End-to-end Document Recognition and Understanding with Dessurt,
TextEvery22(280-296).
Springer DOI 2304
BibRef

Biten, A.F.[Ali Furkan], Tito, R.[Rubèn], Gomez, L.[Lluis], Valveny, E.[Ernest], Karatzas, D.[Dimosthenis],
OCR-IDL: OCR Annotations for Industry Document Library Dataset,
TextEvery22(241-252).
Springer DOI 2304
BibRef

Oussaid, I.[Ismail], Vanhuffel, W.[William], Ratnamogan, P.[Pirashanth], Hajaiej, M.[Mhamed], Mathey, A.[Alexis], Gilles, T.[Thomas],
Information Extraction from Visually Rich Documents with Font Style Embeddings,
ICPR22(1657-1663)
IEEE DOI 2212
Visualization, Computational modeling, Layout, Benchmark testing, Information retrieval, Portable document format BibRef

Kim, G.[Geewook], Hong, T.[Teakgyu], Yim, M.[Moonbin], Nam, J.[JeongYeon], Park, J.[Jinyoung], Yim, J.[Jinyeong], Hwang, W.[Wonseok], Yun, S.[Sangdoo], Han, D.Y.[Dong-Yoon], Park, S.H.[Seung-Hyun],
OCR-Free Document Understanding Transformer,
ECCV22(XXVIII:498-517).
Springer DOI 2211
BibRef

Nicolaieff, L.[Lina], Kandi, M.M.[Mohamed Mehdi], Zegaoui, Y.[Younes], Bortolaso, C.[Christophe],
Intelligent Document Processing with Small and Relevant Training Dataset,
ISCV22(1-7)
IEEE DOI 2208
Training, Location awareness, Annotations, Computer architecture, Object detection, Detectors, Portable document format, Triplet-loss BibRef

Shah, S.[Sarathi], Joshi, M.V.,
Document Language Classification: Hierarchical Model with Deep Learning Approach,
CAIP21(I:372-381).
Springer DOI 2112
BibRef

Dieu, L.T.[Linh Truong], Nguyen, T.T.[Thuan Trong], Vo, N.D.[Nguyen D.], Nguyen, T.V.[Tam V.], Nguyen, K.[Khang],
Parsing Digitized Vietnamese Paper Documents,
CAIP21(I:382-392).
Springer DOI 2112
BibRef

Li, P.Z.[Pei-Zhao], Gu, J.X.[Jiu-Xiang], Kuen, J.[Jason], Morariu, V.I.[Vlad I.], Zhao, H.D.[Han-Dong], Jain, R.[Rajiv], Manjunatha, V.[Varun], Liu, H.F.[Hong-Fu],
SelfDoc: Self-Supervised Document Representation Learning,
CVPR21(5648-5656)
IEEE DOI 2111
Training, Visualization, Semantics, Layout, Linguistics, Pattern recognition BibRef

Zingaro, S.P.[Stefano Pio], Lisanti, G.[Giuseppe], Gabbrielli, M.[Maurizio],
Multimodal Side- Tuning for Document Classification,
ICPR21(5206-5213)
IEEE DOI 2105
Deep learning, Adaptation models, Visualization, Transfer learning, Rigidity, Tuning BibRef

Tropin, D.V.[Daniil V.], Ilyuhin, S.A.[Sergey A.], Nikolaev, D.P.[Dmitry P.], Arlazarov, V.V.[Vladimir V.],
Approach for Document Detection by Contours and Contrasts,
ICPR21(9689-9695)
IEEE DOI 2105
Performance evaluation, Mobile handsets, Pattern recognition, document detection, quadrangle detection, image segmentation BibRef

Naz, T.[Tayyaba], Khan, A.A.[Anam Ahmad], Shafait, F.[Faisal],
Ubiquitous Document Capturing with Deep Learning,
DICTA17(1-8)
IEEE DOI 1804
document image processing, image capture, image enhancement, information retrieval, knowledge management, Neurons BibRef

Li, S., Xiao, T., Li, H., Yang, W., Wang, X.,
Identity-Aware Textual-Visual Matching with Latent Co-attention,
ICCV17(1908-1917)
IEEE DOI 1802
entropy, feature extraction, image matching, text analysis, cross-modal features, easy incorrect matchings, Visualization BibRef

Sicre, R.[Ronan], Awal, A.M.[Ahmad Montaser], Furon, T.[Teddy],
Identity Documents Classification as an Image Classification Problem,
CIAP17(II:602-613).
Springer DOI 1711
BibRef

Das, A., Roy, S., Bhattacharya, U., Parui, S.K.,
Document Image Classification with Intra-Domain Transfer Learning and Stacked Generalization of Deep Convolutional Neural Networks,
ICPR18(3180-3185)
IEEE DOI 1812
Training, Task analysis, Hidden Markov models, Convolutional neural networks, Computational modeling, neural network BibRef

Roy, S., Das, A., Bhattacharya, U.,
Generalized stacking of layerwise-trained Deep Convolutional Neural Networks for document image classification,
ICPR16(1273-1278)
IEEE DOI 1705
Computational modeling, Hidden Markov models, Neural networks, Stacking, Support vector machines, Training, CNN, Deep learning, document classification, supervised layerwise training BibRef

Simon, M.[Marcel], Rodner, E.[Erik], Denzler, J.[Joachim],
Fine-grained classification of identity document types with only one example,
MVA15(126-129)
IEEE DOI 1507
Accuracy BibRef

Tanaka, K., Iwata, M., Kunze, K., Iwamura, M., Kise, K.,
Share Me: A Digital Annotation Sharing Service for Paper Documents with Multiple Clients Support,
ACPR13(779-782)
IEEE DOI 1408
document image processing BibRef

Stamm, K., Liwicki, M., Dengel, A.,
Continuous Partial-Order Planning for Multichannel Document Analysis: A Process-Driven Approach,
ICDAR13(626-630)
IEEE DOI 1312
document handling BibRef

Ho, A.K.N., Ragot, N., Ramel, J.Y., Eglin, V., Sidere, N.,
Document Classification in a Non-stationary Environment: A One-Class SVM Approach,
ICDAR13(616-620)
IEEE DOI 1312
document image processing. What kind of document. BibRef

da Silva Barboza, R., Dueire Lins, R., Marinho de Jesus, D.,
A Color-Based Model to Determine the Age of Documents for Forensic Purposes,
ICDAR13(1350-1354)
IEEE DOI 1312
document image processing BibRef

Huber-Mörk, R.[Reinhold], Schindler, A.[Alexander],
An Image Based Approach for Content Analysis in Document Collections,
ISVC13(II:278-287).
Springer DOI 1311
BibRef
Earlier:
Quality Assurance for Document Image Collections in Digital Preservation,
ACIVS12(108-119).
Springer DOI 1209
BibRef

Damghanian, M.[Mitra], Olsson, R.[Roger], Sjöström, M.[Mårten],
The Sampling Pattern Cube: A Representation and Evaluation Tool for Optical Capturing Systems,
ACIVS12(120-131).
Springer DOI 1209
BibRef

Forcher, B.[Bjorn], Agne, S.[Stefan], Dengel, A.R.[Andreas R.], Gillmann, M.[Michael], Roth-Berghofer, T.[Thomas],
Semantic Logging: Towards Explanation-Aware DAS,
ICDAR11(1140-1144).
IEEE DOI 1111
BibRef

Baird, H.S.[Henry S.],
Document Recognition without Strong Models,
ICDAR11(414-423).
IEEE DOI 1111
General discussion, can high performance happen without detailed domain knowledge? Partly yes. BibRef

Lazzara, G.[Guillaume], Levillain, R.[Roland], Geraud, T.[Thierry], Jacquelet, Y.[Yann], Marquegnies, J.[Julien], Crepin-Leblond, A.[Arthur],
The SCRIBO Module of the Olena Platform: A Free Software Framework for Document Image Analysis,
ICDAR11(252-258).
IEEE DOI 1111
Code, Document Analysis. BibRef

Gangeh, M.J.[Mehrdad J.], Kamel, M.S.[Mohamed S.], Duin, R.P.W.[Robert P.W.],
Random Subspace Method in Text Categorization,
ICPR10(2049-2052).
IEEE DOI 1008
BibRef

Chaudhury, K.[Krishnendu], Jain, A.[Ankur], Thirthala, S.R.[Sri Ram], Sahasranaman, V.[Vivek], Saxena, S.[Shobhit], Mahalingam, S.[Selvam],
Google Newspaper Search: Image Processing and Analysis Pipeline,
ICDAR09(621-625).
IEEE DOI 0907
Scanned older news papers. BibRef

Terasawa, K., Tanaka, Y.,
Locality Sensitive Pseudo-Code for Document Images,
ICDAR07(73-77).
IEEE DOI 0709
BibRef

Seki, M.[Minenobu], Fujio, M.[Masakazu], Nagasaki, T.[Takeshi], Shinjo, H.[Hiroshi], Marukawa, K.[Katsumi],
Information Management System Using Structure Analysis of Paper/Electronic Documents and Its Applications,
ICDAR07(689-693).
IEEE DOI 0709
BibRef

Boutemedjet, S.[Sabri], Ziou, D.[Djemel],
Visual Aspect: A Unified Content-Based Collaborative Filtering Model for Visual Document Recommendation,
ICIAR06(I: 685-696).
Springer DOI 0610
BibRef

Wen, D.[Di], Ding, X.Q.[Xiao-Qing],
Applying Preattentive Visual Guidance in Document Image Analysis,
IWICPAS06(328-338).
Springer DOI 0608
BibRef

Simske, S.J.[Steven J.], Arnabat, J.[Jordi],
Document Analysis System for Automating Workflows,
DAS06(588-592).
Springer DOI 0602
BibRef
Earlier:
User-Directed Analysis of Scanned Images,
DocEng03(212-221). November 2003. segmentation, pixel classification, scene analysis, text processing, document capture, zoning, click and select,
WWW Link. BibRef
And: TRHewlett-Packard Labs, TR-233, 2003.
HTML Version. BibRef

Fan, J., Lin, X., Simske, S.J.,
A comprehensive image processing suite for book re-mastering,
ICDAR05(I: 447-451).
IEEE DOI 0508
BibRef

Belaïd, A.[Abdel], Alusse, A.[André],
Toward File Consolidation by Document Categorization,
DAS06(437-448).
Springer DOI 0602
BibRef

Qiang, Q.[Qi], He, Q.M.[Qin-Ming],
A Multiclass Classification Framework for Document Categorization,
DAS06(474-483).
Springer DOI 0602
BibRef

Nagy, G.[George], Lopresti, D.P.[Daniel P.],
Interactive Document Processing and Digital Libraries,
DIAL06(2-11).
IEEE DOI 0604
BibRef

Lopresti, D.P.[Daniel P.],
Exploiting WWW Resources in Experimental Document Analysis Research,
DAS02(532 ff.).
Springer DOI 0303
BibRef

Baird, H.S.[Henry S.], Popat, K.[Kris],
Human Interactive Proofs and Document Image Analysis,
DAS02(507 ff.).
Springer DOI 0303
BibRef

Mikheev, A.[Artem], Vincent, L.[Luc], Hawrylycz, M.[Mike], Bottou, L.[Léon],
Electronic Document Publishing Using DjVu,
DAS02(480 ff.).
Springer DOI 0303
BibRef

Bottou, L.[Leon], Haffner, P.[Patrick], Le Cun, Y.L.[Yann L.],
Efficient conversion of digital documents to multilayer raster formats,
ICDAR01(444-448).
IEEE DOI 0109
BibRef

Bottou, L.[Leon], Haffner, P.[Patrick], Howard, P.G.[Paul G.], Le Cun, Y.L.[Yann L.],
Color Documents on the Web with DjVu,
ICIP99(I:239-243).
IEEE DOI BibRef 9900

Leung, C.C., Kwok, P.C.K., Chan, F.H.Y., Tsui, W.K.,
Normalization of contrast in document images using generalized fuzzy operator with least square method,
ICPR02(III: 115-118).
IEEE DOI 0211
BibRef

Fataicha, Y., Cheriet, M., Nie, J.Y., Suen, C.Y.,
Content analysis in document images: a scale space approach,
ICPR02(III: 335-338).
IEEE DOI 0211
BibRef

Spitz, A.L.,
Progress in document reconstruction,
ICPR02(I: 464-467).
IEEE DOI 0211
BibRef

Torkkola, K.,
Discriminative features for document classification,
ICPR02(I: 472-475).
IEEE DOI 0211
BibRef

Breuel, T.M., Janssen, W.C., Popat, K., Baird, H.S.,
Paper to PDA,
ICPR02(I: 476-479).
IEEE DOI 0211
BibRef

da Cunha Cavalcanti, G.D., de Barros Carvalho, E.C.,
An architecture for document management,
ICIP02(III: 973-976).
IEEE DOI 0210
BibRef

Harlfinger, D., Kotzabassi, S.,
Hidden in Greek Manuscripts,
ICIP01(Hidden in Greek Manuscripts). 0108
Invited Talk. Not in proceedings. BibRef

Redeke, I.,
Image and Graphic Reader,
ICIP01(I: 806-809).
IEEE DOI 0108
BibRef

Roussel, N., Hitz, O., Ingold, R.,
Web-based cooperative document understanding,
ICDAR01(368-373).
IEEE DOI 0109
BibRef

Zaghetto, A.[Alexandre], de Queiroz, R.L.[Ricardo L.],
High quality scanned book compression using pattern matching,
ICIP10(2165-2168).
IEEE DOI 1009
BibRef
Earlier:
Improved layer processing for MRC compression of scanned documents,
ICIP09(1993-1996).
IEEE DOI 0911
BibRef
Earlier:
Iterative pre- and post-processing for MRC layers of scanned documents,
ICIP08(1009-1012).
IEEE DOI 0810
BibRef

de Queiroz, R.L.,
Pre-Processing for MRC Layers of Scanned Images,
ICIP06(3093-3096).
IEEE DOI 0610
BibRef
Earlier:
On Data-filling Algorithms for MRC Layers,
ICIP00(Vol II: 586-589).
IEEE DOI 0008
BibRef

Yamada, K., Ishikawa, K., Nakajima, N.,
A Method of Analyzing the Handling of Paper Documents in Motion Images,
ICPR00(Vol IV: 413-416).
IEEE DOI 0009
BibRef

Yang, Y., Yan, H.,
A Robust Document Processing System Combining Image Segmentation with Content-based Document Compression,
ICPR00(Vol IV: 519-522).
IEEE DOI 0009
BibRef

Pavlidis, T.,
A New Paper/computer Interface: Two-dimensional Symbologies,
ICPR00(Vol II: 145-151).
IEEE DOI 0009
BibRef

Srihari, S.N., and Zack, G.W.,
Document Image Analysis,
ICPR86(434-436). BibRef 8600

Kim, W.Y., Yuan, P.,
A Practical Pattern Recognition System for Translation, Scale and Rotation Invariance,
CVPR94(391-396).
IEEE DOI BibRef 9400

Huttenlocher, D.P.[Daniel P.], Rucklidge, W.J.[William J.],
DigiPaper: A Versatile Color Document Image Representation,
ICIP99(I:219-223).
IEEE DOI BibRef 9900

Tayeb-Bey, S., Saidi, A., Emptoz, H.,
Analysis and Conversion of Documents,
ICPR98(Vol II: 1089-1091).
IEEE DOI 9808
BibRef

Nakajima, N.[Noboru], Tanaka, N.[Naoya], Yamada, K.[Keiji],
Document Reconstruction and Recognition from an Image Sequence,
ICPR98(Vol I: 922-925).
IEEE DOI 9808
BibRef

Li, Y., Lalonde, M., Reiher, E., Rizand, J.F., Zhu, C.J.,
A Knowledge-Based Image Understanding Environment for Document Processing,
ICDAR97(979-983).
IEEE DOI 9708
BibRef

Kauniskangas, H.[Hannu], Sauvola, J.[Jaakko],
An Automated Defect Management for Document Images,
ICPR98(Vol II: 1288-1294).
IEEE DOI 9808
BibRef

Eglin, V.[Véronique], Emptoz, H.[Hubert],
Logarithmic Spiral Grid and Gaze Control for the Development of Strategies of Visual Segmentation on a Document,
ICDAR97(689-692).
IEEE DOI 9708
BibRef

Blaesius, K.H., Grawemeyer, B., John, I., Kuhn, N.,
Knowledge-Based Document Analysis,
ICDAR97(728-731).
IEEE DOI 9708
BibRef

Wenzel, C.,
Supporting Information Extraction from Printed Documents by Lexico-Semantic Pattern Matching,
ICDAR97(732-735).
IEEE DOI 9708
BibRef

Chang, F., Chiu, T.F., Chou, T.R., Lee, M.C., Lu, Y.C., Shuai, T.Y., Tan, T.M., Wu, J.J., Young, C.S.,
A Document Analysis and Recognition System,
ICDAR97(736-739).
IEEE DOI 9708
BibRef

Miyamoto, T., Ishitani, Y., Seino, K., Nakamura, T., Tanabe, Y.,
Analysis of Required Elements for Next-Generation Document Reader on the Basis of User Requirements,
ICDAR97(428-431).
IEEE DOI 9708
BibRef

Bunke, H., Gonin, R., Moeri, D.,
A Tool For Versatile And User-Friendly Document Correction,
ICDAR97(433-438).
IEEE DOI 9708
BibRef

Yamazaki, Y., Komatsu, N.,
A Proposal for a Text-Indicated Writer Verification Method,
ICDAR97(709-713).
IEEE DOI 9708
BibRef

Bapst, F., Zramdini, A.[Abdelwahab], Ingold, R.[Rolf],
A Scenario Model Advocating User-Driven Adaptive Document Recognition Systems,
ICDAR97(745-748).
IEEE DOI 9708
BibRef

Buddrus, F., Bellavia, M.,
Surfing an ODBMS (Maintaining WWW Documents with O2),
ICDAR97(827-830).
IEEE DOI 9708
BibRef

O'Keefe, S.E.M., and Austin, J.,
Document Feature Recognition using a Mesh of Associative Memories,
BMVC96(Poster Session 1). 9608
BibRef
Earlier:
Application of an Associative Memory to the Analysis of Document Fax Images,
BMVC94(xx-yy).
PDF File. 9409
University of York BibRef

Yamada, M., and Hasuike, K.,
Document Image Processing Based on Enhanced Border Following Algorithm,
ICPR90(II: 231-236).
IEEE DOI BibRef 9000

Kubota, K., Iwaki, O., Arakawa, H.,
Document Understanding System,
ICPR84(612-614). BibRef 8400

Nagy, G., Seth, S.C.,
Hierarchical Representation of Optically Scanned Documents,
ICPR84(347-349). BibRef 8400

Doster, W.,
Different States of a Document's Content on Its Way from the Gutenbergian World to the Electronic World,
ICPR84(872-874). BibRef 8400

Moulinier, I.[Isabelle], Raskinis, G.[Gailius], Ganascia, J.G.[Jean-Gabriel],
Text Categorization: A Symbolic Approach,
SDAIR96(XX) University of Paris. Vtautas Magnus University. BibRef 9600

Kutlu, G.[Gokhan], Draper, B.A.[Bruce A.], Moss, E.B.[Eliot B.], Riseman, E.M.[Edward M.],
Support Tools for Visual Information Management,
SDAIR96(XX) University of Massachusetts.
PS File. BibRef 9600

Searls, D.B., Taylor, S.L.,
Document Image Analysis Using Logic-Grammar-Based Syntactic Pattern Recognition,
SDIA92(xx-yy). 0905
BibRef

Chapter on OCR, Document Analysis and Character Recognition Systems continues in
Document Analysis Systems, General, Survey, Evaluation .


Last update:Nov 26, 2024 at 16:40:19