23.2.2.2 Document Layout, Structure Analysis

Chapter Contents (Back)
Document Analysis. Application, Document Layout.

See also Block Segmentation and Text Extraction in Mixed Text/Image Documents.

Goldwasser, S.M., Troxel, D.E.,
Page Composition of Continuous Tone Imagery,
CVGIP(26), No. 1, April 1984, pp. 30-44.
WWW Version. BibRef 8404

Okawa, Y.[Yoshikuni],
A Structural Analysis of Visual Form on Packaging Graphics and Its Use in an Automated Design System,
CVGIP(43), No. 2, August 1988, pp. 265-278.
WWW Version. BibRef 8808
Earlier:
Identification of Packaged-in-a-box Goods for Designing a Part of an Intelligent Cash Register,
ICPR80(150-152). Where to put the graphics to not hide the picture. BibRef

Srihari, S.N., and Govindaraju, V.,
Analysis of Textual Images Using the Hough Transform,
MVA(2), 1989, pp. 141-153. BibRef 8900

Peppers, N.A.[Norman A.], Young, J.R.[James R.], Nishi, H.[Hisami], Ueno, H.[Hiroshi],
Page segmentor,
US_Patent4,817,169, March 28, 1989.
WWW Version. BibRef 8903

O'Gorman, L.,
The Document Spectrum for Page Layout Analysis,
PAMI(15), No. 11, November 1993, pp. 1162-1173.
IEEE Abstract. IEEE Top Reference.
WWW Version. Determine the structure of the document for storage and recognition. (For evaluation: See also Empirical Performance Evaluation Methodology and Its Application to Page Segmentation Algorithms. ) BibRef 9311

Krishnamoorthy, M., Nagy, G., Seth, S., and Viswanathan, M.,
Syntatic Segmentation and Labeling of Digitized Pages from Technical Journals,
PAMI(15), No. 7, July 1993, pp. 737-747.
IEEE Abstract. IEEE Top Reference.
WWW Version. A more complete version of the following paper and system. Error correction with backtracking. Computationally complex. Understanding of how documents are put together. BibRef 9307

Hones, F., Lichter, J.,
Layout Extraction of Mixed-Mode Documents,
MVA(7), No. 4, 1994, pp. 237-246. BibRef 9400

Saitoh, T., Yamaai, T., Tachikawa, M.,
Document Image Segmentation and Layout Analysis,
IEICE(Info Sys 77), No. 7, 1994, pp. 778-784. BibRef 9400

Saitoh, T., Pavlidis, T.,
Page segmentation without rectangle assumption,
ICPR92(II:277-280).
WWW Version. 9208 BibRef

Cullen, J.F.[John F.], Ejiri, K.[Koichi],
Segmentation of text, picture and lines of a document image,
US_Patent5,335,290, August 2, 1994.
WWW Version. BibRef 9408

Kopec, G.E., Chou, P.A.,
Document Image Decoding Using Markov Source Models,
PAMI(16), No. 6, June 1994, pp. 602-617.
IEEE Abstract. IEEE Top Reference.
WWW Version. BibRef 9406
Earlier:
Document image decoding,
ICIP94(II: 36-40).
WWW Version. 9411 BibRef
Earlier:
Automatic Generation of Custom Document Image Decoders,
ICDAR93(xx-yy). BibRef

Kam, A.C.,
Heuristic Document Image Decoding Using Markov Source Models,
MITMasters Thesis, June 1993. BibRef 9306

Kam, A.C., Kopec, G.E.,
Document Image Decoding by Heuristic Search,
PAMI(18), No. 9, September 1996, pp. 945-950.
IEEE Abstract. IEEE Top Reference.
WWW Version. Heuristic Search. BibRef 9609
Earlier:
Separable Source Models for Document Image Decoding,
SPIE(2422), February 1995, pp. 84-97. BibRef

Shiau, J.N.[Jeng-Nan],
Automatic image segmentation for color documents,
US_Patent5,341,226, August 23, 1994.
WWW Version. BibRef 9408

Hayashi, N.[Naoki], Saito, K.[Kazuo],
Document layout processing method and device for carrying out the same,
US_Patent5,379,373, January 3, 1995.
WWW Version. BibRef 9501

Ozaki, M.[Masaharu],
Method and apparatus for document segmentation by background analysis,
US_Patent5,555,556, September 10, 1996
WWW Version. BibRef 9609

Kopec, G.E.[Gary E.], Lomelin, M.[Mauricio],
Supervised Template Estimation for Document Image Decoding,
PAMI(19), No. 12, December 1997, pp. 1313-1324.
IEEE Abstract. IEEE Top Reference.
WWW Version. 9712 BibRef
Earlier:
Document image Decoding Approach to Character Template Estimation,
ICIP96(II: 213-216).
WWW Version. BibRef
And:
Document Specific Character Template Estimation,
SPIE(2660), 1996, pp. 14-26. Templates for recognizing characters. BibRef

Kopec, G.E.[Gary E.],
Multilevel Character Templates for Document Image Decoding,
SPIE(3027), 1997, pp. xx-yy. BibRef 9700
Earlier:
Document Image Decoding in the Berkeley Digital Library Project,
SPIE(2660), 1996, pp. 2-13. BibRef
And:
Document Image Decoding in the Berkeley Digital Library,
ICIP96(II: 769-772).
WWW Version. BibRef

Dengel, A.R., Dubiel, F.,
Computer Understanding of Document Structure,
IJIST(7), No. 4, Winter 1996, pp. 271-278. 9612 BibRef

Niyogi, D., Srihari, S.N.,
Integrated Approach to Document Decomposition and Structural-Analysis,
IJIST(7), No. 4, Winter 1996, pp. 330-342. 9612 BibRef
Earlier:
Knowledge-Based Derivation of Document Logical Structure,
ICDAR95(472-475). Bottom up approach. 3 levels of rules, knowledge, control and strategy. Accuracy varies (48-100%). Has 160 rules. BibRef

Niyogi, D., Srihari, S.N.,
A Rule-Based System for Document Understanding,
AAAI-86(789-793). BibRef 8600

Simon, A., Pret, J.C., Johnson, A.P.,
A Fast Algorithm for Bottom-Up Document Layout Analysis,
PAMI(19), No. 3, March 1997, pp. 273-277.
IEEE Abstract. IEEE Top Reference.
WWW Version. 9704 BibRef

Liu, J.M., Tang, Y.Y., Suen, C.Y.,
Chinese Document Layout Analysis Based on Adaptive Split-and-Merge and Qualitative Spatial Reasoning,
PR(30), No. 8, August 1997, pp. 1265-1278.
WWW Version. 9708 BibRef

Tang, Y.Y., Ma, H., Liu, J.M., Li, B.F., Xi, D.H.,
Multiresolution Analysis in Extraction of Reference Lines from Documents with Gray-Level Background,
PAMI(19), No. 8, August 1997, pp. 921-926.
IEEE Abstract. IEEE Top Reference.
WWW Version. 9709Find reference lines to determine the structure of the document. BibRef

Bayer, T.[Thomas], Kressel, U.[Ulrich], Mogg-Schneider, H.[Heike], Renz, I.[Ingrid],
Categorizing Paper Documents,
CVIU(70), No. 3, June 1998, pp. 299-306.
WWW Version. BibRef 9806

Caelli, T.M.[Terry M.], and Dillon, C.[Craig],
CITE: A Trainable Image Annotation System,
PRL(18), No. 11-13, November 1997, pp. 1247-1252. 9806 BibRef

Dillon, C.[Craig], and Caelli, T.M.[Terry M.],
Learning Image Annotation: The Cite System,
Videre(1), No. 2, Winter 1998, pp. 90-121. Generate automatic annotations. Apply to airports and office scenes. Region and color based analysis.
HTML Version.
PDF Version. BibRef 9800

Ancin, H.[Hakan],
Document segmentation system,
US_Patent5,956,468, September 21, 1999.
WWW Version. Text and graphics. BibRef 9909

Chao, H.[Hui], Bloomberg, D.S.[Dan S.],
Method and system for document segmentation,
US_Patent6,904,170, June 7, 2005.
WWW Version. Projection profiles. BibRef 0506

Li, J., Gray, R.M.,
Context-Based Multiscale Classification of Document Images Using Wavelet Coefficient Distributions,
IP(9), No. 9, September 2000, pp. 1604-1616.
WWW Version. 0008 BibRef

Ageenko, E., Fränti, P.,
Context-based filtering of document images,
PRL(21), No. 6-7, June 2000, pp. 483-491. 0006 BibRef

Lee, K.H.[Kyong-Ho], Choy, Y.C.[Yoon-Chul], Cho, S.B.[Sung-Bae],
Geometric Structure Analysis of Document Images: A Knowledge-Based Approach,
PAMI(22), No. 11, November 2000, pp. 1224-1240.
IEEE Abstract. IEEE Top Reference.
WWW Version. 0012Journal pages. Combine botton-up and top-down approach. Segmentation then identification. BibRef

Shin, C.[Christian], Doermann, D.[David], Rosenfeld, A.[Azriel],
Classification of document pages using structure-based features,
IJDAR(3), No. 4, 2001, pp. 232-247.
HTML Version. 0106 BibRef

Liang, J.[Jisheng], Phillips, I.T.[Ihsin T.], Haralick, R.M.[Robert M.],
An Optimization Methodology for Document Structure Extraction on Latin Character Documents,
PAMI(23), No. 7, July 2001, pp. 719-734.
IEEE Abstract. IEEE Top Reference.
WWW Version. 0108 BibRef

Chetverikov, D., Liang, J., Komuves, J., Haralick, R.M.,
Zone Classification Using Texture Features,
ICPR96(III: 676-680).
WWW Version. 9608(Hungarian Academy of Sciences, H) BibRef

Liang, J., Ha, J., Haralick, R.M., and Phillips, I.T.,
Document Layout Structure Extraction Using Bounding Boxes of Different Entities,
WACV96(278-283).
IEEE Abstract. IEEE Top Reference. 9609 BibRef

Haralick, R.M.,
Document Image Analysis: Geometric and Logical Layout,
CVPR94(385-390).
IEEE Abstract. IEEE Top Reference. BibRef 9400

Klink, S.[Stefan], Kieninger, T.[Thomas],
Rule-based document structure understanding with a fuzzy combination of layout and textual features,
IJDAR(4), No. 1, 2001, pp. 18-26.
HTML Version. 0111 BibRef

Altamura, O.[Oronzo], Esposito, F.[Floriana], Malerba, D.[Donato],
Transforming paper documents into XML format with WISDOM++,
IJDAR(4), No. 1, 2001, pp. 2-17.
HTML Version. 0111 BibRef

Lee, S.W.[Seong-Whan], Ryu, D.S.[Dae-Seok],
Parameter-Free Geometric Document Layout Analysis,
PAMI(23), No. 11, November 2001, pp. 1240-1256.
IEEE Abstract. IEEE Top Reference.
WWW Version. 0112Segment into maximal homogeneous regions, identify as text, graphics, etc. Periodicity measure for text. BibRef

Ryu, D.S., Kang, S.M., Lee, S.W.,
Parameter-independent Geometric Document Layout Analysis,
ICPR00(Vol IV: 397-400).
WWW Version.
HTML Version. 0009 BibRef

Hull, J.J., Lee, D.S.,
Simultaneous Highlighting of Paper and Electronic Documents,
ICPR00(Vol IV: 401-404).
WWW Version.
HTML Version. 0009 BibRef

Acharyya, M.[Mausumi], Kundu, M.K.[Malay K.],
Document image segmentation using wavelet scale-space features,
CirSysVideo(12), No. 12, December 2002, pp. 1117-1127.
IEEE Top Reference. 0301 BibRef
Earlier:
Multiscale Segmentation of Document Images Using M-Band Wavelets,
CAIP01(510 ff.).
HTML Version. 0210 See also adaptive approach to unsupervised texture segmentation using M-Band wavelet transform, An. BibRef

Lee, J.Y.[Ji-Yeon], Park, J.S.[Jeong-Seon], Byun, H.R.[Hye-Ran], Moon, J.[Jongsub], Lee, S.W.[Seong-Whan],
Automatic generation of structured hyperdocuments from document images,
PR(35), No. 2, February 2002, pp. 485-503.
WWW Version. 0201 BibRef

Lee, J.Y., Choi, S.H., Lee, S.W.,
Automatic Generation of Structured Hyperdocuments from Multi-column Document Images,
ICPR00(Vol IV: 422-425).
WWW Version.
HTML Version. 0009 BibRef

Lam, W.[Wai], Han, Y.[Yiqiu],
Automatic textual document categorization based on generalized instance sets and a metamodel,
PAMI(25), No. 5, May 2003, pp. 628-633.
IEEE Abstract. IEEE Top Reference. 0304Generalized instance set. (GIS) BibRef

Bagdanov, A.D.[Andrew D.], Worring, M.[Marcel],
Multiscale Document Description Using Rectangular Granulometries,
IJDAR(6), No. 3, March 2004, pp. 181-191.
WWW Version. 0406 BibRef
Earlier: DAS02(445 ff.).
HTML Version. 0303 BibRef
Earlier:
Fine-grained document genre classification using first order random graphs,
ICDAR01(79-83).
WWW Version. 0109 BibRef

Chang, F.[Fu], Chu, S.Y.[Shih-Yu], Chen, C.Y.[Chi-Yen],
Chinese document layout analysis using an adaptive regrouping strategy,
PR(38), No. 2, February 2005, pp. 261-271.
WWW Version. 0411 BibRef

Wu, C.C.[Chung-Chih], Chou, C.H.[Chien-Hsing], Chang, F.[Fu],
A machine-learning approach for analyzing document layout structures with two reading orders,
PR(41), No. 10, October 2008, pp. 3200-3213.
WWW Version. 0808Binary decision; Document layout analysis; Reading order; Support vector machine; Taboo box; Textline; Text region BibRef

Ramel, J.Y., Leriche, S., Demonet, M.L., Busson, S.,
User-driven page layout analysis of historical printed books,
IJDAR(9), No. 2-4, April 2007, pp. 243-261.
WWW Version. 0704 BibRef

Altamura, O.[Oronzo], Berardi, M.[Margherita], Ceci, M.[Michelangelo], Malerba, D.[Donato], Varlaro, A.[Antonio],
Using colour information to understand censorship cards of film archives,
IJDAR(9), No. 2-4, April 2007, pp. 281-297.
WWW Version. 0704 BibRef
Earlier: A2, A1, A3, A4, Only:
A color-based layout analysis to process censorship cards of film archives,
ICDAR05(II: 1110-1114).
WWW Version. 0508 BibRef

Natarajan, P.[Prem], Prasad, R.[Rohit], Subramanian, K.[Krishna], Saleem, S.[Shirin], Choi, F.[Fred], Schwartz, R.[Rich],
Finding structure in noisy text: topic classification and unsupervised clustering,
IJDAR(10), No. 3-4, December 2007, pp. 187-198.
WWW Version. 0712 BibRef


Moringen, J.[Jan], Wachsmuth, S.[Sven], Dickinson, S.[Sven], Stevenson, S.[Suzanne],
Learning Visual Compound Models from Parallel Image-Text Datasets,
DAGM08(xx-yy).
WWW Version. 0806 BibRef

Jamieson, M.[Michael], Fazly, A.[Afsaneh], Dickinson, S.[Sven], Stevenson, S.[Suzanne], Wachsmuth, S.[Sven],
Learning Structured Appearance Models from Captioned Images of Cluttered Scenes,
ICCV07(1-8).
WWW Version. 0710 BibRef

Ceci, M.[Michelangelo], Berardi, M.[Margherita], Porcelli, G., Malerba, D.[Donato],
A Data Mining Approach to Reading Order Detection,
ICDAR07(924-928).
WWW Version. 0709 BibRef

Gupta, M.D.[M. Das], Sarkar, P.,
A Shared Parts Model for Document Image Recognition,
ICDAR07(1163-1172).
WWW Version. 0709 BibRef

Kumar, K.S., Kumar, S., Jawahar, C.,
On Segmentation of Documents in Complex Scripts,
ICDAR07(1243-1247).
WWW Version. 0709 BibRef

Drira, F.[Fadoua], le Bourgeois, F.[Frank], Emptoz, H.[Hubert],
A Coupled Mean Shift-Anisotropic Diffusion Approach for Document Image Segmentation and Restoration,
ICDAR07(814-818).
WWW Version. 0709 BibRef

Xia, Y., Xiao, B.H., Wang, C.H., Dai, R.W.,
Integrated Segmentation and Recognition of Mixed Chinese/English Document,
ICDAR07(704-708).
WWW Version. 0709 BibRef

Baird, H.S., Moll, M.A.,
Document Content Inventory and Retrieval,
ICDAR07(93-97).
WWW Version. 0709 BibRef

Gu, G., Han, W.,
Adaptive Window Based Uneven Lighting Document Segmentation,
ICDAR07(223-226).
WWW Version. 0709 BibRef

Cao, H.[Huaigu], Prasad, R.[Rohit], Natarajan, P.[Prem], MacRostie, E.[Ehry],
Robust Page Segmentation Based on Smearing and Error Correction Unifying Top-down and Bottom-up Approaches,
ICDAR07(392-396).
WWW Version. 0709 BibRef

Gao, D., Wang, Y., Hindi, H., Do, M.,
Decompose Document Image Using Integer Linear Programming,
ICDAR07(397-401).
WWW Version. 0709 BibRef

Nicolas, S., Dardenne, J., Paquet, T., Heutte, L.,
Document Image Segmentation Using a 2D Conditional Random Field Model,
ICDAR07(407-411).
WWW Version. 0709 BibRef

Gao, D.S.[Da-Shan], Wang, Y.Z.[Yi-Zhou],
Decomposing Document Images by Heuristic Search,
EMMCVPR07(97-111).
WWW Version. 0708 BibRef

Kumar, K.S.S.[K.S. Sesh], Namboodiri, A.M.[Anoop M.], Jawahar, C.V.,
Learning Segmentation of Documents with Complex Scripts,
ICCVGIP06(749-760).
WWW Version. 0612 BibRef

Hernández-Reyes, E.[Edith], Martínez-Trinidad, J.F., Carrasco-Ochoa, J.A., García-Hernández, R.A.[René A.],
Document Representation Based on Maximal Frequent Sequence Sets,
CIARP06(854-863).
WWW Version. 0611 BibRef

Lakhani, G., Subedi, R.,
Optimal Filling of FG/BG Layers of Compound Document Images,
ICIP06(2273-2276). 0610
WWW Version. BibRef

Mao, S.[Song], Mansukhani, P.[Praveer], Thoma, G.R.[George R.],
Combining Static Classifiers and Class Syntax Models for Logical Entity Recognition in Scanned Historical Documents,
CVPR07(1-8).
WWW Version. 0706 BibRef

Mao, S.[Song], Xu, Z.[Zheng], Tjahjadi, T.[Tardi], Thoma, G.R.[George R.],
Logical Entity Recognition in Multi-Style Document Page Images,
ICPR06(I: 876-879).
WWW Version. 0609 BibRef

Baird, H.S.[Henry S.], Casey, M.R.[Matthew R.],
Towards Versatile Document Analysis Systems,
DAS06(280-290).
WWW Version. 0602 BibRef

Rangoni, Y.[Yves], Belaid, A.[Abdel],
Document Logical Structure Analysis Based on Perceptive Cycles,
DAS06(117-128).
WWW Version. 0602 BibRef
Earlier:
Data categorization for a context return applied to logical document structure recognition,
ICDAR05(I: 297-301).
WWW Version. 0508 BibRef

Bloechle, J.L.[Jean-Luc], Rigamonti, M.[Maurizio], Hadjar, K.[Karim], Lalanne, D.[Denis], Ingold, R.[Rolf],
XCDF: A Canonical and Structured Document Format,
DAS06(141-152).
WWW Version. 0602 BibRef

Sternby, J., Ericsson, A.,
Core points: A framework for structural parameterization,
ICDAR05(I: 217-221).
WWW Version. 0508 BibRef

Lin, X.,
Active document layout synthesis,
ICDAR05(I: 86-90).
WWW Version. 0508 BibRef

Sun, H.M.[Hung-Ming],
Page segmentation for Manhattan and non-Manhattan layout documents via selective CRLA,
ICDAR05(I: 116-120).
WWW Version. 0508 BibRef

Shi, Z.[Zhixin], Govindaraju, V.,
Multi-scale techniques for document page segmentation,
ICDAR05(II: 1020-1024).
WWW Version. 0508 BibRef

Berardi, M.[Margherita], Lapi, M.[Michele], Malerba, D.[Donato],
An Integrated Approach for Automatic Semantic Structure Extraction in Document Images,
DAS04(179-190).
WWW Version. 0505 BibRef

Ceci, M., Berardi, M.[Margherita], Malerba, D.[Donato],
Relational learning techniques for document image understanding: comparing statistical and logical approaches,
ICDAR05(I: 473-482).
WWW Version. 0508 BibRef

Esposito, F., Malerba, D., Semeraro, G., Ferilli, S., Altamura, O., Basile, T.M.A., Berardi, M., Ceci, M., Di Mauro, N.,
Machine learning methods for automatically processing historical documents: from paper acquisition to XML transformation,
DIAL04(328-335).
WWW Version. 0404 BibRef

Malerba, D., Esposito, F., Altamura, O., Ceci, M., Berardi, M.,
Correcting the document layout: a machine learning approach,
ICDAR03(97-102).
IEEE Abstract. IEEE Top Reference. 0311 BibRef

Malerba, D.[Donato], Esposito, F.[Floriana], Lisi, F.A., Altamura, O.[Oronzo],
Automated discovery of dependencies between logical components in document image understanding,
ICDAR01(174-178).
WWW Version. 0109 BibRef

Huang, M., DeMenthon, D.F., Doermann, D., Golebiowski, L., Hamilton, B.A.,
Document ranking by layout relevance,
ICDAR05(I: 362-366).
WWW Version. 0508 BibRef

Waked, B., Suen, C.Y.[Ching Y.], Bergler, S.,
Segmenting document images using diagonal white runs and vertical edges,
ICDAR01(194-199).
WWW Version. 0109 BibRef

Yingsaeree, C., Kawtrakul, A.,
Rule-based middle-level character detection for simplifying Thai document layout analysis,
ICDAR05(II: 888-892).
WWW Version. 0508 BibRef

Nakano, Y.[Yasuaki], Hananoi, T.[Toshihiro], Miyao, H.[Hidetoshi], Maruyama, M.[Minoru], Maruyama, K.I.[Ken-Ichi],
A Document Analysis System Based on Text Line Matching of Multiple OCR Outputs,
DAS04(463-471).
WWW Version. 0505 BibRef

Adam, S.[Sébastien], Rigamonti, M.[Maurizio], Clavier, E.[Eric], Trupin, E.[Eric], Ogier, J.M.[Jean-Marc], Tombre, K.[Karl], Gardes, J.[Joël],
DocMining: A Document Analysis System Builder,
DAS04(472-483).
WWW Version. 0505 BibRef

Carmagnac, F.[Fabien], Héroux, P.[Pierre], Trupin, É.[Éric],
Multi-view HAC for Semi-supervised Document Image Classification,
DAS04(191-200).
WWW Version. 0505 BibRef

Antonacopoulos, A.[Apostolos], Karatzas, D.[Dimosthenis],
Semantics-based content extraction in typewritten historical documents,
ICDAR05(I: 48-53).
WWW Version. 0508 BibRef
Earlier:
A Complete Approach to the Conversion of Typewritten Historical Documents for Digital Archives,
DAS04(90).
WWW Version. 0505 BibRef
And:
Document image analysis for World War II personal records,
DIAL04(336-341).
WWW Version. 0404 See also Colour text segmentation in web images based on human perception. BibRef

Mao, S., Kim, J.W., Thoma, G.R.,
A dynamic feature generation system for automated metadata extraction in preservation of digital materials,
DIAL04(225-232).
WWW Version. 0404 BibRef

Gattani, A., Mukerji, M., Gur, H.,
A fast multifunctional approach for document image analysis,
ICDAR03(1178-1182).
IEEE Abstract. IEEE Top Reference. 0311 BibRef

Hoque, S., Selim, H., Howells, W.G.J., Fairhurst, M.C., Deravi, F.,
SAGENT: a novel technique for document modeling for secure access and distribution,
ICDAR03(1257-1261).
IEEE Abstract. IEEE Top Reference. 0311 BibRef

Howells, W.G.J., Selim, H., Hoque, S., Fairhurst, M.C., Deravi, F.,
The autonomous document object (ADO) model,
ICDAR01(977-981).
WWW Version. 0109 BibRef

Klein, B., Agne, S., Bagdanov, A.D.,
Understanding document analysis and understanding (through modeling),
ICDAR03(1218-1222).
IEEE Abstract. IEEE Top Reference. 0311 BibRef

Breuel, T.M.[Thomas M.],
An algorithm for finding maximal whitespace rectangles at arbitrary orientations for document layout analysis,
ICDAR03(66-70).
IEEE Abstract. IEEE Top Reference. 0311 BibRef
Earlier:
Two Geometric Algorithms for Layout Analysis,
DAS02(188 ff.).
HTML Version. 0303 BibRef

Lee, K.H.[Kyong-Ho], Choy, Y.C.[Yoon-Chul], Cho, S.B.[Sung-Bae], Tang, X.[Xiao], McCrary, V.[Victor],
Document Reverse Engineering: From Paper to XML,
DAS02(503 ff.).
HTML Version. 0303 BibRef

Liang, J.[Jian], Doermann, D.[David],
Logical Labeling of Document Images Using Layout Graph Matching with Adaptive Learning,
DAS02(224 ff.).
HTML Version. 0303 BibRef

Bagdanov, A.D., Worring, M.,
Granulometric analysis of document images,
ICPR02(I: 468-471).
WWW Version. 0211 BibRef

Tam, V., Santoso, A., Setiono, R.,
A comparative study of centroid-based, neighborhood-based and statistical approaches for effective document categorization,
ICPR02(IV: 235-238).
WWW Version. 0211 BibRef

Popat, K., Greene, D.H., Poo, T.L.[Tze-Lei],
Adaptive stack algorithm in document image decoding,
ICPR02(IV: 231-234).
WWW Version. 0211 BibRef

Liang, J.[Jian], Doermann, D., Ma, M.[Matthew], Guo, J.K.,
Page classification through logical labelling,
ICPR02(III: 477-480).
WWW Version. 0211 BibRef

Valveny, E., Marti, E.,
Learning of structural descriptions of graphic symbols using deformable template matching,
ICDAR01(455-459).
WWW Version. 0109 BibRef

Valveny, E., Lamiroy, B.,
Sean-to-XML: automatic generation of browsable technical documents,
ICPR02(III: 188-191).
WWW Version. 0211 BibRef

Duong, J., Emptoz, H., Cote, M.,
Features for printed document image analysis,
ICPR02(III: 245-248).
WWW Version. 0211 BibRef

da Silva, J.M.M.[João Marcelo Monte], Lins, R.D.[Rafael Dueire],
Color Document Synthesis as a Compression Strategy,
ICDAR07(466-470).
WWW Version. 0709 BibRef

Lins, R.D.[Rafael Dueire], da Silva, J.M.M.[João Marcelo Monte],
Generating Color Documents from Segmented and Synthetic Elements,
ICIAR07(1217-1228).
WWW Version. 0708 BibRef

de Mello, C.A.B.[Carlos A. B.],
An Algorithm for Foreground-Background Separation in Low Quality Patrimonial Document Images,
CIARP07(911-920).
WWW Version. 0711 BibRef
Earlier:
Image Segmentation of Historical Documents: Using a Quality Index,
ICIAR04(II: 209-216).
WWW Version. 0409 BibRef

de Mello, C.A.B.[Carlos A.B.], Lins, R.D.[Rafael D.],
Generation of Images of Historical Documents by Composition of their Components,
VI02(45).
PDF Version. 0208 BibRef

Pappas, T., Tseng, S., Kosiba, D.,
A Robust and Efficient Algorithm for Bilevel Document Block Classification,
ICIP01(I: 1122-1125).
IEEE Abstract. IEEE Top Reference. 0108 BibRef

Sylwester, D., Seth, S.,
Adaptive segmentation of document images,
ICDAR01(827-831).
WWW Version. 0109 BibRef

Nagy, G., Kanai, J., Krishnamoorthy, M., Thomas, M., Viswanathan, M.,
Two Complementary Techniques for Digitized Document Analysis,
ACM DPS88(169-176), December 1988. 0101top-down/bottom-up. Publication specific pages. BibRef

Gatos, B., Papamarkos, N.,
Applying fast segmentation techniques at a binary image represented by a set of non-overlapping blocks,
ICDAR01(1147-1151).
WWW Version. 0109 BibRef

Nattee, C., Numao, M.,
Geometric method for document understanding and classification using online machine learning,
ICDAR01(602-606).
WWW Version. 0109 BibRef

Eglin, W., Gagneux, A.,
Visual exploration and functional document labeling,
ICDAR01(816-820).
WWW Version. 0109 BibRef

Kise, K., Miki, Y., Matsumoto, K.,
Backgrounds as Information Carriers for Printed Documents,
ICPR00(Vol IV: 380-384).
WWW Version.
HTML Version. 0009 BibRef

Okun, O., Pietikäinen, M.,
Automatic Ground-truth Generation for Skew-tolerance Evaluation of Document Layout Analysis Methods,
ICPR00(Vol IV: 376-379).
WWW Version.
HTML Version. 0009 BibRef

Maderlechner, G.[Gerd], Panyr, J.[Jiri], Suda, P.[Peter],
Finding Captions in PDF-Documents for Semantic Annotations of Images,
SSPR06(422-430).
WWW Version. 0608 BibRef

Maderlechner, G., Schreyer, A., Suda, P.,
Extraction of Relevant Information from Document Images Using Measures of Visual Attention,
ICPR00(Vol IV: 385-388).
WWW Version.
HTML Version. 0009 BibRef

Watanabe, T., Sobue, T.,
Layout Analysis of Complex Documents,
ICPR00(Vol IV: 447-450).
WWW Version.
HTML Version. 0009 BibRef

Aiyer, A.[Anuradha], Gray, R.M.[Robert M.],
A Fast, Table-Lookup Algorithm for Classifying Document Images,
ICIP99(I:590-594).
IEEE Abstract. IEEE Top Reference. BibRef 9900

Stevens, J., Gee, A., Dance, C.,
Automatic Processing of Document Annotations,
BMVC98(xx-yy). BibRef 9800

Takasu, A.[Atsuhiro],
Document filtering for fast approximate string matching of erroneous text,
ICDAR01(916-920).
WWW Version. 0109 BibRef

Takasu, A.[Atsuhiro],
Probabilistic Interpage Analysis for Article Extraction from Document Images,
ICPR98(Vol I: 932-935).
WWW Version. 9808 BibRef

Leung, M.[Maylor], Twan, T.[Ting],
Linear Layout Processing,
ICPR98(Vol I: 403-405).
WWW Version. 9808 BibRef

Robert, L., Likforman-Sulem, L., Lecolinet, E.,
Image and Text Coupling for Creating Electronic Books from Manuscripts,
ICDAR97(Poste) 9708 BibRef

Hong, T., Srihari, S.N.,
Representing OCRed Documents in HTML,
ICDAR97(Poste) 9708 BibRef

Rus, D.[Daniela], de Santis, P.[Peter],
The Self-Organizing Desk,
IJCAI97(758-763). extracting and organizing document information given a camera viewing a physical desktop. BibRef 9700

Menier, G., Lorette, G.,
Lexical Analyzer Based on a Self-Organizing Feature Map,
ICDAR97(We-3B) 9708 BibRef

Brugger, R., Zramdini, A.[Abdelwahab], Ingold, R.[Rolf],
Modeling Documents for Structure Recognition Using Generalized N-Grams,
ICDAR97(Mo-2B) 9708 BibRef

Baird, H.S., Gilbert, D., Ittner, D.J.,
A family of European page readers,
ICPR94(B:540-543).
WWW Version. 9410 BibRef

Baird, H.S., Ittner, D.,
Language-Free Layout Analysis,
ICDAR93(336-340). BibRef 9300

Kornai, A., Connell, S.D.,
Statistical Zone Finding,
ICPR96(III: 818-822).
WWW Version. 9608(IBM Almaden Res. Center, USA) BibRef

Liu, J.M.[Ji-Ming], Tang, Y.Y.[Yuan Y.], He, Q.[Qichao], Suen, C.Y.[Ching Y.],
Adaptive document segmentation and geometric relation labeling: algorithms and experimental results,
ICPR96(III: 763-767).
WWW Version. 9608(Hong Kong Baptist Univ., HK) BibRef

Ramel, J.Y., Vincent, N., Emptoz, H.,
Combining global and local vision for technical document understanding,
ICPR96(III: 773-777).
WWW Version. 9608(Laboratoire de Reconnaissance, F) BibRef

Sainz, G., Izquierdo, J., Dimitriadis, Y., Lopez Coronado, J.,
A New Neuro-Fuzzy System for Logical Labeling of Documents,
ICPR96(IV: 431-435).
WWW Version. 9608(Univ. of Valladolid, E) BibRef

Esposito, F., Malbera, D., Semeraro, G.,
A Knowledge-Based Approach to the Layout Analysis,
ICDAR95(466-471). BibRef 9500
Earlier:
Automated Acquisition of Rules for Document Understanding,
ICDAR93(650-654). Hybrid approach. Independent of document type. For simple layout such as letters. BibRef

Esposito, F., Malbera, D., Semeraro, G., Annese, E., and Scafuro, G.,
An Experimental Page Layout Recognition System for Office Document Automatic Classification: An Integrated Approach for Inductive Generalization,
ICPR90(I: 557-562).
WWW Version. BibRef 9000

Antonacopoulos, A., Ritchings, R.T.,
Flexible page segmentation using the background,
ICPR94(B:339-344).
WWW Version. 9410 BibRef

Bussi, S.[Silvia], Mangili, F.[Fulvia],
A semi-automatic method for form layout description,
CIAP95(539-544).
WWW Version. 9509 BibRef

Tateisi, Y., Itoh, N.,
Using stochastic syntactic analysis for extracting a logical structure from a document image,
ICPR94(B:391-394).
WWW Version. 9410 BibRef

Ciardello, G., Scafuro, G., de Grandi, M.T., Spada, M.R., Roccotelli, M.P.,
An Experimental System for Office Document Handling and Text Recognition,
ICPR88(739-743).
IEEE Top Reference. BibRef 8800

Meynieux, E., Seisen, S., Tombre, K.,
Bilevel Information Recognition and Coding in Office Paper Documents,
ICPR86(442-445). BibRef 8600

Kida, H., Iwaki, O., Kawada, K.,
Document Recognition System for Office Automation,
ICPR86(446-448). BibRef 8600

Hase, M., Suzuki, G., Itoh, H.,
A Method for Extracting Marked Regions from Document Images,
ICPR86(780-782). BibRef 8600

Derrien-Peden, D.,
Frame-Based System for Macro-Typographical Structure Analysis in Scientific Papers,
ICDAR91(311-319). Gets text in reading order. BibRef 9100

Ingold, R., Armangil, D.,
A Top-down Document Analysis Method for Logical Structure Recognition,
ICDAR91(41-49). BibRef 9100

Zen, H., Ozawa, S.,
Extraction of the Fair Document from Mixed Mode Manuscript,
CVPR85(544-549). BibRef 8500

Chapter on OCR, Document Analysis and Character Recognition Systems continues in
Page Segmentation, General, Evaluations .


Last update:Sep 2, 2008 at 17:29:35