See also Block Segmentation and Text Extraction in Mixed Text/Image Documents.
Goldwasser, S.M.,
Troxel, D.E.,
Page Composition of Continuous Tone Imagery,
CVGIP(26), No. 1, April 1984, pp. 30-44.
WWW Version.
BibRef
8404
Okawa, Y.[Yoshikuni],
A Structural Analysis of Visual Form on Packaging Graphics and Its
Use in an Automated Design System,
CVGIP(43), No. 2, August 1988, pp. 265-278.
WWW Version.
BibRef
8808
Earlier:
Identification of Packaged-in-a-box Goods for Designing a Part of an
Intelligent Cash Register,
ICPR80(150-152).
Where to put the graphics to not hide the picture.
BibRef
Srihari, S.N., and
Govindaraju, V.,
Analysis of Textual Images Using the Hough Transform,
MVA(2), 1989, pp. 141-153.
BibRef
8900
Peppers, N.A.[Norman A.],
Young, J.R.[James R.],
Nishi, H.[Hisami],
Ueno, H.[Hiroshi],
Page segmentor,
US_Patent4,817,169, March 28, 1989.
WWW Version.
BibRef
8903
O'Gorman, L.,
The Document Spectrum for Page Layout Analysis,
PAMI(15), No. 11, November 1993, pp. 1162-1173.
IEEE Abstract. IEEE Top Reference.
WWW Version. Determine the structure of the document for storage and recognition.
(For evaluation:
See also Empirical Performance Evaluation Methodology and Its Application to Page Segmentation Algorithms. )
BibRef
9311
Krishnamoorthy, M.,
Nagy, G.,
Seth, S., and
Viswanathan, M.,
Syntatic Segmentation and Labeling of Digitized
Pages from Technical Journals,
PAMI(15), No. 7, July 1993, pp. 737-747.
IEEE Abstract. IEEE Top Reference.
WWW Version. A more complete version of the following paper and system.
Error correction with backtracking. Computationally complex. Understanding
of how documents are put together.
BibRef
9307
Viswanathan, M.,
Analysis of Scanned documents: A Syntactic Approach,
SDIA92(xx-yy).
BibRef
9200
Hones, F.,
Lichter, J.,
Layout Extraction of Mixed-Mode Documents,
MVA(7), No. 4, 1994, pp. 237-246.
BibRef
9400
Saitoh, T.,
Yamaai, T.,
Tachikawa, M.,
Document Image Segmentation and Layout Analysis,
IEICE(Info Sys 77), No. 7, 1994, pp. 778-784.
BibRef
9400
Saitoh, T.,
Pavlidis, T.,
Page segmentation without rectangle assumption,
ICPR92(II:277-280).
IEEE DOI Link
9208
BibRef
Cullen, J.F.[John F.],
Ejiri, K.[Koichi],
Segmentation of text, picture and lines of a document image,
US_Patent5,335,290, August 2, 1994.
WWW Version.
BibRef
9408
And:
US_Patent5,465,304, Nov 7, 1995
WWW Version.
BibRef
Peairs, M.[Mark],
Method of selecting a target document using features of an example page,
US_Patent5,717,940, Feb 10, 1998
WWW Version.
BibRef
9802
Peairs, M.[Mark],
Hull, J.J.[Jonathan J.],
Cullen, J.F.[John F.],
Automatic document classification using text and images,
US_Patent7,039,856, May 2, 2006
WWW Version.
BibRef
0605
Kopec, G.E.,
Chou, P.A.,
Document Image Decoding Using Markov Source Models,
PAMI(16), No. 6, June 1994, pp. 602-617.
IEEE Abstract. IEEE Top Reference.
WWW Version.
BibRef
9406
Earlier:
Document image decoding,
ICIP94(II: 36-40).
IEEE DOI Link
9411
BibRef
Earlier:
Automatic Generation of Custom Document Image Decoders,
ICDAR93(xx-yy).
BibRef
Kam, A.C.,
Heuristic Document Image Decoding Using Markov Source Models,
MITMasters Thesis, June 1993.
BibRef
9306
Kam, A.C.,
Kopec, G.E.,
Document Image Decoding by Heuristic Search,
PAMI(18), No. 9, September 1996, pp. 945-950.
IEEE Abstract. IEEE Top Reference.
WWW Version.
Heuristic Search.
BibRef
9609
Earlier:
Separable Source Models for Document Image Decoding,
SPIE(2422), February 1995, pp. 84-97.
BibRef
Shiau, J.N.[Jeng-Nan],
Automatic image segmentation for color documents,
US_Patent5,341,226, August 23, 1994.
WWW Version.
BibRef
9408
Hayashi, N.[Naoki],
Saito, K.[Kazuo],
Document layout processing method and device for carrying out the same,
US_Patent5,379,373, January 3, 1995.
WWW Version.
BibRef
9501
Ozaki, M.[Masaharu],
Method and apparatus for document segmentation by background analysis,
US_Patent5,555,556, September 10, 1996
WWW Version.
BibRef
9609
Kopec, G.E.[Gary E.],
Lomelin, M.[Mauricio],
Supervised Template Estimation for Document Image Decoding,
PAMI(19), No. 12, December 1997, pp. 1313-1324.
IEEE Abstract. IEEE Top Reference.
WWW Version.
9712
BibRef
Earlier:
Document image Decoding Approach to Character Template Estimation,
ICIP96(II: 213-216).
IEEE DOI Link
BibRef
And:
Document Specific Character Template Estimation,
SPIE(2660), 1996, pp. 14-26.
Templates for recognizing characters.
BibRef
Kopec, G.E.[Gary E.],
Multilevel Character Templates for Document Image Decoding,
SPIE(3027), 1997, pp. xx-yy.
BibRef
9700
Earlier:
Document Image Decoding in the Berkeley Digital Library Project,
SPIE(2660), 1996, pp. 2-13.
BibRef
And:
Document Image Decoding in the Berkeley Digital Library,
ICIP96(II: 769-772).
IEEE DOI Link
BibRef
Dengel, A.R.,
Dubiel, F.,
Computer Understanding of Document Structure,
IJIST(7), No. 4, Winter 1996, pp. 271-278.
9612
BibRef
Niyogi, D.,
Srihari, S.N.,
Integrated Approach to Document Decomposition and Structural-Analysis,
IJIST(7), No. 4, Winter 1996, pp. 330-342.
9612
BibRef
Earlier:
Knowledge-Based Derivation of Document Logical Structure,
ICDAR95(472-475).
Bottom up approach. 3 levels of rules, knowledge, control and
strategy.
Accuracy varies (48-100%). Has 160 rules.
BibRef
Niyogi, D.,
Srihari, S.N.,
A Rule-Based System for Document Understanding,
AAAI-86(789-793).
BibRef
8600
Simon, A.,
Pret, J.C.,
Johnson, A.P.,
A Fast Algorithm for Bottom-Up Document Layout Analysis,
PAMI(19), No. 3, March 1997, pp. 273-277.
IEEE Abstract. IEEE Top Reference.
WWW Version.
9704
BibRef
Liu, J.M.,
Tang, Y.Y.,
Suen, C.Y.,
Chinese Document Layout Analysis Based on Adaptive Split-and-Merge
and Qualitative Spatial Reasoning,
PR(30), No. 8, August 1997, pp. 1265-1278.
WWW Version.
9708
BibRef
Tang, Y.Y.,
Ma, H.,
Liu, J.M.,
Li, B.F.,
Xi, D.H.,
Multiresolution Analysis in Extraction of Reference Lines from
Documents with Gray-Level Background,
PAMI(19), No. 8, August 1997, pp. 921-926.
IEEE Abstract. IEEE Top Reference.
WWW Version.
9709
Find reference lines to determine the structure of the document.
BibRef
Bayer, T.[Thomas],
Kressel, U.[Ulrich],
Mogg-Schneider, H.[Heike],
Renz, I.[Ingrid],
Categorizing Paper Documents,
CVIU(70), No. 3, June 1998, pp. 299-306.
WWW Version.
BibRef
9806
Caelli, T.M.[Terry M.], and
Dillon, C.[Craig],
CITE: A Trainable Image Annotation System,
PRL(18), No. 11-13, November 1997, pp. 1247-1252.
9806
BibRef
Dillon, C.[Craig], and
Caelli, T.M.[Terry M.],
Learning Image Annotation: The Cite System,
Videre(1), No. 2, Winter 1998, pp. 90-121.
Generate automatic annotations. Apply to airports and office scenes.
Region and color based analysis.
HTML Version.
PDF Version.
BibRef
9800
Cooperman, R.S.[Robert S.],
System for document layout analysis,
US_Patent5,784,487, Jul 21, 1998
WWW Version.
BibRef
9807
Ancin, H.[Hakan],
Document segmentation system,
US_Patent5,956,468, September 21, 1999.
WWW Version. Text and graphics.
BibRef
9909
Chao, H.[Hui],
Bloomberg, D.S.[Dan S.],
Method and system for document segmentation,
US_Patent6,904,170, June 7, 2005.
WWW Version. Projection profiles.
BibRef
0506
Bloomberg, D.S.[Dan S.],
Method and article of manufacture for determining whether a
scanned image is an original image or fax image,
US_Patent5,828,771, Oct 27, 1998
WWW Version.
BibRef
9810
Nakayama, T.[Takehiro],
Method and apparatus for document classification from degraded images,
US_Patent5,909,510, Jun 1, 1999
WWW Version.
BibRef
9906
Crabtree, R.N.[Ralph N.],
Peng, A.[Antai],
Knowledge-based document analysis system,
US_Patent5,937,084, Aug 10, 1999
WWW Version.
BibRef
9908
Li, J.,
Gray, R.M.,
Context-Based Multiscale Classification of Document Images Using
Wavelet Coefficient Distributions,
IP(9), No. 9, September 2000, pp. 1604-1616.
IEEE DOI Link
0008
BibRef
Ageenko, E.,
Fränti, P.,
Context-based filtering of document images,
PRL(21), No. 6-7, June 2000, pp. 483-491.
0006
BibRef
Lee, K.H.[Kyong-Ho],
Choy, Y.C.[Yoon-Chul],
Cho, S.B.[Sung-Bae],
Geometric Structure Analysis of Document Images:
A Knowledge-Based Approach,
PAMI(22), No. 11, November 2000, pp. 1224-1240.
IEEE Abstract. IEEE Top Reference.
WWW Version.
0012
Journal pages. Combine botton-up and top-down approach. Segmentation then
identification.
BibRef
Shin, C.[Christian],
Doermann, D.[David],
Rosenfeld, A.[Azriel],
Classification of document pages using structure-based features,
IJDAR(3), No. 4, 2001, pp. 232-247.
HTML Version.
0106
BibRef
Liang, J.S.[Ji-Sheng],
Phillips, I.T.[Ihsin T.],
Haralick, R.M.[Robert M.],
An Optimization Methodology for Document Structure Extraction on Latin
Character Documents,
PAMI(23), No. 7, July 2001, pp. 719-734.
IEEE Abstract. IEEE Top Reference.
WWW Version.
0108
BibRef
Chetverikov, D.,
Liang, J.,
Komuves, J.,
Haralick, R.M.,
Zone Classification Using Texture Features,
ICPR96(III: 676-680).
IEEE DOI Link
9608
(Hungarian Academy of Sciences, H)
BibRef
Liang, J.,
Ha, J.,
Haralick, R.M., and
Phillips, I.T.,
Document Layout Structure Extraction Using Bounding Boxes of
Different Entities,
WACV96(278-283).
IEEE Abstract. IEEE Top Reference.
9609
BibRef
Haralick, R.M.,
Document Image Analysis: Geometric and Logical Layout,
CVPR94(385-390).
IEEE Abstract. IEEE Top Reference.
BibRef
9400
Klink, S.[Stefan],
Kieninger, T.[Thomas],
Rule-based document structure understanding with a fuzzy combination of
layout and textual features,
IJDAR(4), No. 1, 2001, pp. 18-26.
HTML Version.
0111
BibRef
Altamura, O.[Oronzo],
Esposito, F.[Floriana],
Malerba, D.[Donato],
Transforming paper documents into XML format with WISDOM++,
IJDAR(4), No. 1, 2001, pp. 2-17.
HTML Version.
0111
BibRef
Lee, S.W.[Seong-Whan],
Ryu, D.S.[Dae-Seok],
Parameter-Free Geometric Document Layout Analysis,
PAMI(23), No. 11, November 2001, pp. 1240-1256.
IEEE Abstract. IEEE Top Reference.
WWW Version.
0112
Segment into maximal homogeneous regions, identify as text, graphics,
etc. Periodicity measure for text.
BibRef
Ryu, D.S.,
Kang, S.M.,
Lee, S.W.,
Parameter-independent Geometric Document Layout Analysis,
ICPR00(Vol IV: 397-400).
IEEE DOI Link
HTML Version.
0009
BibRef
Hull, J.J.,
Lee, D.S.,
Simultaneous Highlighting of Paper and Electronic Documents,
ICPR00(Vol IV: 401-404).
IEEE DOI Link
HTML Version.
0009
BibRef
Acharyya, M.[Mausumi],
Kundu, M.K.[Malay K.],
Document image segmentation using wavelet scale-space features,
CirSysVideo(12), No. 12, December 2002, pp. 1117-1127.
IEEE Top Reference.
0301
BibRef
Earlier:
Multiscale Segmentation of Document Images Using M-Band Wavelets,
CAIP01(510 ff.).
HTML Version.
0210
See also adaptive approach to unsupervised texture segmentation using M-Band wavelet transform, An.
BibRef
Lee, J.Y.[Ji-Yeon],
Park, J.S.[Jeong-Seon],
Byun, H.R.[Hye-Ran],
Moon, J.[Jongsub],
Lee, S.W.[Seong-Whan],
Automatic generation of structured hyperdocuments from document images,
PR(35), No. 2, February 2002, pp. 485-503.
WWW Version.
0201
BibRef
Lee, J.Y.,
Choi, S.H.,
Lee, S.W.,
Automatic Generation of Structured Hyperdocuments from Multi-column
Document Images,
ICPR00(Vol IV: 422-425).
IEEE DOI Link
HTML Version.
0009
BibRef
Lam, W.[Wai],
Han, Y.[Yiqiu],
Automatic textual document categorization based on generalized instance
sets and a metamodel,
PAMI(25), No. 5, May 2003, pp. 628-633.
IEEE Abstract. IEEE Top Reference.
0304
Generalized instance set. (GIS)
BibRef
Bagdanov, A.D.[Andrew D.],
Worring, M.[Marcel],
Multiscale Document Description Using Rectangular Granulometries,
IJDAR(6), No. 3, March 2004, pp. 181-191.
Springer DOI Link
0406
BibRef
Earlier:
DAS02(445 ff.).
HTML Version.
0303
BibRef
Earlier:
Fine-grained document genre classification using first order random
graphs,
ICDAR01(79-83).
IEEE DOI Link
0109
BibRef
Chang, F.[Fu],
Chu, S.Y.[Shih-Yu],
Chen, C.Y.[Chi-Yen],
Chinese document layout analysis using an adaptive regrouping strategy,
PR(38), No. 2, February 2005, pp. 261-271.
WWW Version.
0411
BibRef
Wu, C.C.[Chung-Chih],
Chou, C.H.[Chien-Hsing],
Chang, F.[Fu],
A machine-learning approach for analyzing document layout structures
with two reading orders,
PR(41), No. 10, October 2008, pp. 3200-3213.
WWW Version.
0808
Binary decision; Document layout analysis; Reading order;
Support vector machine; Taboo box; Textline; Text region
BibRef
Ramel, J.Y.,
Leriche, S.,
Demonet, M.L.,
Busson, S.,
User-driven page layout analysis of historical printed books,
IJDAR(9), No. 2-4, April 2007, pp. 243-261.
Springer DOI Link
0704
BibRef
Altamura, O.[Oronzo],
Berardi, M.[Margherita],
Ceci, M.[Michelangelo],
Malerba, D.[Donato],
Varlaro, A.[Antonio],
Using colour information to understand censorship cards of film
archives,
IJDAR(9), No. 2-4, April 2007, pp. 281-297.
Springer DOI Link
0704
BibRef
Earlier: A2, A1, A3, A4, Only:
A color-based layout analysis to process censorship cards of film
archives,
ICDAR05(II: 1110-1114).
IEEE DOI Link
0508
BibRef
Natarajan, P.[Prem],
Prasad, R.[Rohit],
Subramanian, K.[Krishna],
Saleem, S.[Shirin],
Choi, F.[Fred],
Schwartz, R.[Rich],
Finding structure in noisy text: topic classification and unsupervised
clustering,
IJDAR(10), No. 3-4, December 2007, pp. 187-198.
Springer DOI Link
0712
BibRef
Cao, H.[Huaigu],
Prasad, R.[Rohit],
Saleem, S.[Shirin],
Natarajan, P.[Premkumar],
Unsupervised HMM Adaptation Using Page Style Clustering,
ICDAR09(1091-1095).
IEEE DOI Link
0907
BibRef
Lemaitre, A.[Aurélie],
Camillerapp, J.[Jean],
Coüasnon, B.[Bertrand],
Multiresolution cooperation makes easier document structure recognition,
IJDAR(11), No. 2, November 2008, pp. xx-yy.
Springer DOI Link
0810
BibRef
Earlier:
Contribution of Multiresolution Description for Archive Document
Structure Recognition,
ICDAR07(247-251).
IEEE DOI Link
0709
BibRef
Lecerf, L.[Loic],
Chidlovskii, B.[Boris],
Scalable Feature Extraction from Noisy Documents,
ICDAR09(361-365).
IEEE DOI Link
0907
Determine frequent patterns for use in layout recognition.
BibRef
Ferilli, S.[Stefano],
Biba, M.[Marenglen],
Esposito, F.[Floriana],
Basile, T.M.A.[Teresa M.A.],
A Distance-Based Technique for Non-Manhattan Layout Analysis,
ICDAR09(231-235).
IEEE DOI Link
0907
BibRef
Smith, R.W.[Raymond W.],
Hybrid Page Layout Analysis via Tab-Stop Detection,
ICDAR09(241-245).
IEEE DOI Link
0907
BibRef
Antonacopoulos, A.[Apostolos],
Bridson, D.[David],
Papadopoulos, C.[Christos],
Pletschacher, S.[Stefan],
A Realistic Dataset for Performance Evaluation of Document Layout
Analysis,
ICDAR09(296-300).
IEEE DOI Link
0907
BibRef
Tatsumi, I.[Itaru],
Habe, H.[Hitoshi],
Kidode, M.[Masatsugu],
Context-oriented Layout Optimization of Large-Print Textbooks,
ICDAR09(1016-1020).
IEEE DOI Link
0907
BibRef
Malleron, V.[Vincent],
Eglin, V.[Véronique],
Emptoz, H.[Hubert],
Dord-Crouslé, S.[Stéphanie],
Régnier, P.[Philippe],
Hierarchical Decomposition of Handwritten Manuscripts Layouts,
CAIP09(221-228).
Springer DOI Link
0909
BibRef
Gordo, A.[Albert],
Valveny, E.[Ernest],
A Rotation Invariant Page Layout Descriptor for Document Classification
and Retrieval,
ICDAR09(481-485).
IEEE DOI Link
0907
BibRef
And:
The Diagonal Split: A Pre-segmentation Step for Page Layout Analysis
and Classification,
IbPRIA09(290-297).
Springer DOI Link
0906
BibRef
Grim, J.[Jiri],
Novovicova, J.[Jana],
Somol, P.[Petr],
Structural poisson mixtures for classification of documents,
ICPR08(1-4).
IEEE DOI Link
0812
BibRef
Agrawal, M.[Mudit],
Doermann, D.[David],
Voronoi++: A Dynamic Page Segmentation Approach Based on Voronoi and
Docstrum Features,
ICDAR09(1011-1015).
IEEE DOI Link
0907
BibRef
Abd-Almageed, W.[Wael],
Agrawal, M.[Mudit],
Seo, W.[Wontaek],
Doermann, D.[David],
Document-zone classification using partial least squares and hybrid
classifiers,
ICPR08(1-4).
IEEE DOI Link
0812
BibRef
Gaceb, D.[Djamel],
Eglin, V.[Véronique],
Le Bourgeois, F.[Frank],
Emptoz, H.[Hubert],
Graph b-Coloring for Automatic Recognition of Documents,
ICDAR09(261-265).
IEEE DOI Link
0907
BibRef
Earlier:
Application of graph coloring in physical layout segmentation,
ICPR08(1-4).
IEEE DOI Link
0812
See also Improvement of postal mail sorting system.
BibRef
Fonseca-Bruzón, A.[Adrian],
Gil-García, R.[Reynaldo],
Pons-Porrata, A.[Aurora],
Using the alpha-beta-Neighborhood for Adaptive Document Filtering,
CIARP08(783-790).
Springer DOI Link
0809
BibRef
Anaya-Sánchez, H.[Henry],
Pons-Porrata, A.[Aurora],
Berlanga-Llavori, R.[Rafael],
A New Document Clustering Algorithm for Topic Discovering and Labeling,
CIARP08(161-168).
Springer DOI Link
0809
BibRef
Pons-Porrata, A.[Aurora],
Gil-García, R.[Reynaldo],
Berlanga-Llavori, R.[Rafael],
Using Typical Testors for Feature Selection in Text Categorization,
CIARP07(643-652).
Springer DOI Link
0711
BibRef
Gil-García, R.[Reynaldo],
Pons-Porrata, A.[Aurora],
Hierarchical Star Clustering Algorithm for Dynamic Document Collections,
CIARP08(187-194).
Springer DOI Link
0809
BibRef
Earlier:
A New Nearest Neighbor Rule for Text Categorization,
CIARP06(814-823).
Springer DOI Link
0611
BibRef
Gil-García, R.J.[Reynaldo J.],
Badía-Contelles, J.M.[Jose M.],
Pons-Porrata, A.[Aurora],
A General Framework for Agglomerative Hierarchical Clustering
Algorithms,
ICPR06(II: 569-572).
WWW Version.
0609
BibRef
Moringen, J.[Jan],
Wachsmuth, S.[Sven],
Dickinson, S.[Sven],
Stevenson, S.[Suzanne],
Learning Visual Compound Models from Parallel Image-Text Datasets,
DAGM08(xx-yy).
Springer DOI Link
0806
BibRef
Jamieson, M.[Michael],
Fazly, A.[Afsaneh],
Dickinson, S.[Sven],
Stevenson, S.[Suzanne],
Wachsmuth, S.[Sven],
Learning Structured Appearance Models from Captioned Images of
Cluttered Scenes,
ICCV07(1-8).
IEEE DOI Link
0710
BibRef
Ceci, M.[Michelangelo],
Berardi, M.[Margherita],
Porcelli, G.,
Malerba, D.[Donato],
A Data Mining Approach to Reading Order Detection,
ICDAR07(924-928).
IEEE DOI Link
0709
BibRef
Gupta, M.D.[M. Das],
Sarkar, P.,
A Shared Parts Model for Document Image Recognition,
ICDAR07(1163-1172).
IEEE DOI Link
0709
BibRef
Dasigi, P.[Praveen],
Jain, R.[Raman],
Jawahar, C.V.,
Document Image Segmentation as a Spectral Partitioning Problem,
ICCVGIP08(305-312).
IEEE DOI Link
0812
BibRef
Kumar, K.S.S.,
Kumar, S.,
Jawahar, C.V.,
On Segmentation of Documents in Complex Scripts,
ICDAR07(1243-1247).
IEEE DOI Link
0709
BibRef
Drira, F.[Fadoua],
Le Bourgeois, F.[Frank],
Emptoz, H.[Hubert],
A Coupled Mean Shift-Anisotropic Diffusion Approach for Document Image
Segmentation and Restoration,
ICDAR07(814-818).
IEEE DOI Link
0709
BibRef
Xia, Y.,
Xiao, B.H.,
Wang, C.H.,
Dai, R.W.,
Integrated Segmentation and Recognition of Mixed Chinese/English
Document,
ICDAR07(704-708).
IEEE DOI Link
0709
BibRef
Baird, H.S.,
Moll, M.A.,
Document Content Inventory and Retrieval,
ICDAR07(93-97).
IEEE DOI Link
0709
BibRef
Gu, G.,
Han, W.,
Adaptive Window Based Uneven Lighting Document Segmentation,
ICDAR07(223-226).
IEEE DOI Link
0709
BibRef
Cao, H.[Huaigu],
Prasad, R.[Rohit],
Natarajan, P.[Prem],
MacRostie, E.[Ehry],
Robust Page Segmentation Based on Smearing and Error Correction
Unifying Top-down and Bottom-up Approaches,
ICDAR07(392-396).
IEEE DOI Link
0709
BibRef
Gao, D.,
Wang, Y.,
Hindi, H.,
Do, M.,
Decompose Document Image Using Integer Linear Programming,
ICDAR07(397-401).
IEEE DOI Link
0709
BibRef
Nicolas, S.,
Dardenne, J.,
Paquet, T.,
Heutte, L.,
Document Image Segmentation Using a 2D Conditional Random Field Model,
ICDAR07(407-411).
IEEE DOI Link
0709
BibRef
Gao, D.S.[Da-Shan],
Wang, Y.Z.[Yi-Zhou],
Decomposing Document Images by Heuristic Search,
EMMCVPR07(97-111).
Springer DOI Link
0708
BibRef
Kumar, K.S.S.[K.S. Sesh],
Namboodiri, A.M.[Anoop M.],
Jawahar, C.V.,
Learning Segmentation of Documents with Complex Scripts,
ICCVGIP06(749-760).
Springer DOI Link
0612
BibRef
Hernández-Reyes, E.[Edith],
Martínez-Trinidad, J.F.,
Carrasco-Ochoa, J.A.,
García-Hernández, R.A.[René A.],
Document Representation Based on Maximal Frequent Sequence Sets,
CIARP06(854-863).
Springer DOI Link
0611
BibRef
Lakhani, G.[Gopal],
Improving Image Decomposition Method of the 3-MRC Coding of Scanned
Compound Document Images,
ICCVGIP08(289-296).
IEEE DOI Link
0812
BibRef
Lakhani, G.,
Subedi, R.,
Optimal Filling of FG/BG Layers of Compound Document Images,
ICIP06(2273-2276).
0610
IEEE DOI Link
BibRef
Mao, S.[Song],
Mansukhani, P.[Praveer],
Thoma, G.R.[George R.],
Combining Static Classifiers and Class Syntax Models for Logical Entity
Recognition in Scanned Historical Documents,
CVPR07(1-8).
IEEE DOI Link
0706
BibRef
Mao, S.[Song],
Xu, Z.[Zheng],
Tjahjadi, T.[Tardi],
Thoma, G.R.[George R.],
Logical Entity Recognition in Multi-Style Document Page Images,
ICPR06(I: 876-879).
WWW Version.
0609
BibRef
Baird, H.S.[Henry S.],
Casey, M.R.[Matthew R.],
Towards Versatile Document Analysis Systems,
DAS06(280-290).
Springer DOI Link
0602
BibRef
Rangoni, Y.[Yves],
Belaid, A.[Abdel],
Document Logical Structure Analysis Based on Perceptive Cycles,
DAS06(117-128).
Springer DOI Link
0602
BibRef
Earlier:
Data categorization for a context return applied to logical document
structure recognition,
ICDAR05(I: 297-301).
IEEE DOI Link
0508
BibRef
Bloechle, J.L.[Jean-Luc],
Lalanne, D.[Denis],
Ingold, R.[Rolf],
OCD: An Optimized and Canonical Document Format,
ICDAR09(236-240).
IEEE DOI Link
0907
BibRef
Bloechle, J.L.[Jean-Luc],
Rigamonti, M.[Maurizio],
Hadjar, K.[Karim],
Lalanne, D.[Denis],
Ingold, R.[Rolf],
XCDF: A Canonical and Structured Document Format,
DAS06(141-152).
Springer DOI Link
0602
BibRef
Sternby, J.,
Ericsson, A.,
Core points: A framework for structural parameterization,
ICDAR05(I: 217-221).
IEEE DOI Link
0508
BibRef
Lin, X.,
Active document layout synthesis,
ICDAR05(I: 86-90).
IEEE DOI Link
0508
BibRef
Sun, H.M.[Hung-Ming],
Page segmentation for Manhattan and non-Manhattan layout documents via
selective CRLA,
ICDAR05(I: 116-120).
IEEE DOI Link
0508
BibRef
Shi, Z.[Zhixin],
Govindaraju, V.,
Multi-scale techniques for document page segmentation,
ICDAR05(II: 1020-1024).
IEEE DOI Link
0508
BibRef
Berardi, M.[Margherita],
Lapi, M.[Michele],
Malerba, D.[Donato],
An Integrated Approach for Automatic Semantic Structure Extraction in
Document Images,
DAS04(179-190).
WWW Version.
0505
BibRef
Ceci, M.,
Berardi, M.[Margherita],
Malerba, D.[Donato],
Relational learning techniques for document image understanding:
comparing statistical and logical approaches,
ICDAR05(I: 473-482).
IEEE DOI Link
0508
BibRef
Esposito, F.,
Malerba, D.,
Semeraro, G.,
Ferilli, S.,
Altamura, O.,
Basile, T.M.A.,
Berardi, M.,
Ceci, M.,
di Mauro, N.,
Machine learning methods for automatically processing historical
documents: from paper acquisition to XML transformation,
DIAL04(328-335).
IEEE DOI Link
0404
BibRef
Malerba, D.,
Esposito, F.,
Altamura, O.,
Ceci, M.,
Berardi, M.,
Correcting the document layout: a machine learning approach,
ICDAR03(97-102).
IEEE Abstract. IEEE Top Reference.
0311
BibRef
Malerba, D.[Donato],
Esposito, F.[Floriana],
Lisi, F.A.,
Altamura, O.[Oronzo],
Automated discovery of dependencies between logical components in
document image understanding,
ICDAR01(174-178).
IEEE DOI Link
0109
BibRef
Huang, M.,
DeMenthon, D.F.,
Doermann, D.,
Golebiowski, L.,
Hamilton, B.A.,
Document ranking by layout relevance,
ICDAR05(I: 362-366).
IEEE DOI Link
0508
BibRef
Waked, B.,
Suen, C.Y.[Ching Y.],
Bergler, S.,
Segmenting document images using diagonal white runs and vertical edges,
ICDAR01(194-199).
IEEE DOI Link
0109
BibRef
Yingsaeree, C.,
Kawtrakul, A.,
Rule-based middle-level character detection for simplifying Thai
document layout analysis,
ICDAR05(II: 888-892).
IEEE DOI Link
0508
BibRef
Nakano, Y.[Yasuaki],
Hananoi, T.[Toshihiro],
Miyao, H.[Hidetoshi],
Maruyama, M.[Minoru],
Maruyama, K.I.[Ken-Ichi],
A Document Analysis System Based on Text Line Matching of Multiple OCR
Outputs,
DAS04(463-471).
WWW Version.
0505
BibRef
Adam, S.[Sébastien],
Rigamonti, M.[Maurizio],
Clavier, E.[Eric],
Trupin, E.[Eric],
Ogier, J.M.[Jean-Marc],
Tombre, K.[Karl],
Gardes, J.[Joël],
DocMining: A Document Analysis System Builder,
DAS04(472-483).
WWW Version.
0505
BibRef
Carmagnac, F.[Fabien],
Héroux, P.[Pierre],
Trupin, É.[Éric],
Multi-view HAC for Semi-supervised Document Image Classification,
DAS04(191-200).
WWW Version.
0505
BibRef
Antonacopoulos, A.[Apostolos],
Karatzas, D.[Dimosthenis],
Semantics-based content extraction in typewritten historical documents,
ICDAR05(I: 48-53).
IEEE DOI Link
0508
BibRef
Earlier:
A Complete Approach to the Conversion of Typewritten Historical
Documents for Digital Archives,
DAS04(90).
WWW Version.
0505
BibRef
And:
Document image analysis for World War II personal records,
DIAL04(336-341).
IEEE DOI Link
0404
See also Colour text segmentation in web images based on human perception.
BibRef
Mao, S.,
Kim, J.W.,
Thoma, G.R.,
A dynamic feature generation system for automated metadata extraction
in preservation of digital materials,
DIAL04(225-232).
IEEE DOI Link
0404
BibRef
Gattani, A.,
Mukerji, M.,
Gur, H.,
A fast multifunctional approach for document image analysis,
ICDAR03(1178-1182).
IEEE Abstract. IEEE Top Reference.
0311
BibRef
Hoque, S.,
Selim, H.,
Howells, W.G.J.,
Fairhurst, M.C.,
Deravi, F.,
SAGENT: a novel technique for document modeling for secure access and
distribution,
ICDAR03(1257-1261).
IEEE Abstract. IEEE Top Reference.
0311
BibRef
Howells, W.G.J.,
Selim, H.,
Hoque, S.,
Fairhurst, M.C.,
Deravi, F.,
The autonomous document object (ADO) model,
ICDAR01(977-981).
IEEE DOI Link
0109
BibRef
Klein, B.,
Agne, S.,
Bagdanov, A.D.,
Understanding document analysis and understanding (through modeling),
ICDAR03(1218-1222).
IEEE Abstract. IEEE Top Reference.
0311
BibRef
Breuel, T.M.[Thomas M.],
An algorithm for finding maximal whitespace rectangles at arbitrary
orientations for document layout analysis,
ICDAR03(66-70).
IEEE Abstract. IEEE Top Reference.
0311
BibRef
Earlier:
Two Geometric Algorithms for Layout Analysis,
DAS02(188 ff.).
HTML Version.
0303
BibRef
Lee, K.H.[Kyong-Ho],
Choy, Y.C.[Yoon-Chul],
Cho, S.B.[Sung-Bae],
Tang, X.[Xiao],
McCrary, V.[Victor],
Document Reverse Engineering: From Paper to XML,
DAS02(503 ff.).
HTML Version.
0303
BibRef
Liang, J.[Jian],
Doermann, D.[David],
Logical Labeling of Document Images Using Layout Graph Matching with
Adaptive Learning,
DAS02(224 ff.).
HTML Version.
0303
BibRef
Bagdanov, A.D.,
Worring, M.,
Granulometric analysis of document images,
ICPR02(I: 468-471).
IEEE DOI Link
0211
BibRef
Tam, V.,
Santoso, A.,
Setiono, R.,
A comparative study of centroid-based, neighborhood-based and
statistical approaches for effective document categorization,
ICPR02(IV: 235-238).
IEEE DOI Link
0211
BibRef
Popat, K.,
Greene, D.H.,
Poo, T.L.[Tze-Lei],
Adaptive stack algorithm in document image decoding,
ICPR02(IV: 231-234).
IEEE DOI Link
0211
BibRef
Liang, J.[Jian],
Doermann, D.,
Ma, M.[Matthew],
Guo, J.K.,
Page classification through logical labelling,
ICPR02(III: 477-480).
IEEE DOI Link
0211
BibRef
Valveny, E.,
Marti, E.,
Learning of structural descriptions of graphic symbols using deformable
template matching,
ICDAR01(455-459).
IEEE DOI Link
0109
BibRef
Valveny, E.,
Lamiroy, B.,
Sean-to-XML: automatic generation of browsable technical documents,
ICPR02(III: 188-191).
IEEE DOI Link
0211
BibRef
Duong, J.,
Emptoz, H.,
Cote, M.,
Features for printed document image analysis,
ICPR02(III: 245-248).
IEEE DOI Link
0211
BibRef
da Silva, J.M.M.[João Marcelo Monte],
Lins, R.D.[Rafael Dueire],
Color Document Synthesis as a Compression Strategy,
ICDAR07(466-470).
IEEE DOI Link
0709
BibRef
Lins, R.D.[Rafael Dueire],
da Silva, J.M.M.[João Marcelo Monte],
Generating Color Documents from Segmented and Synthetic Elements,
ICIAR07(1217-1228).
Springer DOI Link
0708
BibRef
de Mello, C.A.B.[Carlos A. B.],
An Algorithm for Foreground-Background Separation in Low Quality
Patrimonial Document Images,
CIARP07(911-920).
Springer DOI Link
0711
BibRef
Earlier:
Image Segmentation of Historical Documents: Using a Quality Index,
ICIAR04(II: 209-216).
WWW Version.
0409
BibRef
de Mello, C.A.B.[Carlos A.B.],
Lins, R.D.[Rafael D.],
Generation of Images of Historical Documents by Composition of their
Components,
VI02(45).
PDF Version.
0208
BibRef
Pappas, T.,
Tseng, S.,
Kosiba, D.,
A Robust and Efficient Algorithm for Bilevel Document Block
Classification,
ICIP01(I: 1122-1125).
IEEE Abstract. IEEE Top Reference.
0108
BibRef
Sylwester, D.,
Seth, S.,
Adaptive segmentation of document images,
ICDAR01(827-831).
IEEE DOI Link
0109
BibRef
Nagy, G.,
Kanai, J.,
Krishnamoorthy, M.,
Thomas, M.,
Viswanathan, M.,
Two Complementary Techniques for Digitized Document Analysis,
ACM DPS88(169-176), December 1988.
0101
top-down/bottom-up. Publication specific pages.
BibRef
Gatos, B.,
Papamarkos, N.,
Applying fast segmentation techniques at a binary image represented by
a set of non-overlapping blocks,
ICDAR01(1147-1151).
IEEE DOI Link
0109
BibRef
Nattee, C.,
Numao, M.,
Geometric method for document understanding and classification using
online machine learning,
ICDAR01(602-606).
IEEE DOI Link
0109
BibRef
Eglin, W.,
Gagneux, A.,
Visual exploration and functional document labeling,
ICDAR01(816-820).
IEEE DOI Link
0109
BibRef
Kise, K.,
Miki, Y.,
Matsumoto, K.,
Backgrounds as Information Carriers for Printed Documents,
ICPR00(Vol IV: 380-384).
IEEE DOI Link
HTML Version.
0009
BibRef
Okun, O.,
Pietikäinen, M.,
Automatic Ground-truth Generation for Skew-tolerance Evaluation of
Document Layout Analysis Methods,
ICPR00(Vol IV: 376-379).
IEEE DOI Link
HTML Version.
0009
BibRef
Maderlechner, G.[Gerd],
Panyr, J.[Jiri],
Suda, P.[Peter],
Finding Captions in PDF-Documents for Semantic Annotations of Images,
SSPR06(422-430).
Springer DOI Link
0608
BibRef
Maderlechner, G.,
Schreyer, A.,
Suda, P.,
Extraction of Relevant Information from Document Images Using Measures
of Visual Attention,
ICPR00(Vol IV: 385-388).
IEEE DOI Link
HTML Version.
0009
BibRef
Watanabe, T.,
Sobue, T.,
Layout Analysis of Complex Documents,
ICPR00(Vol IV: 447-450).
IEEE DOI Link
HTML Version.
0009
BibRef
Aiyer, A.[Anuradha],
Gray, R.M.[Robert M.],
A Fast, Table-Lookup Algorithm for Classifying Document Images,
ICIP99(I:590-594).
IEEE Abstract. IEEE Top Reference.
BibRef
9900
Stevens, J.,
Gee, A.,
Dance, C.,
Automatic Processing of Document Annotations,
BMVC98(xx-yy).
BibRef
9800
Takasu, A.[Atsuhiro],
Document filtering for fast approximate string matching of erroneous
text,
ICDAR01(916-920).
IEEE DOI Link
0109
BibRef
Takasu, A.[Atsuhiro],
Probabilistic Interpage Analysis for Article Extraction
from Document Images,
ICPR98(Vol I: 932-935).
IEEE DOI Link
9808
BibRef
Leung, M.[Maylor],
Twan, T.[Ting],
Linear Layout Processing,
ICPR98(Vol I: 403-405).
IEEE DOI Link
9808
BibRef
Robert, L.,
Likforman-Sulem, L.,
Lecolinet, E.,
Image and Text Coupling for Creating Electronic Books from Manuscripts,
ICDAR97(Poste)
9708
BibRef
Hong, T.,
Srihari, S.N.,
Representing OCRed Documents in HTML,
ICDAR97(Poste)
9708
BibRef
Rus, D.[Daniela],
de Santis, P.[Peter],
The Self-Organizing Desk,
IJCAI97(758-763).
extracting and organizing document information given a camera viewing a
physical desktop.
BibRef
9700
Menier, G.,
Lorette, G.,
Lexical Analyzer Based on a Self-Organizing Feature Map,
ICDAR97(We-3B)
9708
BibRef
Brugger, R.,
Zramdini, A.[Abdelwahab],
Ingold, R.[Rolf],
Modeling Documents for Structure Recognition Using Generalized N-Grams,
ICDAR97(Mo-2B)
9708
BibRef
Baird, H.S.,
Gilbert, D.,
Ittner, D.J.,
A family of European page readers,
ICPR94(B:540-543).
IEEE DOI Link
9410
BibRef
Baird, H.S.,
Ittner, D.,
Language-Free Layout Analysis,
ICDAR93(336-340).
BibRef
9300
Kornai, A.,
Connell, S.D.,
Statistical Zone Finding,
ICPR96(III: 818-822).
IEEE DOI Link
9608
(IBM Almaden Res. Center, USA)
BibRef
Liu, J.M.[Ji-Ming],
Tang, Y.Y.[Yuan Y.],
He, Q.[Qichao],
Suen, C.Y.[Ching Y.],
Adaptive document segmentation and geometric relation labeling:
algorithms and experimental results,
ICPR96(III: 763-767).
IEEE DOI Link
9608
(Hong Kong Baptist Univ., HK)
BibRef
Ramel, J.Y.,
Vincent, N.,
Emptoz, H.,
Combining global and local vision for technical document understanding,
ICPR96(III: 773-777).
IEEE DOI Link
9608
(Laboratoire de Reconnaissance, F)
BibRef
Sainz, G.,
Izquierdo, J.,
Dimitriadis, Y.,
Lopez Coronado, J.,
A New Neuro-Fuzzy System for Logical Labeling of Documents,
ICPR96(IV: 431-435).
IEEE DOI Link
9608
(Univ. of Valladolid, E)
BibRef
Esposito, F.,
Malbera, D.,
Semeraro, G.,
A Knowledge-Based Approach to the Layout Analysis,
ICDAR95(466-471).
BibRef
9500
Earlier:
Automated Acquisition of Rules for Document Understanding,
ICDAR93(650-654).
Hybrid approach. Independent of document type.
For simple layout such as letters.
BibRef
Esposito, F.,
Malbera, D.,
Semeraro, G.,
Annese, E., and
Scafuro, G.,
An Experimental Page Layout Recognition System for Office
Document Automatic Classification: An Integrated Approach
for Inductive Generalization,
ICPR90(I: 557-562).
IEEE DOI Link
BibRef
9000
Antonacopoulos, A.,
Ritchings, R.T.,
Flexible page segmentation using the background,
ICPR94(B:339-344).
IEEE DOI Link
9410
BibRef
Bussi, S.[Silvia],
Mangili, F.[Fulvia],
A semi-automatic method for form layout description,
CIAP95(539-544).
Springer DOI Link
9509
BibRef
Tateisi, Y.,
Itoh, N.,
Using stochastic syntactic analysis for extracting a logical structure
from a document image,
ICPR94(B:391-394).
IEEE DOI Link
9410
BibRef
Ciardello, G.,
Scafuro, G.,
de Grandi, M.T.,
Spada, M.R.,
Roccotelli, M.P.,
An Experimental System for Office Document Handling and
Text Recognition,
ICPR88(739-743).
IEEE Top Reference.
BibRef
8800
Meynieux, E.,
Seisen, S.,
Tombre, K.,
Bilevel Information Recognition and Coding in Office Paper Documents,
ICPR86(442-445).
BibRef
8600
Kida, H.,
Iwaki, O.,
Kawada, K.,
Document Recognition System for Office Automation,
ICPR86(446-448).
BibRef
8600
Hase, M.,
Suzuki, G.,
Itoh, H.,
A Method for Extracting Marked Regions from Document Images,
ICPR86(780-782).
BibRef
8600
Derrien-Peden, D.,
Frame-Based System for Macro-Typographical Structure Analysis in
Scientific Papers,
ICDAR91(311-319).
Gets text in reading order.
BibRef
9100
Ingold, R.,
Armangil, D.,
A Top-down Document Analysis Method for Logical Structure Recognition,
ICDAR91(41-49).
BibRef
9100
Zen, H.,
Ozawa, S.,
Extraction of the Fair Document from Mixed Mode Manuscript,
CVPR85(544-549).
BibRef
8500
Chapter on OCR, Document Analysis and Character Recognition Systems continues in
Page Segmentation, General, Evaluations .