23.2.2.1.1 Specific Examples: Extract Titles, Table of Contents, Citation, Information from Papers and Books

Chapter Contents (Back)
Document Analysis. Form Analysis.

Chen, W.Y.[Wei-Yuan], Chen, S.Y.[Shu-Yuan],
Adaptive Page Segmentation for Color Technical Journals' Cover Images,
IVC(16), No. 12-13, 24 August 1998, pp. 855-877.
WWW Version. BibRef 9808

Sobottka, K., Kronenberg, H., Perroud, T., Bunke, H.,
Text Extraction from Colored Book and Journal Covers,
IJDAR(2), No. 4, 1999, pp. xx-yy. 0008
BibRef

Belaïd, A.,
Recognition of table of contents for electronic library consulting,
IJDAR(4), No. 1, 2001, pp. 35-45.
HTML Version. 0111
BibRef

Belaïd, A., Pierron, L., Valverde, N.,
Part-of-speech Tagging for Table of Contents Recognition,
ICPR00(Vol IV: 451-454).
IEEE DOI Link 0009
BibRef

Belaid, A., Chenevoy, Y.,
Document Analysis for Retrospective Conversion of Library Reference Catalogues,
ICDAR97(Tu-3C) 9708
In program, not in proceedings. BibRef

Besagni, D., Belaid, A.,
Citation recognition for scientific publications in digital libraries,
DIAL04(244-252).
IEEE DOI Link 0404
BibRef

Besagni, D., Belaid, A., Benet, N.,
A segmentation method for bibliographic references by contextual tagging of fields,
ICDAR03(384-388).
IEEE Abstract. 0311
BibRef

Parmentier, F., Belaid, A.,
Logical Structure Recognition of Scientific Bibliographic References,
ICDAR97(1072-1076).
IEEE DOI Link 9708
BibRef

Chenevoy, Y., Belaid, A.,
Hypothesis Management for Structured Document Recognition,
ICDAR91(121-129). Top-down, written in Lisp BibRef 9100

Belaid, A.,
Retrospective Document Conversion Application to the Library Domain,
IJDAR(1), No. 3, 1998, pp. 319-330. BibRef 9800

Wang, S.H.[Shu-Hua], Cao, Y.[Yang], Cai, S.J.[Shi-Jie],
Using citing information to understand the logical structure of document images,
IJDAR(4), No. 1, 2001, pp. 27-34.
HTML Version. 0111
BibRef

Naoi, S.[Satoshi], Katsuyama, Y.[Yutaka], Takebe, H.[Hiroaki],
Apparatus and method for extracting management information from image,
US_Patent6,721,463, Apr 13, 2004
WWW Version. BibRef 0404
And: US_Patent6,704,450, Mar 9, 2004
WWW Version. BibRef
And: US_Patent6,327,387, Dec 4, 2001
WWW Version. E.g. title. BibRef

Lin, X.F.[Xiao-Fan], Xiong, Y.[Yan],
Detection and analysis of table of contents based on content association,
IJDAR(8), No. 2-3, June 2006, pp. 132-143.
Springer DOI Link 0606
BibRef

Mandal, S., Chowdhury, S.P., Das, A.K., Chanda, B.[Bhabatosh],
A simple and effective table detection system from document images,
IJDAR(8), No. 2-3, June 2006, pp. 172-182.
Springer DOI Link 0606
BibRef
Earlier:
Detection and Segmentation of Table of Contents and Index Pages from Document Images,
DIAL06(70-81).
IEEE DOI Link 0604
BibRef
Earlier:
A Complete System for Detection and Identification of Tabular Structures from Document Images,
ICIAR04(II: 217-225).
WWW Version. 0409
BibRef
Earlier:
Automated detection and segmentation of table of contents page and index pages from document images,
CIAP03(213-218).
IEEE Abstract. 0310
BibRef
And:
Automated detection and segmentation of table of contents page from document images,
ICDAR03(398-402).
IEEE Abstract. 0311
See also Segmentation of Text and Graphics from Document Images. BibRef

Mandal, S., Chowdhury, S.P., Das, A.K., Chanda, B.,
A hierarchical method for automated identification and segmentation of forms,
ICDAR05(II: 705-709).
IEEE DOI Link 0508
BibRef
Earlier:
Automated segmentation of math-zones from document images,
ICDAR03(755-759).
IEEE Abstract. 0311
BibRef

Xiao, Y.[Yi], Yan, H.[Hong],
Location of title and author regions in document images based on the Delaunay triangulation,
IVC(22), No. 4, 1 April 2004, pp. 319-329.
WWW Version. 0402
See also Text region extraction in a document image based on the Delaunay tessellation. BibRef

Staelin, C.[Carl], Elad, M.[Michael], Greig, D.[Darryl], Shmueli, O.[Oded], Vans, M.[Marie],
Biblio: Automatic Meta-Data Extraction,
IJDAR(10), No. 2, November 2007, pp. 113-126.
Springer DOI Link 0711
BibRef

Déjean, H.[Hervé], Meunier, J.L.[Jean-Luc],
On tables of contents and how to recognize them,
IJDAR(12), No. 1, May 2009, pp. xx-yy.
Springer DOI Link 0905
BibRef

Déjean, H.[Hervé], Meunier, J.L.[Jean-Luc],
A System for Converting PDF Documents into Structured XML Format,
DAS06(129-140).
Springer DOI Link 0602
BibRef

Meunier, J.L.,
Optimized XY-cut for determining a page reading order,
ICDAR05(I: 347-351).
IEEE DOI Link 0508
BibRef

Pruteanu-Malinici, I.[Iulian], Ren, L.[Lu], Paisley, J.[John], Wang, E.[Eric], Carin, L.[Lawrence],
Hierarchical Bayesian Modeling of Topics in Time-Stamped Documents,
PAMI(32), No. 6, June 2010, pp. 996-1011.
IEEE DOI Link 1004
model topics in a document sequence with known dates. Infer changes in topic weights over time. BibRef

Zou, J.[Jie], Le, D.[Daniel], Thoma, G.R.[George R.],
Locating and parsing bibliographic references in HTML medical articles,
IJDAR(13), No. 2, June 2010, pp. xx-yy.
Springer DOI Link 1007
BibRef

Bratus, S.[Sergey], Rumshisky, A.[Anna], Khrabrov, A.[Alexy], Magar, R.[Rajenda], Thompson, P.[Paul],
Domain-specific entity extraction from noisy, unstructured data using ontology-guided search,
IJDAR(14), No. 2, June 2011, pp. 201-211.
WWW Version. 1106
BibRef

Vanetti, M.[Marco], Gallo, I.[Ignazio], Nodari, A.[Angelo],
GAS meter reading from real world images using a multi-net system,
PRL(34), No. 5, 1 April 2013, pp. 519-526.
Elsevier DOI Link 1303
Object detection; Object segmentation; Text localization; Ocr; Neural networks; Multi-net system BibRef


Talker, L.[Lior], Moses, Y.[Yael],
Viewpoint-independent book spine segmentation,
WACV14(453-460)
IEEE DOI Link 1406
Active contours BibRef

Noguchi, S.[Shohei], Yamada, M.[Masahiro], Watanabe, Y.[Yoshihiro], Ishikawa, M.[Masatoshi],
Real-time 3D page tracking and book status recognition for high-speed book digitization based on adaptive capturing,
WACV14(137-144)
IEEE DOI Link 1406
Books BibRef

Kim, J.W.[Jong-Woo], Le, D.X., Thoma, G.R.,
Identification of Investigator Name Zones Using SVM Classifiers and Heuristic Rules,
ICDAR13(140-144)
IEEE DOI Link 1312
Automated extraction of listed names of medical papers. BibRef

Wu, Z.[Zhaohui], Mitra, P., Giles, C.L.,
Table of Contents Recognition and Extraction for Heterogeneous Book Documents,
ICDAR13(1205-1209)
IEEE DOI Link 1312
document image processing BibRef

You, D.K.[Dae-Keun], Antani, S.[Sameer], Demner-Fushman, D.[Dina], Govindaraju, V.[Venu], Thoma, G.R.[George R.],
Detecting Figure-Panel Labels in Medical Journal Articles Using MRF,
ICDAR11(967-971).
IEEE DOI Link 1111
BibRef

Gao, L.C.[Liang-Cai], Zhong, Y.[Yuan], Tang, Y.M.[Ying-Min], Tang, Z.[Zhi], Lin, X.F.[Xiao-Fan], Hu, X.[Xuan],
Metadata Extraction System for Chinese Books,
ICDAR11(749-753).
IEEE DOI Link 1111
BibRef

Yalniz, I.Z.[Ismet Zeki], Manmatha, R.,
A Fast Alignment Scheme for Automatic OCR Evaluation of Books,
ICDAR11(754-758).
IEEE DOI Link 1111
BibRef

Fowers, S.G., Lee, D.J.[Dah-Jye], Xiong, G.M.[Guang-Ming],
Improved library shelf reading using color feature matching of book-spine images,
ICARCV10(2160-2165).
IEEE DOI Link 1109
BibRef

Zhang, Z.Y.[Zhi-Yuan], Qi, K.Y.[Kai-Yue], Chen, K.[Kai], Li, C.X.[Chen-Xuan], Chen, J.B.[Jian-Bo], Guan, H.B.[Hai-Bing],
A Novel System for Robust Text Location and Recognition of Book Covers,
ACCV09(II: 608-617).
Springer DOI Link 0909
BibRef

Doucet, A.[Antoine], Kazai, G.[Gabriella], Colutto, S., Muhlberger, G.,
ICDAR 2013 Competition on Book Structure Extraction,
ICDAR13(1438-1443)
IEEE DOI Link 1312
electronic publishing BibRef

Doucet, A.[Antoine], Kazai, G.[Gabriella], Meunier, J.L.[Jean-Luc],
ICDAR 2011 Book Structure Extraction Competition,
ICDAR11(1501-1505).
IEEE DOI Link 1111
BibRef

Doucet, A.[Antoine], Kazai, G.[Gabriella], Dresevic, B.[Bodin], Uzelac, A.[Aleksandar], Radakovic, B.[Bogdan], Todic, N.[Nikola],
ICDAR 2009 Book Structure Extraction Competition,
ICDAR09(1408-1412).
IEEE DOI Link 0907
Goal: construct hyperlinked tables of contents for a collection of 1,000 digitized books. BibRef

Gao, L.C.[Liang-Cai], Tang, Z.[Zhi], Lin, X.F.[Xiao-Fan], Tao, X.[Xin], Chu, Y.M.[Yi-Min],
Analysis of Book Documents' Table of Content Based on Clustering,
ICDAR09(911-915).
IEEE DOI Link 0907
BibRef

Marinai, S.[Simone],
Metadata Extraction from PDF Papers for Digital Library Ingest,
ICDAR09(251-255).
IEEE DOI Link 0907
BibRef

Wei, W., King, I., Lee, J.H-M.,
Bibliographic Attributes Extraction with Layer-upon-Layer Tagging,
ICDAR07(804-808).
IEEE DOI Link 0709
BibRef

van Beusekom, J., Keysers, D., Shafait, F., Breuel, T.M.,
Example-Based Logical Labeling of Document Title Page Images,
ICDAR07(919-923).
IEEE DOI Link 0709
BibRef

Kwon, Y.B., Park, J.,
Implementation of Content Analysis System for Recognition of Journals' Table of Contents,
ICDAR07(1018-1022).
IEEE DOI Link 0709
BibRef

Yacoub, S., Peiro, J.A.,
Identification of document structure and table of content in magazine archives,
ICDAR05(II: 1253-1257).
IEEE DOI Link 0508
BibRef

Mao, S.[Song], Rosenfeld, A., Kanungo, T.,
Stochastic attributed K-D tree modeling of technical paper title pages,
ICIP03(I: 533-536).
IEEE Abstract. 0312
BibRef

Le Bourgeois, F.[Frank], Emptoz, H.[Hubert], Bensafi, S.S.,
Document understanding using probabilistic relaxation: Application on tables of contents of periodicals,
ICDAR01(508-512).
IEEE DOI Link 0109
BibRef

Iwata, K., Yamamoto, K., Yasuda, M., Kato, K., Ishida, M., Murata, K.,
Book cover identification by using four directional features field for a small-scale library system,
ICDAR01(582-586).
IEEE DOI Link 0109
BibRef

Yang, H.[Hua], Onda, N., Kashimura, M., Ozawa, S.,
Extraction of bibliography information based on image of book cover,
CIAP99(921-926).
IEEE DOI Link 9909
BibRef

Akiyama, Y., Ito, M.,
Book Recognition from Color Images of Book Shelves,
MVA98(xx-yy). BibRef 9800

Hsu, W.H.[Wen-Hsing], Hwang, H.Y.[Hui-Yu], Chen, Y.S.[Yung-Sheng],
Locating Book Backs in a Bookrack Image,
ICPR98(Vol II: 1822-1824).
IEEE DOI Link 9808
BibRef

Lin, C.C.[Chun Chen], Niwa, Y., Narita, S.,
Logical structure analysis of book document images using contents information,
ICDAR97(1048-1054).
IEEE DOI Link 9708
BibRef

Luo, Q.[Qin], Watanabe, T., Nakayama, T.,
Identifying contents page of documents,
ICPR96(III: 696-700).
IEEE DOI Link 0509
BibRef

Antoine, D., Collin, S., Tombre, K.,
Analysis of Technical Documents: The REDRAW System,
SDIA92(xx-yy). 0905
BibRef

Masini, G., Tombre, K.,
Discrete relaxation applied to interpretation of technical documents,
ICPR90(I: 706-708).
IEEE DOI Link 9006
BibRef

Chapter on OCR, Document Analysis and Character Recognition Systems continues in
Document Layout, Document Segmentation, Page Layout, Structure Analysis .


Last update:Oct 15, 2014 at 21:10:33