23.2.1 Document Analysis Systems, General, Survey, Evaluation

Chapter Contents (Back)
Survey, Document Analysis. Document Analysis.

Warped Documents, IUPR,
WWW Version. Dataset, Documents. See also University of Kaiserslautern, IUPR.

Doermann, D.[David],
Document Understanding Bibliography,
UMD--TR3775, March 1997.
WWW Version. BibRef 9703

Doermann, D.[David],
Document Image Understanding - 1997,
UMD--TR3936, October 1998.
WWW Version. The bibliography is available online:
WWW Version. Bibliography. BibRef 9810

Hou, H.S.,
Digital Document Processing,
John Wileyand Sons, New York, 1983. BibRef 8300

O'Gorman, L., Kasturi, R.,
Document Image Analysis,
Los Alamitos, CA: Computer Society Press1995. Collection of papers. BibRef 9500

Kasturi, R., O'Gorman, L., (Eds.)
Special Issue: Document Image Analysis Techniques,
MVA(6), No. 2-3, Spring/Summer, 1993, pp. 67-178. BibRef 9300

Kasturi, R., O'Gorman, L.,
Document Image Analysis: A Bibliography,
MVA(5), 1992, pp. 231-243. BibRef 9200

Kasturi, R., O'Gorman, L., Govindaraju, V.,
Document image analysis: a primer,
Sadhana(27), No. 1, 2002, pp. 3-22.
Springer DOI Link 1108

Dengel, A.R., and Spitz, A.L., (Eds.),
Document Analysis Systems,
World ScientificSingapore, 1995. BibRef 9500

Bokser, M.,
Omnidocument Technologies,
PIEEE(80), No. 7, July 1992, pp. 1066-1078.
IEEE Top Reference. BibRef 9207

Loce, R.P., Haralick, R.M., Vincent, L.M.,
Digital Document Imaging,
JEI(5), No. 2, April 1996, pp. 117-118. 9609

Srihari, S.N., Niyogi, D.,
Special Issue: Document Analysis And Recognition - Guests Editors Introduction,
IJIST(7), No. 4, Winter 1996, pp. 269-270. 9612

Farrow, G.S.D., Xydeas, C.S., Oakley, J.P., Khorabi, A., Prelcic, N.G.,
A comparison of system architectures for intelligent document understanding,
SP:IC(9), No. 1, November 1996, pp. 1-19.
WWW Version. BibRef 9611

Doermann, D.[David], Rivlin, E.[Ehud], Rosenfeld, A.[Azriel],
The Function of Documents,
IVC(16), No. 11, August 1 1998, pp. 799-814.
WWW Version. 9808
Earlier: A1, A3, A2: ICDAR97(1077-1081).
IEEE DOI Link 9708
And: A1, A3, A2: UMDTR3697, October 1996.
WWW Version. Given that documents have purpose, the style is important. BibRef

Sauvola, J., Haapakoski, S., Kauniskangas, H., Seppänen, T.[Tapio], Pietikäinen, M., Doermann, D.,
A Distributed Management System for Testing Document Image Analysis Algorithms,
IEEE DOI Link 9708

Doermann, D.[David],
Document Understanding Research at Maryland,
ARPA94(II:817-826). BibRef 9400

Kanai, J.[Junichi], Baird, H.S.[Henry S.],
Special Issue on Document Image Understanding and Retrieval,
CVIU(70), No. 3, June 1998, pp. 285-286.
DOI Link BibRef 9806

Sauvola, J., Kauniskangas, H.,
Media Team Document Database II,
WWW Version. Dataset, Document Analysis. BibRef 9900

Nagy, G.[George],
Twenty Years of Document Image Analysis in PAMI,
PAMI(22), No. 1, January 2000, pp. 38-62.
IEEE DOI Link 0003
Survey, Document Analysis. Part of the 20 year issue. Good survey and discussion of the different aspects of analyzing documents. BibRef

Chhabra, A.K.[Atul K.], Dori, D.[Dov], Tombre, K.[Karl],
Special Issue Preface,
IJDAR(3), No. 2, 2000, pp. 57-57. 0101
Understanding Graphics. BibRef

Liang, J.S.[Ji-Sheng], Phillips, I.T.[Ihsin T.], Haralick, R.M.[Robert M.],
Performance Evaluation of Document Structure Extraction Algorithms,
CVIU(84), No. 1, October 2001, pp. 144-159.
DOI Link 0203
Evaluation, Page Segmentation. Comparison of a number of methods. BibRef

Liang, J., Rogers, R., Haralick, R.M., Phillips, I.T.,
UW-ISL Document Image Analysis Toolbox: An Experimental Environment,
IEEE DOI Link 9708
1800 images to use for comparison of algorithms. BibRef

Das, A.K.[Amit Kumar], Saha, S.K.[Sanjoy Kumar], Chanda, B.[Bhabatosh],
An empirical measure of the performance of a document image segmentation algorithm,
IJDAR(4), No. 3, 2002, pp. 183-190.
HTML Version. 0205

Marinai, S.[Simone], Gori, M.[Marco], Soda, G.[Giovanni],
Artificial Neural Networks for Document Analysis and Recognition,
PAMI(27), No. 1, January 2005, pp. 23-35.
IEEE Abstract. 0412
Survey, Document Segmentation. Survey of document segmentation tasks using connectionist approaches. Anaylsis of potential. BibRef

Todoran, L.[Leon], Worring, M.[Marcel], Smeulders, A.W.M.[Arnold W. M.],
The UvA color document dataset,
IJDAR(7), No. 4, September 2005, pp. 228-240.
Springer DOI Link 0512
Dataset, Documents. BibRef
Data GroundTruth, Complexity, and Evaluation Measures for Color Document Analysis,
DAS02(519 ff.).
HTML Version. 0303

Rahman, A.F.R.[A. Fuad R.], Klein, B.[Bertin],
Special issue on detection and understanding of tables and forms for document processing applications,
IJDAR(8), No. 2-3, June 2006, pp. 65-65.
Springer DOI Link 0606

Embley, D.W.[David W.], Hurst, M.[Matthew], Lopresti, D.P.[Daniel P.], Nagy, G.[George],
Table-processing paradigms: a research survey,
IJDAR(8), No. 2-3, June 2006, pp. 66-86.
Springer DOI Link 0606
Survey, Document Segmentation. BibRef

Antonacopoulos, A.[Apostolos], Downton, A.C.[Andy C.],
Special issue on the analysis of historical documents,
IJDAR(9), No. 2-4, April 2007, pp. 75-77.
Springer DOI Link 0704

Chaudhuri, B.B., (Ed.),
Digital Document Processing: Major Directions and Recent Advances,
Springer2007. ISBN 978-1-84628-501-1.
WWW Version. Includes: Document structure analysis followed by OCR of Japanese, Tibetan and Indian printed scripts. Online and offline handwritten text recognition approaches; Japanese postal and Arabic check processing; Document image quality modelling, mathematical expression recognition, graphics recognition, document information retrieval, super resolution text, metadata extraction in digital library; Biometric and forensic aspects: individuality of handwriting detection; Web document analysis, text and hypertext mining and bank check data mining. BibRef 0700

Knoblock, C.[Craig], Lopresti, D.[Daniel], Roy, S.[Shourya], Subramaniam, L.V.[L. Venkata],
Special Issue on Noisy Text Analytics,
IJDAR(10), No. 3-4, December 2007, pp. 127-128.
Springer DOI Link 0712

Lopresti, D.[Daniel], Roy, S.[Shourya], Schulz, K.[Klaus], Subramaniam, L.V.[L. Venkata],
Special Issue on Noisy Text Analytics, II,
IJDAR(12), No. 3, September 2009, pp. xx-yy.
Springer DOI Link 0911

Lopresti, D.[Daniel], Roy, S.[Shourya], Schulz, K.[Klaus], Subramaniam, L.V.[L. Venkata],
Special Issue on Noisy Text Analytics, III,
IJDAR(14), No. 2, June 2011, pp. 111-112.
WWW Version. 1106

Gamera project,
WWW Version. Code, Document Analysis. A framework for the creation of structured document analysis applications by domain experts. BibRef 0700

Doucet, A.[Antoine], Kazai, G.[Gabriella], Dresevic, B.[Bodin], Uzelac, A.[Aleksandar], Radakovic, B.[Bogdan], Todic, N.[Nikola],
Setting up a competition framework for the evaluation of structure extraction from OCR-ed books,
IJDAR(14), No. 1, March 2011, pp. 45-52.
WWW Version. 1103

Marinai, S.[Simone], Karatzas, D.[Dimosthenis],
Report from the AND 2009 working group on noisy text datasets,
IJDAR(14), No. 2, June 2011, pp. 113-116.
WWW Version. 1106

Reffle, U.[Ulrich], Ringlstetter, C.[Christoph],
Unsupervised profiling of OCRed historical documents,
PR(46), No. 5, May 2013, pp. 1346-1357.
Elsevier DOI Link 1302
Error detection and error correction; Processing of historical documents; OCR postprocessing; Statistical learning BibRef

Bukhari, S.S.[Syed Saqib], Shafait, F.[Faisal], Breuel, T.M.[Thomas M.],
The IUPR Dataset of Camera-Captured Document Images,
Springer DOI Link 1204
Dataset, Document Images. BibRef

Nagy, R.[Robert], Dicker, A.[Anders], Meyer-Wegener, K.[Klaus],
NEOCR: A Configurable Dataset for Natural Image Text Recognition,
Springer DOI Link 1204
Dataset, Natural Image Text. BibRef

Lamiroy, B.[Bart], Lopresti, D.[Daniel],
An Open Architecture for End-to-End Document Analysis Benchmarking,
IEEE DOI Link 1111

Clausner, C., Pletschacher, S., Antonacopoulos, A.,
Aletheia: An Advanced Document Layout and Text Ground-Truthing System for Production Environments,
IEEE DOI Link 1111

Lamiroy, B.[Bart], Lopresti, D.[Daniel], Sun, T.[Tao],
Document Analysis Algorithm Contributions in End-to-End Applications: Report on the ICDAR 2011 Contest,
IEEE DOI Link 1111

Dong, X., Majewicz, P., McNutt, G., Bouman, C.A., Allebach, J.P., Pollak, I.,
A Document Page Classification Algorithm in Copy Pipeline,
ICIP07(III: 237-240).
IEEE DOI Link 0709

Bridson, D., Antonacopoulos, A.,
A geometric approach for accurate and efficient performance evaluation of layout analysis methods,
IEEE DOI Link 0812
Earlier: A2, A1:
Performance Analysis Framework for Layout Analysis Methods,
IEEE DOI Link 0709

Belaïd, A.[Abdel], Rangoni, Y.[Yves], Falk, I.,
XML Data Representation in Document Image Analysis,
IEEE DOI Link 0709

Heroux, P.[Pierre], Barbu, E., Adam, S.[Sebastien], Trupin, E.[Eric],
Automatic Ground-truth Generation for Document Image Analysis and Understanding,
IEEE DOI Link 0709

Costa e Silva, A.[Ana],
New Metrics for Evaluating Performance in Document Analysis Tasks: Application to the Table Case,
IEEE DOI Link 0709

Ferrer, M., Valveny, E.,
Combination of OCR Engines for Page Segmentation Based on Performance Evaluation,
IEEE DOI Link 0709

Miyazaki, J.,
Documents and services: from the historical points of document media as extension of human body,
ICDAR05(II: 811).
IEEE DOI Link 0508

Zhang, B.,
Construction of handwriting databases using transcript-based mapping,
IEEE DOI Link 0404

Baird, H.S.,
Difficult and urgent open problems in document image analysis for libraries,
IEEE DOI Link 0404
Digital libraries and document image analysis,
IEEE Abstract. 0311

Dengel, A.R.,
Making documents work: challenges for document understanding,
IEEE Abstract. 0311

Antonacopoulos, A., Karatzas, D., Bridson, D.,
Ground Truth for Layout Analysis Performance Evaluation,
Springer DOI Link 0602

Antonacopoulos, A., Meng, H.,
A Ground-Truthing Tool for Layout Analysis Performance Evaluation,
DAS02(236 ff.).
HTML Version. 0303

Abak, A.T., Baris, U., Sankur, B.,
The performance evaluation of thresholding algorithms for optical character recognition,
IEEE DOI Link 9708

Chapter on OCR, Document Analysis and Character Recognition Systems continues in
Form and Layout Analysis .

Last update:Apr 12, 2014 at 21:44:02