메뉴 건너뛰기




Volumn 2005, Issue , 2005, Pages 162-166

A corpus for comparative evaluation of OCR software and postcorrection techniques

Author keywords

Comparative evaluation; Cyrillic documents; Ground truth data; Meta data; Mixed alphabet documents; Optical character recognition; Postcorrection of OCR results; Public corpora

Indexed keywords

COMPARATIVE EVALUATION; CYRILLIC DOCUMENTS; IMAGE FILES; MIXED ALPHABET DOCUMENTS; POSTCORRECTION TECHNIQUES; PUBLIC CORPORA;

EID: 33947382234     PISSN: 15205363     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICDAR.2005.6     Document Type: Conference Paper
Times cited : (15)

References (14)
  • 4
    • 21844453170 scopus 로고    scopus 로고
    • Data sets for OCR and document image understanding research
    • Horst Bunke and Patrick S.P. Wang, editors, World Scientific
    • Isabelle Guyon, Robert M. Haralick, Jonathan J. Hull, and Ihsin T. Phillips. Data sets for OCR and document image understanding research. In Horst Bunke and Patrick S.P. Wang, editors, Handbook of Character Recognition and Document Image Analysis, pages 779-799. World Scientific, 1997.
    • (1997) Handbook of Character Recognition and Document Image Analysis , pp. 779-799
    • Guyon, I.1    Haralick, R.M.2    Hull, J.J.3    Phillips, I.T.4
  • 5
    • 33947357854 scopus 로고    scopus 로고
    • OCR accuracy produced by the current DOE document conversion system
    • ISRI, Technical Report 2002-06, Information Science Research Institute University of Nevada Las Vegas
    • ISRI. OCR accuracy produced by the current DOE document conversion system. Technical Report 2002-06, Information Science Research Institute University of Nevada Las Vegas, 2002.
    • (2002)
  • 6
    • 0003559057 scopus 로고
    • Henry Kucera and W. Nelson Francis, editors, Brown University Press
    • Henry Kucera and W. Nelson Francis, editors. Computational Aspects of Present-Day American English. Brown University Press, 1967.
    • (1967) Computational Aspects of Present-Day American English
  • 7
    • 0026979939 scopus 로고
    • Techniques for automatically correcting words in texts
    • Karen Kukich. Techniques for automatically correcting words in texts. ACM Computing Surveys, pages 377-439, 1992.
    • (1992) ACM Computing Surveys , pp. 377-439
    • Kukich, K.1
  • 8
    • 0001311748 scopus 로고    scopus 로고
    • The TREC-5 confusion track: Comparing retrieval methods for scanned text
    • Paul B. Kantor and Ellen M. Voorhees. The TREC-5 confusion track: Comparing retrieval methods for scanned text. Information Retrieval, 2(2/3):165-176, 2000.
    • (2000) Information Retrieval , vol.2 , Issue.2-3 , pp. 165-176
    • Kantor, P.B.1    Voorhees, E.M.2
  • 10
    • 33947366084 scopus 로고
    • University of Washington English/Japanese document image database II: A database of document images for OCR research. CD-ROM
    • Ihsin T. Phillips and Robert M. Haralick. University of Washington English/Japanese document image database II: A database of document images for OCR research. CD-ROM, 1995.
    • (1995) Phillips and Robert M. Haralick
    • Ihsin, T.1
  • 11
    • 0003638961 scopus 로고    scopus 로고
    • The fifth annual test of OCR accuracy
    • Technical Report TR-96-01, Information Science Research Institute University of Nevada Las Vegas
    • Stephen V. Rice, Frank R. Jenkins, and Thomas A. Nartker. The fifth annual test of OCR accuracy. Technical Report TR-96-01, Information Science Research Institute University of Nevada Las Vegas, 1996.
    • (1996)
    • Rice, S.V.1    Jenkins, F.R.2    Nartker, T.A.3
  • 12
    • 0038821141 scopus 로고    scopus 로고
    • The ISRI analytic tools for OCR evaluation
    • Technical Report TR-96-02, Information Science Research Institute University of Nevada Las Vegas
    • Stephen V. Rice and Thomas A. Nartker. The ISRI analytic tools for OCR evaluation. Technical Report TR-96-02, Information Science Research Institute University of Nevada Las Vegas, 1996.
    • (1996)
    • Rice, S.V.1    Nartker, T.A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.