메뉴 건너뛰기




Volumn , Issue , 2011, Pages 67-74

Performing information extraction to improve OCR error detection in semi-structured historical documents

Author keywords

error detection; hidden Markov model; information extraction; OCR; optical character recognition; semi structured text

Indexed keywords

ALTERNATIVE APPROACH; DETECTION APPROACH; DICTIONARY MATCHING; DOCUMENT IMAGES; F-MEASURE; GENERAL APPROACH; HISTORICAL DOCUMENTS; INFORMATION EXTRACTION; POST PROCESS; SEMI-STRUCTURED; SEMI-STRUCTURED TEXT; WORD MODELS;

EID: 80054779191     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2037342.2037354     Document Type: Conference Paper
Times cited : (3)

References (19)
  • 4
    • 0031189914 scopus 로고    scopus 로고
    • Multitask Learning
    • R. Caruana. Multitask learning. Machine Learning, 28:41-75, 1997. (Pubitemid 127507169)
    • (1997) Machine Learning , vol.28 , Issue.1 , pp. 41-75
    • Caruana, R.1
  • 6
    • 77949843125 scopus 로고    scopus 로고
    • Efficient automatic OCR word validation using word partial format derivation and language model
    • L. Likforman-Sulem and G. Agam, editors, San Jose, California, USA, SPIE
    • S. Chen, D. Misra, and G. R. Thoma. Efficient automatic OCR word validation using word partial format derivation and language model. In L. Likforman-Sulem and G. Agam, editors, Document Recognition and Retrieval XVII, volume 7534, pages 1-10, San Jose, California, USA, 2010. SPIE.
    • (2010) Document Recognition and Retrieval XVII , vol.7534 , pp. 1-10
    • Chen, S.1    Misra, D.2    Thoma, G.R.3
  • 10
    • 0026979939 scopus 로고
    • Techniques for automatically correcting words in text
    • DOI 10.1145/146370.146380
    • K. Kukich. Techniques for automatically correcting words in text. ACM Computing Surveys, 24:377-439, 1992. (Pubitemid 23687641)
    • (1992) ACM Computing Surveys , vol.24 , Issue.4 , pp. 377-439
    • Kukich, K.1
  • 11
  • 14
    • 0034315648 scopus 로고    scopus 로고
    • OCR error correction of an inflectional Indian language using morphological parsing
    • U. Pal, P. K. Kundu, and B. B. Chaudhuri. OCR error correction of an inflectional indian language using morphological parsing. Journal of Information Science and Engineering, 16:903-922, 2000. (Pubitemid 32032383)
    • (2000) Journal of Information Science and Engineering , vol.16 , Issue.6 , pp. 903-922
    • Pal, U.1    Kundu, P.K.2    Chaudhuri, B.B.3
  • 15
    • 0024610919 scopus 로고
    • A tutorial on hidden markov models and selected applications in speech recognition
    • L. R. Rabiner. A tutorial on hidden markov models and selected applications in speech recognition. Proceedings of the IEEE, 77:257-286, 1989.
    • (1989) Proceedings of the IEEE , vol.77 , pp. 257-286
    • Rabiner, L.R.1
  • 17
    • 80054791171 scopus 로고
    • Press of the Nichols Print, Haverhill, Massachusetts, USA
    • J. B. White. Barber Genealogy. Press of the Nichols Print, Haverhill, Massachusetts, USA, 1908.
    • (1908) Barber Genealogy
    • White, J.B.1
  • 18
    • 33947320566 scopus 로고    scopus 로고
    • Hidden markov model variants and their application
    • S. Winters-Hilt. Hidden markov model variants and their application. BMC Bioinformatics, 7, 2006.
    • (2006) BMC Bioinformatics , pp. 7
    • Winters-Hilt, S.1
  • 19
    • 38549178489 scopus 로고    scopus 로고
    • A novel, fast, HMM-with-Duration implementation - For application with a new, pattern recognition informed, nanopore detector
    • S. Winters-Hilt and C. Baribault. A novel, fast, HMM-with-Duration implementation - for application with a new, pattern recognition informed, nanopore detector. BMC Bioinformatics, 8, 2007.
    • (2007) BMC Bioinformatics , pp. 8
    • Winters-Hilt, S.1    Baribault, C.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.