메뉴 건너뛰기




Volumn , Issue , 2010, Pages 415-420

Omni font OCR error correction with effect on retrieval

Author keywords

Arabic text; Error correction; Information retrieval; Language modeling; OCR

Indexed keywords

ARABIC TEXTS; CHARACTER LEVEL; CHARACTER MODELS; CORRECTION TECHNIQUES; EDIT DISTANCE; ERROR MODEL; LANGUAGE MODEL; LANGUAGE MODELING; OCR; PRINTED DOCUMENTS; PRINTED MATERIALS; RETRIEVAL EFFECTIVENESS; TEMPORAL COVERAGE;

EID: 79851475788     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ISDA.2010.5687228     Document Type: Conference Paper
Times cited : (3)

References (32)
  • 2
    • 0032687697 scopus 로고    scopus 로고
    • Stemming methodologies over individual query words for arabic information retrieval
    • Abu-Salem, H., M. Al-Omari, and M. Evens. Stemming Methodologies Over Individual Query Words for Arabic Information Retrieval. JASIS, 50(6) (1999) 524-529.
    • (1999) JASIS , vol.50 , Issue.6 , pp. 524-529
    • Abu-Salem, H.1    Al-Omari, M.2    Evens, M.3
  • 5
    • 79851483662 scopus 로고    scopus 로고
    • SRILM - An extensible lamguage modeling toolkit
    • Andreas Stolcke. SRILM - An Extensible Lamguage Modeling Toolkit. Proceedings of the Workshop SMT. (2002)
    • (2002) Proceedings of the Workshop SMT
    • Stolcke, A.1
  • 6
    • 84982481712 scopus 로고
    • Comparing words, stems, and roots as index terms in an arabic information retrieval system
    • Al-Kharashi, I. and M Evens. Comparing Words, Stems, and Roots as Index Terms in an Arabic Information Retrieval System. JASIS 45(8) (1994) 548-560.
    • (1994) JASIS , vol.45 , Issue.8 , pp. 548-560
    • Al-Kharashi, I.1    Evens, M.2
  • 8
    • 84946076597 scopus 로고    scopus 로고
    • An improved error model for noisy channel spelling correction
    • Brill, E. and R. Moore. An improved error model for noisy channel spelling correction. In the proceedings ACL 2000. (2000).
    • (2000) The Proceedings ACL 2000
    • Brill, E.1    Moore, R.2
  • 9
    • 0002589728 scopus 로고
    • Probability scoringfor spelling correction
    • Church, K. and W. Gale. "Probability Scoringfor Spelling Correction." Statistics and Computing, 1: 93-103 (1991).
    • (1991) Statistics and Computing , vol.1 , pp. 93-103
    • Church, K.1    Gale, W.2
  • 10
    • 0036993294 scopus 로고    scopus 로고
    • Term selection for searching printed arabic
    • Darwish, K. and D. Oard. Term Selection for Searching Printed Arabic. In SIGIR-2002 (2002).
    • (2002) SIGIR-2002
    • Darwish, K.1    Oard, D.2
  • 11
    • 33749610227 scopus 로고    scopus 로고
    • CLIR experiments at maryland for TREC 2002: Evidence combination for arabic-english retrieval
    • Gaithersburg, MD
    • Darwish, K. and D. Oard. CLIR Experiments at Maryland for TREC 2002: Evidence Combination for Arabic-English Retrieval. In TREC-2002, Gaithersburg, MD (2002).
    • (2002) TREC-2002
    • Darwish, K.1    Oard, D.2
  • 12
    • 84884575840 scopus 로고    scopus 로고
    • A morphologically sensitive clustering algorithm for identifying arabic roots
    • Hong Kong
    • De Roeck, A. and W. Al-Fares. A Morphologically Sensitive Clustering Algorithm for Identifying Arabic Roots. In the 38th Annual Meeting of the ACL, Hong Kong, (2000).
    • (2000) The 38th Annual Meeting of the ACL
    • De Roeck, A.1    Al-Fares, W.2
  • 13
    • 84937308282 scopus 로고
    • Detection of spelling errors in swedish not using a word list en clair
    • Domeij, R., J. Hollman, V. Kann. Detection of spelling errors in Swedish not using a word list en clair. Journal of Quantitative Linguistics (1994) 195-201.
    • (1994) Journal of Quantitative Linguistics , pp. 195-201
    • Domeij, R.1    Hollman, J.2    Kann, V.3
  • 14
    • 12344328287 scopus 로고    scopus 로고
    • TREC 2002 cross-lingual retrieval at BBN
    • Gaithersburg, MD
    • Fraser, A., J. Xu, and R. Weischedel. TREC 2002 Cross-lingual Retrieval at BBN. In TREC-2002. Gaithersburg, MD (2002).
    • (2002) TREC-2002
    • Fraser, A.1    Xu, J.2    Weischedel, R.3
  • 15
    • 1542369989 scopus 로고    scopus 로고
    • The TREC-2001 cross-language information retrieval track: Searching arabic using english, french or arabic queries
    • Gaithersburg, MD
    • Gey, F. and D. Oard. The TREC-2001 Cross-Language Information Retrieval Track: Searching Arabic Using English, French or Arabic Queries. In TREC-2001, Gaithersburg, MD (2001).
    • (2001) TREC-2001
    • Gey, F.1    Oard, D.2
  • 16
    • 0000957913 scopus 로고    scopus 로고
    • Probabilistic retrieval of OCR-degraded text using N-grams
    • Harding, S., W. Croft, and C. Weir. Probabilistic Retrieval of OCR-degraded Text Using N-Grams. In ECDL'97 (1997).
    • (1997) ECDL'97
    • Harding, S.1    Croft, W.2    Weir, C.3
  • 19
    • 0036995440 scopus 로고    scopus 로고
    • Improving stemming for arabic information retrieval: Light stemming and cooccurrence analysis
    • Larkey, L., L. Ballesteros, and M. Connell. Improving stemming for Arabic information retrieval: light stemming and cooccurrence analysis. In proceedings SIGIR'02. (2002).
    • (2002) Proceedings SIGIR'02
    • Larkey, L.1    Ballesteros, L.2    Connell, M.3
  • 21
    • 37049008029 scopus 로고    scopus 로고
    • Arabic OCR error correction using character segment correction, language modeling, and shallow morphology
    • Magdy, W. and K. Darwish. Arabic OCR Error Correction Using Character Segment Correction, Language Modeling, and Shallow Morphology. In EMNLP 2006, pages 408 - 414 (2006)
    • (2006) EMNLP 2006 , pp. 408-414
    • Magdy, W.1    Darwish, K.2
  • 22
    • 37049021085 scopus 로고    scopus 로고
    • Arabic. Word-based correction for retrieval of arabic OCR degraded documents
    • Magdy, W. and K. Darwish. Arabic. Word-Based Correction for Retrieval of Arabic OCR Degraded Documents. In SPIRE (2006)
    • (2006) SPIRE
    • Magdy, W.1    Darwish, K.2
  • 23
    • 1942515617 scopus 로고    scopus 로고
    • JHU/APL at TREC 2001: Experiments in filtering and in arabic, video, and web retrieval
    • Gaithersburg, MD
    • Mayfield, J., P. McNamee, C. Costello, C. Piatko, and A. Banerjee. JHU/APL at TREC 2001: Experiments in Filtering and in Arabic, Video, and Web Retrieval. In TREC-2001. Gaithersburg, MD (2001).
    • (2001) TREC-2001
    • Mayfield, J.1    McNamee, P.2    Costello, C.3    Piatko, C.4    Banerjee, A.5
  • 24
    • 28844497373 scopus 로고    scopus 로고
    • JHU/APL at TREC 2002: Experiments in filtering and arabic retrieval
    • Gaithersburg, MD
    • McNamee, P., C. Piatko, and J. Mayfield. JHU/APL at TREC 2002: Experiments in Filtering and Arabic Retrieval. In TREC-2002, Gaithersburg, MD (2002).
    • (2002) TREC-2002
    • McNamee, P.1    Piatko, C.2    Mayfield, J.3
  • 25
    • 50849122138 scopus 로고    scopus 로고
    • Arabic treebank: Part 1 - 10Kword english translation
    • Moussa B., M. Maamouri, H. Jin, A. Bies, X. Ma. Arabic Treebank: Part 1 - 10Kword English Translation. LDC (2003).
    • (2003) LDC
    • Moussa, B.1    Maamouri, M.2    Jin, H.3    Bies, A.4    Ma, X.5
  • 26
    • 33646736698 scopus 로고    scopus 로고
    • The TREC 2002 arabic/English CLIR track
    • Gaithersburg, MD
    • Oard, D. and F. Gey. The TREC 2002 Arabic/English CLIR Track. In TREC-2002, Gaithersburg, MD (2002).
    • (2002) TREC-2002
    • Oard, D.1    Gey, F.2
  • 27
    • 0002692959 scopus 로고    scopus 로고
    • Error-tolerant finite state recognition with applications to morphological analysis and spelling correction
    • Oflazer, K. Error-Tolerant Finite State Recognition with Applications to Morphological Analysis and Spelling Correction. Computational Linguistics 22(1), 73-90 (1996).
    • (1996) Computational Linguistics 22(1) , pp. 73-90
    • Oflazer, K.1
  • 28
    • 84885608872 scopus 로고    scopus 로고
    • Information retrieval system evaluation: Effort, sensitivity, and reliability
    • Sanderson, M. and J. Zobel. Information Retrieval System Evaluation: Effort, Sensitivity, and Reliability. In SIGIR'05, (2005).
    • (2005) SIGIR'05
    • Sanderson, M.1    Zobel, J.2
  • 30
    • 37049010935 scopus 로고    scopus 로고
    • Efficient generation and ranking of spelling error corrections
    • Tillenius, M., Efficient generation and ranking of spelling error corrections. NADA (1996).
    • (1996) NADA
    • Tillenius, M.1
  • 32
    • 85093280076 scopus 로고    scopus 로고
    • Factored language models and generalized parallel backoff
    • Bilmes J. A. and K. Kirchhoff. Factored Language Models and Generalized Parallel Backoff. In HLT 2003
    • HLT 2003
    • Bilmes, J.A.1    Kirchhoff, K.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.