메뉴 건너뛰기




Volumn 32, Issue 3, 2006, Pages 295-340

Orthographic errors in Web pages: Toward cleaner Web corpora

Author keywords

[No Author keywords available]

Indexed keywords

ERRORS; LINGUISTICS;

EID: 33748650310     PISSN: 08912017     EISSN: 15309312     Source Type: Journal    
DOI: 10.1162/coli.2006.32.3.295     Document Type: Article
Times cited : (37)

References (43)
  • 1
    • 0032181559 scopus 로고    scopus 로고
    • Efficient error-correcting viterbi parsing
    • Amengual, Juan Carlos and Enrique Vidal. 1998. Efficient error-correcting viterbi parsing. IEEE Transactions on PAMI, 20(10):1-109.
    • (1998) IEEE Transactions on PAMI , vol.20 , Issue.10 , pp. 1-109
    • Amengual, J.C.1    Vidal, E.2
  • 2
    • 84977940268 scopus 로고    scopus 로고
    • BootCaT: Bootstrapping corpora and terms from the web
    • Lisbon
    • Baroni, Marco and Silvia Bernardini. 2004. BootCaT: Bootstrapping corpora and terms from the web. In Proceedings of LREC 2004, pages 1313-1316, Lisbon.
    • (2004) Proceedings of LREC 2004 , pp. 1313-1316
    • Baroni, M.1    Bernardini, S.2
  • 3
    • 0032626065 scopus 로고    scopus 로고
    • Generating translation lexica from multilingual texts
    • Boutsis, Sotiris, Stelious Piperidis, and Iason Demiros. 1999. Generating translation lexica from multilingual texts. Applied Artificial Intelligence, 13(6)583-606.
    • (1999) Applied Artificial Intelligence , vol.13 , Issue.6 , pp. 583-606
    • Boutsis, S.1    Piperidis, S.2    Demiros, I.3
  • 4
    • 0038589165 scopus 로고    scopus 로고
    • The anatomy of a large-scale hypertextual Web search engine
    • Brin, Sergey and Lawrence Page. 1998. The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems, 30:107-117.
    • (1998) Computer Networks and ISDN Systems , vol.30 , pp. 107-117
    • Brin, S.1    Page, L.2
  • 7
    • 0002911416 scopus 로고
    • Introduction to the special issue on computational linguistics using large corpora
    • Church, Kenneth W. and Robert L. Mercer. 1993. Introduction to the special issue on computational linguistics using large corpora. Computational Linguistics, 19(1):1-24.
    • (1993) Computational Linguistics , vol.19 , Issue.1 , pp. 1-24
    • Church, K.W.1    Mercer, R.L.2
  • 9
    • 85055298348 scopus 로고
    • Accurate models for the statistics of surprise and coincidence
    • Dunning, Ted. 1993. Accurate models for the statistics of surprise and coincidence. Computational Linguistics, 19(1):61-74.
    • (1993) Computational Linguistics , vol.19 , Issue.1 , pp. 61-74
    • Dunning, T.1
  • 10
    • 33748649543 scopus 로고    scopus 로고
    • Learning to classify documents according to genre
    • IJCAI-03 Workshop on Computational Approaches to Text Style and Synthesis, Acapulco. (in press)
    • Finn, Aidan and Nicholas Kushmerick. 2003. Learning to classify documents according to genre. In IJCAI-03 Workshop on Computational Approaches to Text Style and Synthesis, Acapulco. Journal of the American Society for Information Science and Technology (in press).
    • (2003) Journal of the American Society for Information Science and Technology
    • Finn, A.1    Kushmerick, N.2
  • 11
    • 85105789584 scopus 로고    scopus 로고
    • Facilitating the compilation and dissemination of ad-hoc web corpora
    • Guy Aston, Silvia Bernardini, and Dominic Stewart, editors, number 17 in Studies in Corpus Linguistics. John Benjamins Publishing Company, Amsterdam
    • Fletcher, William H. 2004a. Facilitating the compilation and dissemination of ad-hoc web corpora. In Guy Aston, Silvia Bernardini, and Dominic Stewart, editors, Corpora and Language Learners, number 17 in Studies in Corpus Linguistics. John Benjamins Publishing Company, Amsterdam.
    • (2004) Corpora and Language Learners
    • Fletcher, W.H.1
  • 12
    • 28244479582 scopus 로고    scopus 로고
    • Making the web more useful as a source for linguistic corpora
    • U. Connor and T. Upton, editors, Rodopi, Amsterdam
    • Fletcher, William H. 2004b. Making the web more useful as a source for linguistic corpora. In U. Connor and T. Upton, editors, Corpus Linguistics in North America 2002. Rodopi, Amsterdam.
    • (2004) Corpus Linguistics in North America 2002
    • Fletcher, W.H.1
  • 19
  • 25
    • 0026979939 scopus 로고
    • Techniques for automatically correcting words in texts
    • Kukich, Karen. 1992. Techniques for automatically correcting words in texts. ACM Computing Surveys, 24(4):377-439.
    • (1992) ACM Computing Surveys , vol.24 , Issue.4 , pp. 377-439
    • Kukich, K.1
  • 32
    • 0030245363 scopus 로고    scopus 로고
    • From HMMs to segment models: A unified view of stochastic modeling for speech recognition
    • Ostendorf, Mari, Vassilios V. Digalakis, and Owen A. Kimball. 1996. From HMMs to segment models: A unified view of stochastic modeling for speech recognition. IEEE Transactions Speech and Audio Processing, 4(5):360-378.
    • (1996) IEEE Transactions Speech and Audio Processing , vol.4 , Issue.5 , pp. 360-378
    • Ostendorf, M.1    Digalakis, V.V.2    Kimball, O.A.3
  • 42
    • 0345016957 scopus 로고    scopus 로고
    • WEBMT: Developing and validating an example-based machine translation system using the world wide web
    • Way, Andy and Nano Gough. 2003. wEBMT: Developing and validating an example-based machine translation system using the world wide web. Computational Linguistics - Special Issue on the Web as Corpus, 29(3):421-458.
    • (2003) Computational Linguistics - Special Issue on the Web As Corpus , vol.29 , Issue.3 , pp. 421-458
    • Way, A.1    Gough, N.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.