메뉴 건너뛰기




Volumn 1, Issue , 2005, Pages 764-768

Language identification in Web pages

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; COMPUTER PROGRAMMING LANGUAGES; HEURISTIC METHODS; SEARCH ENGINES;

EID: 33644526889     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1066677.1066852     Document Type: Conference Paper
Times cited : (82)

References (28)
  • 10
    • 0028911698 scopus 로고
    • Gauging similarity with n-grams: Language independent categorization of text
    • M. Darnashek. Gauging similarity with n-grams: language independent categorization of text. Science. 267(5199):843-848, 1995.
    • (1995) Science , vol.267 , Issue.5199 , pp. 843-848
    • Darnashek, M.1
  • 11
    • 0003984557 scopus 로고
    • Statistical identification of language
    • New Mexico State University
    • T. Dunning. Statistical identification of language. Technical Report MCCS 94-273, New Mexico State University, 1994.
    • (1994) Technical Report , vol.MCCS 94-273
    • Dunning, T.1
  • 12
    • 0000803388 scopus 로고
    • The population frequencies of species and the estimation of population parameters
    • I. J. Good. The population frequencies of species and the estimation of population parameters. Biometrika, 40:237-264, 1953.
    • (1953) Biometrika , vol.40 , pp. 237-264
    • Good, I.J.1
  • 14
    • 33644534588 scopus 로고
    • Language identification for the automatic grapheme-to-phoneme conversion of foreign words in a german text-to-speech system
    • September
    • P. Henrich. Language identification for the automatic grapheme-to-phoneme conversion of foreign words in a german text-to-speech system. In Proceedings of Eurospeech 1989, European Speech Communication and Technology, pages 220-223, September 1989.
    • (1989) Proceedings of Eurospeech 1989, European Speech Communication and Technology , pp. 220-223
    • Henrich, P.1
  • 15
    • 33644547428 scopus 로고    scopus 로고
    • Information space based on html structure
    • E. M. Voorhees and D. K. Harman, editors, Department of Commerce of National Institute of Standards and Technology
    • C. Hill. Information space based on html structure. In E. M. Voorhees and D. K. Harman, editors, Proceedings of TREC-9, the 9th Text REtrieval Conference. Department of Commerce of National Institute of Standards and Technology, 2000.
    • (2000) Proceedings of TREC-9, the 9th Text REtrieval Conference
    • Hill, C.1
  • 20
    • 3843127500 scopus 로고    scopus 로고
    • Character n-gram tokenization for european language text retrieval
    • April
    • P. McNamee and J. Mayfleld. Character n-gram tokenization for european language text retrieval. Information Retrieval, 7, April 2004.
    • (2004) Information Retrieval , vol.7
    • McNamee, P.1    Mayfleld, J.2
  • 21
    • 0003268207 scopus 로고    scopus 로고
    • Performance and scalability of a large-scale n-gram based information retrieval system
    • E. Miller, D. Shen, J. Liu, and C. Nicholas. Performance and scalability of a large-scale n-gram based information retrieval system. Journal of Digital Information, 1(21), 2000.
    • (2000) Journal of Digital Information , vol.1 , Issue.21
    • Miller, E.1    Shen, D.2    Liu, J.3    Nicholas, C.4
  • 23
    • 1542340317 scopus 로고    scopus 로고
    • N-gram term weighting: A comparative analysis
    • National Security Agency Technical, January
    • C. Pearce and B. Rye. N-gram term weighting: A comparative analysis. Technical Report TR-R52-001-98, National Security Agency Technical, January 1998.
    • (1998) Technical Report , vol.TR-R52-001-98
    • Pearce, C.1    Rye, B.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.