메뉴 건너뛰기




Volumn , Issue , 2011, Pages 553-561

Cross-domain Feature Selection for Language Identification

Author keywords

[No Author keywords available]

Indexed keywords

FEATURE EXTRACTION;

EID: 85099685560     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (123)

References (36)
  • 9
    • 0028911698 scopus 로고
    • Gauging similarity with n-grams: Language-independent categorization of text
    • Marc Darnashek. 1995. Gauging similarity with n-grams: Language-independent categorization of text. Science, 267:843-848.
    • (1995) Science , vol.267 , pp. 843-848
    • Darnashek, Marc1
  • 13
    • 0003984557 scopus 로고
    • Technical Report MCCS 940-273, Computing Research Laboratory, New Mexico State University
    • Ted Dunning. 1994. Statistical identification of language. Technical Report MCCS 940-273, Computing Research Laboratory, New Mexico State University.
    • (1994) Statistical identification of language
    • Dunning, Ted1
  • 14
    • 2942731012 scopus 로고    scopus 로고
    • An Extensive Empirical Study of Feature Selection Metrics for Text Classification
    • George Forman. 2003. An Extensive Empirical Study of Feature Selection Metrics for Text Classification. Journal of Machine Learning Research, 3(7-8):1289-1305.
    • (2003) Journal of Machine Learning Research , vol.3 , Issue.7-8 , pp. 1289-1305
    • Forman, George1
  • 15
    • 20344402818 scopus 로고    scopus 로고
    • Building Minority Language Corpora by Learning to Generate Web Search Queries
    • Rayid Ghani, Rosie Jones, and Dunja Mladenic. 2004. Building Minority Language Corpora by Learning to Generate Web Search Queries. Knowledge and Information Systems, 7(1):56-83.
    • (2004) Knowledge and Information Systems , vol.7 , Issue.1 , pp. 56-83
    • Ghani, Rayid1    Jones, Rosie2    Mladenic, Dunja3
  • 17
    • 49949150022 scopus 로고
    • Language identification in the limit
    • E. Mark Gold. 1967. Language identification in the limit. Information and Control, 5:447-474.
    • (1967) Information and Control , vol.5 , pp. 447-474
    • Mark Gold, E.1
  • 29
    • 3843127500 scopus 로고    scopus 로고
    • Character Ngram Tokenization for European Language Text Retrieval
    • Paul McNamee and James Mayfield. 2004. Character Ngram Tokenization for European Language Text Retrieval. Information Retrieval, 7(1-2):73-97.
    • (2004) Information Retrieval , vol.7 , Issue.1-2 , pp. 73-97
    • McNamee, Paul1    Mayfield, James2
  • 31
    • 33744584654 scopus 로고
    • Induction of Decision Trees
    • October
    • J.R. Quinlan. 1986. Induction of Decision Trees. Machine Learning, 1(1):81-106, October.
    • (1986) Machine Learning , vol.1 , Issue.1 , pp. 81-106
    • Quinlan, J.R.1
  • 33
    • 20344398381 scopus 로고    scopus 로고
    • Software
    • Gertjan van Noord, 1997. TextCat. Software available at http://odur.let.rug.nl/~vannoord/TextCat/.
    • (1997) TextCat
    • van Noord, Gertjan1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.