메뉴 건너뛰기




Volumn , Issue , 2012, Pages 25-30

langid.py: An off-the-shelf language identification tool

Author keywords

[No Author keywords available]

Indexed keywords

DESIGN AND IMPLEMENTATIONS; DOCUMENT DATASETS; EMPIRICAL COMPARISON; END-USERS; HIGH-ACCURACY; IDENTIFICATION TOOLS; LANGUAGE IDENTIFICATION; MICRO-BLOG; TRAINING DATA;

EID: 85118481535     PISSN: 0736587X     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (568)

References (18)
  • 1
    • 0016518897 scopus 로고
    • Efficient string matching: an aid to bibliographic search
    • June
    • Alfred V. Aho and Margaret J. Corasick. 1975. Efficient string matching: an aid to bibliographic search. Communications of the ACM, 18(6):333-340, June.
    • (1975) Communications of the ACM , vol.18 , Issue.6 , pp. 333-340
    • Aho, Alfred V.1    Corasick, Margaret J.2
  • 2
    • 84858394637 scopus 로고    scopus 로고
    • Language identification: The long and the short of the matter
    • Los Angeles, USA
    • Timothy Baldwin and Marco Lui. 2010. Language identification: The long and the short of the matter. In Proceedings of NAACL HLT 2010, pages 229-237, Los Angeles, USA.
    • (2010) Proceedings of NAACL HLT 2010 , pp. 229-237
    • Baldwin, Timothy1    Lui, Marco2
  • 6
    • 77958609427 scopus 로고    scopus 로고
    • Language identification of search engine queries
    • Singapore
    • Hakan Ceylan and Yookyung Kim. 2009. Language identification of search engine queries. In Proceedings of ACL2009, pages 1066-1074, Singapore.
    • (2009) Proceedings of ACL2009 , pp. 1066-1074
    • Ceylan, Hakan1    Kim, Yookyung2
  • 7
    • 2942731012 scopus 로고    scopus 로고
    • An Extensive Empirical Study of Feature Selection Metrics for Text Classification
    • October
    • George Forman. 2003. An Extensive Empirical Study of Feature Selection Metrics for Text Classification. Journal of Machine Learning Research, 3(7-8):1289-1305, October.
    • (2003) Journal of Machine Learning Research , vol.3 , Issue.7-8 , pp. 1289-1305
    • Forman, George1
  • 8
    • 20344402818 scopus 로고    scopus 로고
    • Building Minority Language Corpora by Learning to Generate Web Search Queries
    • February
    • Rayid Ghani, Rosie Jones, and Dunja Mladenic. 2004. Building Minority Language Corpora by Learning to Generate Web Search Queries. Knowledge and Information Systems, 7(1):56-83, February.
    • (2004) Knowledge and Information Systems , vol.7 , Issue.1 , pp. 56-83
    • Ghani, Rayid1    Jones, Rosie2    Mladenic, Dunja3
  • 9
    • 79958704049 scopus 로고    scopus 로고
    • A Fine-Grained Model for Language Identication
    • Harald Hammarstrom. 2007. A Fine-Grained Model for Language Identication. In Proceedings of iNEWS07, pages 14-20.
    • (2007) Proceedings of iNEWS07 , pp. 14-20
    • Hammarstrom, Harald1
  • 10
    • 44949230930 scopus 로고    scopus 로고
    • Europarl: A parallel corpus for statistical machine translation
    • Philipp Koehn. 2005. Europarl: A parallel corpus for statistical machine translation. MT summit, 11.
    • (2005) MT summit , pp. 11
    • Koehn, Philipp1
  • 13
    • 33744584654 scopus 로고
    • Induction of Decision Trees
    • October
    • J.R. Quinlan. 1986. Induction of Decision Trees. Machine Learning, 1(1):81-106, October.
    • (1986) Machine Learning , vol.1 , Issue.1 , pp. 81-106
    • Quinlan, J.R.1
  • 15
    • 78049262443 scopus 로고    scopus 로고
    • News from OPUS - A Collection of Multilingual Parallel Corpora with Tools and Interfaces
    • Jörg Tiedemann. 2009. News from OPUS - A Collection of Multilingual Parallel Corpora with Tools and Interfaces. Recent Advances in Natural Language Processing, V:237-248.
    • (2009) Recent Advances in Natural Language Processing , vol.V , pp. 237-248
    • Tiedemann, Jörg1
  • 16
    • 84886880139 scopus 로고    scopus 로고
    • Graph-Based N-gram Language Identification on Short Texts
    • The Hague, Netherlands
    • Erik Tromp and Mykola Pechenizkiy. 2011. Graph-Based N-gram Language Identification on Short Texts. In Proceedings of Benelearn 2011, pages 27-35, The Hague, Netherlands.
    • (2011) Proceedings of Benelearn 2011 , pp. 27-35
    • Tromp, Erik1    Pechenizkiy, Mykola2
  • 18
    • 0003141935 scopus 로고    scopus 로고
    • A comparative study on feature selection in text categorization
    • Yiming Yang and Jan O. Pedersen. 1997. A comparative study on feature selection in text categorization. In Proceedings of ICML 97.
    • (1997) Proceedings of ICML , vol.97
    • Yang, Yiming1    Pedersen, Jan O.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.