메뉴 건너뛰기




Volumn , Issue , 2014, Pages 29-35

{bs,hr,sr}WaC – Web corpora of Bosnian, Croatian and Serbian

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATIONAL LINGUISTICS; MODELING LANGUAGES;

EID: 85088131964     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (135)

References (16)
  • 1
    • 85000897241 scopus 로고    scopus 로고
    • The SETimes.HR linguistically annotated corpus of Croatian
    • [Agić and Ljubešić2014]
    • [Agić and Ljubešić2014] Željko Agić and Nikola Ljubešić. 2014. The SETimes.HR linguistically annotated corpus of Croatian. In Proceedings of LREC 2014.
    • (2014) Proceedings of LREC 2014
    • Agić, Željko1    Ljubešić, Nikola2
  • 5
    • 70350686154 scopus 로고    scopus 로고
    • The WaCky wide web: a collection of very large linguistically processed web-crawled corpora
    • [Baroni et al.2009] pages
    • [Baroni et al.2009] Marco Baroni, Silvia Bernardini, Adriano Ferraresi, and Eros Zanchetta. 2009. The WaCky wide web: a collection of very large linguistically processed web-crawled corpora. Language Resources and Evaluation, pages 209–226.
    • (2009) Language Resources and Evaluation , pp. 209-226
    • Baroni, Marco1    Bernardini, Silvia2    Ferraresi, Adriano3    Zanchetta, Eros4
  • 9
    • 77950904942 scopus 로고    scopus 로고
    • Boilerplate detection using shallow text features
    • [Kohlschütter et al.2010] Brian D. Davison, Torsten Suel, Nick Craswell, and Bing Liu, editors, pages ACM
    • [Kohlschütter et al.2010] Christian Kohlschütter, Peter Fankhauser, and Wolfgang Nejdl. 2010. Boilerplate detection using shallow text features. In Brian D. Davison, Torsten Suel, Nick Craswell, and Bing Liu, editors, WSDM, pages 441–450. ACM.
    • (2010) WSDM , pp. 441-450
    • Kohlschütter, Christian1    Fankhauser, Peter2    Nejdl, Wolfgang3
  • 11
    • 85118481535 scopus 로고    scopus 로고
    • langid.py: An off-the-shelf language identification tool
    • [Lui and Baldwin2012] pages
    • [Lui and Baldwin2012] Marco Lui and Timothy Baldwin. 2012. langid.py: An off-the-shelf language identification tool. In ACL (System Demonstrations), pages 25–30.
    • (2012) ACL (System Demonstrations) , pp. 25-30
    • Lui, Marco1    Baldwin, Timothy2
  • 15
    • 84897949455 scopus 로고    scopus 로고
    • Efficient web crawling for large text corpora
    • [Suchomel and Pomikálek2012] Serge Sharoff Adam Kilgarriff, editor, pages Lyon
    • [Suchomel and Pomikálek2012] Vít Suchomel and Jan Pomikálek. 2012. Efficient web crawling for large text corpora. In Serge Sharoff Adam Kilgarriff, editor, Proceedings of the seventh Web as Corpus Workshop (WAC7), pages 39–43, Lyon.
    • (2012) Proceedings of the seventh Web as Corpus Workshop (WAC7) , pp. 39-43
    • Suchomel, Vít1    Pomikálek, Jan2
  • 16
    • 84876815126 scopus 로고    scopus 로고
    • Efficient discrimination between closely related languages
    • [Tiedemann and Ljubešić2012] pages Mumbai, India
    • [Tiedemann and Ljubešić2012] Jörg Tiedemann and Nikola Ljubešić. 2012. Efficient discrimination between closely related languages. In Proceedings of COLING 2012, pages 2619–2634, Mumbai, India.
    • (2012) Proceedings of COLING 2012 , pp. 2619-2634
    • Tiedemann, Jörg1    Ljubešić, Nikola2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.