메뉴 건너뛰기




Volumn , Issue , 2013, Pages 43-51

A modular open-source focused crawler for mining monolingual and bilingual corpora from the web

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATIONAL LINGUISTICS; INFORMATION RETRIEVAL SYSTEMS; TEXT PROCESSING;

EID: 85121796154     PISSN: 0736587X     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (26)

References (25)
  • 8
    • 33745870172 scopus 로고    scopus 로고
    • Discovering parallel text from the World Wide Web
    • pages Darlinghurst, Australia
    • Jisong Chen, Rowena Chau, and Chung-Hsing Yeh. 2004. Discovering parallel text from the World Wide Web. In Proceedings of ACSW Frontiers '04, volume 32, pages 157-161, Darlinghurst, Australia.
    • (2004) Proceedings of ACSW Frontiers '04 , vol.32 , pp. 157-161
    • Chen, Jisong1    Chau, Rowena2    Yeh, Chung-Hsing3
  • 10
    • 85044811944 scopus 로고    scopus 로고
    • Combining Content-Based and URL-Based Heuristics to Harvest Aligned Bitexts from Multilingual Sites with Bitextor
    • Miquel Esplà-Gomis and Mikel L. Forcada. 2010. Combining Content-Based and URL-Based Heuristics to Harvest Aligned Bitexts from Multilingual Sites with Bitextor. The Prague Bulletin of Mathemathical Lingustics, 93:77-86.
    • (2010) The Prague Bulletin of Mathemathical Lingustics , vol.93 , pp. 77-86
    • Esplà-Gomis, Miquel1    Forcada, Mikel L.2
  • 12
    • 0344154403 scopus 로고    scopus 로고
    • Introduction to the special issue on the web as corpus
    • Adam Kilgarriff and Gregory Grefenstette. 2003. Introduction to the special issue on the web as corpus. Computational Linguistics, 29(3):333-348.
    • (2003) Computational Linguistics , vol.29 , Issue.3 , pp. 333-348
    • Kilgarriff, Adam1    Grefenstette, Gregory2
  • 16
    • 85001022730 scopus 로고    scopus 로고
    • Domain adaptation of statistical machine translation using web-crawled resources: A case study
    • Trento, Italy
    • Domain adaptation of statistical machine translation using web-crawled resources: A case study. In Proceedings of the 16th Annual Conference of EAMT, pages 145-152, Trento, Italy.
    • Proceedings of the 16th Annual Conference of EAMT , pp. 145-152
  • 17
    • 61949425675 scopus 로고    scopus 로고
    • Web page classification: Features and algorithms
    • Xiaoguang Qi and Brian D. Davison. 2009. Web page classification: Features and algorithms. ACM Computing Surveys, 41:11-31.
    • (2009) ACM Computing Surveys , vol.41 , pp. 11-31
    • Qi, Xiaoguang1    Davison, Brian D.2
  • 19
    • 78649256542 scopus 로고    scopus 로고
    • A dom tree alignment model for mining parallel data from the web
    • Lei Shi, Cheng Niu, Ming Zhou, and Jianfeng Gao. 2006. A dom tree alignment model for mining parallel data from the web. In COLING/ACL-2006, pages 489-496.
    • (2006) COLING/ACL-2006 , pp. 489-496
    • Shi, Lei1    Niu, Cheng2    Zhou, Ming3    Gao, Jianfeng4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.