-
2
-
-
84977940268
-
Bootcat: Bootstrapping corpora and terms from the web
-
Marco Baroni and Silvia Bernardini. 2004. Bootcat: Bootstrapping corpora and terms from the web. In Proceedings of LREC 2004, pages 1313-1316.
-
(2004)
Proceedings of LREC 2004
, pp. 1313-1316
-
-
Baroni, M.1
Bernardini, S.2
-
3
-
-
85037352302
-
Cleaneval: A competition for cleaning webpages
-
Marrakech. ELRA
-
Marco Baroni, Francis Chantree, Adam Kilgarriff, and Serge Sharoff. 2008. Cleaneval: A competition for cleaning webpages. In Proceedings of LREC 2008, pages 638-643, Marrakech. ELRA.
-
(2008)
Proceedings of LREC 2008
, pp. 638-643
-
-
Baroni, M.1
Chantree, F.2
Kilgarriff, A.3
Sharoff, S.4
-
4
-
-
70350686154
-
The wacky wide web: A collection of very large linguistically processed web-crawled corpora
-
Marco Baroni, Silvia Bernardini, Adriano Ferraresi, and Eros Zanchetta. 2009. The wacky wide web: A collection of very large linguistically processed web-crawled corpora. Language Resources and Evaluation, 43 (3): 209-226.
-
(2009)
Language Resources and Evaluation
, vol.43
, Issue.3
, pp. 209-226
-
-
Baroni, M.1
Bernardini, S.2
Ferraresi, A.3
Zanchetta, E.4
-
5
-
-
0004278262
-
-
Technical Note 1997-115 SRC, Palo Alto, July 25
-
Andrei Z. Broder, Steven C. Glassman, Mark S. Manasse, and Geoffrey Zweig. 1997. Syntactic clustering of the web. Technical Note 1997-115, SRC, Palo Alto, July 25.
-
(1997)
Syntactic Clustering of the Web
-
-
Broder, A.Z.1
Glassman, S.C.2
Manasse, M.S.3
Zweig, G.4
-
6
-
-
84926125761
-
Measuring web-corpus randomness: A progress report
-
Marco Baroni and Silvia Bernardini, editorss, Bologna
-
Massimiliano Ciamarita and Marco Baroni. 2006. Measuring web-corpus randomness: A progress report. In Marco Baroni and Silvia Bernardini, editors, Wacky! Working papers on the Web as Corpus. GEDIT, Bologna.
-
(2006)
Wacky! Working Papers on the Web As Corpus. GEDIT
-
-
Ciamarita, M.1
Baroni, M.2
-
7
-
-
70350700772
-
Experience building a large corpus for Chinese lexicon construction
-
Marco Baroni and Silvia Bernardini, editors, Bologna
-
Thomas Emerson and John O'Neil. 2006. Experience building a large corpus for Chinese lexicon construction. In Marco Baroni and Silvia Bernardini, editors, Wacky! Working papers on the Web as Corpus. GEDIT, Bologna.
-
(2006)
Wacky! Working Papers on the Web As Corpus. GEDIT
-
-
Emerson, T.1
O'Neil, J.2
-
8
-
-
0344154403
-
Introduction to the special issue on the web as corpus
-
Adam Kilgarriff and Gregory Grefenstette. 2003. Introduction to the special issue on the web as corpus. Computational Linguistics, 29: 333-347.
-
(2003)
Computational Linguistics
, vol.29
, pp. 333-347
-
-
Kilgarriff, A.1
Grefenstette, G.2
-
9
-
-
34548080780
-
An introduction to information retrieval
-
Christopher D. Manning, Prabhakar Raghavan, and Hinrich Schütze. 2009. An Introduction to Information Retrieval. CUP, Cambridge. Steffen Nissen. 2005. Neural Networks made simple. Software 2. 0, 2: 14-19.
-
(2009)
CUP, Cambridge. Steffen Nissen. 2005. Neural Networks Made Simple. Software 2. 0, 2
, pp. 14-19
-
-
Manning, C.D.1
Raghavan, P.2
Schütze, H.3
-
10
-
-
0003676885
-
-
Technical Report TR-CSE-03-01 Center for Research in Computing Technology, Harvard University, Harvard
-
Michael O. Rabin. 1981. Fingerprinting by random polynomials. Technical Report TR-CSE-03-01, Center for Research in Computing Technology, Harvard University, Harvard.
-
(1981)
Fingerprinting by Random Polynomials
-
-
Rabin, M.O.1
-
11
-
-
42649127636
-
Creating general-purpose corpora using automated search engine queries
-
Marco Baroni and Silvia Bernardini, editors. GEDIT, Bologna
-
Serge Sharoff. 2006. Creating general-purpose corpora using automated search engine queries. In Marco Baroni and Silvia Bernardini, editors, Wacky! Working papers on the Web as Corpus. GEDIT, Bologna.
-
(2006)
Wacky! Working Papers on the Web As Corpus
-
-
Sharoff, S.1
-
12
-
-
1542310280
-
Text classification and segmentation using minimum cross entropy
-
William J. Tehan. 2000. Text classification and segmentation using minimum cross entropy. In In Proceeding of RIAO.
-
(2000)
Proceeding of RIAO
-
-
Tehan, W.J.1
|