-
1
-
-
37249002913
-
The effect of corpus size in combining supervised and un-supervised training for disambiguation
-
Michaela Atterer and Hinrich Schütze. 2006. The effect of corpus size in combining supervised and un-supervised training for disambiguation. In Proc. of COLING-ACL'06, pages 25-32.
-
(2006)
Proc. of COLING-acl'06
, pp. 25-32
-
-
Atterer, M.1
Schütze, H.2
-
2
-
-
34548477786
-
Mitigating the paucity-of-data problem: Exploring the effect of training corpus size on classifier performance for natural language processing
-
Michele Banko and Eric Brill. 2001a. Mitigating the paucity-of-data problem: Exploring the effect of training corpus size on classifier performance for natural language processing. In Proc. of HLT'01.
-
(2001)
Proc. of HLT'01
-
-
Banko, M.1
Brill, E.2
-
3
-
-
0345570094
-
Scaling to very very large corpora for natural language disambiguation
-
Michele Banko and Eric Brill. 2001b. Scaling to very very large corpora for natural language disambiguation. In Proc. of ACL'01, pages 26-33.
-
(2001)
Proc. of ACL'01
, pp. 26-33
-
-
Banko, M.1
Brill, E.2
-
4
-
-
80053375619
-
Large language models in machine translation
-
Thorsten Brants, Ashok C. Popat, Peng Xu, Franz J. Och, and Jeffrey Dean. 2007. Large language models in machine translation. In Proc. of EMNLP-CoNLL'07, pages 858-867.
-
(2007)
Proc. of EMNLP-conll'07
, pp. 858-867
-
-
Brants, T.1
Popat, A.C.2
Xu, P.3
Och, F.J.4
Dean, J.5
-
5
-
-
80053415351
-
Using web-search results to measure word-group similarity
-
Ann Gledson and John Keane. 2008. Using web-search results to measure word-group similarity. In Proc. of COLING'08, pages 281-288.
-
(2008)
Proc. of COLING'08
, pp. 281-288
-
-
Gledson, A.1
Keane, J.2
-
7
-
-
60049084774
-
Creating open language resources for hungarian
-
Peter Halacsy, Andras Kornai, Laszlo Nemeth, Andras Rung, Istvan Szakadat, and Vikto Tron. 2004. Creating open language resources for Hungarian. In Proc. of LREC'04, pages 203-210.
-
(2004)
Proc. of LREC'04
, pp. 203-210
-
-
Halacsy, P.1
Kornai, A.2
Nemeth, L.3
Rung, A.4
Szakadat, I.5
Tron, V.6
-
9
-
-
84858385981
-
A fully-lexicalized probabilistic model for Japanese syntactic and case structure analysis
-
Daisuke Kawahara and Sadao Kurohashi. 2006a. A fully-lexicalized probabilistic model for Japanese syntactic and case structure analysis. In Proc. of HLT-NAACL'06, pages 176-183.
-
(2006)
Proc. of HLT-naacl'06
, pp. 176-183
-
-
Kawahara, D.1
Kurohashi, S.2
-
10
-
-
78751686756
-
Case frame compilation from the web using highperformance computing
-
Daisuke Kawahara and Sadao Kurohashi. 2006b. Case frame compilation from the web using highperformance computing. In Proc. of LREC'06, pages 1344-1347.
-
(2006)
Proc. of LREC'06
, pp. 1344-1347
-
-
Kawahara, D.1
Kurohashi, S.2
-
11
-
-
77952990361
-
Probabilistic coordination disambiguation in a fully-lexicalized Japanese parser
-
Daisuke Kawahara and Sadao Kurohashi. 2007. Probabilistic coordination disambiguation in a fully-lexicalized Japanese parser. In Proc. of EMNLP-CoNLL'07, pages 306-314.
-
(2007)
Proc. of EMNLP-conll'07
, pp. 306-314
-
-
Kawahara, D.1
Kurohashi, S.2
-
12
-
-
77952971817
-
Toward text understanding: Integrating relevance-tagged corpora and automatically constructed case frames
-
Daisuke Kawahara, Ryohei Sasano, and Sadao Kurohashi. 2004. Toward text understanding: Integrating relevance-tagged corpora and automatically constructed case frames. In Proc. of LREC'04, pages 1833-1836.
-
(2004)
Proc. of LREC'04
, pp. 1833-1836
-
-
Kawahara, D.1
Sasano, R.2
Kurohashi, S.3
-
13
-
-
0344154403
-
Introduction to the Special Issue on the Web as Corpus
-
DOI 10.1162/089120103322711569
-
Adam Kilgarriff and Gregory Grefenstette. 2003. Introduction to the special issue on the web as corpus. Computational Linguistic, 29(3):333-347. (Pubitemid 37445778)
-
(2003)
Computational Linguistics
, vol.29
, Issue.3
, pp. 333-347
-
-
Kilgarriff, A.1
Grefenstette, G.2
-
16
-
-
47349129451
-
Web text corpus for natural language processing
-
Vinci Liu and James R. Curran. 2006. Web text corpus for natural language processing. In Proc. of EACL'06, pages 233-240.
-
(2006)
Proc. of EACL'06
, pp. 233-240
-
-
Liu, V.1
Curran, J.R.2
-
17
-
-
77953016465
-
Kotonoha, the corpus development project of the national institute for Japanese language
-
Kikuo Maekawa. 2006. Kotonoha, the corpus development project of the National Institute for Japanese language. In Proc. of the 13th NIJL International Symposium, pages 55-62.
-
(2006)
Proc. of the 13th NIJL International Symposium
, pp. 55-62
-
-
Maekawa, K.1
-
19
-
-
33645971389
-
Using the web in machine learning for other-anaphora resolution
-
Natalia N. Modjeska, Katja Markert, and Malvina Nissim. 2003. Using the web in machine learning for other-anaphora resolution. In Proc. of EMNLP-2003, pages 176-183.
-
(2003)
Proc. of EMNLP-2003
, pp. 176-183
-
-
Modjeska, N.N.1
Markert, K.2
Nissim, M.3
-
20
-
-
84962711699
-
A study of using search engine page hits as a proxy for n-gram frequencies
-
Preslav Nakov and Marti Hearst. 2005. A study of using search engine page hits as a proxy for n-gram frequencies. In Proc. of RANLP'05.
-
(2005)
Proc. of RANLP'05
-
-
Nakov, P.1
Hearst, M.2
-
21
-
-
84859913003
-
Solving relational similarity problems using the web as a corpus
-
Preslav Nakov and Marti A. Hearst. 2008. Solving relational similarity problems using the web as a corpus. In Proc. of ACL-HLT'08, pages 452-460.
-
(2008)
Proc. of ACL-hlt'08
, pp. 452-460
-
-
Nakov, P.1
Hearst, M.A.2
-
22
-
-
77952977421
-
Japanese named entity recognition using structural natural language processing
-
Ryohei Sasano and Sadao Kurohashi. 2008. Japanese named entity recognition using structural natural language processing. In Proc. of IJCNLP'08, pages 607-612.
-
(2008)
Proc. of IJCNLP'08
, pp. 607-612
-
-
Sasano, R.1
Kurohashi, S.2
-
23
-
-
80053432315
-
A fully-lexicalized probabilistic model for Japanese zero anaphora resolution
-
Ryohei Sasano, Daisuke Kawahara, and Sadao Kurohashi. 2008. A fully-lexicalized probabilistic model for japanese zero anaphora resolution. In Proc. of COLING'08, pages 769-776.
-
(2008)
Proc. of COLING'08
, pp. 769-776
-
-
Sasano, R.1
Kawahara, D.2
Kurohashi, S.3
-
24
-
-
84859884966
-
Semi-supervised sequential labeling and segmentation using giga-word scale unlabeled data
-
Jun Suzuki and Hideki Isozaki. 2008. Semi-supervised sequential labeling and segmentation using giga-word scale unlabeled data. In Proceedings of ACL-HLT'08, pages 665-673.
-
(2008)
Proceedings of ACL-hlt'08
, pp. 665-673
-
-
Suzuki, J.1
Isozaki, H.2
-
25
-
-
33645999475
-
-
The National Language Institute for Japanese Language Dainippon Tosho, (In Japanese)
-
The National Language Institute for Japanese Language. 2004. Bunruigoihyo. Dainippon Tosho, (In Japanese).
-
(2004)
Bunruigoihyo
-
-
-
26
-
-
0345570088
-
Exploiting the WWW as a corpus to resolve PP attachment ambiguities
-
Martin Volk. 2001. Exploiting the WWW as a corpus to resolve PP attachment ambiguities. In Proc. of the Corpus Linguistics, pages 601-606.
-
(2001)
Proc. of the Corpus Linguistics
, pp. 601-606
-
-
Volk, M.1
|