-
1
-
-
84859921107
-
A high-performance semi-supervised learning method for text chunking
-
Ando, R., & Zhang, T. (2005). A high-performance semi-supervised learning method for text chunking. ACL.
-
(2005)
ACL
-
-
Ando, R.1
Zhang, T.2
-
2
-
-
79959407847
-
Neural net language models
-
Bengio, Y. (2008). Neural net language models. Scholarpedia, 3, 3881.
-
(2008)
Scholarpedia
, vol.3
, pp. 3881
-
-
Bengio, Y.1
-
4
-
-
0142166851
-
A neural probabilistic language model
-
Bengio, Y., Ducharme, R., Vincent, P., & Jauvin, C. (2003). A neural probabilistic language model. Journal of Machine Learning Research, 3, 1137-1155.
-
(2003)
Journal of Machine Learning Research
, vol.3
, pp. 1137-1155
-
-
Bengio, Y.1
Ducharme, R.2
Vincent, P.3
Jauvin, C.4
-
5
-
-
71149116544
-
Curriculum learning
-
Bengio, Y., Louradour, J., Collobert, R., & Weston, J. (2009). Curriculum learning. ICML.
-
(2009)
ICML
-
-
Bengio, Y.1
Louradour, J.2
Collobert, R.3
Weston, J.4
-
6
-
-
10944221006
-
Quick training of probabilistic neural nets by importance sampling
-
Bengio, Y., & Sénécal, J.-S. (2003). Quick training of probabilistic neural nets by importance sampling. AISTATS.
-
(2003)
AISTATS
-
-
Bengio, Y.1
Sénécal, J.-S.2
-
7
-
-
0141607824
-
Latent dirichlet allocation
-
Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent dirichlet allocation. Journal of Machine Learning Research, 3, 993-1022.
-
(2003)
Journal of Machine Learning Research
, vol.3
, pp. 993-1022
-
-
Blei, D. M.1
Ng, A. Y.2
Jordan, M. I.3
-
8
-
-
85022919385
-
Class-based n-gram models of natural language
-
Brown, P. F., deSouza, P. V., Mercer, R. L., Pietra, V. J. D., & Lai, J. C. (1992). Class-based n-gram models of natural language. Computational Linguistics, 18, 467-479.
-
(1992)
Computational Linguistics
, vol.18
, pp. 467-479
-
-
Brown, P. F.1
deSouza, P. V.2
Mercer, R. L.3
Pietra, V. J. D.4
Lai, J. C.5
-
9
-
-
84866875199
-
Improving generative statistical parsing with semi-supervised word clustering
-
Candito, M., & Crabbé, B. (2009). Improving generative statistical parsing with semi-supervised word clustering. IWPT (pp. 138-141).
-
(2009)
IWPT
, pp. 138-141
-
-
Candito, M.1
Crabbé, B.2
-
10
-
-
56449095373
-
A unified architecture for natural language processing: Deep neural networks with multitask learning
-
Collobert, R., & Weston, J. (2008). A unified architecture for natural language processing: Deep neural networks with multitask learning. ICML.
-
(2008)
ICML
-
-
Collobert, R.1
Weston, J.2
-
11
-
-
80053402648
-
Semi-supervised semantic role labeling using the Latent Words Language Model
-
Deschacht, K., & Moens, M.-F. (2009). Semi-supervised semantic role labeling using the Latent Words Language Model. EMNLP (pp. 21-29).
-
(2009)
EMNLP
, pp. 21-29
-
-
Deschacht, K.1
Moens, M.-F.2
-
12
-
-
85031726785
-
Using latent semantic analysis to improve access to textual information
-
ACM
-
Dumais, S. T., Furnas, G. W., Landauer, T. K., Deerwester, S., & Harshman, R. (1988). Using latent semantic analysis to improve access to textual information. SIGCHI Conference on Human Factors in Computing Systems (pp. 281-285). ACM.
-
(1988)
SIGCHI Conference on Human Factors in Computing Systems
, pp. 281-285
-
-
Dumais, S. T.1
Furnas, G. W.2
Landauer, T. K.3
Deerwester, S.4
Harshman, R.5
-
13
-
-
0027636611
-
Learning and development in neural networks: The importance of starting small
-
Elman, J. L. (1993). Learning and development in neural networks: The importance of starting small. Cognition, 48, 781-799.
-
(1993)
Cognition
, vol.48
, pp. 781-799
-
-
Elman, J. L.1
-
14
-
-
84874625722
-
Enhancing unlexicalized parsing performance using a wide coverage lexicon, fuzzy tag-set mapping, and EM-HMM-based lexical probabilities
-
Goldberg, Y., Tsarfaty, R., Adler, M., & Elhadad, M. (2009). Enhancing unlexicalized parsing performance using a wide coverage lexicon, fuzzy tag-set mapping, and EM-HMM-based lexical probabilities. EACL.
-
(2009)
EACL
-
-
Goldberg, Y.1
Tsarfaty, R.2
Adler, M.3
Elhadad, M.4
-
16
-
-
0002646713
-
Contextual relations of words in grimm tales, analyzed by self-organizing map
-
ICANN
-
Honkela, T., Pulkki, V., & Kohonen, T. (1995). Contextual relations of words in grimm tales, analyzed by self-organizing map. ICANN.
-
(1995)
-
-
Honkela, T.1
Pulkki, V.2
Kohonen, T.3
-
17
-
-
80053028502
-
Distributional representations for handling sparsity in supervised sequence labeling
-
Huang, F., & Yates, A. (2009). Distributional representations for handling sparsity in supervised sequence labeling. ACL.
-
(2009)
ACL
-
-
Huang, F.1
Yates, A.2
-
18
-
-
0031625017
-
Dimensionality reduction by random mapping: Fast similarity computation for clustering
-
Kaski, S. (1998). Dimensionality reduction by random mapping: Fast similarity computation for clustering. IJCNN (pp. 413-418).
-
(1998)
IJCNN
, pp. 413-418
-
-
Kaski, S.1
-
19
-
-
80053551637
-
Simple semi-supervised dependency parsing
-
Koo, T., Carreras, X., & Collins, M. (2008). Simple semi-supervised dependency parsing. ACL (pp. 595-603).
-
(2008)
ACL
, pp. 595-603
-
-
Koo, T.1
Carreras, X.2
Collins, M.3
-
20
-
-
80053344402
-
An effective two-stage model for exploiting non-local dependencies in named entity recognition
-
Krishnan, V., & Manning, C. D. (2006). An effective two-stage model for exploiting non-local dependencies in named entity recognition. COLING-ACL.
-
(2006)
COLING-ACL
-
-
Krishnan, V.1
Manning, C. D.2
-
21
-
-
80053431219
-
An introduction to latent semantic analysis
-
Landauer, T. K., Foltz, P. W., & Laham, D. (1998). An introduction to latent semantic analysis. Discourse Processes, 259-284.
-
(1998)
Discourse Processes
, pp. 259-284
-
-
Landauer, T. K.1
Foltz, P. W.2
Laham, D.3
-
22
-
-
34547991287
-
Semi-supervised sequence modeling with syntactic topic models
-
Li, W., & McCallum, A. (2005). Semi-supervised sequence modeling with syntactic topic models. AAAI.
-
(2005)
AAAI
-
-
Li, W.1
McCallum, A.2
-
24
-
-
85185398851
-
Phrase clustering for discriminative learning
-
Lin, D., & Wu, X. (2009). Phrase clustering for discriminative learning. ACL-IJCNLP (pp. 1030-1038).
-
(2009)
ACL-IJCNLP
, pp. 1030-1038
-
-
Lin, D.1
Wu, X.2
-
25
-
-
0000458784
-
Producing highdimensional semantic spaces from lexical co-occurrence
-
Lund, K., & Burgess, C. (1996). Producing highdimensional semantic spaces from lexical co-occurrence. Behavior Research Methods, Instrumentation, and Computers, 28, 203-208.
-
(1996)
Behavior Research Methods, Instrumentation, and Computers
, vol.28
, pp. 203-208
-
-
Lund, K.1
Burgess, C.2
-
26
-
-
0001860276
-
Semantic and associative priming in highdimensional semantic space
-
Lund, K., Burgess, C., & Atchley, R. A. (1995). Semantic and associative priming in highdimensional semantic space. Cognitive Science Proceedings, LEA (pp. 660-665).
-
(1995)
Cognitive Science Proceedings, LEA
, pp. 660-665
-
-
Lund, K.1
Burgess, C.2
Atchley, R. A.3
-
27
-
-
0032049073
-
Algorithms for bigram and trigram word clustering
-
Martin, S., Liermann, J., & Ney, H. (1998). Algorithms for bigram and trigram word clustering. Speech Communication, 24, 19-37.
-
(1998)
Speech Communication
, vol.24
, pp. 19-37
-
-
Martin, S.1
Liermann, J.2
Ney, H.3
-
28
-
-
85117730830
-
Name tagging with word clusters and discriminative training
-
Miller, S., Guinness, J., & Zamanian, A. (2004). Name tagging with word clusters and discriminative training. HLT-NAACL (pp. 337-342).
-
(2004)
HLT-NAACL
, pp. 337-342
-
-
Miller, S.1
Guinness, J.2
Zamanian, A.3
-
29
-
-
67650453038
-
Three new graphical models for statistical language modelling
-
Mnih, A., & Hinton, G. E. (2007). Three new graphical models for statistical language modelling. ICML.
-
(2007)
ICML
-
-
Mnih, A.1
Hinton, G. E.2
-
30
-
-
84858779990
-
A scalable hierarchical distributed language model
-
Mnih, A., & Hinton, G. E. (2009). A scalable hierarchical distributed language model. NIPS (pp. 1081-1088).
-
(2009)
NIPS
, pp. 1081-1088
-
-
Mnih, A.1
Hinton, G. E.2
-
31
-
-
34547997987
-
Hierarchical probabilistic neural network language model
-
Morin, F., & Bengio, Y. (2005). Hierarchical probabilistic neural network language model. AISTATS.
-
(2005)
AISTATS
-
-
Morin, F.1
Bengio, Y.2
-
32
-
-
85123966307
-
Distributional clustering of english words
-
Pereira, F., Tishby, N., & Lee, L. (1993). Distributional clustering of english words. ACL (pp. 183-190).
-
(1993)
ACL
, pp. 183-190
-
-
Pereira, F.1
Tishby, N.2
Lee, L.3
-
33
-
-
84862300668
-
Design challenges and misconceptions in named entity recognition
-
Ratinov, L., & Roth, D. (2009). Design challenges and misconceptions in named entity recognition. CoNLL.
-
(2009)
CoNLL
-
-
Ratinov, L.1
Roth, D.2
-
38
-
-
0346346018
-
Introduction to the CoNLL-2000 shared task: Chunking
-
CoNLL
-
Sang, E. T., & Buchholz, S. (2000). Introduction to the CoNLL-2000 shared task: Chunking. CoNLL.
-
(2000)
-
-
Sang, E. T.1
Buchholz, S.2
-
39
-
-
0036293862
-
Connectionist language modeling for large vocabulary continuous speech recognition
-
Orlando, Florida
-
Schwenk, H., & Gauvain, J.-L. (2002). Connectionist language modeling for large vocabulary continuous speech recognition. International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 765-768). Orlando, Florida.
-
(2002)
International Conference on Acoustics, Speech and Signal Processing (ICASSP)
, pp. 765-768
-
-
Schwenk, H.1
Gauvain, J.-L.2
-
40
-
-
85043116988
-
Shallow parsing with conditional random fields
-
Sha, F., & Pereira, F. C. N. (2003). Shallow parsing with conditional random fields. HLT-NAACL.
-
(2003)
HLT-NAACL
-
-
Sha, F.1
Pereira, F. C. N.2
-
41
-
-
84858423134
-
From baby steps to leapfrog: How “less is more” in unsupervised dependency parsing
-
Spitkovsky, V., Alshawi, H., & Jurafsky, D. (2010). From baby steps to leapfrog: How “less is more” in unsupervised dependency parsing. NAACL-HLT.
-
(2010)
NAACL-HLT
-
-
Spitkovsky, V.1
Alshawi, H.2
Jurafsky, D.3
-
42
-
-
84859884966
-
Semi-supervised sequential labeling and segmentation using giga-word scale unlabeled data
-
Suzuki, J., & Isozaki, H. (2008). Semi-supervised sequential labeling and segmentation using giga-word scale unlabeled data. ACL-08: HLT (pp. 665-673).
-
(2008)
ACL-08: HLT
, pp. 665-673
-
-
Suzuki, J.1
Isozaki, H.2
-
43
-
-
80053399428
-
An empirical study of semi-supervised structured conditional models for dependency parsing
-
Suzuki, J., Isozaki, H., Carreras, X., & Collins, M. (2009). An empirical study of semi-supervised structured conditional models for dependency parsing. EMNLP.
-
(2009)
EMNLP
-
-
Suzuki, J.1
Isozaki, H.2
Carreras, X.3
Collins, M.4
-
44
-
-
84859979112
-
A preliminary evaluation of word representations for named-entity recognition
-
Turian, J., Ratinov, L., Bengio, Y., & Roth, D. (2009). A preliminary evaluation of word representations for named-entity recognition. NIPS Workshop on Grammar Induction, Representation of Language and Language Learning.
-
(2009)
NIPS Workshop on Grammar Induction, Representation of Language and Language Learning
-
-
Turian, J.1
Ratinov, L.2
Bengio, Y.3
Roth, D.4
-
46
-
-
2242436639
-
Hierarchical clustering of words
-
Ushioda, A. (1996). Hierarchical clustering of words. COLING (pp. 1159-1162).
-
(1996)
COLING
, pp. 1159-1162
-
-
Ushioda, A.1
-
49
-
-
84859968879
-
Towards explicit semantic features using independent component analysis
-
Stockholm, Sweden: Swedish Institute of Computer Science
-
Väyrynen, J. J., Honkela, T., & Lindqvist, L. (2007). Towards explicit semantic features using independent component analysis. Proceedings of the Workshop Semantic Content Acquisition and Representation (SCAR). Stockholm, Sweden: Swedish Institute of Computer Science.
-
(2007)
Proceedings of the Workshop Semantic Content Acquisition and Representation (SCAR)
-
-
Väyrynen, J. J.1
Honkela, T.2
Lindqvist, L.3
-
50
-
-
78649933846
-
Software framework for topic modelling with large corpora
-
Řehůřek, R., & Sojka, P. (2010). Software framework for topic modelling with large corpora. LREC.
-
(2010)
LREC
-
-
Řehůřek, R.1
Sojka, P.2
-
51
-
-
84919495233
-
A robust risk minimization based named entity recognition system
-
Zhang, T., & Johnson, D. (2003). A robust risk minimization based named entity recognition system. CoNLL.
-
(2003)
CoNLL
-
-
Zhang, T.1
Johnson, D.2
-
52
-
-
84859984064
-
Multilingual dependency learning: a huge feature engineering method to semantic dependency parsing
-
Zhao, H., Chen, W., Kit, C., & Zhou, G. (2009). Multilingual dependency learning: a huge feature engineering method to semantic dependency parsing. CoNLL (pp. 55-60).
-
(2009)
CoNLL
, pp. 55-60
-
-
Zhao, H.1
Chen, W.2
Kit, C.3
Zhou, G.4
|