-
1
-
-
0032264186
-
Distributional clustering of words for text classification
-
Baker, D. and McCallum, A. (1998). Distributional clustering of words for text classification. In SIGIR'98.
-
(1998)
SIGIR'98
-
-
Baker, D.1
McCallum, A.2
-
2
-
-
84899005563
-
A neural probabilistic language model
-
Leen, T., Dietterich, T., and Tresp, V., editors MIT Press
-
Bengio, Y., Ducharme, R., and Vincent, P. (2001). A neural probabilistic language model. In Leen, T., Dietterich, T., and Tresp, V., editors, Advances in Neural Information Processing Systems 13, pages 933-938. MIT Press.
-
(2001)
Advances in Neural Information Processing Systems
, vol.13
, pp. 933-938
-
-
Bengio, Y.1
Ducharme, R.2
Vincent, P.3
-
3
-
-
0142166851
-
A neural probabilistic language model
-
Bengio, Y., Ducharme, R., Vincent, P., and Jauvin, C. (2003). A neural probabilistic language model. Journal of Machine Learning Research, 3:1137-1155.
-
(2003)
Journal of Machine Learning Research
, vol.3
, pp. 1137-1155
-
-
Bengio, Y.1
Ducharme, R.2
Vincent, P.3
Jauvin, C.4
-
4
-
-
10944221006
-
Quick training of probabilistic neural nets by importance sampling
-
Bengio, Y. and Senécal, J.-S. (2003). Quick training of probabilistic neural nets by importance sampling. In Proceedings of AISTATS'2003.
-
(2003)
Proceedings of AISTATS'2003
-
-
Bengio, Y.1
Senécal, J.-S.2
-
5
-
-
0002652285
-
-
A maximum entropy approach to natural language processing
-
Berger, A., Della Pietra, S., and Della Pietra, V. (1996). A maximum entropy approach to natural language processing. Computational Linguistics, 22:39-71.
-
(1996)
Computational Linguistics
, vol.22
, pp. 39-71
-
-
Berger, A.1
Della Pietra, S.2
Della Pietra, V.3
-
6
-
-
85022919385
-
Class-based n-gram models of natural language
-
Brown, P., Pietra, V. D., DeSouza, P., Lai, J., and Mercer, R. (1992). Class-based n-gram models of natural language. Computational Linguistics, 18:467-479.
-
(1992)
Computational Linguistics
, vol.18
, pp. 467-479
-
-
Brown, P.1
Pietra, V.D.2
DeSouza, P.3
Lai, J.4
Mercer, R.5
-
7
-
-
84989525001
-
Indexing by latent semantic analysis
-
Deerwester, S., Dumais, S., Furnas, G., Landauer, T., and Harshman, R. (1990). Indexing by latent semantic analysis. Journal of the American Society for Information Science, 41(6):391-407.
-
(1990)
Journal of the American Society for Information Science
, vol.41
, Issue.6
, pp. 391-407
-
-
Deerwester, S.1
Dumais, S.2
Furnas, G.3
Landauer, T.4
Harshman, R.5
-
8
-
-
26444565569
-
Finding structure in time
-
Elman, J. (1990). Finding structure in time. Cognitive Science, 14:179-211.
-
(1990)
Cognitive Science
, vol.14
, pp. 179-211
-
-
Elman, J.1
-
13
-
-
0008602090
-
Training products of experts by minimizing contrastive divergence
-
University College London
-
Hinton, G. (2000). Training products of experts by minimizing contrastive divergence. Technical Report GCNU TR 2000-004, Gatsby Unit, University College London.
-
(2000)
Technical Report GCNU TR 2000-004, Gatsby Unit
-
-
Hinton, G.1
-
14
-
-
0019114666
-
Interpolated estimation of Markov source parameters from sparse data
-
Gelsema, E. S. and Kanal, L. N., editors North-Holland, Amsterdam
-
Jelinek, F. and Mercer, R. L. (1980). Interpolated estimation of Markov source parameters from sparse data. In Gelsema, E. S. and Kanal, L. N., editors, Pattern Recognition in Practice. North-Holland, Amsterdam.
-
(1980)
Pattern Recognition in Practice
-
-
Jelinek, F.1
Mercer, R.L.2
-
15
-
-
0023312404
-
Estimation of probabilities from sparse data for the language model component of a speech recognizer
-
Katz, S. M. (1987). Estimation of probabilities from sparse data for the language model component of a speech recognizer. IEEE Transactions on Acoustics, Speech, and Signal Processing, ASSP-35(3):400-401.
-
(1987)
IEEE Transactions on Acoustics, Speech, and Signal Processing
, vol.ASSP-35
, Issue.3
, pp. 400-401
-
-
Katz, S.M.1
-
16
-
-
34248842385
-
Natural language processing with modular neural networks and distributed lexicon
-
Miikkulainen, R. and Dyer, M. (1991). Natural language processing with modular neural networks and distributed lexicon. Cognitive Science, 15:343-399.
-
(1991)
Cognitive Science
, vol.15
, pp. 343-399
-
-
Miikkulainen, R.1
Dyer, M.2
-
18
-
-
0031628780
-
Comparison of part-of-speech and automatically derived category-based language models for speech recognition
-
Niesler, T., Whittaker, E., and Woodland, P. (1998). Comparison of part-of-speech and automatically derived category-based language models for speech recognition. In International Conference on Acoustics, Speech, and Signal Processing, pages 177-180.
-
(1998)
International Conference on Acoustics, Speech, and Signal Processing
, pp. 177-180
-
-
Niesler, T.1
Whittaker, E.2
Woodland, P.3
-
19
-
-
85123966307
-
Distributional clustering of English words
-
Columbus, Ohio
-
Pereira, F., Tishby, N., and Lee, L. (1993). Distributional clustering of english words. In 30th Annual Meeting of the Association for Computational Linguistics, pages 183-190, Columbus, Ohio.
-
(1993)
30th Annual Meeting of the Association for Computational Linguistics
, pp. 183-190
-
-
Pereira, F.1
Tishby, N.2
Lee, L.3
-
20
-
-
45549117987
-
Term weighting approaches in automatic text retrieval
-
Salton, G. and Buckley, C. (1988). Term weighting approaches in automatic text retrieval. Information Processing and Management, 24(5):513-523.
-
(1988)
Information Processing and Management
, vol.24
, Issue.5
, pp. 513-523
-
-
Salton, G.1
Buckley, C.2
-
22
-
-
0142161367
-
Word space
-
Giles, C., Hanson, S., and Cowan, J., editors San Mateo CA. Morgan Kaufmann
-
Schutze, H. (1993). Word space. In Giles, C., Hanson, S., and Cowan, J., editors, Advances in Neural Information Processing Systems 5, pages pp. 895-902, San Mateo CA. Morgan Kaufmann.
-
(1993)
Advances in Neural Information Processing Systems
, vol.5
, pp. 895-902
-
-
Schutze, H.1
-
23
-
-
10944267136
-
Efficient training of large neural networks for language modeling
-
Schwenk, H. (2004). Efficient training of large neural networks for language modeling. In IEEE joint conference on neural networks.
-
(2004)
IEEE Joint Conference on Neural Networks
-
-
Schwenk, H.1
-
24
-
-
0036293862
-
Connectionist language modeling for large vocabulary continuous speech recognition
-
Orlando, Florida
-
Schwenk, H. and Gauvain, J.-L. (2002). Connectionist language modeling for large vocabulary continuous speech recognition. In International Conference on Acoustics, Speech, and Signal Processing, pages 765-768, Orlando, Florida.
-
(2002)
International Conference on Acoustics, Speech, and Signal Processing
, pp. 765-768
-
-
Schwenk, H.1
Gauvain, J.-L.2
-
26
-
-
33645488707
-
Training connectionist models for the structured language model
-
Xu, P., Emami, A., and Jelinek, F. (2003). Training connectionist models for the structured language model. In Empirical Methods in Natural Language Processing, EMNLP'2003.
-
(2003)
Empirical Methods in Natural Language Processing, EMNLP'2003
-
-
Xu, P.1
Emami, A.2
Jelinek, F.3
|