-
1
-
-
0028392483
-
Learning long-term dependencies with gradient descent is difficult
-
Yoshua Bengio, Patrice Simard, and Paolo Frasconi. 1994. Learning long-term dependencies with gradient descent is difficult. Neural Networks, IEEE Transactions on, 5(2):157-166.
-
(1994)
Neural Networks, IEEE Transactions on
, vol.5
, Issue.2
, pp. 157-166
-
-
Bengio, Y.1
Simard, P.2
Frasconi, P.3
-
2
-
-
0142166851
-
A neural probabilistic language model
-
Yoshua Bengio, Réjean Ducharme, Pascal Vincent, and Christian Janvin. 2003. A neural probabilistic language model. The Journal of Machine Learning Research, 3:1137-1155.
-
(2003)
The Journal of Machine Learning Research
, vol.3
, pp. 1137-1155
-
-
Bengio, Y.1
Ducharme, R.2
Vincent, P.3
Janvin, C.4
-
4
-
-
84961291190
-
Learning phrase representations using rnn encoder-decoder for statistical machine translation
-
Kyunghyun Cho, Bart van Merrienboer, Caglar Gulcehre, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning phrase representations using rnn encoder-decoder for statistical machine translation. In Proceedings of EMNLP.
-
(2014)
Proceedings of EMNLP
-
-
Cho, K.1
Van Merrienboer, B.2
Gulcehre, C.3
Bougares, F.4
Schwenk, H.5
Bengio, Y.6
-
6
-
-
80053558787
-
Natural language processing (almost) from scratch
-
Ronan Collobert, Jason Weston, Leon Bottou, Michael Karlen, Koray Kavukcuoglu, and Pavel Kuksa. 2011. Natural language processing (almost) from scratch. The Journal of Machine Learning Research, 12:2493-2537.
-
(2011)
The Journal of Machine Learning Research
, vol.12
, pp. 2493-2537
-
-
Collobert, R.1
Weston, J.2
Bottou, L.3
Karlen, M.4
Kavukcuoglu, K.5
Kuksa, P.6
-
7
-
-
84941010245
-
-
arXiv preprint arXiv: 1406.3830
-
Misha Denil, Alban Demiraj, Nal Kalchbrenner, Phil Blunsom, and Nando de Freitas. 2014. Modelling, visualising and summarising documents with a single convolutional neural network. arXiv preprint arXiv: 1406.3830.
-
(2014)
Modelling, Visualising and Summarising Documents with a Single Convolutional Neural Network
-
-
Denil, M.1
Demiraj, A.2
Kalchbrenner, N.3
Blunsom, P.4
De Freitas, N.5
-
8
-
-
80052250414
-
Adaptive subgradient methods for online learning and stochastic optimization
-
John Duchi, Elad Hazan, and Yoram Singer. 2011. Adaptive subgradient methods for online learning and stochastic optimization. The Journal of Machine Learning Research, 12:2121-2159.
-
(2011)
The Journal of Machine Learning Research
, vol.12
, pp. 2121-2159
-
-
Duchi, J.1
Hazan, E.2
Singer, Y.3
-
9
-
-
85057230110
-
Hierarchical recurrent neural networks for long-term dependencies
-
Citeseer
-
Salah El Hihi and Yoshua Bengio. 1995. Hierarchical recurrent neural networks for long-term dependencies. In NIPS, pages 493-499. Citeseer.
-
(1995)
NIPS
, pp. 493-499
-
-
El Hihi, S.1
Bengio, Y.2
-
10
-
-
26444565569
-
Finding structure in time
-
Jeffrey L Elman. 1990. Finding structure in time. Cognitive science, 14(2):179-211.
-
(1990)
Cognitive Science
, vol.14
, Issue.2
, pp. 179-211
-
-
Elman, J.L.1
-
18
-
-
84919829999
-
Distributed representations of sentences and documents
-
Quoc V. Le and Tomas Mikolov. 2014. Distributed representations of sentences and documents. In Proceedings of ICML.
-
(2014)
Proceedings of ICML
-
-
Le, Q.V.1
Mikolov, T.2
-
20
-
-
84859023447
-
Learning word vectors for sentiment analysis
-
Association for Computational Linguistics
-
Andrew L Maas, Raymond E Daly, Peter T Pham, Dan Huang, Andrew Y Ng, and Christopher Potts. 2011. Learning word vectors for sentiment analysis. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1, pages 142-150. Association for Computational Linguistics.
-
(2011)
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies
, vol.1
, pp. 142-150
-
-
Maas, A.L.1
Daly, R.E.2
Pham, P.T.3
Huang, D.4
Ng, A.Y.5
Potts, C.6
-
23
-
-
84898956512
-
Distributed representations of words and phrases and their compositionality
-
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. 2013b. Distributed representations of words and phrases and their compositionality. In NIPS, pages 3111-3119.
-
(2013)
NIPS
, pp. 3111-3119
-
-
Mikolov, T.1
Sutskever, I.2
Chen, K.3
Corrado, G.S.4
Dean, J.5
-
24
-
-
80053288309
-
Composition in distributional models of semantics
-
Jeff Mitchell and Mirella Lapata. 2010. Composition in distributional models of semantics. Cognitive science, 34(8):1388-1429.
-
(2010)
Cognitive Science
, vol.34
, Issue.8
, pp. 1388-1429
-
-
Mitchell, J.1
Lapata, M.2
-
25
-
-
0025519291
-
Recursive distributed representations
-
Jordan B Pollack. 1990. Recursive distributed representations. Artificial Intelligence, 46(1):77-105.
-
(1990)
Artificial Intelligence
, vol.46
, Issue.1
, pp. 77-105
-
-
Pollack, J.B.1
-
27
-
-
80053261327
-
Semi-supervised recursive autoencoders for predicting sentiment distributions
-
Richard Socher, Jeffrey Pennington, Eric H Huang, Andrew Y Ng, and Christopher D Manning. 2011b. Semi-supervised recursive autoencoders for predicting sentiment distributions. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, pages 151-161.
-
(2011)
Proceedings of the Conference on Empirical Methods in Natural Language Processing
, pp. 151-161
-
-
Socher, R.1
Pennington, J.2
Huang, E.H.3
Ng, A.Y.4
Manning, C.D.5
-
29
-
-
84926358845
-
Recursive deep models for semantic compositionality over a sentiment treebank
-
Richard Socher, Alex Perelygin, Jean Y Wu, Jason Chuang, Christopher D Manning, Andrew Y Ng, and Christopher Potts. 2013. Recursive deep models for semantic compositionality over a sentiment treebank. In Proceedings of the conference on empirical methods in natural language processing (EMNLP).
-
(2013)
Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP)
-
-
Socher, R.1
Perelygin, A.2
Wu, J.Y.3
Chuang, J.4
Manning, C.D.5
Ng, A.Y.6
Potts, C.7
-
33
-
-
0025503558
-
Backpropagation through time: What it does and how to do it
-
Paul J Werbos. 1990. Backpropagation through time: what it does and how to do it. Proceedings of the IEEE, 78(10):1550-1560.
-
(1990)
Proceedings of the IEEE
, vol.78
, Issue.10
, pp. 1550-1560
-
-
Werbos, P.J.1
-
34
-
-
84951321278
-
Spoken language understanding using long short-term memory neural networks
-
Kaisheng Yao, Baolin Peng, Yu Zhang, Dong Yu, Geoffrey Zweig, and Yangyang Shi. 2014. Spoken language understanding using long short-term memory neural networks. IEEE SLT.
-
(2014)
IEEE SLT
-
-
Yao, K.1
Peng, B.2
Zhang, Y.3
Yu, D.4
Zweig, G.5
Shi, Y.6
|