-
1
-
-
84922389693
-
-
Technical report, arXiv preprint arXiv: 1409.0473
-
Bahdanau, Dzmitry, Cho, Kyunghyun, and Bengio, Yoshua. Neural machine translation by jointly learning to align and translate. Technical report, arXiv preprint arXiv: 1409.0473, 2014.
-
(2014)
Neural Machine Translation by Jointly Learning to Align and Translate
-
-
Bahdanau, D.1
Cho, K.2
Bengio, Y.3
-
2
-
-
84897544737
-
Theano: New features and speed improvements
-
Bastien, Frédéric, Lamblin, Pascal, Pascanu, Razvan, Bergstra, James, Goodfellow, Ian J., Bergeron, Arnaud, Bouchard, Nicolas, and Bengio, Yoshua. Theano: new features and speed improvements. Deep Learning and Unsupervised Feature Learning NIPS 2012 Workshop, 2012.
-
(2012)
Deep Learning and Unsupervised Feature Learning NIPS 2012 Workshop
-
-
Bastien, F.1
Lamblin, P.2
Pascanu, R.3
Bergstra, J.4
Goodfellow, I.J.5
Bergeron, A.6
Bouchard, N.7
Bengio, Y.8
-
3
-
-
0028392483
-
Learning long-term dependencies with gradient descent is difficult
-
Bengio, Yoshua, Simard, Patrice, and Frasconi, Paolo. Learning long-term dependencies with gradient descent is difficult. IEEE Transactions on Neural Networks, 5 (2): 157-166, 1994.
-
(1994)
IEEE Transactions on Neural Networks
, vol.5
, Issue.2
, pp. 157-166
-
-
Bengio, Y.1
Simard, P.2
Frasconi, P.3
-
4
-
-
84899005563
-
A neural probabilistic language model
-
Bengio, Yoshua, Ducharme, Rejean, and Vincent, Pascal. A neural probabilistic language model. In Adv. Neural Inf. Proc. Sys. 13, pp. 932-938, 2001.
-
(2001)
Adv. Neural Inf. Proc. Sys.
, vol.13
, pp. 932-938
-
-
Bengio, Y.1
Ducharme, R.2
Vincent, P.3
-
5
-
-
84919728106
-
-
arXiv preprint arXiv:1406.1078
-
Cho, Kyunghyun, Van Merriënboer, Bart, Gulcehre, Caglar, Bougares, Fethi, Schwenk, Holger, and Bengio, Yoshua. Learning phrase representations using rnn encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078, 2014.
-
(2014)
Learning Phrase Representations Using Rnn Encoder-decoder for Statistical Machine Translation
-
-
Cho, K.1
Van Merriënboer, B.2
Gulcehre, C.3
Bougares, F.4
Schwenk, H.5
Bengio, Y.6
-
6
-
-
85057230110
-
Hierarchical recurrent neural networks for long-term dependencies
-
Citeseer
-
El Hihi, Salah and Bengio, Yoshua. Hierarchical recurrent neural networks for long-term dependencies. In Advances in Neural Information Processing Systems, pp. 493-499. Citeseer, 1995.
-
(1995)
Advances in Neural Information Processing Systems
, pp. 493-499
-
-
El Hihi, S.1
Bengio, Y.2
-
7
-
-
0034293152
-
Learning to forget: Continual prediction with LSTM
-
Gers, Felix A., Schmidhuber, Jürgen, and Cummins, Fred A. Learning to forget: Continual prediction with LSTM. Neural Computation, 12(10):2451-2471, 2000.
-
(2000)
Neural Computation
, vol.12
, Issue.10
, pp. 2451-2471
-
-
Gers, F.A.1
Schmidhuber, J.2
Cummins, F.A.3
-
8
-
-
84893401626
-
-
arXiv preprint arXiv: 1308.4214
-
Goodfellow, Ian J., Warde-Farley, David, Lamblin, Pascal, Dumoulin, Vincent, Mirza, Mehdi, Pascanu, Razvan, Bergstra, James, Bastien, Frédéric, and Bengio, Yoshua. Pylearn2: a machine learning research library. arXiv preprint arXiv: 1308.4214, 2013.
-
(2013)
Pylearn2: A Machine Learning Research Library
-
-
Goodfellow, I.J.1
Warde-Farley, D.2
Lamblin, P.3
Dumoulin, V.4
Mirza, M.5
Pascanu, R.6
Bergstra, J.7
Bastien, F.8
Bengio, Y.9
-
12
-
-
0003575034
-
-
Diploma thesis, Institut fur Informatik, Lehrstuhl Prof. Brauer, Technische Universität München, URL
-
Hochreiter, Sepp. Untersuchungen zu dynamischen neuronalen Netzen. Diploma thesis, Institut fur Informatik, Lehrstuhl Prof. Brauer, Technische Universität München, 1991. URL http://www7. informatik.tu-muenchen.de/-Ehochreit.
-
(1991)
Untersuchungen zu Dynamischen Neuronalen Netzen
-
-
Hochreiter, S.1
-
17
-
-
84919782249
-
A clockwork rnn
-
Koutník, Jan, Greff, Klaus, Gomez, Faustino, and Schmidhuber, Jürgen. A clockwork rnn. In Proceedings of the 31st International Conference on Machine Learning (ICML'14), 2014.
-
(2014)
Proceedings of the 31st International Conference on Machine Learning (ICML'14)
-
-
Koutník, J.1
Greff, K.2
Gomez, F.3
Schmidhuber, J.4
-
19
-
-
84897527816
-
-
Preprint
-
Mikolov, Tomas, Sutskever, Ilya, Deoras, Anoop, Le, Hai-Son, Kombrink, Stefan, and Cernocky, J. Subword language modeling with neural networks. Preprint, 2012.
-
(2012)
Subword Language Modeling with Neural Networks
-
-
Mikolov, T.1
Sutskever, I.2
Deoras, A.3
Le, H.-S.4
Kombrink, S.5
Cernocky, J.6
-
20
-
-
0001033889
-
Learning complex, extended sequences using the principle of history compression
-
Schmidhuber, Jürgen. Learning complex, extended sequences using the principle of history compression. Neural Computation, 4(2):234-242, 1992.
-
(1992)
Neural Computation
, vol.4
, Issue.2
, pp. 234-242
-
-
Schmidhuber, J.1
-
21
-
-
84937961845
-
Deep networks with internal selective attention through feedback connections
-
Stollenga, Marijn F, Masci, Jonathan, Gomez, Faustino, and Schmidhuber, Jürgen. Deep networks with internal selective attention through feedback connections. In Advances in Neural Information Processing Systems, pp. 3545-3553, 2014.
-
(2014)
Advances in Neural Information Processing Systems
, pp. 3545-3553
-
-
Stollenga, M.F.1
Masci, J.2
Gomez, F.3
Schmidhuber, J.4
-
22
-
-
80053459857
-
Generating text with recurrent neural networks
-
Sutskever, Ilya, Martens, James, and Hinton, Geoffrey E. Generating text with recurrent neural networks. In Proceedings of the 28th International Conference on Machine Learning (ICML'11), pp. 1017-1024, 2011.
-
(2011)
Proceedings of the 28th International Conference on Machine Learning (ICML'11)
, pp. 1017-1024
-
-
Sutskever, I.1
Martens, J.2
Hinton, G.E.3
-
23
-
-
84928547704
-
Sequence to sequence learning with neural networks
-
Sutskever, Ilya, Vinyals, Oriol, and Le, Quoc VV. Sequence to sequence learning with neural networks. In Advances in Neural Information Processing Systems, pp. 3104-3112, 2014.
-
(2014)
Advances in Neural Information Processing Systems
, pp. 3104-3112
-
-
Sutskever, I.1
Vinyals, O.2
Le, Q.V.V.3
|