-
1
-
-
85083953689
-
Neural machine translation by jointly learning to align and translate
-
Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. Neural machine translation by jointly learning to align and translate. In Proceedings of the ICLR 2015, 2015.
-
(2015)
Proceedings of the ICLR 2015
-
-
Bahdanau, D.1
Cho, K.2
Bengio, Y.3
-
2
-
-
0020970738
-
Neuronlike adaptive elements that can solve difficult learning control problems
-
Andrew G Barto, Richard S Sutton, and Charles W Anderson. Neuronlike adaptive elements that can solve difficult learning control problems. Systems, Man and Cybernetics, IEEE Transactions on, (5):834-846, 1983.
-
(1983)
Systems, Man and Cybernetics, IEEE Transactions on
, Issue.5
, pp. 834-846
-
-
Barto, A.G.1
Sutton, R.S.2
Anderson, C.W.3
-
4
-
-
84959100558
-
Report on the 11th iwslt evaluation campaign
-
Mauro Cettolo, Jan Niehues, Sebastian Stüker, Luisa Bentivogli, and Marcello Federico. Report on the 11th iwslt evaluation campaign. In Proc. of IWSLT, 2014.
-
(2014)
Proc. Of IWSLT
-
-
Cettolo, M.1
Niehues, J.2
Stüker, S.3
Bentivogli, L.4
Federico, M.5
-
5
-
-
84994328213
-
-
arXiv preprint
-
William Chan, Navdeep Jaitly, Quoc V Le, and Oriol Vinyals. Listen, attend and spell. arXiv preprint arXiv:1508.01211, 2015.
-
(2015)
Listen, Attend and Spell
-
-
Chan, W.1
Jaitly, N.2
Le, Q.V.3
Vinyals, O.4
-
6
-
-
84943795466
-
-
arXiv preprint
-
Ciprian Chelba, Tomas Mikolov, Mike Schuster, Qi Ge, Thorsten Brants, Phillipp Koehn, and Tony Robinson. One billion word benchmark for measuring progress in statistical language modeling. arXiv preprint arXiv:1312.3005, 2013.
-
(2013)
One Billion Word Benchmark for Measuring Progress in Statistical Language Modeling
-
-
Chelba, C.1
Mikolov, T.2
Schuster, M.3
Ge, Q.4
Brants, T.5
Koehn, P.6
Robinson, T.7
-
7
-
-
84961291190
-
-
arXiv preprint
-
Kyunghyun Cho, Bart Van Merriënboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. Learning phrase representations using rnn encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078, 2014.
-
(2014)
Learning Phrase Representations Using Rnn Encoder-Decoder for Statistical Machine Translation
-
-
Cho, K.1
Van Merriënboer, B.2
Gulcehre, C.3
Bahdanau, D.4
Bougares, F.5
Schwenk, H.6
Bengio, Y.7
-
8
-
-
84986286501
-
Attention-based models for speech recognition
-
abs/1506.07503
-
Jan Chorowski, Dzmitry Bahdanau, Dmitriy Serdyuk, KyungHyun Cho, and Yoshua Bengio. Attention-based models for speech recognition. CoRR, abs/1506.07503, 2015. URL http://arxiv.org/abs/1506.07503.
-
(2015)
CoRR
-
-
Chorowski, J.1
Bahdanau, D.2
Serdyuk, D.3
Cho, K.4
Bengio, Y.5
-
10
-
-
67349244372
-
Search-based structured prediction
-
Hal Daumé Iii, John Langford, and Daniel Marcu. Search-based structured prediction. Machine learning, 75(3):297-325, 2009.
-
(2009)
Machine Learning
, vol.75
, Issue.3
, pp. 297-325
-
-
Iii, H.D.1
Langford, J.2
Marcu, D.3
-
11
-
-
84959236502
-
Long-term recurrent convolutional networks for visual recognition and description
-
Jeffrey Donahue, Lisa Anne Hendricks, Sergio Guadarrama, Marcus Rohrbach, Subhashini Venu-gopalan, Kate Saenko, and Trevor Darrell. Long-term recurrent convolutional networks for visual recognition and description. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2625-2634, 2015.
-
(2015)
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
, pp. 2625-2634
-
-
Donahue, J.1
Hendricks, L.A.2
Guadarrama, S.3
Rohrbach, M.4
Venu-Gopalan, S.5
Saenko, K.6
Darrell, T.7
-
12
-
-
0001492251
-
Minimum bayes-risk automatic speech recognition
-
Vaibhava Goel and William J Byrne. Minimum bayes-risk automatic speech recognition. Computer Speech & Language, 14(2):115-135, 2000.
-
(2000)
Computer Speech & Language
, vol.14
, Issue.2
, pp. 115-135
-
-
Goel, V.1
Byrne, W.J.2
-
19
-
-
84965135289
-
-
arXiv preprint
-
Timothy P Lillicrap, Jonathan J Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, and Daan Wierstra. Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971, 2015.
-
(2015)
Continuous Control with Deep Reinforcement Learning
-
-
Lillicrap, T.P.1
Hunt, J.J.2
Pritzel, A.3
Heess, N.4
Erez, T.5
Tassa, Y.6
Silver, D.7
Wierstra, D.8
-
21
-
-
72449136767
-
Structured prediction with reinforcement learning
-
Francis Maes, Ludovic Denoyer, and Patrick Gallinari. Structured prediction with reinforcement learning. Machine learning, 77(2-3):271-301, 2009.
-
(2009)
Machine Learning
, vol.77
, Issue.2-3
, pp. 271-301
-
-
Maes, F.1
Denoyer, L.2
Gallinari, P.3
-
23
-
-
84924051598
-
Human-level control through deep reinforcement learning
-
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A Rusu, Joel Veness, Marc G Bellemare, Alex Graves, Martin Riedmiller, Andreas K Fidjeland, Georg Ostrovski, et al. Human-level control through deep reinforcement learning. Nature, 518(7540):529-533, 2015.
-
(2015)
Nature
, vol.518
, Issue.7540
, pp. 529-533
-
-
Mnih, V.1
Kavukcuoglu, K.2
Silver, D.3
Rusu, A.A.4
Veness, J.5
Bellemare, M.G.6
Graves, A.7
Riedmiller, M.8
Fidjeland, A.K.9
Ostrovski, G.10
-
24
-
-
0141596576
-
Policy invariance under reward transformations: Theory and application to reward shaping
-
Andrew Y Ng, Daishi Harada, and Stuart Russell. Policy invariance under reward transformations: Theory and application to reward shaping. In ICML, volume 99, pp. 278-287, 1999.
-
(1999)
ICML
, vol.99
, pp. 278-287
-
-
Ng, A.Y.1
Harada, D.2
Russell, S.3
-
25
-
-
84944098666
-
Minimum error rate training in statistical machine translation
-
Association for Computational Linguistics
-
Franz Josef Och. Minimum error rate training in statistical machine translation. In Proceedings of the 41st Annual Meeting on Association for Computational Linguistics-Volume 1, pp. 160-167. Association for Computational Linguistics, 2003.
-
(2003)
Proceedings of the 41st Annual Meeting on Association for Computational Linguistics-
, vol.1
, pp. 160-167
-
-
Och, F.J.1
-
26
-
-
85133336275
-
BLEU: A method for automatic evaluation of machine translation
-
Association for Computational Linguistics
-
Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. Bleu: a method for automatic evaluation of machine translation. In Proceedings of the 40th annual meeting on association for computational linguistics, pp. 311-318. Association for Computational Linguistics, 2002.
-
(2002)
Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
, pp. 311-318
-
-
Papineni, K.1
Roukos, S.2
Ward, T.3
Zhu, W.-J.4
-
31
-
-
85019788709
-
-
arXiv preprint
-
Shiqi Shen, Yong Cheng, Zhongjun He, Wei He, Hua Wu, Maosong Sun, and Yang Liu. Minimum risk training for neural machine translation. arXiv preprint arXiv:1512.02433, 2015.
-
(2015)
Minimum Risk Training for Neural Machine Translation
-
-
Shen, S.1
Cheng, Y.2
He, Z.3
He, W.4
Wu, H.5
Sun, M.6
Liu, Y.7
-
32
-
-
84928547704
-
Sequence to sequence learning with neural networks
-
December 8-13 2014, Montreal, Quebec, Canada
-
Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. Sequence to sequence learning with neural networks. In Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, December 8-13 2014, Montreal, Quebec, Canada, pp. 3104-3112, 2014.
-
(2014)
Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014
, pp. 3104-3112
-
-
Sutskever, I.1
Vinyals, O.2
Le, Q.V.3
-
33
-
-
33847202724
-
Learning to predict by the methods of temporal differences
-
Richard S Sutton. Learning to predict by the methods of temporal differences. Machine learning, 3 (1):9-44, 1988.
-
(1988)
Machine Learning
, vol.3
, Issue.1
, pp. 9-44
-
-
Sutton, R.S.1
-
35
-
-
84898939480
-
Policy gradient methods for reinforcement learning with function approximation
-
Richard S Sutton, David A McAllester, Satinder P Singh, Yishay Mansour, et al. Policy gradient methods for reinforcement learning with function approximation. In NIPS, volume 99, pp. 1057-1063, 1999.
-
(1999)
NIPS
, vol.99
, pp. 1057-1063
-
-
Sutton, R.S.1
McAllester, D.A.2
Singh, S.P.3
Mansour, Y.4
-
37
-
-
0000985504
-
Td-gammon, a self-teaching backgammon program, achieves master-level play
-
Gerald Tesauro. Td-gammon, a self-teaching backgammon program, achieves master-level play. Neural computation, 6(2):215-219, 1994.
-
(1994)
Neural Computation
, vol.6
, Issue.2
, pp. 215-219
-
-
Tesauro, G.1
-
39
-
-
0031143730
-
An analysis of temporal-difference learning with function approximation
-
John N Tsitsiklis and Benjamin Van Roy. An analysis of temporal-difference learning with function approximation. Automatic Control, IEEE Transactions on, 42(5):674-690, 1997.
-
(1997)
Automatic Control, IEEE Transactions on
, vol.42
, Issue.5
, pp. 674-690
-
-
Tsitsiklis, J.N.1
Van Roy, B.2
-
40
-
-
84962564531
-
-
cs, stat, June
-
Bart van Merriënboer, Dzmitry Bahdanau, Vincent Dumoulin, Dmitriy Serdyuk, David Warde-Farley, Jan Chorowski, and Yoshua Bengio. Blocks and fuel: Frameworks for deep learning. arXiv:1506.00619 [cs, stat], June 2015.
-
(2015)
Blocks and Fuel: Frameworks for Deep Learning
-
-
Van Merriënboer, B.1
Bahdanau, D.2
Dumoulin, V.3
Serdyuk, D.4
Warde-Farley, D.5
Chorowski, J.6
Bengio, Y.7
-
41
-
-
84946747440
-
Show and tell: A neural image caption generator
-
Oriol Vinyals, Alexander Toshev, Samy Bengio, and Dumitru Erhan. Show and tell: A neural image caption generator. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3156-3164, 2015.
-
(2015)
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
, pp. 3156-3164
-
-
Vinyals, O.1
Toshev, A.2
Bengio, S.3
Erhan, D.4
-
42
-
-
85017437235
-
An investigation of imitation learning algorithms for structured prediction
-
Citeseer
-
Andreas Vlachos. An investigation of imitation learning algorithms for structured prediction. In EWRL, pp. 143-154. Citeseer, 2012.
-
(2012)
EWRL
, pp. 143-154
-
-
Vlachos, A.1
-
43
-
-
0000337576
-
Simple statistical gradient-following algorithms for connectionist reinforcement learning
-
Ronald J Williams. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine learning, 8(3-4):229-256, 1992.
-
(1992)
Machine Learning
, vol.8
, Issue.3-4
, pp. 229-256
-
-
Williams, R.J.1
-
45
-
-
85018271332
-
-
arXiv preprint
-
Yonghui Wu, Mike Schuster, Zhifeng Chen, Quoc V Le, Mohammad Norouzi, Wolfgang Macherey, Maxim Krikun, Yuan Cao, Qin Gao, Klaus Macherey, et al. Google's neural machine translation system: Bridging the gap between human and machine translation. arXiv preprint arXiv:1609.08144, 2016.
-
(2016)
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
-
-
Wu, Y.1
Schuster, M.2
Chen, Z.3
Le, Q.V.4
Norouzi, M.5
Macherey, W.6
Krikun, M.7
Cao, Y.8
Gao, Q.9
Macherey, K.10
-
46
-
-
84970002232
-
Show, attend and tell: Neural image caption generation with visual attention
-
Lille, France, 6-11 July 2015
-
Kelvin Xu, Jimmy Ba, Ryan Kiros, Kyunghyun Cho, Aaron C. Courville, Ruslan Salakhutdinov, Richard S. Zemel, and Yoshua Bengio. Show, attend and tell: Neural image caption generation with visual attention. In Proceedings of the 32nd International Conference on Machine Learning, ICML 2015, Lille, France, 6-11 July 2015, pp. 2048-2057, 2015.
-
(2015)
Proceedings of the 32nd International Conference on Machine Learning, ICML 2015
, pp. 2048-2057
-
-
Xu, K.1
Ba, J.2
Kiros, R.3
Cho, K.4
Courville, A.C.5
Salakhutdinov, R.6
Zemel, R.S.7
Bengio, Y.8
|