SCOPUS 정보 검색 플랫폼

5th International Conference on Learning Representations, ICLR 2017 - Conference Track Proceedings

Volumn , Issue , 2017, Pages

An actor-critic algorithm for sequence prediction

(7) Bahdanau, Dzmitry a Brakel, Philemon a Xu, Kelvin a Goyal, Anirudh a Courville, Aaron a,c Pineau, Ryan Lowe Joelle b,c Bengio, Yoshua a,c

a UNIVERSITÉ DE MONTRÉAL (Canada)

b MCGILL UNIVERSITY (Canada)

c CIFAR ^*

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATIONAL LINGUISTICS; COMPUTER AIDED LANGUAGE TRANSLATION; MACHINE LEARNING; MODELING LANGUAGES; NATURAL LANGUAGE PROCESSING SYSTEMS;

ACTOR-CRITIC ALGORITHM; ACTOR-CRITIC METHODS; DIALOGUE MODELLING; MACHINE TRANSLATIONS; NATURAL LANGUAGE GENERATION; SEQUENCE PREDICTION; TRAINING AND TESTING; TRAINING PROCEDURES;

REINFORCEMENT LEARNING;

EID: 85088229482 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (346)

References (47)

1
- 85083953689
- Neural machine translation by jointly learning to align and translate
- Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. Neural machine translation by jointly learning to align and translate. In Proceedings of the ICLR 2015, 2015.
- (2015) Proceedings of the ICLR 2015
- Bahdanau, D.¹ Cho, K.² Bengio, Y.³

2
- 0020970738
- Neuronlike adaptive elements that can solve difficult learning control problems
- Andrew G Barto, Richard S Sutton, and Charles W Anderson. Neuronlike adaptive elements that can solve difficult learning control problems. Systems, Man and Cybernetics, IEEE Transactions on, (5):834-846, 1983.
- (1983) Systems, Man and Cybernetics, IEEE Transactions on , Issue.5 , pp. 834-846
- Barto, A.G.¹ Sutton, R.S.² Anderson, C.W.³

3
- 85011805705
- arXiv preprint
- Samy Bengio, Oriol Vinyals, Navdeep Jaitly, and Noam Shazeer. Scheduled sampling for sequence prediction with recurrent neural networks. arXiv preprint arXiv:1506.03099, 2015.
- (2015) Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks
- Bengio, S.¹ Vinyals, O.² Jaitly, N.³ Shazeer, N.⁴

4
- 84959100558
- Report on the 11th iwslt evaluation campaign
- Mauro Cettolo, Jan Niehues, Sebastian Stüker, Luisa Bentivogli, and Marcello Federico. Report on the 11th iwslt evaluation campaign. In Proc. of IWSLT, 2014.
- (2014) Proc. Of IWSLT
- Cettolo, M.¹ Niehues, J.² Stüker, S.³ Bentivogli, L.⁴ Federico, M.⁵

5
- 84994328213
- arXiv preprint
- William Chan, Navdeep Jaitly, Quoc V Le, and Oriol Vinyals. Listen, attend and spell. arXiv preprint arXiv:1508.01211, 2015.
- (2015) Listen, Attend and Spell
- Chan, W.¹ Jaitly, N.² Le, Q.V.³ Vinyals, O.⁴

6
- 84943795466
- arXiv preprint
- Ciprian Chelba, Tomas Mikolov, Mike Schuster, Qi Ge, Thorsten Brants, Phillipp Koehn, and Tony Robinson. One billion word benchmark for measuring progress in statistical language modeling. arXiv preprint arXiv:1312.3005, 2013.
- (2013) One Billion Word Benchmark for Measuring Progress in Statistical Language Modeling
- Chelba, C.¹ Mikolov, T.² Schuster, M.³ Ge, Q.⁴ Brants, T.⁵ Koehn, P.⁶ Robinson, T.⁷

7
- 84961291190
- arXiv preprint
- Kyunghyun Cho, Bart Van Merriënboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. Learning phrase representations using rnn encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078, 2014.
- (2014) Learning Phrase Representations Using Rnn Encoder-Decoder for Statistical Machine Translation
- Cho, K.¹ Van Merriënboer, B.² Gulcehre, C.³ Bahdanau, D.⁴ Bougares, F.⁵ Schwenk, H.⁶ Bengio, Y.⁷

8
- 84986286501
- Attention-based models for speech recognition
- abs/1506.07503
- Jan Chorowski, Dzmitry Bahdanau, Dmitriy Serdyuk, KyungHyun Cho, and Yoshua Bengio. Attention-based models for speech recognition. CoRR, abs/1506.07503, 2015. URL http://arxiv.org/abs/1506.07503.
- (2015) CoRR
- Chorowski, J.¹ Bahdanau, D.² Serdyuk, D.³ Cho, K.⁴ Bengio, Y.⁵

9
- 31844433245
- Learning as search optimization: Approximate large margin methods for structured prediction
- Hal Daumé III and Daniel Marcu. Learning as search optimization: Approximate large margin methods for structured prediction. In Proceedings of the 22nd international conference on Machine learning, pp. 169-176. ACM, 2005.
- (2005) Proceedings of the 22nd International Conference on Machine Learning , pp. 169-176
- Daumé, H.¹ Marcu, D.²

10
- 67349244372
- Search-based structured prediction
- Hal Daumé Iii, John Langford, and Daniel Marcu. Search-based structured prediction. Machine learning, 75(3):297-325, 2009.
- (2009) Machine Learning , vol.75 , Issue.3 , pp. 297-325
- Iii, H.D.¹ Langford, J.² Marcu, D.³

11
- 84959236502
- Long-term recurrent convolutional networks for visual recognition and description
- Jeffrey Donahue, Lisa Anne Hendricks, Sergio Guadarrama, Marcus Rohrbach, Subhashini Venu-gopalan, Kate Saenko, and Trevor Darrell. Long-term recurrent convolutional networks for visual recognition and description. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2625-2634, 2015.
- (2015) Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pp. 2625-2634
- Donahue, J.¹ Hendricks, L.A.² Guadarrama, S.³ Rohrbach, M.⁴ Venu-Gopalan, S.⁵ Saenko, K.⁶ Darrell, T.⁷

12
- 0001492251
- Minimum bayes-risk automatic speech recognition
- Vaibhava Goel and William J Byrne. Minimum bayes-risk automatic speech recognition. Computer Speech & Language, 14(2):115-135, 2000.
- (2000) Computer Speech & Language , vol.14 , Issue.2 , pp. 115-135
- Goel, V.¹ Byrne, W.J.²

13
- 84957716354
- arXiv preprint
- Awni Y Hannun, Andrew L Maas, Daniel Jurafsky, and Andrew Y Ng. First-pass large vocabulary continuous speech recognition using bi-directional recurrent dnns. arXiv preprint arXiv:1408.2873, 2014.
- (2014) First-Pass Large Vocabulary Continuous Speech Recognition Using Bi-Directional Recurrent Dnns
- Hannun, A.Y.¹ Maas, A.L.² Jurafsky, D.³ Ng, A.Y.⁴

14
- 85162488701
- Direct loss minimization for structured prediction
- Tamir Hazan, Joseph Keshet, and David A McAllester. Direct loss minimization for structured prediction. In Advances in Neural Information Processing Systems, pp. 1594-1602, 2010.
- (2010) Advances in Neural Information Processing Systems , pp. 1594-1602
- Hazan, T.¹ Keshet, J.² McAllester, D.A.³

15
- 0031573117
- Long short-term memory
- Sepp Hochreiter and Jürgen Schmidhuber. Long short-term memory. Neural computation, 9(8): 1735-1780, 1997.
- (1997) Neural Computation , vol.9 , Issue.8 , pp. 1735-1780
- Hochreiter, S.¹ Schmidhuber, J.²

16
- 84946734827
- Deep visual-semantic alignments for generating image descriptions
- Andrej Karpathy and Li Fei-Fei. Deep visual-semantic alignments for generating image descriptions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3128-3137, 2015.
- (2015) Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pp. 3128-3137
- Karpathy, A.¹ Fei-Fei, L.²

17
- 85083951076
- A method for stochastic optimization
- Diederik P Kingma and Jimmy Ba. A method for stochastic optimization. In International Conference on Learning Representation, 2015.
- (2015) International Conference on Learning Representation
- Kingma, D.P.¹ Ba, J.²

18
- 84944113729
- arXiv preprint
- Ryan Kiros, Ruslan Salakhutdinov, and Richard S Zemel. Unifying visual-semantic embeddings with multimodal neural language models. arXiv preprint arXiv:1411.2539, 2014.
- (2014) Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models
- Kiros, R.¹ Salakhutdinov, R.² Zemel, R.S.³

19
- 84965135289
- arXiv preprint
- Timothy P Lillicrap, Jonathan J Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, and Daan Wierstra. Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971, 2015.
- (2015) Continuous Control with Deep Reinforcement Learning
- Lillicrap, T.P.¹ Hunt, J.J.² Pritzel, A.³ Heess, N.⁴ Erez, T.⁵ Tassa, Y.⁶ Silver, D.⁷ Wierstra, D.⁸

20
- 85016508365
- Automatic evaluation of summaries using n-gram co-occurrence statistics
- Association for Computational Linguistics
- Chin-Yew Lin and Eduard Hovy. Automatic evaluation of summaries using n-gram co-occurrence statistics. In Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology-Volume 1, pp. 71-78. Association for Computational Linguistics, 2003.
- (2003) Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology- , vol.1 , pp. 71-78
- Lin, C.-Y.¹ Hovy, E.²

21
- 72449136767
- Structured prediction with reinforcement learning
- Francis Maes, Ludovic Denoyer, and Patrick Gallinari. Structured prediction with reinforcement learning. Machine learning, 77(2-3):271-301, 2009.
- (2009) Machine Learning , vol.77 , Issue.2-3 , pp. 271-301
- Maes, F.¹ Denoyer, L.² Gallinari, P.³

22
- 0004059199
- MIT press
- W Thomas Miller, Paul J Werbos, and Richard S Sutton. Neural networks for control. MIT press, 1995.
- (1995) Neural Networks for Control
- Thomas Miller, W.¹ Werbos, P.J.² Sutton, R.S.³

23
- 84924051598
- Human-level control through deep reinforcement learning
- Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A Rusu, Joel Veness, Marc G Bellemare, Alex Graves, Martin Riedmiller, Andreas K Fidjeland, Georg Ostrovski, et al. Human-level control through deep reinforcement learning. Nature, 518(7540):529-533, 2015.
- (2015) Nature , vol.518 , Issue.7540 , pp. 529-533
- Mnih, V.¹ Kavukcuoglu, K.² Silver, D.³ Rusu, A.A.⁴ Veness, J.⁵ Bellemare, M.G.⁶ Graves, A.⁷ Riedmiller, M.⁸ Fidjeland, A.K.⁹ Ostrovski, G.¹⁰

24
- 0141596576
- Policy invariance under reward transformations: Theory and application to reward shaping
- Andrew Y Ng, Daishi Harada, and Stuart Russell. Policy invariance under reward transformations: Theory and application to reward shaping. In ICML, volume 99, pp. 278-287, 1999.
- (1999) ICML , vol.99 , pp. 278-287
- Ng, A.Y.¹ Harada, D.² Russell, S.³

25
- 84944098666
- Minimum error rate training in statistical machine translation
- Association for Computational Linguistics
- Franz Josef Och. Minimum error rate training in statistical machine translation. In Proceedings of the 41st Annual Meeting on Association for Computational Linguistics-Volume 1, pp. 160-167. Association for Computational Linguistics, 2003.
- (2003) Proceedings of the 41st Annual Meeting on Association for Computational Linguistics- , vol.1 , pp. 160-167
- Och, F.J.¹

26
- 85133336275
- BLEU: A method for automatic evaluation of machine translation
- Association for Computational Linguistics
- Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. Bleu: a method for automatic evaluation of machine translation. In Proceedings of the 40th annual meeting on association for computational linguistics, pp. 311-318. Association for Computational Linguistics, 2002.
- (2002) Proceedings of the 40th Annual Meeting on Association for Computational Linguistics , pp. 311-318
- Papineni, K.¹ Roukos, S.² Ward, T.³ Zhu, W.-J.⁴

27
- 84994149921
- arXiv preprint
- Marc'Aurelio Ranzato, Sumit Chopra, Michael Auli, and Wojciech Zaremba. Sequence level training with recurrent neural networks. arXiv preprint arXiv:1511.06732, 2015.
- (2015) Sequence Level Training with Recurrent Neural Networks
- Marc'Aurelio, R.¹ Chopra, S.² Auli, M.³ Zaremba, W.⁴

28
- 84899437369
- arXiv preprint
- Stéphane Ross, Geoffrey J Gordon, and J Andrew Bagnell. A reduction of imitation learning and structured prediction to no-regret online learning. arXiv preprint arXiv:1011.0686, 2010.
- (2010) A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning
- Ross, S.¹ Gordon, G.J.² Andrew Bagnell, J.³

29
- 85011836388
- arXiv preprint
- Alexander M Rush, Sumit Chopra, and Jason Weston. A neural attention model for abstractive sentence summarization. arXiv preprint arXiv:1509.00685, 2015.
- (2015) A Neural Attention Model for Abstractive Sentence Summarization
- Rush, A.M.¹ Chopra, S.² Weston, J.³

30
- 0031268931
- Bidirectional recurrent neural networks
- Mike Schuster and Kuldip K Paliwal. Bidirectional recurrent neural networks. Signal Processing, IEEE Transactions on, 45(11):2673-2681, 1997.
- (1997) Signal Processing, IEEE Transactions on , vol.45 , Issue.11 , pp. 2673-2681
- Schuster, M.¹ Paliwal, K.K.²

31
- 85019788709
- arXiv preprint
- Shiqi Shen, Yong Cheng, Zhongjun He, Wei He, Hua Wu, Maosong Sun, and Yang Liu. Minimum risk training for neural machine translation. arXiv preprint arXiv:1512.02433, 2015.
- (2015) Minimum Risk Training for Neural Machine Translation
- Shen, S.¹ Cheng, Y.² He, Z.³ He, W.⁴ Wu, H.⁵ Sun, M.⁶ Liu, Y.⁷

32
- 84928547704
- Sequence to sequence learning with neural networks
- December 8-13 2014, Montreal, Quebec, Canada
- Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. Sequence to sequence learning with neural networks. In Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, December 8-13 2014, Montreal, Quebec, Canada, pp. 3104-3112, 2014.
- (2014) Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014 , pp. 3104-3112
- Sutskever, I.¹ Vinyals, O.² Le, Q.V.³

33
- 33847202724
- Learning to predict by the methods of temporal differences
- Richard S Sutton. Learning to predict by the methods of temporal differences. Machine learning, 3 (1):9-44, 1988.
- (1988) Machine Learning , vol.3 , Issue.1 , pp. 9-44
- Sutton, R.S.¹

34
- 0004102479
- MIT Press Cambridge
- Richard S Sutton and Andrew G Barto. Introduction to reinforcement learning, volume 135. MIT Press Cambridge, 1998.
- (1998) Introduction to Reinforcement Learning , vol.135
- Sutton, R.S.¹ Barto, A.G.²

35
- 84898939480
- Policy gradient methods for reinforcement learning with function approximation
- Richard S Sutton, David A McAllester, Satinder P Singh, Yishay Mansour, et al. Policy gradient methods for reinforcement learning with function approximation. In NIPS, volume 99, pp. 1057-1063, 1999.
- (1999) NIPS , vol.99 , pp. 1057-1063
- Sutton, R.S.¹ McAllester, D.A.² Singh, S.P.³ Mansour, Y.⁴

36
- 0003617454
- Richard Stuart Sutton. Temporal credit assignment in reinforcement learning. 1984.
- (1984) Temporal Credit Assignment in Reinforcement Learning
- Sutton, R.S.¹

37
- 0000985504
- Td-gammon, a self-teaching backgammon program, achieves master-level play
- Gerald Tesauro. Td-gammon, a self-teaching backgammon program, achieves master-level play. Neural computation, 6(2):215-219, 1994.
- (1994) Neural Computation , vol.6 , Issue.2 , pp. 215-219
- Tesauro, G.¹

38
- 84979557463
- arXiv e-prints, abs/1605.02688, May
- Theano Development Team. Theano: A Python framework for fast computation of mathematical expressions. arXiv e-prints, abs/1605.02688, May 2016. URL http://arxiv.org/abs/1605.02688.
- (2016) Theano: A Python Framework for Fast Computation of Mathematical Expressions

39
- 0031143730
- An analysis of temporal-difference learning with function approximation
- John N Tsitsiklis and Benjamin Van Roy. An analysis of temporal-difference learning with function approximation. Automatic Control, IEEE Transactions on, 42(5):674-690, 1997.
- (1997) Automatic Control, IEEE Transactions on , vol.42 , Issue.5 , pp. 674-690
- Tsitsiklis, J.N.¹ Van Roy, B.²

40
- 84962564531
- cs, stat, June
- Bart van Merriënboer, Dzmitry Bahdanau, Vincent Dumoulin, Dmitriy Serdyuk, David Warde-Farley, Jan Chorowski, and Yoshua Bengio. Blocks and fuel: Frameworks for deep learning. arXiv:1506.00619 [cs, stat], June 2015.
- (2015) Blocks and Fuel: Frameworks for Deep Learning
- Van Merriënboer, B.¹ Bahdanau, D.² Dumoulin, V.³ Serdyuk, D.⁴ Warde-Farley, D.⁵ Chorowski, J.⁶ Bengio, Y.⁷

41
- 84946747440
- Show and tell: A neural image caption generator
- Oriol Vinyals, Alexander Toshev, Samy Bengio, and Dumitru Erhan. Show and tell: A neural image caption generator. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3156-3164, 2015.
- (2015) Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pp. 3156-3164
- Vinyals, O.¹ Toshev, A.² Bengio, S.³ Erhan, D.⁴

42
- 85017437235
- An investigation of imitation learning algorithms for structured prediction
- Citeseer
- Andreas Vlachos. An investigation of imitation learning algorithms for structured prediction. In EWRL, pp. 143-154. Citeseer, 2012.
- (2012) EWRL , pp. 143-154
- Vlachos, A.¹

43
- 0000337576
- Simple statistical gradient-following algorithms for connectionist reinforcement learning
- Ronald J Williams. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine learning, 8(3-4):229-256, 1992.
- (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 229-256
- Williams, R.J.¹

44
- 85018916044
- arXiv preprint
- Sam Wiseman and Alexander M Rush. Sequence-to-sequence learning as beam-search optimization. arXiv preprint arXiv:1606.02960, 2016.
- (2016) Sequence-to-Sequence Learning as Beam-Search Optimization
- Wiseman, S.¹ Rush, A.M.²

45
- 85018271332
- arXiv preprint
- Yonghui Wu, Mike Schuster, Zhifeng Chen, Quoc V Le, Mohammad Norouzi, Wolfgang Macherey, Maxim Krikun, Yuan Cao, Qin Gao, Klaus Macherey, et al. Google's neural machine translation system: Bridging the gap between human and machine translation. arXiv preprint arXiv:1609.08144, 2016.
- (2016) Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
- Wu, Y.¹ Schuster, M.² Chen, Z.³ Le, Q.V.⁴ Norouzi, M.⁵ Macherey, W.⁶ Krikun, M.⁷ Cao, Y.⁸ Gao, Q.⁹ Macherey, K.¹⁰

46
- 84970002232
- Show, attend and tell: Neural image caption generation with visual attention
- Lille, France, 6-11 July 2015
- Kelvin Xu, Jimmy Ba, Ryan Kiros, Kyunghyun Cho, Aaron C. Courville, Ruslan Salakhutdinov, Richard S. Zemel, and Yoshua Bengio. Show, attend and tell: Neural image caption generation with visual attention. In Proceedings of the 32nd International Conference on Machine Learning, ICML 2015, Lille, France, 6-11 July 2015, pp. 2048-2057, 2015.
- (2015) Proceedings of the 32nd International Conference on Machine Learning, ICML 2015 , pp. 2048-2057
- Xu, K.¹ Ba, J.² Kiros, R.³ Cho, K.⁴ Courville, A.C.⁵ Salakhutdinov, R.⁶ Zemel, R.S.⁷ Bengio, Y.⁸

47
- 85030473604
- arXiv preprint
- Wojciech Zaremba, Tomas Mikolov, Armand Joulin, and Rob Fergus. Learning simple algorithms from examples. arXiv preprint arXiv:1511.07275, 2015.
- (2015) Learning Simple Algorithms from Examples
- Zaremba, W.¹ Mikolov, T.² Joulin, A.³ Fergus, R.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.