SCOPUS 정보 검색 플랫폼

54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers

Volumn 3, Issue , 2016, Pages 1621-1630

Deep reinforcement learning with a natural language action space

(7) He, Ji a Chen, Jianshu b He, Xiaodong b Gao, Jianfeng b Li, Lihong b Dengt, Li b Ostendor, Mari a

a University of Washington (United States)

b MICROSOFT RESEARCH (United States)

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATIONAL LINGUISTICS; DEEP NEURAL NETWORKS; INTERFACES (COMPUTER); MACHINE LEARNING; NETWORK ARCHITECTURE; VECTOR SPACES;

ACTION DESCRIPTIONS; ACTION SPACES; INTERACTION FUNCTIONS; NATURAL LANGUAGES; NOVEL ARCHITECTURE; Q-FUNCTIONS; Q-LEARNING; RELEVANCE NETWORKS;

REINFORCEMENT LEARNING;

EID: 85011842203 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.18653/v1/p16-1153 Document Type: Conference Paper

Times cited : (192)

References (25)

1
- 61049182272
- Pearson Education
- E. Adams. 2014. Fundamentals of game design. Pearson Education.
- (2014) Fundamentals of Game Design
- Adams, E.¹

2
- 80051494649
- Reinforcement learning for mapping instructions to actions
- August
- S. R. K. Branavan, H. Chen, L. Zettlemoyer, and R. Barzilay. 2009. Reinforcement learning for mapping instructions to actions. In Proc. of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th IJCNLP, pages 82-90, August.
- (2009) Proc. of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th IJCNLP , pp. 82-90
- Branavan, S.R.K.¹ Chen, H.² Zettlemoyer, L.³ Barzilay, R.⁴

3
- 84859015643
- Learning to win by reading manuals in amonte-carlo framework
- Association for Computational Linguistics
- S. R. K. Branavan, D. Silver, and R. Barzilay. 2011. Learning to win by reading manuals in amonte-carlo framework. In Proc. of the Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1, pages 268-277. Association for Computational Linguistics.
- (2011) Proc. of the Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-volume 1 , pp. 268-277
- Branavan, S.R.K.¹ Silver, D.² Barzilay, R.³

4
- 56449095373
- A unified architecture for natural language processing: Deep neural networks with multitask learning
- ACM
- R. Collobert and J. Weston. 2008. A unified architecture for natural language processing: Deep neural networks with multitask learning. In Proc. of the 25th International Conference on Machine learning, pages 160-167. ACM.
- (2008) Proc. of the 25th International Conference on Machine Learning , pp. 160-167
- Collobert, R.¹ Weston, J.²

5
- 84055222005
- Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
- G. E. Dahl, D. Yu, L. Deng, and A. Acero. 2012. Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition. Audio, Speech, and Language Processing, IEEE Transactions on, 20(1):30-42.
- (2012) Audio, Speech, and Language Processing, IEEE Transactions on , vol.20 , Issue.1 , pp. 30-42
- Dahl, G.E.¹ Yu, D.² Deng, L.³ Acero, A.⁴

6
- 85032751458
- Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
- G. Hinton, L. Deng, D. Yu, G. E. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. N. Sainath, and B. Kingsbury. 2012. Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups. IEEE Signal Process. Mag., 29(6):82-97.
- (2012) IEEE Signal Process. Mag. , vol.29 , Issue.6 , pp. 82-97
- Hinton, G.¹ Deng, L.² Yu, D.³ Dahl, G.E.⁴ Mohamed, A.⁵ Jaitly, N.⁶ Senior, A.⁷ Vanhoucke, V.⁸ Nguyen, P.⁹ Sainath, T.N.¹⁰ Kingsbury, B.¹¹

7
- 84889566627
- Learning deep structured semantic models for web search using clickthrough data
- ACM
- P-S. Huang, X. He, J. Gao, L. Deng, A. Acero, and L. Heck. 2013. Learning deep structured semantic models for web search using clickthrough data. In Proc. of the ACM International Conference on Information & Knowledge Management, pages 2333-2338. ACM.
- (2013) Proc. of the ACM International Conference on Information & Knowledge Management , pp. 2333-2338
- Huang, P.-S.¹ He, X.² Gao, J.³ Deng, L.⁴ Acero, A.⁵ Heck, L.⁶

8
- 84965153327
- Skipthought vectors
- R. Kiros, Y. Zhu, R. R. Salakhutdinov, R. Zemel, R. Urtasun, A. Torralba, and S. Fidler. 2015. Skipthought vectors. In Advances in Neural Information Processing Systems, pages 3276-3284.
- (2015) Advances in Neural Information Processing Systems , pp. 3276-3284
- Kiros, R.¹ Zhu, Y.² Salakhutdinov, R.R.³ Zemel, R.⁴ Urtasun, R.⁵ Torralba, A.⁶ Fidler, S.⁷

9
- 84876231242
- Imagenet classification with deep convolutional neural networks
- A. Krizhevsky, I. Sutskever, and G. E. Hinton. 2012. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems, pages 1097-1105.
- (2012) Advances in Neural Information Processing Systems , pp. 1097-1105
- Krizhevsky, A.¹ Sutskever, I.² Hinton, G.E.³

10
- 84919829999
- Distributed representations of sentences and documents
- Q. V. Le and T. Mikolov. 2014. Distributed representations of sentences and documents. In International Conference on Machine Learning.
- (2014) International Conference on Machine Learning
- Le, Q.V.¹ Mikolov, T.²

11
- 84930630277
- Deep learning
- Y. LeCun, Y. Bengio, and G. Hinton. 2015. Deep learning. Nature, 521(7553):436-444.
- (2015) Nature , vol.521 , Issue.7553 , pp. 436-444
- LeCun, Y.¹ Bengio, Y.² Hinton, G.³

12
- 70450186275
- Reinforcement learning for spoken dialog management using least-squares policy iteration and fast feature selection
- L. Li, J. D. Williams, and S. Balakrishnan. 2009. Reinforcement learning for spoken dialog management using least-squares policy iteration and fast feature selection. In Proceedings of the Tenth Annual Conference of the International Speech Communication Association (INTERSPEECH-09), page 24752478.
- (2009) Proceedings of the Tenth Annual Conference of the International Speech Communication Association (INTERSPEECH-09) , pp. 24752478
- Li, L.¹ Williams, J.D.² Balakrishnan, S.³

13
- 85083953657
- Continuous control with deep reinforcement learning
- T. P. Lillicrap, J. J. Hunt, A. Pritzel, N. Heess, T. Erez, Y. Tassa, D. Silver, and D. Wierstra. 2016. Continuous control with deep reinforcement learning. In International Conference on Learning Representations.
- (2016) International Conference on Learning Representations
- Lillicrap, T.P.¹ Hunt, J.J.² Pritzel, A.³ Heess, N.⁴ Erez, T.⁵ Tassa, Y.⁶ Silver, D.⁷ Wierstra, D.⁸

14
- 0003673017
- Technical report, DTIC Document
- L-J. Lin. 1993. Reinforcement learning for robots using neural networks. Technical report, DTIC Document.
- (1993) Reinforcement Learning for Robots Using Neural Networks
- Lin, L.-J.¹

15
- 84959874994
- Effective approaches to attention-based neural machine translation
- September
- M-T. Luong, H. Pham, and C. D. Manning. 2015. Effective approaches to attention-based neural machine translation. In Proc. of the 2015 Conference on Empirical Methods in Natural Language Processing, pages 1412-1421, September.
- (2015) Proc. of the 2015 Conference on Empirical Methods in Natural Language Processing , pp. 1412-1421
- Luong, M.-T.¹ Pham, H.² Manning, C.D.³

16
- 85011904626
- NIPS Deep Learning Workshop, December
- V. Mnih, K. Kavukcuoglu, D. Silver, A. Graves, I. Antonoglou, D. Wierstra, and M. Riedmiller. 2013. Playing Atari with Deep Reinforcement Learning. NIPS Deep Learning Workshop, December.
- (2013) Playing Atari with Deep Reinforcement Learning
- Mnih, V.¹ Kavukcuoglu, K.² Silver, D.³ Graves, A.⁴ Antonoglou, I.⁵ Wierstra, D.⁶ Riedmiller, M.⁷

17
- 84924051598
- Humanlevel control through deep reinforcement learning
- V. Mnih, K. Kavukcuoglu, D. Silver, A. A. Rusu, J. Veness, M. G. Bellemare, A. Graves, M. Riedmiller, A. K. Fidjeland, G. Ostrovski, et al. 2015. Humanlevel control through deep reinforcement learning. Nature, 518(7540):529-533.
- (2015) Nature , vol.518 , Issue.7540 , pp. 529-533
- Mnih, V.¹ Kavukcuoglu, K.² Silver, D.³ Rusu, A.A.⁴ Veness, J.⁵ Bellemare, M.G.⁶ Graves, A.⁷ Riedmiller, M.⁸ Fidjeland, A.K.⁹ Ostrovski, G.¹⁰

18
- 84959861546
- Language understanding for text-based games using deep reinforcement learning
- September
- K. Narasimhan, T. Kulkarni, and R. Barzilay. 2015. Language understanding for text-based games using deep reinforcement learning. In Proc. of the 2015 Conference on Empirical Methods in Natural Language Processing, pages 1-11, September.
- (2015) Proc. of the 2015 Conference on Empirical Methods in Natural Language Processing , pp. 1-11
- Narasimhan, K.¹ Kulkarni, T.² Barzilay, R.³

19
- 85011918181
- arXiv preprint arXiv:1602.02261
- R. Nogueira and K. Cho. 2016. Webnav: A new largescale task for natural language based sequential decision making. arXiv preprint arXiv:1602.02261.
- (2016) Webnav: A New Largescale Task for Natural Language Based Sequential Decision Making
- Nogueira, R.¹ Cho, K.²

20
- 33846263279
- Automatic learning of dialogue strategy using dialogue simulation and reinforcement learning
- K. Schettler and S. Young. 2002. Automatic learning of dialogue strategy using dialogue simulation and reinforcement learning. In Proc. of the second International Conference on Human Language Technology Research, pages 12-19.
- (2002) Proc. of the Second International Conference on Human Language Technology Research , pp. 12-19
- Schettler, K.¹ Young, S.²

21
- 84928547704
- Sequence to sequence learning with neural networks
- I. Sutskever, O. Vinyals, and Q. V. Le. 2014. Sequence to sequence learning with neural networks. In Advances in Neural Information Processing Systems, pages 3104-3112.
- (2014) Advances in Neural Information Processing Systems , pp. 3104-3112
- Sutskever, I.¹ Vinyals, O.² Le, Q.V.³

22
- 0004102479
- MIT press Cambridge
- R. S. Sutton and A. G. Barto. 1998. Reinforcement learning: An introduction, volume 1. MIT press Cambridge.
- (1998) Reinforcement Learning: An Introduction , vol.1
- Sutton, R.S.¹ Barto, A.G.²

23
- 0029276036
- Temporal difference learning and td-gammon
- G. Tesauro. 1995. Temporal difference learning and td-gammon. Communications of the ACM, 38(3):58-68.
- (1995) Communications of the ACM , vol.38 , Issue.3 , pp. 58-68
- Tesauro, G.¹

24
- 34249833101
- Q-learning
- C. JCH Watkins and P. Dayan. 1992. Q-learning. Machine learning, 8(3-4):279-292.
- (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

25
- 84876682878
- Pomdp-based statistical spoken dialog systems: A review
- S. Young, M. Gasic, B. Thomson, and J. D. Williams. 2013. Pomdp-based statistical spoken dialog systems: A review. Proceedings of the IEEE, 101(5):1160-1179.
- (2013) Proceedings of the IEEE , vol.101 , Issue.5 , pp. 1160-1179
- Young, S.¹ Gasic, M.² Thomson, B.³ Williams, J.D.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.