SCOPUS 정보 검색 플랫폼

Conference Proceedings - EMNLP 2015: Conference on Empirical Methods in Natural Language Processing

Volumn , Issue , 2015, Pages 1-11

Language understanding for text-based games using deep reinforcement learning

(3) Narasimhan, Karthik a Kulkarni, Tejas D a Barzilay, Regina a

a MASSACHUSETTS INSTITUTE OF TECHNOLOGY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

INTERACTIVE COMPUTER GRAPHICS; MACHINE LEARNING; NATURAL LANGUAGE PROCESSING SYSTEMS; REINFORCEMENT LEARNING; SEMANTICS; VIRTUAL REALITY;

ACTION POLICIES; GAME PLAYERS; LANGUAGE BARRIERS; LANGUAGE UNDERSTANDING; LEARNING CONTROL; STATE REPRESENTATION; VECTOR REPRESENTATIONS; VIRTUAL WORLDS;

DEEP LEARNING;

EID: 84959861546 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.18653/v1/d15-1001 Document Type: Conference Paper

Times cited : (337)

References (29)

1
- 84865771838
- High-level reinforcement learning in strategy games
- International Foundation for Autonomous Agents and Multiagent Systems
- Christopher Amato and Guy Shani. 2010. High-level reinforcement learning in strategy games. In Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: Volume 1, pages 75-82. International Foundation for Autonomous Agents and Multiagent Systems.
- (2010) Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems , vol.1 , pp. 75-82
- Amato, C.¹ Shani, G.²

2
- 35048863614
- Adventure games: A challenge for cognitive robotics
- Eyal Amir and Patrick Doyle. 2002. Adventure games: A challenge for cognitive robotics. In Proc. Int. Cognitive Robotics Workshop, pages 148-155.
- (2002) Proc. Int. Cognitive Robotics Workshop , pp. 148-155
- Amir, E.¹ Doyle, P.²

3
- 84959904704
- Alignmentbased compositional semantics for instruction following
- Jacob Andreas and Dan Klein. 2015. Alignmentbased compositional semantics for instruction following. In Proceedings of the Conference on Empirical Methods in Natural Language Processing.
- (2015) Proceedings of the Conference on Empirical Methods in Natural Language Processing
- Andreas, J.¹ Klein, D.²

4
- 84898906433
- Weakly supervised learning of semantic parsers for mapping instructions to actions
- Yoav Artzi and Luke Zettlemoyer. 2013. Weakly supervised learning of semantic parsers for mapping instructions to actions. Transactions of the Association for Computational Linguistics, 1(1):49-62.
- (2013) Transactions of the Association for Computational Linguistics , vol.1 , Issue.1 , pp. 49-62
- Artzi, Y.¹ Zettlemoyer, L.²

5
- 84859963996
- Reading between the lines: Learning to map high-level instructions to commands
- Association for Computational Linguistics
- S R K Branavan, Luke S Zettlemoyer, and Regina Barzilay. 2010. Reading between the lines: Learning to map high-level instructions to commands. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pages 1268-1277. Association for Computational Linguistics.
- (2010) Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics , pp. 1268-1277
- Branavan, S.R.K.¹ Zettlemoyer, L.S.² Barzilay, R.³

6
- 84859015643
- Learning to win by reading manuals in a monte-carlo framework
- Association for Computational Linguistics
- S R K Branavan, David Silver, and Regina Barzilay. 2011a. Learning to win by reading manuals in a monte-carlo framework. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1, pages 268-277. Association for Computational Linguistics.
- (2011) Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies , vol.1 , pp. 268-277
- Branavan, S.R.K.¹ Silver, D.² Barzilay, R.³

7
- 84959894860
- AAAI Press/International Joint Conferences on Artificial Intelligence
- S R K Branavan, David Silver, and Regina Barzilay. 2011b. Non-linear monte-carlo search in Civilization II. AAAI Press/International Joint Conferences on Artificial Intelligence.
- (2011) Non-linear Monte-carlo Search in Civilization II
- Branavan, S.R.K.¹ Silver, D.² Barzilay, R.³

8
- 0037888731
- Mudding: Social phenomena in text-based virtual realities
- Pavel Curtis. 1992. Mudding: Social phenomena in text-based virtual realities. High noon on the electronic frontier: Conceptual issues in cyberspace, pages 347-374.
- (1992) High Noon on the Electronic Frontier: Conceptual Issues in Cyberspace , pp. 347-374
- Curtis, P.¹

9
- 1142293279
- Being-inthe-world
- Mark A DePristo and Robert Zubek. 2001. being-inthe-world. In Proceedings of the 2001 AAAI Spring Symposium on Artificial Intelligence and Interactive Entertainment, pages 31-34.
- (2001) Proceedings of the 2001 AAAI Spring Symposium on Artificial Intelligence and Interactive Entertainment , pp. 31-34
- DePristo, M.A.¹ Zubek, R.²

10
- 80053432060
- Reading to learn: Constructing features from semantic abstracts
- Singapore, August. Association for Computational Linguistics
- Jacob Eisenstein, James Clarke, Dan Goldwasser, and Dan Roth. 2009. Reading to learn: Constructing features from semantic abstracts. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, pages 958-967, Singapore, August. Association for Computational Linguistics.
- (2009) Proceedings of the Conference on Empirical Methods in Natural Language Processing , pp. 958-967
- Eisenstein, J.¹ Clarke, J.² Goldwasser, D.³ Roth, D.⁴

11
- 84883072695
- Speaking with your sidekick: Understanding situated speech in computer role playing games
- R. Michael Young and John E. Laird, editors, June 1-5, 2005, Marina del Rey, California, USA, AAAI Press
- Peter Gorniak and Deb Roy. 2005. Speaking with your sidekick: Understanding situated speech in computer role playing games. In R. Michael Young and John E. Laird, editors, Proceedings of the First Artificial Intelligence and Interactive Digital Entertainment Conference, June 1-5, 2005, Marina del Rey, California, USA, pages 57-62. AAAI Press.
- (2005) Proceedings of the First Artificial Intelligence and Interactive Digital Entertainment Conference , pp. 57-62
- Gorniak, P.¹ Roy, D.²

12
- 0031573117
- Long short-term memory
- Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural computation, 9(8):1735-1780.
- (1997) Neural Computation , vol.9 , Issue.8 , pp. 1735-1780
- Hochreiter, S.¹ Schmidhuber, J.²

13
- 77951550992
- Toward understanding natural language directions
- IEEE
- Thomas Kollar, Stefanie Tellex, Deb Roy, and Nicholas Roy. 2010. Toward understanding natural language directions. In Human-Robot Interaction (HRI), 2010 5th ACM/IEEE International Conference on, pages 259-266. IEEE.
- (2010) Human-robot Interaction (HRI), 2010 5th ACM/IEEE International Conference On , pp. 259-266
- Kollar, T.¹ Tellex, S.² Roy, D.³ Roy, N.⁴

14
- 84883060087
- Evolving largescale neural networks for vision-based reinforcement learning
- ACM
- Jan Koutník, Giuseppe Cuccu, Jiirgen Schmidhuber, and Faustino Gomez. 2013. Evolving largescale neural networks for vision-based reinforcement learning. In Proceedings of the 15th annual conference on Genetic and evolutionary computation, pages 1061-1068. ACM.
- (2013) Proceedings of the 15th Annual Conference on Genetic and Evolutionary Computation , pp. 1061-1068
- Koutník, J.¹ Cuccu, G.² Schmidhuber, J.³ Gomez, F.⁴

15
- 84906930657
- Learning to automatically solve algebra word problems
- Nate Kushman, Yoav Artzi, Luke Zettlemoyer, and Regina Barzilay. 2014. Learning to automatically solve algebra word problems. ACL (1), pages 271-281.
- (2014) ACL , Issue.1 , pp. 271-281
- Kushman, N.¹ Artzi, Y.² Zettlemoyer, L.³ Barzilay, R.⁴

16
- 84923510502
- Learning to parse natural language commands to a robot control system
- Springer
- Cynthia Matuszek, Evan Herbst, Luke Zettlemoyer, and Dieter Fox. 2013. Learning to parse natural language commands to a robot control system. In Experimental Robotics, pages 403-415. Springer.
- (2013) Experimental Robotics , pp. 403-415
- Matuszek, C.¹ Herbst, E.² Zettlemoyer, L.³ Fox, D.⁴

17
- 85083951332
- arXiv preprint arXiv: 1301.3781
- Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. arXiv preprint arXiv: 1301.3781.
- (2013) Efficient Estimation of Word Representations in Vector Space
- Mikolov, T.¹ Chen, K.² Corrado, G.³ Dean, J.⁴

18
- 84924051598
- Human-level control through deep reinforcement learning
- 02
- Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, Joel Veness, Marc G. Bellemare, Alex Graves, Martin Riedmiller, Andreas K. Fidjeland, Georg Ostrovski, Stig Petersen, Charles Beattie, Amir Sadik, Ioannis Antonoglou, Helen King, Dharshan Kumaran, Daan Wierstra, Shane Legg, and Demis Hassabis. 2015. Human-level control through deep reinforcement learning. Nature, 518(7540):529-533, 02.
- (2015) Nature , vol.518 , Issue.7540 , pp. 529-533
- Mnih, V.¹ Kavukcuoglu, K.² Silver, D.³ Rusu, A.A.⁴ Veness, J.⁵ Bellemare, M.G.⁶ Graves, A.⁷ Riedmiller, M.⁸ Fidjeland, A.K.⁹ Ostrovski, G.¹⁰ Petersen, S.¹¹ Beattie, C.¹² Sadik, A.¹³ Antonoglou, I.¹⁴ King, H.¹⁵ Kumaran, D.¹⁶ Wierstra, D.¹⁷ Legg, S.¹⁸ Hassabis, D.¹⁹

19
- 0027684215
- Prioritized sweeping: Reinforcement learning with less data and less time
- Andrew W Moore and Christopher G Atkeson. 1993. Prioritized sweeping: Reinforcement learning with less data and less time. Machine Learning, 13(1):103-130.
- (1993) Machine Learning , vol.13 , Issue.1 , pp. 103-130
- Moore, A.W.¹ Atkeson, C.G.²

20
- 84961289992
- Glove: Global vectors for word representation
- Jeffrey Pennington, Richard Socher, and Christopher D Manning. 2014. Glove: Global vectors for word representation. Proceedings of the Empiricial Methods in Natural Language Processing (EMNLP 2014), 12.
- (2014) Proceedings of the Empiricial Methods in Natural Language Processing (EMNLP 2014) , pp. 12
- Pennington, J.¹ Socher, R.² Manning, C.D.³

21
- 84880900542
- Reinforcement learning of local shape in the game of go
- David Silver, Richard S Sutton, and Martin Miiller. 2007. Reinforcement learning of local shape in the game of go. In IJCAI, volume 7, pages 1053-1058.
- (2007) IJCAI , vol.7 , pp. 1053-1058
- Silver, D.¹ Sutton, R.S.² Miiller, M.³

22
- 84928547704
- Sequence to sequence learning with neural networks
- Ilya Sutskever, Oriol Vinyals, and Quoc VV Le. 2014. Sequence to sequence learning with neural networks. In Advances in Neural Information Processing Systems, pages 3104-3112.
- (2014) Advances in Neural Information Processing Systems , pp. 3104-3112
- Sutskever, I.¹ Vinyals, O.² Le Quoc, V.V.³

23
- 0003420416
- MIT Press
- Richard S Sutton and Andrew G Barto. 1998. Introduction to reinforcement learning. MIT Press.
- (1998) Introduction to Reinforcement Learning
- Sutton, R.S.¹ Barto, A.G.²

24
- 84867399396
- Reinforcement learning in games
- Springer
- Istvan Szita. 2012. Reinforcement learning in games. In Reinforcement Learning, pages 539-577. Springer.
- (2012) Reinforcement Learning , pp. 539-577
- Szita, I.¹

25
- 84943797465
- Improved semantic representations from tree-structured long short-term memory networks
- Beijing, China, July. Association for Computational Linguistics
- Kai Sheng Tai, Richard Socher, and Christopher D. Manning. 2015. Improved semantic representations from tree-structured long short-term memory networks. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 1556-1566, Beijing, China, July. Association for Computational Linguistics.
- (2015) Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers) , pp. 1556-1566
- Tai, K.S.¹ Socher, R.² Manning, C.D.³

26
- 84893343292
- Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude
- Tijmen Tieleman and Geoffrey Hinton. 2012. Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude. COURSERA: Neural Networks for Machine Learning, 4.
- (2012) COURSERA: Neural Networks for Machine Learning , pp. 4
- Tieleman, T.¹ Hinton, G.²

27
- 57249084011
- Visualizing data using t-sne
- Laurens Van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-sne. Journal of Machine Learning Research, 9(2579-2605):85.
- (2008) Journal of Machine Learning Research , vol.9 , Issue.2579-2605 , pp. 85
- Van Der Maaten, L.¹ Hinton, G.²

28
- 84859945237
- Learning to follow navigational directions
- Association for Computational Linguistics
- Adam Vogel and Dan Jurafsky. 2010. Learning to follow navigational directions. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pages 806-814. Association for Computational Linguistics.
- (2010) Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics , pp. 806-814
- Vogel, A.¹ Jurafsky, D.²

29
- 34249833101
- Q-learning
- Christopher JCH Watkins and Peter Dayan. 1992. Q-learning. Machine learning, 8(3-4):279-292.
- (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.