-
1
-
-
84865771838
-
High-level reinforcement learning in strategy games
-
International Foundation for Autonomous Agents and Multiagent Systems
-
Christopher Amato and Guy Shani. 2010. High-level reinforcement learning in strategy games. In Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: Volume 1, pages 75-82. International Foundation for Autonomous Agents and Multiagent Systems.
-
(2010)
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems
, vol.1
, pp. 75-82
-
-
Amato, C.1
Shani, G.2
-
2
-
-
35048863614
-
Adventure games: A challenge for cognitive robotics
-
Eyal Amir and Patrick Doyle. 2002. Adventure games: A challenge for cognitive robotics. In Proc. Int. Cognitive Robotics Workshop, pages 148-155.
-
(2002)
Proc. Int. Cognitive Robotics Workshop
, pp. 148-155
-
-
Amir, E.1
Doyle, P.2
-
10
-
-
80053432060
-
Reading to learn: Constructing features from semantic abstracts
-
Singapore, August. Association for Computational Linguistics
-
Jacob Eisenstein, James Clarke, Dan Goldwasser, and Dan Roth. 2009. Reading to learn: Constructing features from semantic abstracts. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, pages 958-967, Singapore, August. Association for Computational Linguistics.
-
(2009)
Proceedings of the Conference on Empirical Methods in Natural Language Processing
, pp. 958-967
-
-
Eisenstein, J.1
Clarke, J.2
Goldwasser, D.3
Roth, D.4
-
11
-
-
84883072695
-
Speaking with your sidekick: Understanding situated speech in computer role playing games
-
R. Michael Young and John E. Laird, editors, June 1-5, 2005, Marina del Rey, California, USA, AAAI Press
-
Peter Gorniak and Deb Roy. 2005. Speaking with your sidekick: Understanding situated speech in computer role playing games. In R. Michael Young and John E. Laird, editors, Proceedings of the First Artificial Intelligence and Interactive Digital Entertainment Conference, June 1-5, 2005, Marina del Rey, California, USA, pages 57-62. AAAI Press.
-
(2005)
Proceedings of the First Artificial Intelligence and Interactive Digital Entertainment Conference
, pp. 57-62
-
-
Gorniak, P.1
Roy, D.2
-
13
-
-
77951550992
-
Toward understanding natural language directions
-
IEEE
-
Thomas Kollar, Stefanie Tellex, Deb Roy, and Nicholas Roy. 2010. Toward understanding natural language directions. In Human-Robot Interaction (HRI), 2010 5th ACM/IEEE International Conference on, pages 259-266. IEEE.
-
(2010)
Human-robot Interaction (HRI), 2010 5th ACM/IEEE International Conference On
, pp. 259-266
-
-
Kollar, T.1
Tellex, S.2
Roy, D.3
Roy, N.4
-
15
-
-
84906930657
-
Learning to automatically solve algebra word problems
-
Nate Kushman, Yoav Artzi, Luke Zettlemoyer, and Regina Barzilay. 2014. Learning to automatically solve algebra word problems. ACL (1), pages 271-281.
-
(2014)
ACL
, Issue.1
, pp. 271-281
-
-
Kushman, N.1
Artzi, Y.2
Zettlemoyer, L.3
Barzilay, R.4
-
16
-
-
84923510502
-
Learning to parse natural language commands to a robot control system
-
Springer
-
Cynthia Matuszek, Evan Herbst, Luke Zettlemoyer, and Dieter Fox. 2013. Learning to parse natural language commands to a robot control system. In Experimental Robotics, pages 403-415. Springer.
-
(2013)
Experimental Robotics
, pp. 403-415
-
-
Matuszek, C.1
Herbst, E.2
Zettlemoyer, L.3
Fox, D.4
-
18
-
-
84924051598
-
Human-level control through deep reinforcement learning
-
02
-
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, Joel Veness, Marc G. Bellemare, Alex Graves, Martin Riedmiller, Andreas K. Fidjeland, Georg Ostrovski, Stig Petersen, Charles Beattie, Amir Sadik, Ioannis Antonoglou, Helen King, Dharshan Kumaran, Daan Wierstra, Shane Legg, and Demis Hassabis. 2015. Human-level control through deep reinforcement learning. Nature, 518(7540):529-533, 02.
-
(2015)
Nature
, vol.518
, Issue.7540
, pp. 529-533
-
-
Mnih, V.1
Kavukcuoglu, K.2
Silver, D.3
Rusu, A.A.4
Veness, J.5
Bellemare, M.G.6
Graves, A.7
Riedmiller, M.8
Fidjeland, A.K.9
Ostrovski, G.10
Petersen, S.11
Beattie, C.12
Sadik, A.13
Antonoglou, I.14
King, H.15
Kumaran, D.16
Wierstra, D.17
Legg, S.18
Hassabis, D.19
-
19
-
-
0027684215
-
Prioritized sweeping: Reinforcement learning with less data and less time
-
Andrew W Moore and Christopher G Atkeson. 1993. Prioritized sweeping: Reinforcement learning with less data and less time. Machine Learning, 13(1):103-130.
-
(1993)
Machine Learning
, vol.13
, Issue.1
, pp. 103-130
-
-
Moore, A.W.1
Atkeson, C.G.2
-
21
-
-
84880900542
-
Reinforcement learning of local shape in the game of go
-
David Silver, Richard S Sutton, and Martin Miiller. 2007. Reinforcement learning of local shape in the game of go. In IJCAI, volume 7, pages 1053-1058.
-
(2007)
IJCAI
, vol.7
, pp. 1053-1058
-
-
Silver, D.1
Sutton, R.S.2
Miiller, M.3
-
24
-
-
84867399396
-
Reinforcement learning in games
-
Springer
-
Istvan Szita. 2012. Reinforcement learning in games. In Reinforcement Learning, pages 539-577. Springer.
-
(2012)
Reinforcement Learning
, pp. 539-577
-
-
Szita, I.1
-
25
-
-
84943797465
-
Improved semantic representations from tree-structured long short-term memory networks
-
Beijing, China, July. Association for Computational Linguistics
-
Kai Sheng Tai, Richard Socher, and Christopher D. Manning. 2015. Improved semantic representations from tree-structured long short-term memory networks. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 1556-1566, Beijing, China, July. Association for Computational Linguistics.
-
(2015)
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
, pp. 1556-1566
-
-
Tai, K.S.1
Socher, R.2
Manning, C.D.3
-
26
-
-
84893343292
-
Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude
-
Tijmen Tieleman and Geoffrey Hinton. 2012. Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude. COURSERA: Neural Networks for Machine Learning, 4.
-
(2012)
COURSERA: Neural Networks for Machine Learning
, pp. 4
-
-
Tieleman, T.1
Hinton, G.2
|