-
1
-
-
0003942195
-
-
Byte Books, Peterborough, NH
-
James S. Albus. Brains, Behavior, and Robotics. Byte Books, Peterborough, NH, 1981.
-
(1981)
Brains, Behavior, and Robotics
-
-
Albus, J.S.1
-
3
-
-
84947424101
-
Evolving team Darwin United
-
Minoru Asada and Hiroaki Kitano, editors, Springer Verlag, Berlin
-
David Andre and Astro Teller. Evolving team Darwin United. In Minoru Asada and Hiroaki Kitano, editors, RoboCup-98: Robot Soccer World Cup II, pages 346-351. Springer Verlag, Berlin, 1999.
-
(1999)
RoboCup-98: Robot Soccer World Cup II
, pp. 346-351
-
-
Andre, D.1
Teller, A.2
-
4
-
-
0006221144
-
Vision-based behavior acquisition for a shooting robot by using a reinforcement learning
-
Minoru Asada, Shoichi Noda, Sukoya Tawaratsumida, and Koh Hosoda. Vision-based behavior acquisition for a shooting robot by using a reinforcement learning. In Proc. of IAPR/IEEE Workshop on Visual Behaviors-1994, pages 112-118, 1994.
-
(1994)
Proc. of IAPR/IEEE Workshop on Visual Behaviors-1994
, pp. 112-118
-
-
Asada, M.1
Noda, S.2
Tawaratsumida, S.3
Hosoda, K.4
-
5
-
-
85150714688
-
Reinforcement learning methods for continuous-time Markov decision problems
-
G. Tesauro, D. Touretzky, and T. Leen, editors, San Mateo, CA, Morgan Kaufmann
-
Steven J. Bradtke and Michael O. Duff. Reinforcement learning methods for continuous-time Markov decision problems. In G. Tesauro, D. Touretzky, and T. Leen, editors, Advances in Neural Information Processing Systems, volume 7, pages 393-400, San Mateo, CA, 1995. Morgan Kaufmann.
-
(1995)
Advances in Neural Information Processing Systems
, vol.7
, pp. 393-400
-
-
Bradtke, S.J.1
Duff, M.O.2
-
6
-
-
34848884838
-
-
Mao Chen, Ehsan Foroughi, Fredrik Heintz, Spiros Kapetanakis, Kostas Kostiadis, Johan Kummeneje, Itsuki Noda, Oliver Obst, Patrick Riley, Timo Steffens, Yi Wang, and Xiang Yin. Users manual: RoboCup soccer server manual for soccer server version 7.07 and later, 2003. Available at http://sourceforge. net/projects/sserver/.
-
Mao Chen, Ehsan Foroughi, Fredrik Heintz, Spiros Kapetanakis, Kostas Kostiadis, Johan Kummeneje, Itsuki Noda, Oliver Obst, Patrick Riley, Timo Steffens, Yi Wang, and Xiang Yin. Users manual: RoboCup soccer server manual for soccer server version 7.07 and later, 2003. Available at http://sourceforge. net/projects/sserver/.
-
-
-
-
7
-
-
24844475430
-
Robot Shaping: Developing Situated Agents through Learning
-
Technical Report TR-92-040, International Computer Science Institute, Berkeley, CA
-
Marco Colombetti and Marco Dorigo. Robot Shaping: Developing Situated Agents through Learning. Technical Report TR-92-040, International Computer Science Institute, Berkeley, CA, 1993.
-
(1993)
-
-
Colombetti, M.1
Dorigo, M.2
-
8
-
-
85156187730
-
Improving elevator performance using reinforcement learning
-
D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, editors, Cambridge, MA, MIT Press
-
Robert H. Crites and Andrew G. Barto. Improving elevator performance using reinforcement learning. In D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, editors, Advances in Neural Information Processing Systems 8, pages 1017-1023, Cambridge, MA, 1996. MIT Press.
-
(1996)
Advances in Neural Information Processing Systems 8
, pp. 1017-1023
-
-
Crites, R.H.1
Barto, A.G.2
-
9
-
-
0043247546
-
Accelerating reinforcement learning by composing solutions of automatically identified subtasks
-
Chris Drummond. Accelerating reinforcement learning by composing solutions of automatically identified subtasks. Journal of Artificial Intelligence Research, 16:59-104, 2002.
-
(2002)
Journal of Artificial Intelligence Research
, vol.16
, pp. 59-104
-
-
Drummond, C.1
-
10
-
-
22944468731
-
Approximate policy iteration with a policy language bias
-
Sebastian Thrun, Lawrence Saul, and Bernhard Schölkopf, editors, MIT Press, Cambridge, MA
-
Alan Fern, Sungwook Yoon, and Robert Givan. Approximate policy iteration with a policy language bias. In Sebastian Thrun, Lawrence Saul, and Bernhard Schölkopf, editors, Advances in Neural Information Processing Systems 16. MIT Press, Cambridge, MA, 2004.
-
(2004)
Advances in Neural Information Processing Systems 16
-
-
Fern, A.1
Yoon, S.2
Givan, R.3
-
13
-
-
84880803349
-
Generalizing plans to new environments in relational mdps
-
Acapulco, Mexico, August
-
Carlos Guestrin, Daphne Koller, Chris Gearhart, and Neal Kanodia. Generalizing plans to new environments in relational mdps. In International Joint Conference on Artificial Intelligence (IJCAI-03), Acapulco, Mexico, August 2003.
-
(2003)
International Joint Conference on Artificial Intelligence (IJCAI-03)
-
-
Guestrin, C.1
Koller, D.2
Gearhart, C.3
Kanodia, N.4
-
17
-
-
0003496531
-
-
MIT Press, Cambridge, MA, USA, ISBN 0-262-13328-8
-
Kishan Mehrotra, Chilukuri K. Mohan, and Sanjay Ranka. Elements of Artificial Neural Networks. MIT Press, Cambridge, MA, USA, 1997. ISBN 0-262-13328-8.
-
(1997)
Elements of Artificial Neural Networks
-
-
Mehrotra, K.1
Mohan, C.K.2
Ranka, S.3
-
18
-
-
0032021222
-
Soccer server: A tool for research on multiagent systems
-
Itsuki Noda, Hitoshi Matsubara, Kazuo Hiraki, and Ian Frank. Soccer server: A tool for research on multiagent systems. Applied Artificial Intelligence, 12:233-250, 1998.
-
(1998)
Applied Artificial Intelligence
, vol.12
, pp. 233-250
-
-
Noda, I.1
Matsubara, H.2
Hiraki, K.3
Frank, I.4
-
21
-
-
84867471400
-
Karlsruhe brainstormers - a reinforcement learning approach to robotic soccer
-
Peter Stone, Tucker Balch, and Gerhard Kraetszchmar, editors, Springer Verlag, Berlin
-
Martin Riedmiller, Author Merke, David Meier, Andreas Hoffman, Alex Sinner, Ortwin Thate, and Ralf Ehrmann. Karlsruhe brainstormers - a reinforcement learning approach to robotic soccer. In Peter Stone, Tucker Balch, and Gerhard Kraetszchmar, editors, RoboCup-2000: Robot Soccer World Cup IV, pages 367-372. Springer Verlag, Berlin, 2001.
-
(2001)
RoboCup-2000: Robot Soccer World Cup IV
, pp. 367-372
-
-
Riedmiller, M.1
Merke, A.2
Meier, D.3
Hoffman, A.4
Sinner, A.5
Thate, O.6
Ehrmann, R.7
-
22
-
-
0003636089
-
On-line Q-learning using connectionist systems
-
Engineering Department, Cambridge University
-
Gavin Rummery and Mahesan Niranjan. On-line Q-learning using connectionist systems. Technical Report CUED/F-INFENG-RT 116, Engineering Department, Cambridge University, 1994.
-
(1994)
Technical Report CUED/F-INFENG-RT
, vol.116
-
-
Rummery, G.1
Niranjan, M.2
-
24
-
-
0001027894
-
Transfer of learning by composing solutions of elemental sequential tasks
-
Satinder P. Singh. Transfer of learning by composing solutions of elemental sequential tasks. Machine Learning, 8:323-339, 1992.
-
(1992)
Machine Learning
, vol.8
, pp. 323-339
-
-
Singh, S.P.1
-
25
-
-
0029753630
-
Reinforcement learning with replacing eligibility traces
-
Satinder P. Singh and Richard S. Sutton. Reinforcement learning with replacing eligibility traces. Machine Learning, 22:123-158, 1996.
-
(1996)
Machine Learning
, vol.22
, pp. 123-158
-
-
Singh, S.P.1
Sutton, R.S.2
-
28
-
-
84867470253
-
Keepaway soccer: A machine learning testbed
-
Andreas Birk, Silvia Coradeschi, and Satoshi Tadokoro, editors, RoboCup-2001: Robot Soccer World Cup V, of, Springer Verlag, Berlin
-
Peter Stone and Richard S. Sutton. Keepaway soccer: a machine learning testbed. In Andreas Birk, Silvia Coradeschi, and Satoshi Tadokoro, editors, RoboCup-2001: Robot Soccer World Cup V, volume 2377 of Lecture Notes in Artificial Intelligence, pages 214-223. Springer Verlag, Berlin, 2002.
-
(2002)
Lecture Notes in Artificial Intelligence
, vol.2377
, pp. 214-223
-
-
Stone, P.1
Sutton, R.S.2
-
29
-
-
27544506565
-
Reinforcement learning for RoboCup-soccer keepaway
-
Peter Stone, Richard S. Sutton, and Gregory Kuhlmann. Reinforcement learning for RoboCup-soccer keepaway. Adaptive Behavior, 13(3):165-188, 2005.
-
(2005)
Adaptive Behavior
, vol.13
, Issue.3
, pp. 165-188
-
-
Stone, P.1
Sutton, R.S.2
Kuhlmann, G.3
-
30
-
-
37249034293
-
Keepaway soccer: From machine learning testbed to benchmark
-
Itsuki Noda, Adam Jacoff, Ansgar Bredenfeld, and Yasutake Takahashi, editors, Springer Verlag, Berlin
-
Peter Stone, Gregory Kuhlmann, Matthew E. Taylor, and Yaxin Liu. Keepaway soccer: From machine learning testbed to benchmark. In Itsuki Noda, Adam Jacoff, Ansgar Bredenfeld, and Yasutake Takahashi, editors, RoboCup-2005: Robot Soccer World Cup IX, volume 4020, pages 93-105. Springer Verlag, Berlin, 2006.
-
(2006)
RoboCup-2005: Robot Soccer World Cup IX
, vol.4020
, pp. 93-105
-
-
Stone, P.1
Kuhlmann, G.2
Taylor, M.E.3
Liu, Y.4
-
33
-
-
27544473171
-
Behavior transfer for value-function-based reinforcement learning
-
Frank Dignum, Virginia Dignum, Sven Koenig, Sarit Kraus, Munindar P. Singh, and Michael Wooldridge, editors, New York, NY, July, ACM Press
-
Matthew E. Taylor and Peter Stone. Behavior transfer for value-function-based reinforcement learning. In Frank Dignum, Virginia Dignum, Sven Koenig, Sarit Kraus, Munindar P. Singh, and Michael Wooldridge, editors, The Fourth International Joint Conference on Autonomous Agents and Multiagent Systems, pages 53-59, New York, NY, July 2005. ACM Press.
-
(2005)
The Fourth International Joint Conference on Autonomous Agents and Multiagent Systems
, pp. 53-59
-
-
Taylor, M.E.1
Stone, P.2
-
37
-
-
34848888015
-
-
Gerald Tesauro. TD-Gammon, a self-teaching backgammon program, achieves master-level play. Neural Computation, 6(2):215-219, 1994.
-
Gerald Tesauro. TD-Gammon, a self-teaching backgammon program, achieves master-level play. Neural Computation, 6(2):215-219, 1994.
-
-
-
-
40
-
-
34547994508
-
Multi-task reinforcement learning: A hierarchical bayesian approach
-
New York, NY, USA, ACM Press
-
Aaron Wilson, Alan Fern, Soumya Ray, and Prasad Tadepalli. Multi-task reinforcement learning: a hierarchical bayesian approach. In ICML '07: Proceedings of the 24th international conference on Machine learning, pages 1015-1022, New York, NY, USA, 2007. ACM Press.
-
(2007)
ICML '07: Proceedings of the 24th international conference on Machine learning
, pp. 1015-1022
-
-
Wilson, A.1
Fern, A.2
Ray, S.3
Tadepalli, P.4
|