-
1
-
-
78049298143
-
-
The MathWorks. http://www.mathworks.com.
-
The MathWorks
-
-
-
3
-
-
0013109127
-
-
Teambots, 2000. http://www.cs.cmu.edu/-trb/Teambots/Domains/SoccerBots.
-
(2000)
Teambots
-
-
-
5
-
-
49649148257
-
A Theory of Cerebellar Function
-
J.S. Albus, A Theory of Cerebellar Function, Mathematical Biosciences 10 (1971), 25-61.
-
(1971)
Mathematical Biosciences
, vol.10
, pp. 25-61
-
-
Albus, J.S.1
-
7
-
-
85013568384
-
-
Princeton University Press, Princeton, NJ
-
R. Bellman, Dynamic Programming, Princeton University Press, Princeton, NJ, 1957.
-
(1957)
Dynamic Programming
-
-
Bellman, R.1
-
8
-
-
0028388685
-
TD(λ) Converges with Probability 1
-
Peter Dayan and Terrence J. Sejnowski, TD(λ) Converges with Probability 1, Machine Learning 14(1) (1994), 295-301.
-
(1994)
Machine Learning
, vol.14
, Issue.1
, pp. 295-301
-
-
Dayan, P.1
Sejnowski, T.J.2
-
9
-
-
36348936344
-
Learning a Partial Behavior for a Competitive Robotic Soccer Agent
-
Thomas Gabel and Martin A. Riedmiller, Learning a Partial Behavior for a Competitive Robotic Soccer Agent, KI 20(2) (2006), 18-23.
-
(2006)
KI
, vol.20
, Issue.2
, pp. 18-23
-
-
Gabel, T.1
Riedmiller, M.A.2
-
12
-
-
0032329151
-
A Roadmap of Agent Research and Development
-
Nicholas R. Jennings, Katia Sycara and Michael Wooldridge, A Roadmap of Agent Research and Development, Autonomous Agents and Multi-Agent Systems 1(1) (1998), 7-38.
-
(1998)
Autonomous Agents and Multi-Agent Systems
, vol.1
, Issue.1
, pp. 7-38
-
-
Jennings, N.R.1
Sycara, K.2
Wooldridge, M.3
-
13
-
-
0012075670
-
Towards a Life-Long Learning Soccer Agent
-
Fukuoka, Japan
-
A. Kleiner, M. Dietl and B. Nebel, Towards a Life-Long Learning Soccer Agent, In Proc. Int. RoboCup Symposium 02, pages 119-127, Fukuoka, Japan, 2002.
-
(2002)
Proc. Int. RoboCup Symposium 02
, pp. 119-127
-
-
Kleiner, A.1
Dietl, M.2
Nebel, B.3
-
14
-
-
33750805750
-
Teamwork and Simulation in Hybrid Cognitive Architecture
-
B. Gabrys et al., Springer-Verlag Berlin Heidelberg
-
Jinsong Leng, Colin Fyfe and Lakhmi Jain, Teamwork and Simulation in Hybrid Cognitive Architecture, in: Proceeding in 10th Knowledge-Based Intelligent Information and Engineering Systems, LNCS 4252, B. Gabrys et al., Springer-Verlag Berlin Heidelberg, 2006, pp. 472-478.
-
(2006)
Proceeding in 10th Knowledge-Based Intelligent Information and Engineering Systems, LNCS 4252
, pp. 472-478
-
-
Leng, J.1
Fyfe, C.2
Jain, L.3
-
15
-
-
38049144717
-
Reinforcement Learning of Competitive Skills with Soccer Agents
-
B. Apolloni et al., Springer-Verlag Berlin Heidelberg
-
Jinsong Leng, Colin Fyfe and Lakhmi Jain, Reinforcement Learning of Competitive Skills with Soccer Agents, in:Proceeding in 11th International Conference on Knowledge-Based and Intelligent Information and Engineering Systems, LNAI 4692, B. Apolloni et al., Springer-Verlag Berlin Heidelberg, 2007.
-
(2007)
Proceeding in 11th International Conference on Knowledge-Based and Intelligent Information and Engineering Systems, LNAI 4692
-
-
Leng, J.1
Fyfe, C.2
Jain, L.3
-
16
-
-
64349090183
-
Convergence Analysis on Temporal Difference Learning
-
press
-
Jinsong Leng, Lakhmi Jain and Colin Fyfe, Convergence Analysis on Temporal Difference Learning, International Journal of Innovative computing, Information and Control 5(2) (2009), in press.
-
(2009)
International Journal of Innovative computing, Information and Control
, vol.5
, Issue.2
-
-
Leng, J.1
Jain, L.2
Fyfe, C.3
-
18
-
-
84867463287
-
Karlsruhe Brainstormers - A Reinforcement Learning Approach to Robotic Soccer
-
LNAI, Springer Berlin / Heidelberg
-
A. Merke and M. Riedmiller, Karlsruhe Brainstormers - A Reinforcement Learning Approach to Robotic Soccer, RoboCup 2001, volume LNAI 2377, pages 435-440. Springer Berlin / Heidelberg, 2002.
-
(2002)
RoboCup 2001
, vol.2377
, pp. 435-440
-
-
Merke, A.1
Riedmiller, M.2
-
23
-
-
26944466214
-
Function Approximation via Tile Coding: Automating Parameter Choice
-
Berlin, Springer-Verlag
-
Alexander A. Sherstov and Peter Stone, Function Approximation via Tile Coding: Automating Parameter Choice, SARA 2005. LNCS 3607, pages 194-205, Berlin, 2005. Springer-Verlag.
-
(2005)
SARA 2005. LNCS 3607
, pp. 194-205
-
-
Sherstov, A.A.1
Stone, P.2
-
24
-
-
33750709709
-
-
PhD thesis, School of Electrical and Information Engineering, University of South Australia
-
Christos Sioutis, Reasoning and Learning for Intelligent Agents, PhD thesis, School of Electrical and Information Engineering, University of South Australia, 2005.
-
(2005)
Reasoning and Learning for Intelligent Agents
-
-
Sioutis, C.1
-
25
-
-
0003328519
-
TPOT-RL: Team-partitioned, opaque-transition reinforcement learning
-
Berlin, Springer Verlag
-
P. Stone and M. Veloso, TPOT-RL: Team-partitioned, opaque-transition reinforcement learning, RoboCup 98: Robot Soccer World Cup II, pages 221-236, Berlin, 1998. Springer Verlag.
-
(1998)
RoboCup 98: Robot Soccer World Cup II
, pp. 221-236
-
-
Stone, P.1
Veloso, M.2
-
26
-
-
37249034293
-
GregoryKuhlmann, MatthewE. Taylor andYaxin Liu, Keepaway Soccer: From Machine Learning Testbed to Benchmark
-
Itsuki Noda, Adam Jacoff, Ansgar Bredenfeld and Yasutake Takahashi, eds, Berlin, Springer Verlag
-
Peter Stone, GregoryKuhlmann, MatthewE. Taylor andYaxin Liu, Keepaway Soccer: From Machine Learning Testbed to Benchmark, in: RoboCup-2005: Robot Soccer World Cup IX, (Vol. 4020,) Itsuki Noda, Adam Jacoff, Ansgar Bredenfeld and Yasutake Takahashi, eds, Berlin, 2006. Springer Verlag, pp. 93-105.
-
(2006)
RoboCup-2005: Robot Soccer World Cup IX
, vol.4020
, pp. 93-105
-
-
Stone, P.1
-
27
-
-
0013528313
-
Scaling Reinforcement Learning Toward RoboCup Soccer
-
Morgan Kaufmann, San Francisco, CA
-
Peter Stone and Richard S. Sutton, Scaling Reinforcement Learning Toward RoboCup Soccer, Proc. 18th International Conf. on Machine Learning, pages 537-544. Morgan Kaufmann, San Francisco, CA, 2001.
-
(2001)
Proc. 18th International Conf. on Machine Learning
, pp. 537-544
-
-
Stone, P.1
Sutton, R.S.2
-
28
-
-
33847202724
-
Learning to Predict by the Method of Temporal Differences
-
R.S. Sutton, Learning to Predict by the Method of Temporal Differences, Machine Learning 3 (1988), 9-44.
-
(1988)
Machine Learning
, vol.3
, pp. 9-44
-
-
Sutton, R.S.1
-
29
-
-
85156221438
-
Generalization in Reinforcement Learning: Successful Examples using Sparse Coarse Coding
-
D.S. Touretzky, M.C. Mozer and M.E. Hasselmo, eds, Cambridge, MA. MIT Press
-
R.S. Sutton, Generalization in Reinforcement Learning: Successful Examples using Sparse Coarse Coding, in: Advances in Neural Information Processing Systems: Proceedings of the 1995 Conference, D.S. Touretzky, M.C. Mozer and M.E. Hasselmo, eds, Cambridge, MA. MIT Press, 1995, pp. 1038-1044.
-
(1995)
Advances in Neural Information Processing Systems: Proceedings of the 1995 Conference
, pp. 1038-1044
-
-
Sutton, R.S.1
-
31
-
-
0004049893
-
-
PhD thesis, Cambridge University, Cambridge, England
-
C. J. C. H. Watkins, Learning from Delayed Rewards, PhD thesis, Cambridge University, Cambridge, England, 1989.
-
(1989)
Learning from Delayed Rewards
-
-
Watkins, C.J.C.H.1
-
33
-
-
33646714634
-
Evolutionary Function Approximation for Reinforcement Learning
-
Shimon Whiteson and Peter Stone, Evolutionary Function Approximation for Reinforcement Learning, Journal of Machine Learning Research 7 (2006), 877-917.
-
(2006)
Journal of Machine Learning Research
, vol.7
, pp. 877-917
-
-
Whiteson, S.1
Stone, P.2
-
34
-
-
84972263711
-
Intelligent Agents: Theory and Practice
-
Michael Wooldridge and Nick Jennings, Intelligent Agents: Theory and Practice, Knowledge Engineering Review 10(2) (1995), 115-152.
-
(1995)
Knowledge Engineering Review
, vol.10
, Issue.2
, pp. 115-152
-
-
Wooldridge, M.1
Jennings, N.2
|