SCOPUS 정보 검색 플랫폼

Multiagent and Grid Systems

Volumn 4, Issue 4, 2008, Pages 415-436

Simulation and reinforcement learning with soccer agents

(3) Leng, Jinsong a Fyfe, Colin b Jain, Lakhmi a

a UNIVERSITY OF SOUTH AUSTRALIA (Australia)

b UNIVERSITY OF THE WEST OF SCOTLAND (United Kingdom)

Author keywords

Agent; Decision making; Simulation

Indexed keywords

AGENTS; APPROXIMATION ALGORITHMS; ARTIFICIAL INTELLIGENCE; AUTONOMOUS AGENTS; COOPERATIVE COMMUNICATION; DECISION MAKING; DISTRIBUTED COMPUTER SYSTEMS; EFFICIENCY; INTELLIGENT AGENTS; MULTI AGENT SYSTEMS; OPTIMIZATION; REINFORCEMENT LEARNING; SPORTS; STOCHASTIC SYSTEMS;

COMMUNICATION AND COLLABORATIONS; CONTINUOUS STATE-ACTION SPACES; COOPERATIVE LEARNING; DYNAMIC ENVIRONMENTS; OPTIMISATION TECHNIQUES; SIMULATION; SIMULATION ENVIRONMENT; UNCERTAIN ENVIRONMENTS;

LEARNING ALGORITHMS;

EID: 85006272607 PISSN: 15741702 EISSN: 18759076 Source Type: Journal
DOI: 10.3233/MGS-2008-4407 Document Type: Article

Times cited : (11)

References (34)

1
- 78049298143
- The MathWorks. http://www.mathworks.com.
- The MathWorks

2
- 77957915787
- Technical report, Unreal Tournament Manual
- InfoGrames Epic Games and Digital Entertainment. Technical report, Unreal Tournament Manual, 2000.
- (2000) InfoGrames Epic Games and Digital Entertainment

3
- 0013109127
- Teambots, 2000. http://www.cs.cmu.edu/-trb/Teambots/Domains/SoccerBots.
- (2000) Teambots

4
- 85013586387
- Technical report, Robocup
- Humaniod Kid and Medium Size League, Rules and Setup for Osaka 2005. Technical report, Robocup, 2005.
- (2005) Rules and Setup for Osaka 2005
- Kid, H.¹

5
- 49649148257
- A Theory of Cerebellar Function
- J.S. Albus, A Theory of Cerebellar Function, Mathematical Biosciences 10 (1971), 25-61.
- (1971) Mathematical Biosciences , vol.10 , pp. 25-61
- Albus, J.S.¹

6
- 0001700171
- A Markovian Decision Process
- R. Bellman, A Markovian Decision Process, Journal of Mathematics and Mechanics 6 (1957).
- (1957) Journal of Mathematics and Mechanics , vol.6
- Bellman, R.¹

7
- 85013568384
- Princeton University Press, Princeton, NJ
- R. Bellman, Dynamic Programming, Princeton University Press, Princeton, NJ, 1957.
- (1957) Dynamic Programming
- Bellman, R.¹

8
- 0028388685
- TD(λ) Converges with Probability 1
- Peter Dayan and Terrence J. Sejnowski, TD(λ) Converges with Probability 1, Machine Learning 14(1) (1994), 295-301.
- (1994) Machine Learning , vol.14 , Issue.1 , pp. 295-301
- Dayan, P.¹ Sejnowski, T.J.²

9
- 36348936344
- Learning a Partial Behavior for a Competitive Robotic Soccer Agent
- Thomas Gabel and Martin A. Riedmiller, Learning a Partial Behavior for a Competitive Robotic Soccer Agent, KI 20(2) (2006), 18-23.
- (2006) KI , vol.20 , Issue.2 , pp. 18-23
- Gabel, T.¹ Riedmiller, M.A.²

10
- 84888630832
- Kluwer Academic Publishers
- Abhijit Gosavi, Simulation-based Optimization: parametric optimization techniques and reinforcement learning, Kluwer Academic Publishers, 2003.
- (2003) Simulation-based Optimization: Parametric optimization techniques and reinforcement learning
- Gosavi, A.¹

11
- 0003644124
- MIT Press, Cambridge
- R.A. Howard, Dynamic Programming and Markov Processes, MIT Press, Cambridge, 1960.
- (1960) Dynamic Programming and Markov Processes
- Howard, R.A.¹

12
- 0032329151
- A Roadmap of Agent Research and Development
- Nicholas R. Jennings, Katia Sycara and Michael Wooldridge, A Roadmap of Agent Research and Development, Autonomous Agents and Multi-Agent Systems 1(1) (1998), 7-38.
- (1998) Autonomous Agents and Multi-Agent Systems , vol.1 , Issue.1 , pp. 7-38
- Jennings, N.R.¹ Sycara, K.² Wooldridge, M.³

13
- 0012075670
- Towards a Life-Long Learning Soccer Agent
- Fukuoka, Japan
- A. Kleiner, M. Dietl and B. Nebel, Towards a Life-Long Learning Soccer Agent, In Proc. Int. RoboCup Symposium 02, pages 119-127, Fukuoka, Japan, 2002.
- (2002) Proc. Int. RoboCup Symposium 02 , pp. 119-127
- Kleiner, A.¹ Dietl, M.² Nebel, B.³

14
- 33750805750
- Teamwork and Simulation in Hybrid Cognitive Architecture
- B. Gabrys et al., Springer-Verlag Berlin Heidelberg
- Jinsong Leng, Colin Fyfe and Lakhmi Jain, Teamwork and Simulation in Hybrid Cognitive Architecture, in: Proceeding in 10th Knowledge-Based Intelligent Information and Engineering Systems, LNCS 4252, B. Gabrys et al., Springer-Verlag Berlin Heidelberg, 2006, pp. 472-478.
- (2006) Proceeding in 10th Knowledge-Based Intelligent Information and Engineering Systems, LNCS 4252 , pp. 472-478
- Leng, J.¹ Fyfe, C.² Jain, L.³

15
- 38049144717
- Reinforcement Learning of Competitive Skills with Soccer Agents
- B. Apolloni et al., Springer-Verlag Berlin Heidelberg
- Jinsong Leng, Colin Fyfe and Lakhmi Jain, Reinforcement Learning of Competitive Skills with Soccer Agents, in:Proceeding in 11th International Conference on Knowledge-Based and Intelligent Information and Engineering Systems, LNAI 4692, B. Apolloni et al., Springer-Verlag Berlin Heidelberg, 2007.
- (2007) Proceeding in 11th International Conference on Knowledge-Based and Intelligent Information and Engineering Systems, LNAI 4692
- Leng, J.¹ Fyfe, C.² Jain, L.³

16
- 64349090183
- Convergence Analysis on Temporal Difference Learning
- press
- Jinsong Leng, Lakhmi Jain and Colin Fyfe, Convergence Analysis on Temporal Difference Learning, International Journal of Innovative computing, Information and Control 5(2) (2009), in press.
- (2009) International Journal of Innovative computing, Information and Control , vol.5 , Issue.2
- Leng, J.¹ Jain, L.² Fyfe, C.³

17
- 0004272772
- Cambridge University Press
- David J. C. Mackay, Information Theory, Inference, and Learning Algorithms, Cambridge University Press, 2003.
- (2003) Information Theory, Inference, and Learning Algorithms
- David, J.¹ Mackay, C.²

18
- 84867463287
- Karlsruhe Brainstormers - A Reinforcement Learning Approach to Robotic Soccer
- LNAI, Springer Berlin / Heidelberg
- A. Merke and M. Riedmiller, Karlsruhe Brainstormers - A Reinforcement Learning Approach to Robotic Soccer, RoboCup 2001, volume LNAI 2377, pages 435-440. Springer Berlin / Heidelberg, 2002.
- (2002) RoboCup 2001 , vol.2377 , pp. 435-440
- Merke, A.¹ Riedmiller, M.²

19
- 85013623145
- John Wiley & Sons Inc
- M.L. Puterman, Markovian Decision Problems, John Wiley & Sons Inc, 1994.
- (1994) Markovian Decision Problems
- Puterman, M.L.¹

20
- 85013587228
- A Scoring Policy for Simulated Soccer Agents using Reinforcement Learning
- New Zealand
- Azam Rabiee and Nasser Ghasem-Aghaee, A Scoring Policy for Simulated Soccer Agents using Reinforcement Learning, 2nd International Conference on Autonomous Robots and Agents, New Zealand, 2004.
- (2004) 2nd International Conference on Autonomous Robots and Agents
- Rabiee, A.¹ Ghasem-Aghaee, N.²

21
- 0004080531
- New York: Wiley
- Reuven Y. Rubinstein, Simulation and the Monte Carlo Method, New York: Wiley, 1981.
- (1981) Simulation and the Monte Carlo Method
- Rubinstein, R.Y.¹

22
- 0003584577
- Prentice-Hall, Englewood Cliffs, NJ
- S. Russell and P. Norvig, Artificial Intelligence: A Modern Approach, Prentice-Hall, Englewood Cliffs, NJ, 1995.
- (1995) Artificial Intelligence: A Modern Approach
- Russell, S.¹ Norvig, P.²

23
- 26944466214
- Function Approximation via Tile Coding: Automating Parameter Choice
- Berlin, Springer-Verlag
- Alexander A. Sherstov and Peter Stone, Function Approximation via Tile Coding: Automating Parameter Choice, SARA 2005. LNCS 3607, pages 194-205, Berlin, 2005. Springer-Verlag.
- (2005) SARA 2005. LNCS 3607 , pp. 194-205
- Sherstov, A.A.¹ Stone, P.²

24
- 33750709709
- PhD thesis, School of Electrical and Information Engineering, University of South Australia
- Christos Sioutis, Reasoning and Learning for Intelligent Agents, PhD thesis, School of Electrical and Information Engineering, University of South Australia, 2005.
- (2005) Reasoning and Learning for Intelligent Agents
- Sioutis, C.¹

25
- 0003328519
- TPOT-RL: Team-partitioned, opaque-transition reinforcement learning
- Berlin, Springer Verlag
- P. Stone and M. Veloso, TPOT-RL: Team-partitioned, opaque-transition reinforcement learning, RoboCup 98: Robot Soccer World Cup II, pages 221-236, Berlin, 1998. Springer Verlag.
- (1998) RoboCup 98: Robot Soccer World Cup II , pp. 221-236
- Stone, P.¹ Veloso, M.²

26
- 37249034293
- GregoryKuhlmann, MatthewE. Taylor andYaxin Liu, Keepaway Soccer: From Machine Learning Testbed to Benchmark
- Itsuki Noda, Adam Jacoff, Ansgar Bredenfeld and Yasutake Takahashi, eds, Berlin, Springer Verlag
- Peter Stone, GregoryKuhlmann, MatthewE. Taylor andYaxin Liu, Keepaway Soccer: From Machine Learning Testbed to Benchmark, in: RoboCup-2005: Robot Soccer World Cup IX, (Vol. 4020,) Itsuki Noda, Adam Jacoff, Ansgar Bredenfeld and Yasutake Takahashi, eds, Berlin, 2006. Springer Verlag, pp. 93-105.
- (2006) RoboCup-2005: Robot Soccer World Cup IX , vol.4020 , pp. 93-105
- Stone, P.¹

27
- 0013528313
- Scaling Reinforcement Learning Toward RoboCup Soccer
- Morgan Kaufmann, San Francisco, CA
- Peter Stone and Richard S. Sutton, Scaling Reinforcement Learning Toward RoboCup Soccer, Proc. 18th International Conf. on Machine Learning, pages 537-544. Morgan Kaufmann, San Francisco, CA, 2001.
- (2001) Proc. 18th International Conf. on Machine Learning , pp. 537-544
- Stone, P.¹ Sutton, R.S.²

28
- 33847202724
- Learning to Predict by the Method of Temporal Differences
- R.S. Sutton, Learning to Predict by the Method of Temporal Differences, Machine Learning 3 (1988), 9-44.
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, R.S.¹

29
- 85156221438
- Generalization in Reinforcement Learning: Successful Examples using Sparse Coarse Coding
- D.S. Touretzky, M.C. Mozer and M.E. Hasselmo, eds, Cambridge, MA. MIT Press
- R.S. Sutton, Generalization in Reinforcement Learning: Successful Examples using Sparse Coarse Coding, in: Advances in Neural Information Processing Systems: Proceedings of the 1995 Conference, D.S. Touretzky, M.C. Mozer and M.E. Hasselmo, eds, Cambridge, MA. MIT Press, 1995, pp. 1038-1044.
- (1995) Advances in Neural Information Processing Systems: Proceedings of the 1995 Conference , pp. 1038-1044
- Sutton, R.S.¹

30
- 0004102479
- MIT Press
- R.S. Sutton and A.G. Barto, Reinforcement Learning: An Introduction, MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

31
- 0004049893
- PhD thesis, Cambridge University, Cambridge, England
- C. J. C. H. Watkins, Learning from Delayed Rewards, PhD thesis, Cambridge University, Cambridge, England, 1989.
- (1989) Learning from Delayed Rewards
- Watkins, C.J.C.H.¹

32
- 1142280955
- Concurrent Layered Learning
- Australia
- Shimon Whiteson and Peter Stone, Concurrent Layered Learning, In Proceeding of the Second International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS 03), pages 193-200, Australia, 2003.
- (2003) Proceeding of the Second International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS 03) , pp. 193-200
- Whiteson, S.¹ Stone, P.²

33
- 33646714634
- Evolutionary Function Approximation for Reinforcement Learning
- Shimon Whiteson and Peter Stone, Evolutionary Function Approximation for Reinforcement Learning, Journal of Machine Learning Research 7 (2006), 877-917.
- (2006) Journal of Machine Learning Research , vol.7 , pp. 877-917
- Whiteson, S.¹ Stone, P.²

34
- 84972263711
- Intelligent Agents: Theory and Practice
- Michael Wooldridge and Nick Jennings, Intelligent Agents: Theory and Practice, Knowledge Engineering Review 10(2) (1995), 115-152.
- (1995) Knowledge Engineering Review , vol.10 , Issue.2 , pp. 115-152
- Wooldridge, M.¹ Jennings, N.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.