SCOPUS 정보 검색 플랫폼

Advances in Complex Systems

Volumn 14, Issue 2, 2011, Pages 251-278

An empirical study of potential-based reward shaping and advice in complex, multi-agent systems

(3) Devlin, Sam a Kudenko, Daniel a Grze, Marek b

a UNIVERSITY OF YORK (United Kingdom)

b UNIVERSITY OF WATERLOO (Canada)

Author keywords

multi agent; Reinforcement learning; reward shaping

Indexed keywords

EID: 79955403826 PISSN: 02195259 EISSN: None Source Type: Journal
DOI: 10.1142/S0219525911002998 Document Type: Conference Paper

Times cited : (84)

References (37)

1
- 84899963942
- Social reward shaping in the prisoner's dilemma
- Babes, M., de Cote, E. and Littman, M., Social reward shaping in the prisoner's dilemma, in Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems, Vol. 3 (2008), pp. 1389-1392.
- (2008) Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems , vol.3 , pp. 1389-1392
- Babes, M.¹ De Cote, E.² Littman, M.³

2
- 0002852356
- Learning roles: Behavioral diversity in robot teams
- Balch, T., Learning roles: Behavioral diversity in robot teams, in AAAI Workshop on Multiagent Learning (1997).
- (1997) AAAI Workshop on Multiagent Learning
- Balch, T.¹

3
- 0003565783
- Athena Scientific, 3rd edn.
- Bertsekas, D. P., Dynamic Programming and Optimal Control (2 Vol Set) (Athena Scientific, 3rd edn., 2007).
- (2007) Dynamic Programming and Optimal Control (2 Vol Set)
- Bertsekas, D.P.¹

4
- 0004106775
- D. C. Heath & Co.
- Binmore, K., Fun and Games - A Text on Game Theory (D. C. Heath & Co., 1991).
- (1991) Fun and Games - A Text on Game Theory
- Binmore, K.¹

5
- 40949147745
- A comprehensive survey of multi-agent reinforcement learning
- Busoniu, L., Babuska, R. and De Schutter, B., A comprehensive survey of multi-agent reinforcement learning, IEEE Trans. Syst. Man Cyb. C. 38 (2008) 156.
- (2008) IEEE Trans. Syst. Man Cyb. C. , vol.38 , pp. 156
- Busoniu, L.¹ Babuska, R.² De Schutter, B.³

6
- 0031630561
- The dynamics of reinforcement learning in cooperative multiagent systems
- Claus, C. and Boutilier, C., The dynamics of reinforcement learning in cooperative multiagent systems, in Proceedings of the National Conference on Artificial Intelligence (1998), pp. 746-752.
- (1998) Proceedings of the National Conference on Artificial Intelligence , pp. 746-752
- Claus, C.¹ Boutilier, C.²

7
- 84856908031
- Reinforcement learning in robocup keepaway with partial observability
- Devlin, S., Grzés, M. and Kudenko, D., Reinforcement learning in robocup keepaway with partial observability, in IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, 2009. WI-IAT'09 (2009).
- (2009) IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, 2009. WI-IAT'09
- Devlin, S.¹ Grzés, M.² Kudenko, D.³

8
- 84899455116
- Theoretical considerations of potential-based reward shaping for multi-agent systems
- Devlin, S. and Kudenko, D., Theoretical considerations of potential-based reward shaping for multi-agent systems, in Proceedings of The Tenth Annual International Conference on Autonomous Agents and Multiagent Systems (AAMAS) (2011).
- (2011) Proceedings of the Tenth Annual International Conference on Autonomous Agents and Multiagent Systems (AAMAS)
- Devlin, S.¹ Kudenko, D.²

9
- 1942484477
- Principled methods for advising reinforcement learning agents
- Eric Wiewiora, G. C. and Elkan, C., Principled methods for advising reinforcement learning agents, in Proceedings of the Twentieth International Conference on Machine Learning (2003).
- (2003) Proceedings of the Twentieth International Conference on Machine Learning
- Eric Wiewiora, G.C.¹ Elkan, C.²

10
- 0004260007
- MIT Press, Cambridge, MA
- Fudenberg, D. and Tirole, J., Game Theory (MIT Press, Cambridge, MA, 1991).
- (1991) Game Theory
- Fudenberg, D.¹ Tirole, J.²

11
- 58849111871
- Multigrid reinforcement learning with reward shaping
- Grzés, M. and Kudenko, D., Multigrid reinforcement learning with reward shaping, Artificial Neural Networks-ICANN 2008 (2008), pp. 357-366.
- (2008) Artificial Neural Networks-ICANN 2008 , pp. 357-366
- Grzés, M.¹ Kudenko, D.²

12
- 78650499444
- Plan-based reward shaping for reinforcement learning
- IEEE
- Grzés, M. and Kudenko, D., Plan-based reward shaping for reinforcement learning, in Proceedings of the 4th IEEE International Conference on Intelligent Systems (IS'08) (IEEE, 2008), pp. 22-29.
- (2008) Proceedings of the 4th IEEE International Conference on Intelligent Systems (IS'08) , pp. 22-29
- Grzés, M.¹ Kudenko, D.²

13
- 4644369748
- Nash Q-learning for general-sum stochastic games
- Hu, J. andWellman, M., Nash Q-learning for general-sum stochastic games, J. Mach. Learn. Res. 4 (2003) 1039-1069.
- (2003) J. Mach. Learn. Res. , vol.4 , pp. 1039-1069
- Hu, J.¹ Wellman, M.²

14
- 84899897564
- A new perspective to the keepaway soccer: The takers
- Iscen, A. and Erogul, U., A new perspective to the keepaway soccer: the takers, in Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems, Vol. 3 (2008), pp. 1341-1344.
- (2008) Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems , vol.3 , pp. 1341-1344
- Iscen, A.¹ Erogul, U.²

15
- 77950988223
- Learning complementary multiagent behaviors: A case study
- RoboCup 2009: Robot Soccer World Cup XIII, eds. Baltes, J., Lagoudakis, M., Naruse, T. and Ghidary, S., Springer Berlin/Heidelberg
- Kalyanakrishnan, S. and Stone, P., Learning complementary multiagent behaviors: A case study, in RoboCup 2009: Robot Soccer World Cup XIII, eds. Baltes, J., Lagoudakis, M., Naruse, T. and Ghidary, S., Lecture Notes in Computer Science, Vol. 5949 (Springer Berlin/Heidelberg, 2010), pp. 153-165.
- (2010) Lecture Notes in Computer Science , vol.5949 , pp. 153-165
- Kalyanakrishnan, S.¹ Stone, P.²

16
- 0029732210
- Creating advice-taking reinforcement learners
- Maclin, R. and Shavlik, J., Creating advice-taking reinforcement learners, Lect. Notes Artif. Int. (1996) 251-281. (Pubitemid 126724368)
- (1996) Machine Learning , vol.22 , Issue.1-3 , pp. 251-281
- Maclin, R.¹ Shavlik, J.W.²

17
- 34547964974
- Automatic shaping and decomposition of reward functions
- ACM
- Marthi, B., Automatic shaping and decomposition of reward functions, in Proceedings of the 24th International Conference on Machine Learning (ACM, 2007), p. 608.
- (2007) Proceedings of the 24th International Conference on Machine Learning , pp. 608
- Marthi, B.¹

18
- 77950915046
- Decentralized learning in wireless sensor networks
- Mihaylov, M., Tuyls, K. and Noẃe, A., Decentralized learning in wireless sensor networks, Adaptive and Learning Agents (2009), pp. 60-73.
- (2009) Adaptive and Learning Agents , pp. 60-73
- Mihaylov, M.¹ Tuyls, K.² Noẃe, A.³

19
- 62949148941
- A study of reinforcement learning in a new multiagent domain
- Min, H., Zeng, J., Chen, J. and Zhu, J., A Study of reinforcement learning in a new multiagent domain, in IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, 2008. WI-IAT'08, Vol. 2 (2008).
- (2008) IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, 2008. WI-IAT'08 , vol.2
- Min, H.¹ Zeng, J.² Chen, J.³ Zhu, J.⁴

20
- 0001730497
- Non-cooperative games
- Nash, J., Non-cooperative games, Ann. Math. 54 (1951) 286-295.
- (1951) Ann. Math. , vol.54 , pp. 286-295
- Nash, J.¹

21
- 0141596576
- Policy invariance under reward transformations: Theory and application to reward shaping
- Ng, A. Y., Harada, D. and Russell, S. J., Policy invariance under reward transformations: Theory and application to reward shaping, in Proceedings of the 16th International Conference on Machine Learning (1999), pp. 278-287.
- (1999) Proceedings of the 16th International Conference on Machine Learning , pp. 278-287
- Ng, A.Y.¹ Harada, D.² Russell, S.J.³

22
- 34447553096
- Reinforcement learning for humanoid robotics
- Peters, J., Vijayakumar, S. and Schaal, S., Reinforcement learning for humanoid robotics, in Proceedings of Humanoids2003, Third IEEE-RAS International Conference on Humanoid Robots (2003).
- (2003) Proceedings of Humanoids2003, Third IEEE-RAS International Conference on Humanoid Robots
- Peters, J.¹ Vijayakumar, S.² Schaal, S.³

23
- 0003998452
- John Wiley & Sons, Inc., New York, NY, USA
- Puterman, M. L., Markov Decision Processes: Discrete Stochastic Dynamic Programming (John Wiley & Sons, Inc., New York, NY, USA, 1994).
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

24
- 1642401055
- Learning to drive a bicycle using reinforcement learning and shaping
- Randløv, J. and Alstrom, P., Learning to drive a bicycle using reinforcement learning and shaping, in Proceedings of the 15th International Conference on Machine Learning (1998), pp. 463-471.
- (1998) Proceedings of the 15th International Conference on Machine Learning , pp. 463-471
- Randløv, J.¹ Alstrom, P.²

25
- 34147161536
- If multi-agent learning is the answer, what is the question?
- DOI 10.1016/j.artint.2006.02.006, PII S0004370207000495, Foundations of Multi-Agent Learning
- Shoham, Y., Powers, R. and Grenager, T., If multi-agent learning is the answer, what is the question? Artif. Intell. 171 (2007) 365-377. (Pubitemid 46802421)
- (2007) Artificial Intelligence , vol.171 , Issue.7 , pp. 365-377
- Shoham, Y.¹ Powers, R.² Grenager, T.³

26
- 37249034293
- Keepaway soccer: From machine learning testbed to benchmark
- RoboCup 2005: Robot Soccer World Cup IX
- Stone, P., Kuhlmann, G., Taylor, M. E. and Liu, Y., Keepaway soccer: From machine learning testbed to benchmark, in RoboCup-2005: Robot Soccer World Cup IX, eds. Noda, I., Jacoff, A., Bredenfeld, A. and Takahashi, Y., Vol. 4020 (Springer-Verlag, Berlin, 2006), pp. 93-105. (Pubitemid 350278772)
- (2006) Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , vol.4020 LNAI , pp. 93-105
- Stone, P.¹ Kuhlmann, G.² Taylor, M.E.³ Liu, Y.⁴

27
- 27544506565
- Reinforcement learning for RoboCupsoccer keepaway
- Stone, P., Sutton, R. S. and Kuhlmann, G., Reinforcement learning for RoboCupsoccer keepaway, Adapt. Behav. 13 (2005) 165-188.
- (2005) Adapt. Behav. , vol.13 , pp. 165-188
- Stone, P.¹ Sutton, R.S.² Kuhlmann, G.³

28
- 85156221438
- Generalization in reinforcement learning: Successful examples using sparse coarse coding
- Sutton, R., Generalization in reinforcement learning: Successful examples using sparse coarse coding, Adv. Neur. In. (1996) 1038-1044.
- (1996) Adv. Neur. In. , pp. 1038-1044
- Sutton, R.¹

29
- 0003617454
- Ph.D. Thesis, Department of Computer Science, University of Massachusetts, Amherst
- Sutton, R. S., Temporal Credit Assignment in Reinforcement Learning, Ph.D. Thesis, Department of Computer Science, University of Massachusetts, Amherst (1984).
- (1984) Temporal Credit Assignment in Reinforcement Learning
- Sutton, R.S.¹

30
- 0004102479
- MIT Press
- Sutton, R. S. and Barto, A. G., Reinforcement Learning: An Introduction (MIT Press, 1998).
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

31
- 85152198941
- Multi-agent reinforcement learning: Independent vs. cooperative agents
- Tan, M., Multi-agent reinforcement learning: Independent vs. cooperative agents, in Proceedings of the Tenth International Conference on Machine Learning, Vol. 337 (1993).
- (1993) Proceedings of the Tenth International Conference on Machine Learning , vol.337
- Tan, M.¹

32
- 70349592320
- Learning from actions not taken in multiagent systems
- Tumer, K. and Khani, N., Learning from actions not taken in multiagent systems, Adv. Complex Syst. 12 (2009) 455-473.
- (2009) Adv. Complex Syst. , vol.12 , pp. 455-473
- Tumer, K.¹ Khani, N.²

33
- 85158118268
- Collective intelligence and Braess' paradox
- Tumer, K. and Wolpert, D., Collective intelligence and Braess' paradox, in Proceedings of the National Conference on Artificial Intelligence (2000), pp. 104-109.
- (2000) Proceedings of the National Conference on Artificial Intelligence , pp. 104-109
- Tumer, K.¹ Wolpert, D.²

34
- 27744448185
- Reinforcement learning to play an optimal Nash equilibrium in team Markov games
- Wang, X. and Sandholm, T., Reinforcement learning to play an optimal Nash equilibrium in team Markov games, Adv. Neur. In. (2003) 1603-1610.
- (2003) Adv. Neur. In. , pp. 1603-1610
- Wang, X.¹ Sandholm, T.²

35
- 27344453198
- Potential-based shaping and Q-value initialization are equivalent
- Wiewiora, E., Potential-based shaping and Q-value initialization are equivalent, J. Artif. Intell. Res. 19 (2003) 205-208. (Pubitemid 41525920)
- (2003) Journal of Artificial Intelligence Research , vol.19 , pp. 205-208
- Wiewiora, E.¹

36
- 0004320981
- An introduction to collective intelligence
- NASA Ames Research Center
- Wolpert, D. and Tumer, K., An introduction to collective intelligence, Technical Report cs.LG/9908014, NASA Ames Research Center (1999).
- (1999) Technical Report cs.LG/9908014
- Wolpert, D.¹ Tumer, K.²

37
- 0004285157
- John Wiley and Sons
- Wooldridge, M., An Introduction to MultiAgent Systems (John Wiley and Sons, 2002).
- (2002) An Introduction to MultiAgent Systems
- Wooldridge, M.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.