메뉴 건너뛰기




Volumn 14, Issue 2, 2011, Pages 251-278

An empirical study of potential-based reward shaping and advice in complex, multi-agent systems

Author keywords

multi agent; Reinforcement learning; reward shaping

Indexed keywords


EID: 79955403826     PISSN: 02195259     EISSN: None     Source Type: Journal    
DOI: 10.1142/S0219525911002998     Document Type: Conference Paper
Times cited : (84)

References (37)
  • 13
    • 4644369748 scopus 로고    scopus 로고
    • Nash Q-learning for general-sum stochastic games
    • Hu, J. andWellman, M., Nash Q-learning for general-sum stochastic games, J. Mach. Learn. Res. 4 (2003) 1039-1069.
    • (2003) J. Mach. Learn. Res. , vol.4 , pp. 1039-1069
    • Hu, J.1    Wellman, M.2
  • 15
    • 77950988223 scopus 로고    scopus 로고
    • Learning complementary multiagent behaviors: A case study
    • RoboCup 2009: Robot Soccer World Cup XIII, eds. Baltes, J., Lagoudakis, M., Naruse, T. and Ghidary, S., Springer Berlin/Heidelberg
    • Kalyanakrishnan, S. and Stone, P., Learning complementary multiagent behaviors: A case study, in RoboCup 2009: Robot Soccer World Cup XIII, eds. Baltes, J., Lagoudakis, M., Naruse, T. and Ghidary, S., Lecture Notes in Computer Science, Vol. 5949 (Springer Berlin/Heidelberg, 2010), pp. 153-165.
    • (2010) Lecture Notes in Computer Science , vol.5949 , pp. 153-165
    • Kalyanakrishnan, S.1    Stone, P.2
  • 16
    • 0029732210 scopus 로고    scopus 로고
    • Creating advice-taking reinforcement learners
    • Maclin, R. and Shavlik, J., Creating advice-taking reinforcement learners, Lect. Notes Artif. Int. (1996) 251-281. (Pubitemid 126724368)
    • (1996) Machine Learning , vol.22 , Issue.1-3 , pp. 251-281
    • Maclin, R.1    Shavlik, J.W.2
  • 20
    • 0001730497 scopus 로고
    • Non-cooperative games
    • Nash, J., Non-cooperative games, Ann. Math. 54 (1951) 286-295.
    • (1951) Ann. Math. , vol.54 , pp. 286-295
    • Nash, J.1
  • 25
    • 34147161536 scopus 로고    scopus 로고
    • If multi-agent learning is the answer, what is the question?
    • DOI 10.1016/j.artint.2006.02.006, PII S0004370207000495, Foundations of Multi-Agent Learning
    • Shoham, Y., Powers, R. and Grenager, T., If multi-agent learning is the answer, what is the question? Artif. Intell. 171 (2007) 365-377. (Pubitemid 46802421)
    • (2007) Artificial Intelligence , vol.171 , Issue.7 , pp. 365-377
    • Shoham, Y.1    Powers, R.2    Grenager, T.3
  • 27
    • 27544506565 scopus 로고    scopus 로고
    • Reinforcement learning for RoboCupsoccer keepaway
    • Stone, P., Sutton, R. S. and Kuhlmann, G., Reinforcement learning for RoboCupsoccer keepaway, Adapt. Behav. 13 (2005) 165-188.
    • (2005) Adapt. Behav. , vol.13 , pp. 165-188
    • Stone, P.1    Sutton, R.S.2    Kuhlmann, G.3
  • 28
    • 85156221438 scopus 로고    scopus 로고
    • Generalization in reinforcement learning: Successful examples using sparse coarse coding
    • Sutton, R., Generalization in reinforcement learning: Successful examples using sparse coarse coding, Adv. Neur. In. (1996) 1038-1044.
    • (1996) Adv. Neur. In. , pp. 1038-1044
    • Sutton, R.1
  • 29
  • 32
    • 70349592320 scopus 로고    scopus 로고
    • Learning from actions not taken in multiagent systems
    • Tumer, K. and Khani, N., Learning from actions not taken in multiagent systems, Adv. Complex Syst. 12 (2009) 455-473.
    • (2009) Adv. Complex Syst. , vol.12 , pp. 455-473
    • Tumer, K.1    Khani, N.2
  • 34
    • 27744448185 scopus 로고    scopus 로고
    • Reinforcement learning to play an optimal Nash equilibrium in team Markov games
    • Wang, X. and Sandholm, T., Reinforcement learning to play an optimal Nash equilibrium in team Markov games, Adv. Neur. In. (2003) 1603-1610.
    • (2003) Adv. Neur. In. , pp. 1603-1610
    • Wang, X.1    Sandholm, T.2
  • 35
    • 27344453198 scopus 로고    scopus 로고
    • Potential-based shaping and Q-value initialization are equivalent
    • Wiewiora, E., Potential-based shaping and Q-value initialization are equivalent, J. Artif. Intell. Res. 19 (2003) 205-208. (Pubitemid 41525920)
    • (2003) Journal of Artificial Intelligence Research , vol.19 , pp. 205-208
    • Wiewiora, E.1
  • 36
    • 0004320981 scopus 로고    scopus 로고
    • An introduction to collective intelligence
    • NASA Ames Research Center
    • Wolpert, D. and Tumer, K., An introduction to collective intelligence, Technical Report cs.LG/9908014, NASA Ames Research Center (1999).
    • (1999) Technical Report cs.LG/9908014
    • Wolpert, D.1    Tumer, K.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.