SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2009, Pages 369-376

Dynamic analysis of multiagent Q-learning with ε-greedy exploration

Author keywords

[No Author keywords available]

Indexed keywords

CONTINUOUS TIME; GREEDY EXPLORATION; MULTI-AGENT; Q-LEARNING; Q-LEARNING AGENTS; SYSTEM OF DIFFERENCE EQUATIONS;

DIFFERENCE EQUATIONS; DYNAMIC ANALYSIS; EDUCATION; INTELLIGENT AGENTS; ROBOT LEARNING;

MATHEMATICAL MODELS;

EID: 71149097863 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (106)

References (18)

2
- 0031281590
- Learning through reinforcement and replicator dynamics
- Borgers, T., & Sarin, R. (1997). Learning through reinforcement and replicator dynamics. Journal of Economic Theory, 77, 1-14.
- (1997) Journal of Economic Theory , vol.77 , pp. 1-14
- Borgers, T.¹ Sarin, R.²

3
- 0033876515
- Borkar, V. S., & Meyn, S. P. (2000). The O.D.E. method for convergence of stochastic approximation and reinforcement learning. SIAM Journal on Control and Optimization, 38, 447-469.
- Borkar, V. S., & Meyn, S. P. (2000). The O.D.E. method for convergence of stochastic approximation and reinforcement learning. SIAM Journal on Control and Optimization, 38, 447-469.

4
- 0003781528
- New York: John Wiley and Sons
- Bush, R. R., & Mosteller, F. (1955). Stochastic models for learning. New York: John Wiley and Sons.
- (1955) Stochastic models for learning
- Bush, R.R.¹ Mosteller, F.²

6
- 84880861539
- Predicting and preventing coordination problems in cooperative Q-learning systems
- Fulda, N., & Ventura, D. (2007). Predicting and preventing coordination problems in cooperative Q-learning systems. Proceedings of the Twentieth International Joint Conference on Artificial Intelligence (IJCAI'07) (pp. 780-785).
- (2007) Proceedings of the Twentieth International Joint Conference on Artificial Intelligence (IJCAI'07) , pp. 780-785
- Fulda, N.¹ Ventura, D.²

9
- 58149457775
- Optimal local basis: A reinforcement learning approach for face recognition
- Harandi, M. T., Ahmadabadi, M. N., & Araabi, B. N. (2008). Optimal local basis: A reinforcement learning approach for face recognition. International Journal of Computer Vision, 81, 191-204.
- (2008) International Journal of Computer Vision , vol.81 , pp. 191-204
- Harandi, M.T.¹ Ahmadabadi, M.N.² Araabi, B.N.³

10
- 0003532627
- Cambridge University Press
- Hofbauer, J., & Sigmund, K. (1998). Evolutionary games and population dynamics. Cambridge University Press.
- (1998) Evolutionary games and population dynamics
- Hofbauer, J.¹ Sigmund, K.²

12
- 33645029191
- Individual Q-learning in normal form games
- Leslie, D. S., & Collins, E. J. (2005). Individual Q-learning in normal form games. SIAM Journal on Control and Optimization, 44, 495-514.
- (2005) SIAM Journal on Control and Optimization , vol.44 , pp. 495-514
- Leslie, D.S.¹ Collins, E.J.²

13
- 26444601262
- Cooperative multi-agent learning: The state of the art
- Panait, L., & Luke, S. (2005). Cooperative multi-agent learning: The state of the art. Autonomous Agents and Multi-Agent Systems, 11, 387-434.
- (2005) Autonomous Agents and Multi-Agent Systems , vol.11 , pp. 387-434
- Panait, L.¹ Luke, S.²

14
- 41549123971
- Theoretical advantages of lenient learners: An evolutionary game theoretic perspective
- Panait, L., Tuyls, K., & Luke, S. (2008). Theoretical advantages of lenient learners: An evolutionary game theoretic perspective. Journal of Machine Learning Research, 9, 423-457.
- (2008) Journal of Machine Learning Research , vol.9 , pp. 423-457
- Panait, L.¹ Tuyls, K.² Luke, S.³

15
- 0004102479
- Cambridge, MA: MIT Press
- Sutton, R. S., & Barto, A. G. (1998). Reinforcement learning: An introduction. Cambridge, MA: MIT Press.
- (1998) Reinforcement learning: An introduction
- Sutton, R.S.¹ Barto, A.G.²

17
- 0346502047
- Predicting the expected behavior of agents that learn about agents: The CLRI framework
- Vidal, J. M., & Durfee, E. H. (2003). Predicting the expected behavior of agents that learn about agents: the CLRI framework. Autonomous Agents and Multi-Agent Systems, 6, 77-107.
- (2003) Autonomous Agents and Multi-Agent Systems , vol.6 , pp. 77-107
- Vidal, J.M.¹ Durfee, E.H.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.