메뉴 건너뛰기




Volumn , Issue , 2009, Pages 369-376

Dynamic analysis of multiagent Q-learning with ε-greedy exploration

Author keywords

[No Author keywords available]

Indexed keywords

CONTINUOUS TIME; GREEDY EXPLORATION; MULTI-AGENT; Q-LEARNING; Q-LEARNING AGENTS; SYSTEM OF DIFFERENCE EQUATIONS;

EID: 71149097863     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (106)

References (18)
  • 2
    • 0031281590 scopus 로고    scopus 로고
    • Learning through reinforcement and replicator dynamics
    • Borgers, T., & Sarin, R. (1997). Learning through reinforcement and replicator dynamics. Journal of Economic Theory, 77, 1-14.
    • (1997) Journal of Economic Theory , vol.77 , pp. 1-14
    • Borgers, T.1    Sarin, R.2
  • 3
    • 0033876515 scopus 로고    scopus 로고
    • Borkar, V. S., & Meyn, S. P. (2000). The O.D.E. method for convergence of stochastic approximation and reinforcement learning. SIAM Journal on Control and Optimization, 38, 447-469.
    • Borkar, V. S., & Meyn, S. P. (2000). The O.D.E. method for convergence of stochastic approximation and reinforcement learning. SIAM Journal on Control and Optimization, 38, 447-469.
  • 11
    • 70049111791 scopus 로고    scopus 로고
    • Learning teaching strategies in an adaptive and intelligent educational system through reinforcement learning
    • in press
    • Iglesias, A., Martnez, P., Aler, R., & Fernndez, F. (2008). Learning teaching strategies in an adaptive and intelligent educational system through reinforcement learning. Applied Intelligence, in press.
    • (2008) Applied Intelligence
    • Iglesias, A.1    Martnez, P.2    Aler, R.3    Fernndez, F.4
  • 13
    • 26444601262 scopus 로고    scopus 로고
    • Cooperative multi-agent learning: The state of the art
    • Panait, L., & Luke, S. (2005). Cooperative multi-agent learning: The state of the art. Autonomous Agents and Multi-Agent Systems, 11, 387-434.
    • (2005) Autonomous Agents and Multi-Agent Systems , vol.11 , pp. 387-434
    • Panait, L.1    Luke, S.2
  • 14
    • 41549123971 scopus 로고    scopus 로고
    • Theoretical advantages of lenient learners: An evolutionary game theoretic perspective
    • Panait, L., Tuyls, K., & Luke, S. (2008). Theoretical advantages of lenient learners: An evolutionary game theoretic perspective. Journal of Machine Learning Research, 9, 423-457.
    • (2008) Journal of Machine Learning Research , vol.9 , pp. 423-457
    • Panait, L.1    Tuyls, K.2    Luke, S.3
  • 17
    • 0346502047 scopus 로고    scopus 로고
    • Predicting the expected behavior of agents that learn about agents: The CLRI framework
    • Vidal, J. M., & Durfee, E. H. (2003). Predicting the expected behavior of agents that learn about agents: the CLRI framework. Autonomous Agents and Multi-Agent Systems, 6, 77-107.
    • (2003) Autonomous Agents and Multi-Agent Systems , vol.6 , pp. 77-107
    • Vidal, J.M.1    Durfee, E.H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.