메뉴 건너뛰기




Volumn 1, Issue , 2000, Pages 272-276

A novel multi-agent Q-learning algorithm in cooperative multi-agent system

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER SIMULATION; LEARNING ALGORITHMS; LEARNING SYSTEMS; OPTIMIZATION;

EID: 0034591615     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (6)

References (7)
  • 1
    • 0008614785 scopus 로고    scopus 로고
    • The dynamics of reinforcement learning in cooperative multi-agent systems
    • Providence
    • C. Claus and C. Boutilier. The Dynamics of Reinforcement Learning in Cooperative Multi-agent Systems. AAAI-97 Work. Multi-agent Learning, 1997, pp. 13-18, Providence.
    • (1997) AAAI-97 Work. Multi-agent Learning , pp. 13-18
    • Claus, C.1    Boutilier, C.2
  • 2
  • 3
    • 0040235739 scopus 로고    scopus 로고
    • Self-fulfilling bias in multi-agent learning
    • July
    • J.L. Hu and M.P. Wellman. Self-fulfilling bias in multi-agent learning. Proc. ICMAS-98, July 1998, pp. 118-125.
    • (1998) Proc. ICMAS-98 , pp. 118-125
    • Hu, J.L.1    Wellman, M.P.2
  • 4
    • 85152198941 scopus 로고
    • Multi-agent reinforcement learning: Independent vs. cooperative agents
    • Amherst, MA
    • th Intl. Conf. on Machine Learning, 1993, pp. 330-337, Amherst, MA.
    • (1993) th Intl. Conf. on Machine Learning , pp. 330-337
    • Tan, M.1
  • 6
    • 0022738693 scopus 로고    scopus 로고
    • Decentralized learning in Markov chains
    • 1951
    • R.M. Wheeler and K.S. Narendra. Decentralized learning in Markov chains. IEEE Trans. Aut. Control, 1998, 31:519-526, 1951
    • (1998) IEEE Trans. Aut. Control , vol.31 , pp. 519-526
    • Wheeler, R.M.1    Narendra, K.S.2
  • 7
    • 0000221289 scopus 로고
    • Rational learning leads to nash equilibrium
    • E. Kalai and E. Lehrer. Rational learning leads to Nash equilibrium. Econometrica, 1993, 61(5):1019-1045.
    • (1993) Econometrica , vol.61 , Issue.5 , pp. 1019-1045
    • Kalai, E.1    Lehrer, E.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.