메뉴 건너뛰기




Volumn 14, Issue 3, 2007, Pages 239-269

Exploring selfish reinforcement learning in repeated games with stochastic rewards

Author keywords

Learning automata; Multi agent reinforcement learning; Non zero sum games

Indexed keywords


EID: 34247642270     PISSN: 13872532     EISSN: 15737454     Source Type: Journal    
DOI: 10.1007/s10458-006-9007-0     Document Type: Article
Times cited : (43)

References (26)
  • 1
    • 0002430114 scopus 로고
    • Subjectivity and correlation in randomized strategies
    • Aumann, R. (1974). Subjectivity and correlation in randomized strategies. Journal of Mathematical Economics, 1, 67-96.
    • (1974) Journal of Mathematical Economics , vol.1 , pp. 67-96
    • Aumann, R.1
  • 8
    • 4644369748 scopus 로고    scopus 로고
    • Nash q-learning for general-sum stochastic games
    • Hu, J., & Wellman, M. (2003). Nash q-learning for general-sum stochastic games. Journal of Machine Learning Research, 4, 1039-1069.
    • (2003) Journal of Machine Learning Research , vol.4 , pp. 1039-1069
    • Hu, J.1    Wellman, M.2
  • 10
  • 20
    • 0028423534 scopus 로고
    • Decentralized learning of nash equilibria in multi-person stochastic games with incomplete information
    • Sastry, P., Phansalkar, V., & Thathachar, M. (1994). Decentralized learning of nash equilibria in multi-person stochastic games with incomplete information. IEEE Transactions on Systems, Man, and Cybernetics, 24(5), 769-777.
    • (1994) IEEE Transactions on Systems, Man, and Cybernetics , vol.24 , Issue.5 , pp. 769-777
    • Sastry, P.1    Phansalkar, V.2    Thathachar, M.3
  • 23
    • 0028497630 scopus 로고
    • Asynchronous stochastic approximation and q-learning
    • Tsitsiklis, J. (1994). Asynchronous stochastic approximation and q-learning. Machine Learning, 16, 185-202.
    • (1994) Machine Learning , vol.16 , pp. 185-202
    • Tsitsiklis, J.1
  • 26
    • 7044229393 scopus 로고    scopus 로고
    • Homo egualis reinforcement learning agents for load balancing
    • Proceedings of the 1st NASA workshop on radical agent concepts, pp, Springer-Verlag
    • Verbeeck, K., Nowé, A., & Parent, J. (2002). Homo egualis reinforcement learning agents for load balancing. In Proceedings of the 1st NASA workshop on radical agent concepts, pp. 81-91. Springer-Verlag LNAI 2564.
    • (2002) LNAI , vol.2564 , pp. 81-91
    • Verbeeck, K.1    Nowé, A.2    Parent, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.