SCOPUS 정보 검색 플랫폼

Autonomous Agents and Multi-Agent Systems

Volumn 14, Issue 3, 2007, Pages 239-269

Exploring selfish reinforcement learning in repeated games with stochastic rewards

(4) Verbeeck, Katja a Nowé, Ann a Parent, Johan a Tuyls, Karl b

a VRIJE UNIVERSITEIT BRUSSEL (Belgium)

b MAASTRICHT UNIVERSITY (Netherlands)

Author keywords

Learning automata; Multi agent reinforcement learning; Non zero sum games

Indexed keywords

EID: 34247642270 PISSN: 13872532 EISSN: 15737454 Source Type: Journal
DOI: 10.1007/s10458-006-9007-0 Document Type: Article

Times cited : (43)

References (26)

1
- 0002430114
- Subjectivity and correlation in randomized strategies
- Aumann, R. (1974). Subjectivity and correlation in randomized strategies. Journal of Mathematical Economics, 1, 67-96.
- (1974) Journal of Mathematical Economics , vol.1 , pp. 67-96
- Aumann, R.¹

2
- 4644369644
- Learning to coordinate efficiently: A model-based approach
- Brafman, R., & Tennenholtz, M. (2003). Learning to coordinate efficiently: A model-based approach. Journal on Artificial Intelligence Research (JAIR), 19, 11-23.
- (2003) Journal on Artificial Intelligence Research (JAIR) , vol.19 , pp. 11-23
- Brafman, R.¹ Tennenholtz, M.²

3
- 34247620375
- Baselines for joint-action reinforcement learning of coordination in cooperative multi-agent systems
- Carpenter, M., & Kudenko, D. (2004). Baselines for joint-action reinforcement learning of coordination in cooperative multi-agent systems. In Proceedings of the 4th symposium on adaptive agents and multi-agent systems, (AISB04) Society for the study of Artificial Intelligence and Simulation of Behaviour (pp. 10-19).
- (2004) Proceedings of the 4th symposium on adaptive agents and multi-agent systems, (AISB04) Society for the study of Artificial Intelligence and Simulation of Behaviour , pp. 10-19
- Carpenter, M.¹ Kudenko, D.²

4
- 0031630561
- The dynamics of reinforcement learning in cooperative multiagent systems
- Claus, C., & Boutilier, C. (1998). The dynamics of reinforcement learning in cooperative multiagent systems. In Proceedings of the 15th national conference on artificial intelligence (pp. 746-752).
- (1998) Proceedings of the 15th national conference on artificial intelligence , pp. 746-752
- Claus, C.¹ Boutilier, C.²

5
- 0003635251
- MIT Press
- Geist, A., & Beguelin, A. (1994). PVM: Parallel virtual machine. MIT Press.
- (1994) PVM: Parallel virtual machine
- Geist, A.¹ Beguelin, A.²

6
- 0003860985
- Princeton, New Jersey: Princeton University Press
- Gintis, H. (2000). Game theory evolving: A problem-centered introduction to modeling strategic behavior. Princeton, New Jersey: Princeton University Press.
- (2000) Game theory evolving: A problem-centered introduction to modeling strategic behavior
- Gintis, H.¹

7
- 1942517280
- Correlated q-learning
- Greenwald, A., & Hall, K. (2003). Correlated q-learning. In Proceedings of the twentieth international conference on machine learning (pp. 242-249).
- (2003) Proceedings of the twentieth international conference on machine learning , pp. 242-249
- Greenwald, A.¹ Hall, K.²

8
- 4644369748
- Nash q-learning for general-sum stochastic games
- Hu, J., & Wellman, M. (2003). Nash q-learning for general-sum stochastic games. Journal of Machine Learning Research, 4, 1039-1069.
- (2003) Journal of Machine Learning Research , vol.4 , pp. 1039-1069
- Hu, J.¹ Wellman, M.²

9
- 0036932299
- Reinforcement learning of coordination in cooperative multi-agent systems
- Kapetanakis, S., & Kudenko, D. (2002). Reinforcement learning of coordination in cooperative multi-agent systems. In Proceedings of the 18th national conference on artificial intelligence (pp. 326-331).
- (2002) Proceedings of the 18th national conference on artificial intelligence , pp. 326-331
- Kapetanakis, S.¹ Kudenko, D.²

10
- 4143053349
- Learning to coordinate using commitment sequences in cooperative multi-agent systems
- Society for the study of Artificial Intelligence and Simulation of Behaviour
- Kapetanakis, S., Kudenko, D., & Strens, M. (2003). Learning to coordinate using commitment sequences in cooperative multi-agent systems. In Proceedings of the 3rd symposium on adaptive agents and multi-agent systems, (AISB03) Society for the study of Artificial Intelligence and Simulation of Behaviour.
- (2003) Proceedings of the 3rd symposium on adaptive agents and multi-agent systems, (AISB03)
- Kapetanakis, S.¹ Kudenko, D.² Strens, M.³

11
- 0012286079
- An algorithm for distributed reinforcement learning in cooperative multi-agent systems
- Lauer, M., & Riedmiller, M. (2000). An algorithm for distributed reinforcement learning in cooperative multi-agent systems. In Proceedings of the 17th international conference on machine learning (pp. 535-542).
- (2000) Proceedings of the 17th international conference on machine learning , pp. 535-542
- Lauer, M.¹ Riedmiller, M.²

12
- 0000707715
- Markov games as a framework for multi-agent reinforcement learning
- Littman, M. (1994). Markov games as a framework for multi-agent reinforcement learning. In Proceedings of the 11th international conference on machine learning (pp. 322-328).
- (1994) Proceedings of the 11th international conference on machine learning , pp. 322-328
- Littman, M.¹

13
- 0242466944
- Friend-or-foe q-learning in general-sum games
- Littman, M. (2001). Friend-or-foe q-learning in general-sum games. In Proceedings of the 18th international conference on machine learning (pp. 157-163).
- (2001) Proceedings of the 18th international conference on machine learning , pp. 157-163
- Littman, M.¹

14
- 0001961616
- A generalized reinforcement-learning model: Convergence and applications
- Littman, M., & Szepesvári, C. (1996). A generalized reinforcement-learning model: Convergence and applications. In Proceedings of the 13th international conference on machine learning (pp. 310-318).
- (1996) Proceedings of the 13th international conference on machine learning , pp. 310-318
- Littman, M.¹ Szepesvári, C.²

15
- 0003891507
- Prentice-Hall International, Inc
- Narendra, K., & Thathachar, M. (1989). Learning automata: An introduction. Prentice-Hall International, Inc.
- (1989) Learning automata: An introduction
- Narendra, K.¹ Thathachar, M.²

16
- 0002021736
- Equilibrium points in n-person games
- Nash, J. (1950). Equilibrium points in n-person games. Proceedings of the national academy of siences 36, 48-49.
- (1950) Proceedings of the national academy of siences , vol.36 , pp. 48-49
- Nash, J.¹

17
- 84948131383
- Social agents playing a periodical policy
- Freiburg, Germany: Springer-Verlag LNAI2168
- Nowé, A., Parent, J., & Verbeeck, K. (2001). Social agents playing a periodical policy. In Proceedings of the 12th European conference on machine learning pp. 382-393. Freiburg, Germany: Springer-Verlag LNAI2168.
- (2001) Proceedings of the 12th European conference on machine learning , pp. 382-393
- Nowé, A.¹ Parent, J.² Verbeeck, K.³

18
- 0003427725
- Cambridge, MA: MIT Press
- Osborne, J., & Rubinstein, A. (1994). A course in game theory. Cambridge, MA: MIT Press.
- (1994) A course in game theory
- Osborne, J.¹ Rubinstein, A.²

19
- 0004151788
- Cambridge, MA: MIT Press
- Samuelson, L. (1997). Evolutionary games and equilibrium selection. Cambridge, MA: MIT Press.
- (1997) Evolutionary games and equilibrium selection
- Samuelson, L.¹

20
- 0028423534
- Decentralized learning of nash equilibria in multi-person stochastic games with incomplete information
- Sastry, P., Phansalkar, V., & Thathachar, M. (1994). Decentralized learning of nash equilibria in multi-person stochastic games with incomplete information. IEEE Transactions on Systems, Man, and Cybernetics, 24(5), 769-777.
- (1994) IEEE Transactions on Systems, Man, and Cybernetics , vol.24 , Issue.5 , pp. 769-777
- Sastry, P.¹ Phansalkar, V.² Thathachar, M.³

21
- 0004102479
- Cambridge, MA: MIT Press
- Sutton, R., & Barto, A. (1998). Reinforcement learning: An introduction. Cambridge, MA: MIT Press.
- (1998) Reinforcement learning: An introduction
- Sutton, R.¹ Barto, A.²

22
- 0036894214
- Varieties of learning automata: An overview
- Thathachar, M., & Sastry, P. (2002). Varieties of learning automata: An overview. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, 32(6), 711-722.
- (2002) IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics , vol.32 , Issue.6 , pp. 711-722
- Thathachar, M.¹ Sastry, P.²

23
- 0028497630
- Asynchronous stochastic approximation and q-learning
- Tsitsiklis, J. (1994). Asynchronous stochastic approximation and q-learning. Machine Learning, 16, 185-202.
- (1994) Machine Learning , vol.16 , pp. 185-202
- Tsitsiklis, J.¹

24
- 34247615600
- PhD Thesis, Computational Modeling Lab, Vrije Universiteit Brussel, Belgium
- Tuyls, K. (2004). Multiagent reinforcement learning: A game theoretic approach. PhD Thesis, Computational Modeling Lab, Vrije Universiteit Brussel, Belgium.
- (2004) Multiagent reinforcement learning: A game theoretic approach
- Tuyls, K.¹

25
- 33644810504
- PhD Thesis, Computational Modeling Lab, Vrije Universiteit Brussel, Belgium
- Verbeeck, K. (2004). Coordinated exploration in multi-agent reinforcement learning. PhD Thesis, Computational Modeling Lab, Vrije Universiteit Brussel, Belgium.
- (2004) Coordinated exploration in multi-agent reinforcement learning
- Verbeeck, K.¹

26
- 7044229393
- Homo egualis reinforcement learning agents for load balancing
- Proceedings of the 1st NASA workshop on radical agent concepts, pp, Springer-Verlag
- Verbeeck, K., Nowé, A., & Parent, J. (2002). Homo egualis reinforcement learning agents for load balancing. In Proceedings of the 1st NASA workshop on radical agent concepts, pp. 81-91. Springer-Verlag LNAI 2564.
- (2002) LNAI , vol.2564 , pp. 81-91
- Verbeeck, K.¹ Nowé, A.² Parent, J.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.