SCOPUS 정보 검색 플랫폼

IJCAI International Joint Conference on Artificial Intelligence

Volumn , Issue , 2005, Pages 817-822

Learning against opponents with bounded memory

(2) Powers, Rob a Shoham, Yoav a

a Stanford University (United States)

Author keywords

[No Author keywords available]

Indexed keywords

BEST RESPONSE; BOUNDED MEMORY; EMPIRICAL TEST; MULTI-AGENT SETTING; SELF-PLAY;

MULTI AGENT SYSTEMS;

ALGORITHMS;

EID: 33745609272 PISSN: 10450823 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (85)

References (25)

1
- 84936824515
- Basic Books, New York
- Robert Axelrod. The Evolution of Cooperation. Basic Books, New York, 1984.
- (1984) The Evolution of Cooperation
- Axelrod, R.¹

2
- 0036531878
- Multiagent learning using a variable learning rate
- Michael Bowling and Manuela Veloso. Multiagent learning using a variable learning rate. Artificial Intelligence, 136:215-250, 2002.
- (2002) Artificial Intelligence , vol.136 , pp. 215-250
- Bowling, M.¹ Veloso, M.²

3
- 84899027977
- Convergence and no-regret in multiagent learning
- MIT Press
- Michael Bowling. Convergence and no-regret in multiagent learning. In Advances in Neural Information Processing Systems 17. MIT Press, 2005.
- (2005) Advances in Neural Information Processing Systems 17
- Bowling, M.¹

4
- 0002672918
- Iterative solution of games by fictitious play
- John Wiley and Sons, New York
- George Brown. Iterative solution of games by fictitious play. In Activity Analysis of Production and Allocation. John Wiley and Sons, New York, 1951.
- (1951) Activity Analysis of Production and Allocation
- Brown, G.¹

5
- 84898960502
- Playing is believing: The role of beliefs in multi-agent learning
- Yu-Han Chang and Leslie Pack Kaelbling. Playing is believing: The role of beliefs in multi-agent learning. In Advances in Neural Information Processing Systems 14, pages 1483-1490, 2002.
- (2002) Advances in Neural Information Processing Systems 14 , pp. 1483-1490
- Chang, Y.-H.¹ Kaelbling, L.P.²

6
- 0026998041
- Reinforcement learning with perceptual aliasing: The perceptual distinctions approach
- Lonnie Chrisman. Reinforcement learning with perceptual aliasing: The perceptual distinctions approach. In Proceedings of the Tenth National Conference on Artificial Intelligence, pages 183-188, 1992.
- (1992) Proceedings of the Tenth National Conference on Artificial Intelligence , pp. 183-188
- Chrisman, L.¹

7
- 0031630561
- The dynamics of reinforcement learning in cooperative multiagent systems
- Caroline Claus and Craig Boutilier. The dynamics of reinforcement learning in cooperative multiagent systems. In Proceedings of the Fifteenth National Conference on Artificial Intelligence, pages 746-752, 1998.
- (1998) Proceedings of the Fifteenth National Conference on Artificial Intelligence , pp. 746-752
- Claus, C.¹ Boutilier, C.²

8
- 1942421183
- Awesome: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents
- Vincent Conitzer and Tuomas Sandholm. Awesome: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents. In Proceedings of the 20th International Conference on Machine Learning, pages 83-90, 2003.
- (2003) Proceedings of the 20th International Conference on Machine Learning , pp. 83-90
- Conitzer, V.¹ Sandholm, T.²

9
- 33845304828
- How to combine expert (or novice) advice when actions impact the environment
- Daniela Pucci de Farias and Nimrod Megiddo. How to combine expert (or novice) advice when actions impact the environment. In Advances in Neural Information Processing Systems 16, 2004.
- (2004) Advances in Neural Information Processing Systems 16
- Pucci De Farias, D.¹ Megiddo, N.²

10
- 0000668347
- Universal consistency and cautious fictitious play
- Drew Fudenberg and David Levine. Universal consistency and cautious fictitious play. Journal of Economic Dynamics and Control, 19:1065-1089, 1995.
- (1995) Journal of Economic Dynamics and Control , vol.19 , pp. 1065-1089
- Fudenberg, D.¹ Levine, D.²

11
- 0008547696
- Conditional universal consistency
- Drew Fudenberg and David Levine. Conditional universal consistency. Games and Economic Behavior, 29:104-130, 1999.
- (1999) Games and Economic Behavior , vol.29 , pp. 104-130
- Fudenberg, D.¹ Levine, D.²

12
- 0000908510
- A simple adaptive procedure leading to correlated equilibrium
- Sergiu Hart and Andreu Mas-Colell. A simple adaptive procedure leading to correlated equilibrium. Econometrica, 68:1127-1150, 2000.
- (2000) Econometrica , vol.68 , pp. 1127-1150
- Hart, S.¹ Mas-Colell, A.²

13
- 0001069505
- On the distribution of the number of successes in independent trials
- Wassily Hoeffding. On the distribution of the number of successes in independent trials. Annals of Mathematical Statistics, 27:713-721, 1956.
- (1956) Annals of Mathematical Statistics , vol.27 , pp. 713-721
- Hoeffding, W.¹

14
- 9444236608
- On no-regret learning, fictitious play, and nash equilibrium
- Amir Jafari, Amy Greenwald, David Gondek, and Gunes Ercal. On no-regret learning, fictitious play, and nash equilibrium. In Proceedings of the Eighteenth International Conference on Machine Learning, pages 226-223, 2001.
- (2001) Proceedings of the Eighteenth International Conference on Machine Learning , pp. 226-1223
- Jafari, A.¹ Greenwald, A.² Gondek, D.³ Ercal, G.⁴

15
- 0000221289
- Rational learning leads to nash equilibrium
- Ehud Kalai and Ehud Lehrer. Rational learning leads to nash equilibrium. Econometrica, 61(5):1019-1045, 1993.
- (1993) Econometrica , vol.61 , Issue.5 , pp. 1019-1045
- Kalai, E.¹ Lehrer, E.²

16
- 84880774150
- A portfolio approach to algorithm selection
- Kevin Leyton-Brown, Eugene Nudelman, Galen Andrew, Jim McFadden, and Yoav Shoham. A portfolio approach to algorithm selection. In Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence, 2003.
- Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence, 2003
- Leyton-Brown, K.¹ Nudelman, E.² Andrew, G.³ McFadden, J.⁴ Shoham, Y.⁵

17
- 80053136974
- Implicit negotiation in repeated games
- Michael Littman and Peter Stone. Implicit negotiation in repeated games. In Proceedings of The Eighth International Workshop on Agent Theories, Architectures, and Languages, pages 393-404, 2001.
- (2001) Proceedings of the Eighth International Workshop on Agent Theories, Architectures, and Languages , pp. 393-404
- Littman, M.¹ Stone, P.²

18
- 85149834820
- Markov games as a framework for multi-agent reinforcement learning
- Michael L. Littman. Markov games as a framework for multi-agent reinforcement learning. In Proceedings of the 11th International Conference on Machine Learning, pages 157-163, 1994.
- (1994) Proceedings of the 11th International Conference on Machine Learning , pp. 157-163
- Littman, M.L.¹

19
- 0000614213
- Bounded complexity justifies cooperation in finitely repeated prisoner's dilemma
- Abraham Neyman. Bounded complexity justifies cooperation in finitely repeated prisoner's dilemma. Economic Letters, pages 227-229, 1985.
- (1985) Economic Letters , pp. 227-229
- Neyman, A.¹

20
- 33645654883
- Learning probabilistic models for decision-theoretic navigation of mobile robots
- Daniel Nikovski and Illah Nourbakhsh. Learning probabilistic models for decision-theoretic navigation of mobile robots. In Proceedings of the International Conference on Machine Learning, pages 266-274, 2000.
- (2000) Proceedings of the International Conference on Machine Learning , pp. 266-274
- Nikovski, D.¹ Nourbakhsh, I.²

21
- 4544335718
- Run the gamut: A comprehensive approach to evaluating game-theorectic algorithms
- Eugene Nudelman, Jenn Wortman, Kevin Leyton-Brown, and Yoav Shoham. Run the gamut: A comprehensive approach to evaluating game-theorectic algorithms. AAMAS, 2004.
- (2004) AAMAS
- Nudelman, E.¹ Wortman, J.² Leyton-Brown, K.³ Shoham, Y.⁴

22
- 0027928808
- On complexity as bounded rationality
- Christos H. Papadimitriou and Mihalis Yannakakis. On complexity as bounded rationality. In STOC-94, pages 726-733, 1994.
- (1994) STOC-94 , pp. 726-733
- Papadimitriou, C.H.¹ Yannakakis, M.²

23
- 84898936075
- New criteria and a new algorithm for learning in multiagent systems
- MIT Press
- Rob Powers and Yoav Shoham. New criteria and a new algorithm for learning in multiagent systems. In Advances in Neural Information Processing Systems 17. MIT Press, 2005.
- (2005) Advances in Neural Information Processing Systems 17
- Powers, R.¹ Shoham, Y.²

24
- 84898941549
- Extending q-learning to general adaptive multi-agent systems
- Gerald Tesauro. Extending q-learning to general adaptive multi-agent systems. In Advances in Neural Information Processing Systems, volume 16, 2004.
- (2004) Advances in Neural Information Processing Systems , vol.16
- Tesauro, G.¹

25
- 34249833101
- Technical note: Q-learning
- May
- Chris Watkins and Peter Dayan. Technical note: Q-learning. Machine Learning, 8(3/4):279-292, May 1992.
- (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 279-292
- Watkins, C.¹ Dayan, P.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.