SCOPUS 정보 검색 플랫폼

Artificial Intelligence

Volumn 171, Issue 7, 2007, Pages 365-377

If multi-agent learning is the answer, what is the question?

(3) Shoham, Yoav a Powers, Rob a Grenager, Trond a

a Stanford University (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; GAME THEORY; LEARNING SYSTEMS;

FOUNDATIONAL QUESTIONS; MULTI-AGENT LEARNING;

MULTI AGENT SYSTEMS;

EID: 34147161536 PISSN: 00043702 EISSN: None Source Type: Journal
DOI: 10.1016/j.artint.2006.02.006 Document Type: Article

Times cited : (345)

References (55)

1
- 0000428680
- Rationality of self and others in an economic system
- Arrow K. Rationality of self and others in an economic system. Journal of Business 59 4 (1986)
- (1986) Journal of Business , vol.59 , Issue.4
- Arrow, K.¹

2
- 34249056307
- B. Banerjee, J. Peng, Efficient no-regret multiagent learning, in: AAAI, 2005

3
- 85012688561
- Princeton University Press
- Bellman R. Dynamic Programming (1957), Princeton University Press
- (1957) Dynamic Programming
- Bellman, R.¹

4
- 84880840280
- D. Billings, N. Burch, A. Davidson, R. Holte, J. Schaeffer, T. Schauenberg, D. Szafron, Approximating game-theoretic optimal strategies for full-scale poker, in: The Eighteenth International Joint Conference on Artificial Intelligence, 2003

5
- 0013371249
- Controlled random walks
- North-Holland, Amsterdam
- Blackwell D. Controlled random walks. Proceedings of the International Congress of Mathematicians vol. 3 (1956), North-Holland, Amsterdam 336-338
- (1956) Proceedings of the International Congress of Mathematicians , vol.3 , pp. 336-338
- Blackwell, D.¹

6
- 84899027977
- Convergence and no-regret in multiagent learning
- MIT Press, Cambridge, MA
- Bowling M. Convergence and no-regret in multiagent learning. Advances in Neural Information Processing Systems vol. 17 (2005), MIT Press, Cambridge, MA
- (2005) Advances in Neural Information Processing Systems , vol.17
- Bowling, M.¹

7
- 84880865940
- M. Bowling, M. Veloso, Rational and convergent learning in stochastic games, in: Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, 2001

8
- 0041965975
- R-max, a general polynomial time algorithm for near-optimal reinforcement learning
- Brafman R., and Tennenholtz M. R-max, a general polynomial time algorithm for near-optimal reinforcement learning. Journal of Machine Learning Research 3 (2002) 213-231
- (2002) Journal of Machine Learning Research , vol.3 , pp. 213-231
- Brafman, R.¹ Tennenholtz, M.²

9
- 4544271516
- Efficient learning equilibrium
- Brafman R., and Tennenholtz M. Efficient learning equilibrium. Artificial Intelligence 159 1-2 (2004) 27-47
- (2004) Artificial Intelligence , vol.159 , Issue.1-2 , pp. 27-47
- Brafman, R.¹ Tennenholtz, M.²

10
- 0002672918
- Iterative solution of games by fictitious play
- John Wiley and Sons, New York
- Brown G. Iterative solution of games by fictitious play. Activity Analysis of Production and Allocation (1951), John Wiley and Sons, New York
- (1951) Activity Analysis of Production and Allocation
- Brown, G.¹

11
- 0036268277
- Sophisticated EWA learning and strategic teaching in repeated games
- Camerer C., Ho T., and Chong J. Sophisticated EWA learning and strategic teaching in repeated games. Journal of Economic Theory 104 (2002) 137-188
- (2002) Journal of Economic Theory , vol.104 , pp. 137-188
- Camerer, C.¹ Ho, T.² Chong, J.³

12
- 4544279432
- Y.-H. Chang, T. Ho, L.P. Kaelbling, Mobilized ad-hoc networks: A reinforcement learning approach, in: 1st International Conference on Autonomic Computing (ICAC 2004), 2004, pp. 240-247

13
- 10444238075
- Walverine: A walrasian trading agent
- Cheng S.-F., Leung E., Lochner K.M., O'Malley K., Reeves D.M., Schvartzman L.J., and Wellman M.P. Walverine: A walrasian trading agent. Decision Support Systems 39 (2005) 169-184
- (2005) Decision Support Systems , vol.39 , pp. 169-184
- Cheng, S.-F.¹ Leung, E.² Lochner, K.M.³ O'Malley, K.⁴ Reeves, D.M.⁵ Schvartzman, L.J.⁶ Wellman, M.P.⁷

14
- 0031630561
- C. Claus, C. Boutilier, The dynamics of reinforcement learning in cooperative multiagent systems, in: Proceedings of the Fifteenth National Conference on Artificial Intelligence, 1998, pp. 746-752

15
- 0038829878
- Predicting how people play games: reinforcement leaning in experimental games with unique, mixed strategy equilibria
- Erev I., and Roth A.E. Predicting how people play games: reinforcement leaning in experimental games with unique, mixed strategy equilibria. The American Economic Review 88 4 (1998) 848-881
- (1998) The American Economic Review , vol.88 , Issue.4 , pp. 848-881
- Erev, I.¹ Roth, A.E.²

16
- 0002476325
- Regret in the on-line decision problem
- Foster D., and Vohra R. Regret in the on-line decision problem. Games and Economic Behavior 29 (1999) 7-36
- (1999) Games and Economic Behavior , vol.29 , pp. 7-36
- Foster, D.¹ Vohra, R.²

17
- 84983110889
- A decision-theoretic generalization of on-line learning and an application to boosting
- Springer-Verlag, Berlin
- Freund Y., and Schapire R.E. A decision-theoretic generalization of on-line learning and an application to boosting. Computational Learning Theory: Proceedings of the Second European Conference (1995), Springer-Verlag, Berlin 23-37
- (1995) Computational Learning Theory: Proceedings of the Second European Conference , pp. 23-37
- Freund, Y.¹ Schapire, R.E.²

18
- 0000466473
- Learning mixed equilibria
- Fudenberg D., and Kreps D. Learning mixed equilibria. Games and Economic Behavior 5 (1993) 320-367
- (1993) Games and Economic Behavior , vol.5 , pp. 320-367
- Fudenberg, D.¹ Kreps, D.²

19
- 0000668347
- Universal consistency and cautious fictitious play
- Fudenberg D., and Levine D. Universal consistency and cautious fictitious play. Journal of Economic Dynamics and Control 19 (1995) 1065-1089
- (1995) Journal of Economic Dynamics and Control , vol.19 , pp. 1065-1089
- Fudenberg, D.¹ Levine, D.²

20
- 0004247096
- MIT Press, Cambridge, MA
- Fudenberg D., and Levine D.K. The Theory of Learning in Games (1998), MIT Press, Cambridge, MA
- (1998) The Theory of Learning in Games
- Fudenberg, D.¹ Levine, D.K.²

21
- 1942517280
- A. Greenwald, K. Hall, Correlated Q-learning, in: Proceedings of the Twentieth International Conference on Machine Learning, 2003, pp. 242-249

22
- 34249052651
- C. Guestrin, D. Koller, R. Parr, Multiagent planning with factored mdps, in: Advances in Neural Information Processing Systems (NIPS-14), 2001

23
- 0001976283
- Approximation to Bayes risk in repeated plays
- Hannan J.F. Approximation to Bayes risk in repeated plays. Contributions to the Theory of Games 3 (1957) 97-139
- (1957) Contributions to the Theory of Games , vol.3 , pp. 97-139
- Hannan, J.F.¹

24
- 0000908510
- A simple adaptive procedure leading to correlated equilibrium
- Hart S., and Mas-Colell A. A simple adaptive procedure leading to correlated equilibrium. Econometrica 68 (2000) 1127-1150
- (2000) Econometrica , vol.68 , pp. 1127-1150
- Hart, S.¹ Mas-Colell, A.²

25
- 4644369748
- Nash Q-learning for general-sum stochastic games
- Hu J., and Wellman M. Nash Q-learning for general-sum stochastic games. Journal of Machine Learning Research 4 (2003) 1039-1069
- (2003) Journal of Machine Learning Research , vol.4 , pp. 1039-1069
- Hu, J.¹ Wellman, M.²

26
- 34249074572
- J. Hu, P. Wellman, Multiagent reinforcement learning: Theoretical framework and an algorithm, in: Proceedings of the Fifteenth International Conference on Machine Learning, 1998, pp. 242-250

27
- 34249071716
- A. Jafari, A. Greenwald, D. Gondek, G. Ercal, On no-regret learning, fictitious play, and Nash equilibrium, in: Proceedings of the Eighteenth International Conference on Machine Learning, 2001

28
- 1142305713
- Learning to play games in extensive form by valuation
- Jehiel P., and Samet D. Learning to play games in extensive form by valuation. NAJ Economics 3 (2001)
- (2001) NAJ Economics , vol.3
- Jehiel, P.¹ Samet, D.²

29
- 0029679044
- Reinforcement learning: A survey
- Kaelbling L.P., Littman M.L., and Moore A.P. Reinforcement learning: A survey. Journal of Artificial Intelligence Research 4 (1996) 237-285
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 237-285
- Kaelbling, L.P.¹ Littman, M.L.² Moore, A.P.³

30
- 0000221289
- Rational learning leads to Nash equilibrium
- Kalai E., and Lehrer E. Rational learning leads to Nash equilibrium. Econometrica 61 5 (1993) 1019-1045
- (1993) Econometrica , vol.61 , Issue.5 , pp. 1019-1045
- Kalai, E.¹ Lehrer, E.²

31
- 4544251885
- S. Kapetanakis, D. Kudenko, Reinforcement learning of coordination in heterogeneous cooperative multi-agent systems, in: Proceedings of the Third Autonomous Agents and Multi-Agent Systems Conference, 2004

32
- 34249043965
- M. Kearns, S. Singh, Near-optimal reinforcement learning in polynomial time, in: Proceedings of the Fifteenth International Conference on Machine Learning, 1998, pp. 260-268

33
- 0031192989
- Representations and solutions for game-theoretic problems
- Koller D., and Pfeffer A. Representations and solutions for game-theoretic problems. Artificial Intelligence 94 1 (1997) 167-215
- (1997) Artificial Intelligence , vol.94 , Issue.1 , pp. 167-215
- Koller, D.¹ Pfeffer, A.²

34
- 0012286079
- An algorithm for distributed reinforcement learning in cooperative multi-agent systems
- Morgan Kaufman
- Lauer M., and Riedmiller M. An algorithm for distributed reinforcement learning in cooperative multi-agent systems. Proceedings of the 17th International Conference on Machine Learning (2000), Morgan Kaufman 535-542
- (2000) Proceedings of the 17th International Conference on Machine Learning , pp. 535-542
- Lauer, M.¹ Riedmiller, M.²

35
- 84880839504
- K. Leyton-Brown, M. Tennenholtz, Local-effect games, in: Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence, 2003, pp. 772-780

36
- 34249095889
- M.L. Littman, Markov games as a framework for multi-agent reinforcement learning, in: Proceedings of the 11th International Conference on Machine Learning, 1994, pp. 157-163

37
- 34249110426
- M.L. Littman, Friend-or-foe Q-learning in general-sum games, in: Proceedings of the Eighteenth International Conference on Machine Learning, 2001

38
- 34249011566
- M.L. Littman, C. Szepesvari, A generalized reinforcement-learning model: Convergence and applications, in: Proceedings of the 13th International Conference on Machine Learning, 1996, pp. 310-318

39
- 0038386340
- The empirical Bayes envelope and regret minimization in competitive Markov decision processes
- Mannor S., and Shimkin N. The empirical Bayes envelope and regret minimization in competitive Markov decision processes. Mathematics of Operations Research 28 2 (2003) 327-345
- (2003) Mathematics of Operations Research , vol.28 , Issue.2 , pp. 327-345
- Mannor, S.¹ Shimkin, N.²

40
- 0004255908
- McGraw Hill
- Mitchell T. Machine Learning (1997), McGraw Hill
- (1997) Machine Learning
- Mitchell, T.¹

41
- 0003646168
- On the convergence of learning processes in a 2 × 2 non-zero-person game
- Miyasawa K. On the convergence of learning processes in a 2 × 2 non-zero-person game. Research Memo 33 (1961)
- (1961) Research Memo , vol.33
- Miyasawa, K.¹

42
- 0002714588
- Evolutionary selection dynamics in games: Convergence and limit properties
- Nachbar J. Evolutionary selection dynamics in games: Convergence and limit properties. International Journal of Game Theory 19 (1990) 59-89
- (1990) International Journal of Game Theory , vol.19 , pp. 59-89
- Nachbar, J.¹

43
- 4544335718
- E. Nudelman, J. Wortman, K. Leyton-Brown, Y. Shoham, Run the GAMUT: A comprehensive approach to evaluating game-theoretic algorithms, in: AAMAS, 2004

44
- 33745609272
- R. Powers, Y. Shoham, Learning against opponents with bounded memory, in: Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, 2005

45
- 84898936075
- New criteria and a new algorithm for learning in multi-agent systems
- MIT Press, Cambridge, MA
- Powers R., and Shoham Y. New criteria and a new algorithm for learning in multi-agent systems. Advances in Neural Information Processing Systems vol. 17 (2005), MIT Press, Cambridge, MA
- (2005) Advances in Neural Information Processing Systems , vol.17
- Powers, R.¹ Shoham, Y.²

46
- 0001402950
- An iterative method of solving a game
- Robinson J. An iterative method of solving a game. Annals of Mathematics 54 (1951) 298-301
- (1951) Annals of Mathematics , vol.54 , pp. 298-301
- Robinson, J.¹

47
- 0001190058
- Replicator dynamics
- Schuster P., and Sigmund K. Replicator dynamics. Journal of Theoretical Biology 100 (1983) 533-538
- (1983) Journal of Theoretical Biology , vol.100 , pp. 533-538
- Schuster, P.¹ Sigmund, K.²

48
- 0028555752
- S. Sen, M. Sekaran, J. Hale, Learning to coordinate without sharing information, in: Proceedings of the Twelfth National Conference on Artificial Intelligence, Seattle, WA, 1994, pp. 426-431

49
- 0004018184
- Cambridge University Press
- Smith J.M. Evolution and the Theory of Games (1982), Cambridge University Press
- (1982) Evolution and the Theory of Games
- Smith, J.M.¹

50
- 0004102479
- MIT Press, Cambridge, MA
- Sutton R.S., and Barto A.G. Reinforcement Learning: An Introduction (1998), MIT Press, Cambridge, MA
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

51
- 34247179640
- T. Vu, R. Powers, Y. Shoham, Learning against multiple opponents, in: Proceedings of the Fifth International Joint Conference on Autonomous Agents and Multi Agent Systems, 2006

52
- 34249105930
- X. Wang, T. Sandholm, Reinforcement learning to play an optimal Nash equilibrium in team Markov games, in: Advances in Neural Information Processing Systems, vol. 15, 2002

53
- 34249833101
- Technical note: Q-learning
- Watkins C., and Dayan P. Technical note: Q-learning. Machine Learning 8 3/4 (1992) 279-292
- (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 279-292
- Watkins, C.¹ Dayan, P.²

54
- 84920627764
- Oxford University Press, Oxford
- Young H.P. Strategic Learning and Its Limits (2004), Oxford University Press, Oxford
- (2004) Strategic Learning and Its Limits
- Young, H.P.¹

55
- 1942484421
- M. Zinkevich, Online convex programming and generalized infinitesimal gradient ascent, in: ICML, 2003

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.