SCOPUS 정보 검색 플랫폼

AAAI Fall Symposium - Technical Report

Volumn FS-04-02, Issue , 2004, Pages 89-95

On the agenda(s) of research on multi-agent learning

(3) Shoham, Yoav a Powers, Rob a Grenager, Trond a

a Stanford University (United States)

Author keywords

[No Author keywords available]

Indexed keywords

MULTI-AGENT LEARNING; REINFORCEMENT LEARNING; STOCHASTIC GAMES; WELL-DEFINED PROBLEMS;

ARTIFICIAL INTELLIGENCE; GAME THEORY; MULTI AGENT SYSTEMS; PROBLEM SOLVING; RESEARCH AND DEVELOPMENT MANAGEMENT; STOCHASTIC CONTROL SYSTEMS;

LEARNING SYSTEMS;

EID: 26444543263 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (9)

References (32)

1
- 0003787146
- Princeton University Press
- Bellman, R. 1957. Dynamic Programming. Princeton University Press.
- (1957) Dynamic Programming
- Bellman, R.¹

2
- 84880865940
- Rational and convergent learning in stochastic games
- Bowling, M., and Veloso, M. 2001. Rational and convergent learning in stochastic games. In Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence.
- (2001) Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence
- Bowling, M.¹ Veloso, M.²

3
- 0003091684
- Convergence problems of general-sum multiagent reinforcement learning
- Bowling, M. 2000. Convergence problems of general-sum multiagent reinforcement learning. In Proceedings of the Seventeenth International Conference on Machine Learning, 89-94.
- (2000) Proceedings of the Seventeenth International Conference on Machine Learning , pp. 89-94
- Bowling, M.¹

4
- 0002672918
- Iterative solution of games by fictitious play
- New York: John Wiley and Sons
- Brown, G. 1951. Iterative solution of games by fictitious play. In Activity Analysis of Production and Allocation. New York: John Wiley and Sons.
- (1951) Activity Analysis of Production and Allocation
- Brown, G.¹

5
- 0036268277
- Sophisticated EWA learning and strategic teaching in repeated games
- Camerer, C.; Ho, T.; and Chong, J. 2002. Sophisticated EWA learning and strategic teaching in repeated games. Journal of Economic Theory 104:137-188.
- (2002) Journal of Economic Theory , vol.104 , pp. 137-188
- Camerer, C.¹ Ho, T.² Chong, J.³

6
- 27944508225
- Playing is believing: The role of beliefs in multi-agent learning
- Chang, Y.-H., and Kaelbling, L. P. 2001. Playing is believing: The role of beliefs in multi-agent learning. In Proceedings of NIPS.
- (2001) Proceedings of NIPS
- Chang, Y.-H.¹ Kaelbling, L.P.²

7
- 0031630561
- The dynamics of reinforcement learning in cooperative multiagent systems
- Claus, C., and Boutilier, C. 1998. The dynamics of reinforcement learning in cooperative multiagent systems. In Proceedings of the Fifteenth National Conference on Artificial Intelligence, 746-752.
- (1998) Proceedings of the Fifteenth National Conference on Artificial Intelligence , pp. 746-752
- Claus, C.¹ Boutilier, C.²

8
- 0038829878
- Predicting how people play games: Reinforcement leaning in experimental games with unique, mixed strategy equilibria
- Erev, I., and Roth, A. E. 1998. Predicting how people play games: reinforcement leaning in experimental games with unique, mixed strategy equilibria. The American Economic Review 88(4):848-881.
- (1998) The American Economic Review , vol.88 , Issue.4 , pp. 848-881
- Erev, I.¹ Roth, A.E.²

9
- 0000466473
- Learning mixed equilibria
- Fudenberg, D., and Kreps, D. 1993. Learning mixed equilibria. Games and Economic Behavior 5:320-367.
- (1993) Games and Economic Behavior , vol.5 , pp. 320-367
- Fudenberg, D.¹ Kreps, D.²

10
- 0002428783
- A decision-theoretic approach to coordinating multiagent interactions
- Gmytrasiewicz, P.; Durfee, E.; and Wehe, D. 1991. A decision-theoretic approach to coordinating multiagent interactions. In Proceedings of the Twelfth International Joint Conference on Artificial Intelligence, 62-68.
- (1991) Proceedings of the Twelfth International Joint Conference on Artificial Intelligence , pp. 62-68
- Gmytrasiewicz, P.¹ Durfee, E.² Wehe, D.³

11
- 22844438585
- Correlated-Q learning
- Greenwald, A.; Hall, K.; and Serrano, R. 2002. Correlated-Q learning. In NIPS Workshop on Multiagent Learning.
- (2002) NIPS Workshop on Multiagent Learning
- Greenwald, A.¹ Hall, K.² Serrano, R.³

12
- 0001976283
- Approximation to bayes risk in repeated plays
- Hannan, J. F. 1959. Approximation to bayes risk in repeated plays. Contributions to the Theory of Games 3:97-139.
- (1959) Contributions to the Theory of Games , vol.3 , pp. 97-139
- Hannan, J.F.¹

13
- 0000929496
- Multiagent reinforcement learning: Theoretical framework and an algorithm
- Hu, J., and Wellman, P. 1998. Multiagent reinforcement learning: Theoretical framework and an algorithm. In Proceedings of the Fifteenth International Conference on Machine Learning, 242-250.
- (1998) Proceedings of the Fifteenth International Conference on Machine Learning , pp. 242-250
- Hu, J.¹ Wellman, P.²

14
- 0002550841
- Learning about other agents in a dynamic multiagent system
- Hu, J., and Wellman, M. 2001. Learning about other agents in a dynamic multiagent system. Journal of Cognitive Systems Research 2:67-69.
- (2001) Journal of Cognitive Systems Research , vol.2 , pp. 67-69
- Hu, J.¹ Wellman, M.²

15
- 9444286839
- Multiagent Q-learning
- Hu, J., and Wellman, M. 2002. Multiagent Q-learning. Journal of Machine Learning.
- (2002) Journal of Machine Learning
- Hu, J.¹ Wellman, M.²

16
- 1142305713
- Learning to play games in extensive form by valuation
- Jehiel, P., and Samet, D. 2001. Learning to play games in extensive form by valuation. NAJ Economics 3.
- (2001) NAJ Economics , vol.3
- Jehiel, P.¹ Samet, D.²

17
- 0000221289
- Rational learning leads to nash equilibrium
- Kalai, E., and Lehrer, E. 1993. Rational learning leads to nash equilibrium. Econometrica 61(5): 1019-1045.
- (1993) Econometrica , vol.61 , Issue.5 , pp. 1019-1045
- Kalai, E.¹ Lehrer, E.²

18
- 84880839504
- Localeffect games
- Leyton-Brown, K., and Tennenholtz, M. 2003. Localeffect games. In Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence, 772-780.
- (2003) Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence , pp. 772-780
- Leyton-Brown, K.¹ Tennenholtz, M.²

19
- 0001961616
- A generalized reinforcement-learning model: Convergence and applications
- Littman, M. L., and Szepesvari, C. 1996. A generalized reinforcement-learning model: Convergence and applications. In Proceedings of the 13th International Conference on Machine Learning, 310-318.
- (1996) Proceedings of the 13th International Conference on Machine Learning , pp. 310-318
- Littman, M.L.¹ Szepesvari, C.²

20
- 85149834820
- Markov games as a framework for multi-agent reinforcement learning
- Littman, M. L. 1994. Markov games as a framework for multi-agent reinforcement learning. In Proceedings of the 11th International Conference on Machine Learning, 157-163.
- (1994) Proceedings of the 11th International Conference on Machine Learning , pp. 157-163
- Littman, M.L.¹

21
- 0242466944
- Friend-or-foe Q-learning in generalsum games
- Littman, M. L. 2001. Friend-or-foe Q-learning in generalsum games. In Proceedings of the Eighteenth International Conference on Machine Learning.
- (2001) Proceedings of the Eighteenth International Conference on Machine Learning
- Littman, M.L.¹

22
- 33845300407
- Formulation of bayesian analysis for games with incomplete information
- Mertens, J.-F., and Zamir, S. 1985. Formulation of bayesian analysis for games with incomplete information. International Journal of Game Theory 14:1-29.
- (1985) International Journal of Game Theory , vol.14 , pp. 1-29
- Mertens, J.-F.¹ Zamir, S.²

23
- 0030306234
- Non-computable strategies and discounted repeated games
- Nachbar, J. H., and Zame, W. R. 1996. Non-computable strategies and discounted repeated games. Economic Theory 8:103-122.
- (1996) Economic Theory , vol.8 , pp. 103-122
- Nachbar, J.H.¹ Zame, W.R.²

24
- 0000614213
- Bounded complexity justifies cooperation in finitely repeated prisoner's dilemma
- Neyman, A. 1985. Bounded complexity justifies cooperation in finitely repeated prisoner's dilemma. Economic Letters 227-229.
- (1985) Economic Letters , pp. 227-229
- Neyman, A.¹

25
- 0027928808
- On complexity as bounded rationality
- Papadimitriou, C., and Yannakakis, M. 1994. On complexity as bounded rationality. In STOC-94, 726-733.
- (1994) STOC-94 , pp. 726-733
- Papadimitriou, C.¹ Yannakakis, M.²

26
- 84898936075
- New criteria and a new algorithm for learning in multi-agent systems
- Forthcoming
- Powers, R., and Shoham, Y. 2005. New criteria and a new algorithm for learning in multi-agent systems. In Advances in Neural Information Processing Systems. Forthcoming.
- (2005) Advances in Neural Information Processing Systems
- Powers, R.¹ Shoham, Y.²

27
- 0003687484
- MIT Press
- Rubinstein, A. 1998. Modeling Bounded Rationality. MIT Press.
- (1998) Modeling Bounded Rationality
- Rubinstein, A.¹

28
- 0028555752
- Learning to coordinate without sharing information
- Sen, S.; Sekaran, M.; and Hale, J. 1994. Learning to coordinate without sharing information. In Proceedings of the Twelfth National Conference on Artificial Intelligence, 426-431.
- (1994) Proceedings of the Twelfth National Conference on Artificial Intelligence , pp. 426-431
- Sen, S.¹ Sekaran, M.² Hale, J.³

29
- 0242635251
- Implicit negotiation in repeated games
- Meyer, J.-J., and Tambe, M., eds.
- Stone, P., and Littman, M. L. 2001. Implicit negotiation in repeated games. In Meyer, J.-J., and Tambe, M., eds., Pre-proceedings of the Eighth International Workshop on Agent Theories, Architectures, and Languages (ATAL-2001), 96-105.
- (2001) Pre-proceedings of the Eighth International Workshop on Agent Theories, Architectures, and Languages (ATAL-2001) , pp. 96-105
- Stone, P.¹ Littman, M.L.²

30
- 34247193577
- Efficient learning equilibrium
- Cambridge, Mass.: MIT Press
- Tennenholtz, M. 2002. Efficient learning equilibrium. In Advances in Neural Information Processing Systems, volume 15. Cambridge, Mass.: MIT Press.
- (2002) Advances in Neural Information Processing Systems , vol.15
- Tennenholtz, M.¹

31
- 34249833101
- Technical note: Q-learning
- Watkins, C. J. C. H., and Dayan, P. 1992. Technical note: Q-learning. Machine Learning 8(3/4):279-292.
- (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

32
- 0005058631
- A trading agent competition for the research community
- Wellman, M. P., and Wurman, P. R. 1999. A trading agent competition for the research community. In IJCAI-99 Workshop on Agent-Mediated Electronic Trading.
- (1999) IJCAI-99 Workshop on Agent-mediated Electronic Trading
- Wellman, M.P.¹ Wurman, P.R.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.