SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2005, Pages

New criteria and a new algorithm for learning in multi-agent systems

Author keywords

[No Author keywords available]

Indexed keywords

LEARNING ALGORITHMS;

AVERAGE REWARD; BEST RESPONSE; GAME-THEORETIC; MAXIMIN; NOVEL ALGORITHM; REPEATED GAMES; SECURITY LEVEL; SELF-PLAY;

MULTI AGENT SYSTEMS;

EID: 84898936075 PISSN: 10495258 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (60)

References (19)

1
- 0036531878
- Multiagent learning using a variable learning rate
- Bowling, M. & Veloso, M. (2002). Multiagent learning using a variable learning rate. In Artificial Intelligence, 136, pp. 215-250.
- (2002) Artificial Intelligence , vol.136 , pp. 215-250
- Bowling, M.¹ Veloso, M.²

2
- 34247193577
- Efficient learning equilibrium
- Brafman, R. & Tennenholtz, M. (2002). Efficient Learning Equilibrium. In Advances in Neural Information Processing Systems 15.
- (2002) Advances in Neural Information Processing Systems , vol.15
- Brafman, R.¹ Tennenholtz, M.²

4
- 0031630561
- The dynamics of reinforcement learning in cooperative multiagent systems
- Claus, C. & Boutilier, C. (1998). The dynamics of reinforcement learning in cooperative multiagent systems. In Proceedings of the National Conference on Artificial Intelligence, pp. 746-752.
- (1998) Proceedings of the National Conference on Artificial Intelligence , pp. 746-752
- Claus, C.¹ Boutilier, C.²

6
- 0002476325
- Regret in the on-line decision problem
- Foster, D. & Vohra, R. (1999). Regret in the on-line decision problem. "Games and Economic Behavior" 29:7-36.
- (1999) Games and Economic Behavior , vol.29 , pp. 7-36
- Foster, D.¹ Vohra, R.²

7
- 0000668347
- Universal, consistency and cautious fictitious play
- Fudenberg, D. & Levine, D. (1995) Universal consistency and cautious fictitious play. Journal of Economics Dynamics and Control 19:1065-1089.
- (1995) Journal of Economics Dynamics and Control , vol.19 , pp. 1065-1089
- Fudenberg, D.¹ Levine, D.²

8
- 0004247096
- MIT Press
- Fudenberg, D. & Levine, D. (1998). The theory of learning in games. MIT Press.
- (1998) The Theory of Learning in Games
- Fudenberg, D.¹ Levine, D.²

9
- 0001976283
- Approximation to bayes risk in repeated plays
- Hannan, J. (1957) Approximation to Bayes risk in repeated plays. Contributions to the Theory of Games 3:97-139.
- (1957) Contributions to the Theory Of, Games , vol.3 , pp. 97-139
- Hannan, J.¹

10
- 0000908510
- A simple adaptive procedure leading to correlated equilibrium
- Hart, S. & Mas-Colell, A. (2000). A simple adaptive procedure leading to correlated equilibrium. In Econometrica, Vol. 68, No. 5, pages 1127-1150.
- (2000) Econometrica , vol.68 , Issue.5 , pp. 1127-1150
- Hart, S.¹ Mas-Colell, A.²

11
- 0001069505
- On the distribution of the number of successes in independent trials
- Hoeffding, W. (1956). On the distribution of the number of successes in independent trials. Annals of Mathematical Statistics 27:713-721.
- (1956) Annals of Mathematical Statistics , vol.27 , pp. 713-721
- Hoeffding, W.¹

12
- 80053136974
- Implicit negotiation in repeated games
- Littman, M. & Stone, P. (2001). Implicit Negotiation in Repeated Games. In Proceedings of the Eighth International Workshop on Agent Theories, Architectures, and Languages, pp. 393-404.
- (2001) Proceedings of the Eighth International Workshop on Agent Theories, Architectures, and Languages , pp. 393-404
- Littman, M.¹ Stone, P.²

13
- 84898955951
- AAMAS-2004. To Appear
- Nudelman, E., Wortman, J., Leyton-Brown, K., & Shoham, Y. (2004). Run the GAMUT: A Comprehensive Approach to Evaluating Game-Theoretic Algorithms. AAMAS-2004. To Appear.
- (2004) Run the GAMUT: A Comprehensive Approach to Evaluating Game-Theoretic Algorithms
- Nudelman, E.¹ Wortman, J.² Leyton-Brown, K.³ Shoham, Y.⁴

15
- 4544279348
- Technical Report
- Shoham, Y., Powers, R., & Grenager, T. (2003). Multi-Agent Reinforcement Learning: A critical survey. Technical Report.
- (2003) Multi-Agent Reinforcement Learning: A Critical Survey
- Shoham, Y.¹ Powers, R.² Grenager, T.³

17
- 0034205975
- Multiagent systems: A survey from a machine learning perspective
- Stone, P. & Veloso, M. (2000). Multiagent systems: A survey from a machine learning perspective. Autonomous Robots, 8(3).
- (2000) Autonomous, Robots , vol.8 , Issue.3
- Stone, P.¹ Veloso, M.²

18
- 84898941549
- Extending q-learning to general adaptive multi-agent systems
- Tesauro, G. (2004). Extending Q-Learning to General Adaptive Multi-Agent Systems. In Advances in Neural Information Processing Systems 16.
- (2004) Advances in Neural Information Processing Systems , pp. 16
- Tesauro, G.¹

19
- 34249833101
- Technical note: Q-learning
- Watkins, C. & Dayan, P. (1992). Technical note: Q-learning. Machine Learning, 8(3):279-292.
- (1992) Machine Learning , vol.8 , Issue.3 , pp. 279-292
- Watkins, C.¹ Dayan, P.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.