SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2010, Pages 191-198

Convergence, targeted optimality, and safety in multiagent learning

Author keywords

[No Author keywords available]

Indexed keywords

EMPIRICAL RESULTS; EXPLORATION AND EXPLOITATION; MODEL LEARNING; MULTI-AGENT LEARNING; MULTIAGENT LEARNING ALGORITHM; OPTIMALITY; REPEATED GAMES;

LEARNING ALGORITHMS; LEARNING SYSTEMS;

CONVERGENCE OF NUMERICAL METHODS;

EID: 77956517473 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (18)

References (9)

1
- 9444299000
- Performance bounded reinforcement learning in strategic interactions
- Banerjee, Bikramjit and Peng, Jing. Performance bounded reinforcement learning in strategic interactions. In AAAI, pp. 2-7, 2004.
- (2004) AAAI , pp. 2-7
- Banerjee, B.¹ Peng, J.²

2
- 36348967415
- Convergence of gradient dynamics with a variable learning rate
- Bowling, Michael and Veloso, Manuela. Convergence of gradient dynamics with a variable learning rate. In ICML, pp. 27-34, 2001.
- (2001) ICML , pp. 27-34
- Bowling, M.¹ Veloso, M.²

3
- 0041965975
- R-max- A general polynomial time algorithm for near-optimal reinforcement learning
- Brafman, Ronen I. and Tennenholtz, Moshe. R-max - a general polynomial time algorithm for near-optimal reinforcement learning. J. Mach. Learn. Res., pp. 213-231, 2003.
- (2003) J. Mach. Learn. Res. , pp. 213-231
- Brafman, R.I.¹ Tennenholtz, M.²

4
- 40949147745
- A comprehensive survey of multi-agent reinforcement learning
- Buşoniu, L., Babuška, R., and De Schutter, B. A comprehensive survey of multi-agent reinforcement learning. IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews, pp. 156-172, 2008.
- (2008) IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews , pp. 156-172
- Buşoniu, L.¹ Babuška, R.² De Schutter, B.³

5
- 56049086673
- Online multiagent learning against memory bounded adversaries
- Chakraborty, Doran and Stone, Peter. Online multiagent learning against memory bounded adversaries. In ECML, pp. 211-226, 2008.
- (2008) ECML , pp. 211-226
- Chakraborty, D.¹ Stone, P.²

6
- 34147159616
- Awesome: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents
- Conitzer, Vincent and Sandholm, Tuomas. Awesome: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents. In J. Mach. Learn. Res., pp. 23-43, 2006.
- (2006) J. Mach. Learn. Res. , pp. 23-43
- Conitzer, V.¹ Sandholm, T.²

7
- 33745609272
- Learning against opponents with bounded memory
- Powers, Rob and Shoham, Yoav. Learning against opponents with bounded memory. In IJCAI, pp. 817-822, 2005.
- (2005) IJCAI , pp. 817-822
- Powers, R.¹ Shoham, Y.²

8
- 34147097403
- A general criterion and an algorithmic framework for learning in multi-agent systems
- Powers, Rob, Shoham, Yoav, and Vu, Thuc. A general criterion and an algorithmic framework for learning in multi-agent systems. Mach. Learn., pp. 45-76, 2007.
- (2007) Mach. Learn. , pp. 45-76
- Powers, R.¹ Shoham, Y.² Vu, T.³

9
- 0004007508
- MIT Press
- Sutton, Richard S. and Barto, Andrew G. Reinforcement Learning. MIT Press, 1998.
- (1998) Reinforcement Learning
- Sutton, R.S.¹ Barto, A.G.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.