SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

Annals of Applied Probability

Volumn 13, Issue 4, 2003, Pages 1231-1251

Convergent multiple-timescales reinforcement learning algorithms in normal form games

(2) Leslie, David S a Collins, E J a

a UNIVERSITY OF BRISTOL (United Kingdom)

Author keywords

Best response dynamics; Reinforcement learning; Repeated normal form games; Stochastic approximation

Indexed keywords

EID: 0346913265 PISSN: 10505164 EISSN: None Source Type: Journal
DOI: 10.1214/aoap/1069786497 Document Type: Article

Times cited : (86)

References (20)

1
- 0001793657
- Dynamics of stochastic approximation algorithms
- Springer, Berlin
- BENAÏM, M. (1999). Dynamics of stochastic approximation algorithms. Le Séminaire de Probabilités XXXIII. Lecture Notes in Math. 1709 1-68. Springer, Berlin.
- (1999) Le Séminaire de Probabilités XXXIII. Lecture Notes in Math. , vol.1709 , pp. 1-68
- Benaïm, M.¹

2
- 0002277539
- Mixed equilibria and dynamical systems arising from fictitious play in perturbed games
- BENAÏM, M. and HIRSCH, M. W. (1999). Mixed equilibria and dynamical systems arising from fictitious play in perturbed games, Games Econom. Behav. 29 36-72.
- (1999) Games Econom. Behav. , vol.29 , pp. 36-72
- Benaïm, M.¹ Hirsch, M.W.²

3
- 0003487482
- Athena Scientific, Belmont, MA
- BERTSEKAS, D. P. and TSITSIKLIS, J. N. (1996). Neuro-Dynamic Programming. Athena Scientific, Belmont, MA.
- (1996) Neuro-dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

4
- 0031076413
- Stochastic approximation with two timescales
- BORKAR, V. S. (1997). Stochastic approximation with two timescales. Systems Control Lett. 29 291-294.
- (1997) Systems Control Lett. , vol.29 , pp. 291-294
- Borkar, V.S.¹

5
- 33645016930
- BORKAR, V. S. (2002). Reinforcement learning in Markovian evolutionary games. Available at www.tcs.tifr.res.in/̃borkar/games.ps.
- (2002) Reinforcement Learning in Markovian Evolutionary Games
- Borkar, V.S.¹

6
- 0007336816
- Ph.D. dissertation, Univ. California, Berkeley
- COWAN, S. (1992), Dynamical systems arising from game theory. Ph.D. dissertation, Univ. California, Berkeley.
- (1992) Dynamical Systems Arising from Game Theory
- Cowan, S.¹

7
- 0000466473
- Learning mixed equilibria
- FUDENBERG, D. and KREPS, D. M. (1993). Learning mixed equilibria. Games Econom. Behav. 5 320-367.
- (1993) Games Econom. Behav. , vol.5 , pp. 320-367
- Fudenberg, D.¹ Kreps, D.M.²

8
- 0004247096
- MIT Press, Cambridge, MA
- FUDENBERG, D. and LEVINE, D. K. (1998). The Theory of Learning in Games. MIT Press, Cambridge, MA.
- (1998) The Theory of Learning in Games
- Fudenberg, D.¹ Levine, D.K.²

9
- 0003161771
- Games with randomly disturbed payoffs: A new rationale for mixed-strategy equilibrium points
- HARSANYI, J. (1973). Games with randomly disturbed payoffs: A new rationale for mixed-strategy equilibrium points. Internat. J. Game Theory 2 1-23.
- (1973) Internat. J. Game Theory , vol.2 , pp. 1-23
- Harsanyi, J.¹

10
- 0011651132
- HOFBAUER, J. and HOPKINS, E. (2002). Learning in perturbed asymmetric games. Available at www.econ.ed.ac.uk/pdf/perturb.pdf.
- (2002) Learning in Perturbed Asymmetric Games
- Hofbauer, J.¹ Hopkins, E.²

11
- 0000978264
- A note on best response dynamics
- HOPKINS, E. (1999). A note on best response dynamics. Games Econom. Behav. 29 138-150.
- (1999) Games Econom. Behav. , vol.29 , pp. 138-150
- Hopkins, E.¹

12
- 0002316532
- Geometric singular perturbation theory
- Springer, Berlin
- JONES, C. K. R. T. (1995). Geometric singular perturbation theory. Dynamical Systems. Lecture Notes in Math. 1609 44-118. Springer, Berlin.
- (1995) Dynamical Systems. Lecture Notes in Math. , vol.1609 , pp. 44-118
- Jones, C.K.R.T.¹

13
- 0000415605
- Three problems in learning mixed strategy equilibria
- JORDAN, J. S. (1993). Three problems in learning mixed strategy equilibria. Games Econom. Behav. 5 368-386.
- (1993) Games Econom. Behav. , vol.5 , pp. 368-386
- Jordan, J.S.¹

14
- 0343893613
- Actor-critic-type learning algorithms for Markov decision process
- KONDA, V. R. and BORKAR, V. S. (2000). Actor-critic-type learning algorithms for Markov decision process. SIAM J. Control Opt. 38 94-123.
- (2000) SIAM J. Control Opt. , vol.38 , pp. 94-123
- Konda, V.R.¹ Borkar, V.S.²

15
- 0003452601
- Springer, New York
- KUSHNER, H. J. and CLARK, D. S. (1978). Stochastic Approximation Methods for Constrained and Unconstrained Systems. Springer, New York.
- (1978) Stochastic Approximation Methods for Constrained and Unconstrained Systems
- Kushner, H.J.¹ Clark, D.S.²

16
- 80053136974
- Implicit negotiation in repeated games
- Springer, Berlin
- LITTMAN, M. and STONE, P. (2001). Implicit negotiation in repeated games. Intelligent agents VIII: Agent Theories, Architectures and Languages. Lecture Notes in Comput. Sci. 2333 393-404. Springer, Berlin.
- (2001) Intelligent Agents VIII: Agent Theories, Architectures and Languages. Lecture Notes in Comput. Sci. , vol.2333 , pp. 393-404
- Littman, M.¹ Stone, P.²

17
- 0001730497
- Non-cooperative games
- NASH, J. (1951). Non-cooperative games. Ann. Math. 54 286-295.
- (1951) Ann. Math. , vol.54 , pp. 286-295
- Nash, J.¹

18
- 0001000786
- Nonconvergence to unstable points in urn models and stochastic approximations
- PEMANTLE, R. (1990). Nonconvergence to unstable points in urn models and stochastic approximations. Ann. Probab. 18 698-712.
- (1990) Ann. Probab. , vol.18 , pp. 698-712
- Pemantle, R.¹

19
- 0002623794
- Some topics in two person games
- (M. Dresher, L. S. Shapley and A. W. Tucker, eds.). Princeton Univ. Press
- SHAPLEY, L. S. (1964). Some topics in two person games. In Advances in Game Theory (M. Dresher, L. S. Shapley and A. W. Tucker, eds.) 1-28. Princeton Univ. Press.
- (1964) Advances in Game Theory , pp. 1-28
- Shapley, L.S.¹

20
- 0004102479
- MIT Press
- SUTTON, R. S. and BARTO, A. G. (1998). Reinforcement Learning: An Introduction. MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.