메뉴 건너뛰기




Volumn , Issue , 2010, Pages 219-224

Heterogeneous learning in zero-sum stochastic games with incomplete information

Author keywords

[No Author keywords available]

Indexed keywords

DYNAMICS; GAME THEORY; ORDINARY DIFFERENTIAL EQUATIONS; STOCHASTIC CONTROL SYSTEMS; STOCHASTIC SYSTEMS;

EID: 79953139643     PISSN: 07431546     EISSN: 25762370     Source Type: Conference Proceeding    
DOI: 10.1109/CDC.2010.5718053     Document Type: Conference Paper
Times cited : (59)

References (12)
  • 1
    • 0001784118 scopus 로고
    • On designing economic agents that behave like human agents
    • W. B. Arthur, On designing economic agents that behave like human agents. J. Evolutionary Econ. Vol. 3, 1993, pp. 1-22.
    • (1993) J. Evolutionary Econ. , vol.3 , pp. 1-22
    • Arthur, W.B.1
  • 3
    • 0001793657 scopus 로고    scopus 로고
    • Dynamics of Stochastic Approximations. Le Seminaire de Probabilites
    • M. Benaïm, "Dynamics of Stochastic Approximations. Le Seminaire de Probabilites". Lectures Notes in Mathematics, Vol. 1709, pp. 1-68, 1999.
    • (1999) Lectures Notes in Mathematics , vol.1709 , pp. 1-68
    • Benaïm, M.1
  • 4
    • 16244381482 scopus 로고    scopus 로고
    • Dynamic Fictitious Play, Dynamic Gradient Play, and Distributed Convergence to Nash Equilibria
    • March
    • J. S. Shamma and G. Arslan, "Dynamic Fictitious Play, Dynamic Gradient Play, and Distributed Convergence to Nash Equilibria," IEEE Trans. Automatic Control, Vol. 50, Issue 3, March 2005, pp. 312-327.
    • (2005) IEEE Trans. Automatic Control , vol.50 , Issue.3 , pp. 312-327
    • Shamma, J.S.1    Arslan, G.2
  • 5
    • 0031281590 scopus 로고    scopus 로고
    • Learning Through Reinforcement and Replicator Dynamics, UCSD, Economics Working Paper Series 93-47, 1993
    • Appeared in November
    • T. Borgers and R. Sarin, Learning Through Reinforcement and Replicator Dynamics, UCSD, Economics Working Paper Series 93-47, 1993. Appeared in Journal of Economic Theory, Vol. 77, Issue 1, November 1997, pp. 1-14.
    • (1997) Journal of Economic Theory , vol.77 , Issue.1 , pp. 1-14
    • Borgers, T.1    Sarin, R.2
  • 6
    • 0031076413 scopus 로고    scopus 로고
    • Stochastic approximation with two time scales
    • V. S. Borkar, "Stochastic approximation with two time scales", Systems Control Letters, Vol. 29, Issue 5, 1997, pp. 291-294.
    • (1997) Systems Control Letters , vol.29 , Issue.5 , pp. 291-294
    • Borkar, V.S.1
  • 8
    • 70049094993 scopus 로고    scopus 로고
    • Time averages, recurrence and transience in the stochastic replicator dynamics
    • Aug.
    • J. Hofbauer and L. Imhof, Time averages, recurrence and transience in the stochastic replicator dynamics. Annals of Applied Probability, Vol. 19, Aug. 2009, 1347-1368.
    • (2009) Annals of Applied Probability , vol.19 , pp. 1347-1368
    • Hofbauer, J.1    Imhof, L.2
  • 10
    • 0346913265 scopus 로고    scopus 로고
    • Convergent multiple timescales reinforcement learning algorithms in normal form games
    • D. S. Leslie and E. J. Collins, Convergent multiple timescales reinforcement learning algorithms in normal form games, The Annals of Applied Probability, Vol. 13, No. 4, 2003, pp. 1231-1251.
    • (2003) The Annals of Applied Probability , vol.13 , Issue.4 , pp. 1231-1251
    • Leslie, D.S.1    Collins, E.J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.