메뉴 건너뛰기




Volumn 85, Issue 4, 2012, Pages

Dynamics of Boltzmann Q learning in two-player two-action games

Author keywords

[No Author keywords available]

Indexed keywords

ASYMPTOTIC BEHAVIORS; BOLTZMANN; NASH EQULIBRIA; POINT STRUCTURE; Q-LEARNING;

EID: 84860692863     PISSN: 15393755     EISSN: 15502376     Source Type: Journal    
DOI: 10.1103/PhysRevE.85.041145     Document Type: Article
Times cited : (97)

References (28)
  • 6
    • 0038732510 scopus 로고    scopus 로고
    • PLEEE8 1539-3755 10.1103/PhysRevE.67.015206
    • Y. Sato and J. P. Crutchfield, Phys. Rev. E PLEEE8 1539-3755 10.1103/PhysRevE.67.015206 67, 015206 (R) (2003).
    • (2003) Phys. Rev. e , vol.67 , pp. 015206
    • Sato, Y.1    Crutchfield, J.P.2
  • 7
    • 24644517087 scopus 로고    scopus 로고
    • Stability and diversity in collective adaptation
    • DOI 10.1016/j.physd.2005.06.031, PII S0167278905002708
    • Y. Sato, E. Akiyama, and J. P. Crutchfield, Physica D: Nonlinear Phenomena PDNPDT 0167-2789 10.1016/j.physd.2005.06.031 210, 21 (2005). (Pubitemid 41277023)
    • (2005) Physica D: Nonlinear Phenomena , vol.210 , Issue.1-2 , pp. 21-57
    • Sato, Y.1    Akiyama, E.2    Crutchfield, J.P.3
  • 11
    • 9244231144 scopus 로고    scopus 로고
    • Reinforcement learning and decision making in monkeys during a competitive game
    • DOI 10.1016/j.cogbrainres.2004.07.007, PII S0926641004001971
    • D. Lee, M. L. Conroy, B. P. McGreevy, and D. J. Barraclough, Cognit. Brain Res. CBRREZ 0926-6410 10.1016/j.cogbrainres.2004.07.007 22, 45 (2004). (Pubitemid 39550751)
    • (2004) Cognitive Brain Research , vol.22 , Issue.1 , pp. 45-58
    • Lee, D.1    Conroy, M.L.2    McGreevy, B.P.3    Barraclough, D.J.4
  • 12
    • 66749182947 scopus 로고    scopus 로고
    • NNETEB 0893-6080 10.1016/j.neunet.2009.03.010
    • S. Kim, J. Hwang, H. Seo, and D. Lee, Neural Networks NNETEB 0893-6080 10.1016/j.neunet.2009.03.010 22, 294 (2009).
    • (2009) Neural Networks , vol.22 , pp. 294
    • Kim, S.1    Hwang, J.2    Seo, H.3    Lee, D.4
  • 14
  • 15
    • 0036434064 scopus 로고    scopus 로고
    • 0012-9682 10.1111/1468-0262.00372
    • E. Hopkins, Econometrica 0012-9682 10.1111/1468-0262.00372 70, 2141 (2002).
    • (2002) Econometrica , vol.70 , pp. 2141
    • Hopkins, E.1
  • 16
    • 20344390000 scopus 로고    scopus 로고
    • Learning in perturbed asymmetric games
    • DOI 10.1016/j.geb.2004.06.006, PII S0899825604000971
    • J. Hofbauer and E. Hopkins, Games and Economic Behavior GEBEEF 0899-8256 10.1016/j.geb.2004.06.006 52, 133 (2005). (Pubitemid 40792429)
    • (2005) Games and Economic Behavior , vol.52 , Issue.1 , pp. 133-152
    • Hofbauer, J.1    Hopkins, E.2
  • 17
    • 34249833101 scopus 로고
    • 0885-6125 10.1023/A:1022676722315
    • C. J. C. H. Watkins and P. Dayan, Mach. Learn. 0885-6125 10.1023/A:1022676722315 8, 279 (1992).
    • (1992) Mach. Learn. , vol.8 , pp. 279
    • Watkins, C.J.C.H.1    Dayan, P.2
  • 18
    • 33645029191 scopus 로고    scopus 로고
    • SJCODC 0363-0129 10.1137/S0363012903437976
    • D. S. Leslie and E. J. Collins, SIAM J. Control Optim. SJCODC 0363-0129 10.1137/S0363012903437976 44, 495 (2006).
    • (2006) SIAM J. Control Optim. , vol.44 , pp. 495
    • Leslie, D.S.1    Collins, E.J.2
  • 21
    • 0029704370 scopus 로고    scopus 로고
    • JMBLAJ 0303-6812 10.1007/BF02409754
    • J. Hofbauer, J. Math. Biol. JMBLAJ 0303-6812 10.1007/BF02409754 34, 675 (1996).
    • (1996) J. Math. Biol. , vol.34 , pp. 675
    • Hofbauer, J.1
  • 22
    • 0031281590 scopus 로고    scopus 로고
    • Learning through reinforcement and replicator dynamics
    • DOI 10.1006/jeth.1997.2319, PII S002205319792319X
    • T. Borgers and R. Sarin, Journal of Economic Theory JECTAQ 0022-0531 10.1006/jeth.1997.2319 77, 1 (1997). (Pubitemid 127172745)
    • (1997) Journal of Economic Theory , vol.77 , Issue.1 , pp. 1-14
    • Borgers, T.1    Sarin, R.2
  • 24
    • 70449857987 scopus 로고    scopus 로고
    • PRLTAO 0031-9007 10.1103/PhysRevLett.103.198702
    • Tobias Galla, Phys. Rev. Lett. PRLTAO 0031-9007 10.1103/PhysRevLett.103. 198702 103, 198702 (2009).
    • (2009) Phys. Rev. Lett. , vol.103 , pp. 198702
    • Galla, T.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.