메뉴 건너뛰기




Volumn , Issue , 2011, Pages 2453-2458

Perturbed learning automata in potential games

Author keywords

[No Author keywords available]

Indexed keywords

REINFORCEMENT LEARNING;

EID: 84860653554     PISSN: 07431546     EISSN: 25762370     Source Type: Conference Proceeding    
DOI: 10.1109/CDC.2011.6161294     Document Type: Conference Paper
Times cited : (15)

References (16)
  • 2
    • 0028423534 scopus 로고
    • Decentralized learning of nash equilibria in multi-person stochastic games with incomplete information
    • P. Sastry, V. Phansalkar, and M. Thathachar, "Decentralized learning of Nash equilibria in multi-person stochastic games with incomplete information," IEEE Transactions on Systems, Man, and Cybernetics, vol. 24, no. 5, pp. 769-777, 1994.
    • (1994) IEEE Transactions on Systems, Man, and Cybernetics , vol.24 , Issue.5 , pp. 769-777
    • Sastry, P.1    Phansalkar, V.2    Thathachar, M.3
  • 3
    • 26844467703 scopus 로고    scopus 로고
    • Attainability of boundary points under reinforcement learning
    • E. Hopkins and M. Posch, "Attainability of boundary points under reinforcement learning," Games and Economic Behavior, vol. 53, pp. 110-125, 2005.
    • (2005) Games and Economic Behavior , vol.53 , pp. 110-125
    • Hopkins, E.1    Posch, M.2
  • 4
    • 0001000786 scopus 로고
    • Nonconvergence to unstable points in urn models and stochastic approximations
    • R. Pemantle, "Nonconvergence to unstable points in urn models and stochastic approximations," The Annals of Probability, vol. 18, no. 2, pp. 698-712, 1990.
    • (1990) The Annals of Probability , vol.18 , Issue.2 , pp. 698-712
    • Pemantle, R.1
  • 5
    • 0030374074 scopus 로고    scopus 로고
    • Evolution with state-dependent mutations
    • J. Bergin and B. L. Lipman, "Evolution with state-dependent mutations," Econometrica, vol. 64, no. 4, pp. 943-956, 1996.
    • (1996) Econometrica , vol.64 , Issue.4 , pp. 943-956
    • Bergin, J.1    Lipman, B.L.2
  • 6
    • 84860662348 scopus 로고    scopus 로고
    • Distributed dynamic reinforcement of efficient outcomes in multiagent coordination and network formation
    • Atlanta, GA, Discussion Paper
    • G. Chasparis and J. Shamma, "Distributed dynamic reinforcement of efficient outcomes in multiagent coordination and network formation," Georgia Institute of Technology, Atlanta, GA, Discussion Paper, 2009.
    • (2009) Georgia Institute of Technology
    • Chasparis, G.1    Shamma, J.2
  • 9
    • 24144477156 scopus 로고
    • On linear models with two absorbing states
    • M. F. Norman, "On linear models with two absorbing states," Journal of Mathematical Psychology, vol. 5, pp. 225-241, 1968.
    • (1968) Journal of Mathematical Psychology , vol.5 , pp. 225-241
    • Norman, M.F.1
  • 10
    • 0014580386 scopus 로고
    • Use of stochastic automata for parameter self-organization with multi-modal performance criteria
    • I. J. Shapiro and K. S. Narendra, "Use of stochastic automata for parameter self-organization with multi-modal performance criteria," IEEE Transactions on Systems Science and Cybernetics, vol. 5, pp. 352-360, 1969.
    • (1969) IEEE Transactions on Systems Science and Cybernetics , vol.5 , pp. 352-360
    • Shapiro, I.J.1    Narendra, K.S.2
  • 11
    • 0001784118 scopus 로고
    • On designing economic agents that behave like human agents
    • W. B. Arthur, "On designing economic agents that behave like human agents," Journal of Evolutionary Economics, vol. 3, pp. 1-22, 1993.
    • (1993) Journal of Evolutionary Economics , vol.3 , pp. 1-22
    • Arthur, W.B.1
  • 15
    • 0031287487 scopus 로고    scopus 로고
    • Cycling in a stochastic learning algorithm for normal form games
    • M. Posch, "Cycling in a stochastic learning algorithm for normal form games," Evolutionary Economics, vol. 7, pp. 193-207, 1997.
    • (1997) Evolutionary Economics , vol.7 , pp. 193-207
    • Posch, M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.