메뉴 건너뛰기




Volumn 32, Issue 6, 2002, Pages 759-771

Learning through reinforcement for N-person repeated constrained games

Author keywords

Adaptive strategies; Learning automata (LA); Reinforcement learning; Repeated game

Indexed keywords

ADAPTIVE SYSTEMS; AUTOMATA THEORY; COMPUTER SIMULATION; LAGRANGE MULTIPLIERS; LEARNING ALGORITHMS; LEARNING SYSTEMS; RANDOM PROCESSES;

EID: 0036894103     PISSN: 10834419     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSMCB.2002.1049610     Document Type: Article
Times cited : (22)

References (41)
  • 2
    • 0000235370 scopus 로고
    • Adaptive control of constrained Markov chains: Criteria and policies
    • E. Altman and A. Shwartz, "Adaptive control of constrained Markov chains: Criteria and policies," Ann. Oper. Res., vol. 28, pp. 101-134, 1991.
    • (1991) Ann. Oper. Res. , vol.28 , pp. 101-134
    • Altman, E.1    Shwartz, A.2
  • 3
    • 43949152639 scopus 로고
    • Genetic algorithm and the Cobweb model
    • J. Arifovic, "Genetic algorithm and the Cobweb model," J. Econ. Dynam. Contr, vol. 18, pp. 2-28, 1994.
    • (1994) J. Econ. Dynam. Contr , vol.18 , pp. 2-28
    • Arifovic, J.1
  • 4
    • 0011893938 scopus 로고
    • Constraint qualifications in maximization problems
    • K.J. Arrow, L. Hurwicz, and H. Uzawa, "Constraint qualifications in maximization problems," Nav. Res. Logist. Q., vol. 8, pp. 175-191, 1961.
    • (1961) Nav. Res. Logist. Q. , vol.8 , pp. 175-191
    • Arrow, K.J.1    Hurwicz, L.2    Uzawa, H.3
  • 15
    • 0031281590 scopus 로고    scopus 로고
    • Learning through reinforcement and replicator dynamics
    • T. Börgers and R. Sarin, "Learning through reinforcement and replicator dynamics," J. Econ. Theory, vol. 77, pp. 1-14, 1997.
    • (1997) J. Econ. Theory , vol.77 , pp. 1-14
    • Börgers, T.1    Sarin, R.2
  • 16
    • 0011807465 scopus 로고    scopus 로고
    • Nonparametric adaptive learning with feed-back
    • X. Chen anal H. White, "Nonparametric adaptive learning with feed-back," J. Econ. Theory, vol. 82, pp. 190-222, 1998.
    • (1998) J. Econ. Theory , vol.82 , pp. 190-222
    • Chen, X.1    White, H.2
  • 18
    • 0030215342 scopus 로고    scopus 로고
    • Learning by observation within the firm
    • J. Dutta and K. Prasad, "Learning by observation within the firm," J. Econ. Dynam. Contr., vol. 20, pp. 1395-1425, 1996.
    • (1996) J. Econ. Dynam. Contr. , vol.20 , pp. 1395-1425
    • Dutta, J.1    Prasad, K.2
  • 20
    • 0038829878 scopus 로고    scopus 로고
    • Predicting how people play games: Reinforcement learning in experimental games with unique, mixed strategy equilibria
    • I. Erev and A.E. Roth, "Predicting how people play games: Reinforcement learning in experimental games with unique, mixed strategy equilibria," Amer. Econ. Rev., vol. 88, pp. 848-881, 1998.
    • (1998) Amer. Econ. Rev. , vol.88 , pp. 848-881
    • Erev, I.1    Roth, A.E.2
  • 21
    • 0032218188 scopus 로고    scopus 로고
    • Evolutionary economics goes mainstream: A review of the theory of learning in games
    • D. Friedman, "Evolutionary economics goes mainstream: A review of the theory of learning in games," J. Evol. Econ., vol. 8, pp. 423-432, 1998.
    • (1998) J. Evol. Econ. , vol.8 , pp. 423-432
    • Friedman, D.1
  • 22
    • 0000221289 scopus 로고
    • Rational learning leads to Nash equilibrium
    • E. Kalai and E. Lehrer, "Rational learning leads to Nash equilibrium," Econometrica, vol. 61, pp. 1019-1045, 1993.
    • (1993) Econometrica , vol.61 , pp. 1019-1045
    • Kalai, E.1    Lehrer, E.2
  • 23
    • 0000199420 scopus 로고
    • Adaptive learning with nonlinear dynamics driven by dependent processes
    • C.M. Kuan and H. White, "Adaptive learning with nonlinear dynamics driven by dependent processes," Econometrica, vol. 62, pp. 1087-1114, 1994.
    • (1994) Econometrica , vol.62 , pp. 1087-1114
    • Kuan, C.M.1    White, H.2
  • 26
    • 0030212622 scopus 로고    scopus 로고
    • Multimodal searching technique based on learning automata with continuous input and changing number of actions
    • Aug.
    • _, "Multimodal searching technique based on learning automata with continuous input and changing number of actions," IEEE Trans. Syst., Man, Cybern. B, vol. 26, pp. 666-673, Aug. 1996.
    • (1996) IEEE Trans. Syst., Man, Cybern. B , vol.26 , pp. 666-673
  • 27
    • 0035400337 scopus 로고    scopus 로고
    • Adaptive policy for two finite Markov chains zero-sum stochastic games with unknown transition matrices and average payoffs
    • K. Najim, A.S. Poznyak, and E. Gomez, "Adaptive policy for two finite Markov chains zero-sum stochastic games with unknown transition matrices and average payoffs," IFAC Automatica, vol. 37, pp. 1008-1018, 2001.
    • (2001) IFAC Automatica , vol.37 , pp. 1008-1018
    • Najim, K.1    Poznyak, A.S.2    Gomez, E.3
  • 29
    • 0002021736 scopus 로고
    • Equilibrium points in n-person games
    • J. Nash, "Equilibrium points in n-person games," Proc. Nat. Acad. USA, vol. 36, pp. 48-49, 1950.
    • (1950) Proc. Nat. Acad. USA , vol.36 , pp. 48-49
    • Nash, J.1
  • 30
    • 0011881447 scopus 로고
    • Matrix N-person game with incomplete information
    • A.V. Nazin and A.S. Poznyak, "Matrix N-person game with incomplete information," Econ. Math. Meth., vol. 14, pp. 958-968, 1978.
    • (1978) Econ. Math. Meth. , vol.14 , pp. 958-968
    • Nazin, A.V.1    Poznyak, A.S.2
  • 33
    • 0001100221 scopus 로고    scopus 로고
    • Learning the optimum as a Nash equilibrium
    • S. Özyildirim and N.M. Alenmdar, "Learning the optimum as a Nash equilibrium," J. Econ. Dynam. Contr., vol. 24, pp. 483-499, 2000.
    • (2000) J. Econ. Dynam. Contr. , vol.24 , pp. 483-499
    • Özyildirim, S.1    Alenmdar, N.M.2
  • 36
    • 0035496949 scopus 로고    scopus 로고
    • Bush-Mosteller learning for zero-sum repeated game with random pay-offs
    • A. Poznyak and K. Najim, "Bush-Mosteller learning for zero-sum repeated game with random pay-offs," Int. J. Syst. Sci., vol. 32, no. 10, pp. 1251-1260, 2001.
    • (2001) Int. J. Syst. Sci. , vol.32 , Issue.10 , pp. 1251-1260
    • Poznyak, A.1    Najim, K.2
  • 37
    • 58149324992 scopus 로고
    • Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term
    • A.E. Roth and I. Erev, "Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term," Games Econ. Beh., vol. 8, pp. 164-212, 1995.
    • (1995) Games Econ. Beh. , vol.8 , pp. 164-212
    • Roth, A.E.1    Erev, I.2
  • 38
    • 0002686402 scopus 로고
    • A convergence theorem for nonnegative almost supermartingales and some applications
    • J.S. Rustagi, Ed. New York: Academic
    • H. Robbins and D. Siegmund, "A convergence theorem for nonnegative almost supermartingales and some applications," in Optimizing Methods in Statistics, J.S. Rustagi, Ed. New York: Academic, 1971.
    • (1971) Optimizing Methods in Statistics
    • Robbins, H.1    Siegmund, D.2
  • 39
    • 0018922522 scopus 로고
    • Existence and uniqueness of equilibrium points for concave N-persons games
    • J.B. Rosen, "Existence and uniqueness of equilibrium points for concave N-persons games," Econometrica, vol. 33, pp. 520-534, 1965.
    • (1965) Econometrica , vol.33 , pp. 520-534
    • Rosen, J.B.1
  • 40
    • 0028423534 scopus 로고
    • Decentralized learning of Nash equilibria in multiperson stochastic game with incomplete information
    • May
    • P.S. Sastry, V.V. Phansalkar, and M.A.L. Thathachar, "Decentralized learning of Nash equilibria in multiperson stochastic game with incomplete information," IEEE Trans. Syst., Man, Cybern., vol. 24, pp. 769-777, May 1994.
    • (1994) IEEE Trans. Syst., Man, Cybern. , vol.24 , pp. 769-777
    • Sastry, P.S.1    Phansalkar, V.V.2    Thathachar, M.A.L.3
  • 41
    • 0011853562 scopus 로고
    • Stochastic games with average cost constraints
    • Annals of the International Society of Dynamic Games, T. Basar and H. Haurie, Eds. Cambridge, MA: Birkhaüser
    • N. Shimkin, "Stochastic games with average cost constraints," in Annals of the International Society of Dynamic Games, T. Basar and H. Haurie, Eds. Cambridge, MA: Birkhaüser, 1994, vol. 1, Advances in Dynamic Games and Applications.
    • (1994) Advances in Dynamic Games and Applications , vol.1
    • Shimkin, N.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.