SCOPUS 정보 검색 플랫폼

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics

Volumn 32, Issue 6, 2002, Pages 759-771

Learning through reinforcement for N-person repeated constrained games

(2) Poznyak, Alexander S a Najim, Kaddour b

a DEPARTAMENTO DE FÍSICA (Mexico)

b UNIVERSITÉ DE TOULOUSE (France)

Author keywords

Adaptive strategies; Learning automata (LA); Reinforcement learning; Repeated game

Indexed keywords

ADAPTIVE SYSTEMS; AUTOMATA THEORY; COMPUTER SIMULATION; LAGRANGE MULTIPLIERS; LEARNING ALGORITHMS; LEARNING SYSTEMS; RANDOM PROCESSES;

REINFORCEMENT LEARNINGS;

GAME THEORY;

EID: 0036894103 PISSN: 10834419 EISSN: None Source Type: Journal
DOI: 10.1109/TSMCB.2002.1049610 Document Type: Article

Times cited : (22)

References (41)

1
- 0003989208
- London, U.K.: Chapman & Hall
- E. Altman, Controlled Markov Decision Processes. London, U.K.: Chapman & Hall, 1999.
- (1999) Controlled Markov Decision Processes
- Altman, E.¹

2
- 0000235370
- Adaptive control of constrained Markov chains: Criteria and policies
- E. Altman and A. Shwartz, "Adaptive control of constrained Markov chains: Criteria and policies," Ann. Oper. Res., vol. 28, pp. 101-134, 1991.
- (1991) Ann. Oper. Res. , vol.28 , pp. 101-134
- Altman, E.¹ Shwartz, A.²

3
- 43949152639
- Genetic algorithm and the Cobweb model
- J. Arifovic, "Genetic algorithm and the Cobweb model," J. Econ. Dynam. Contr, vol. 18, pp. 2-28, 1994.
- (1994) J. Econ. Dynam. Contr , vol.18 , pp. 2-28
- Arifovic, J.¹

4
- 0011893938
- Constraint qualifications in maximization problems
- K.J. Arrow, L. Hurwicz, and H. Uzawa, "Constraint qualifications in maximization problems," Nav. Res. Logist. Q., vol. 8, pp. 175-191, 1961.
- (1961) Nav. Res. Logist. Q. , vol.8 , pp. 175-191
- Arrow, K.J.¹ Hurwicz, L.² Uzawa, H.³

5
- 0003603884
- New York: Springer-Verlag
- N. Baba, New Topics in Learning Automata: Theory and Applications. New York: Springer-Verlag, 1984.
- (1984) New Topics in Learning Automata: Theory and Applications
- Baba, N.¹

6
- 0002310119
- Stochastic games
- J.F. Mertens and A. Neyman. "Stochastic games," Int. J. Game Theory, vol. 10, pp. 53-66, 1981.
- (1981) Int. J. Game Theory , vol.10 , pp. 53-66
- Mertens, J.F.¹ Neyman, A.²

7
- 0000066148
- Algorithms for stochastic games - A survey
- T.E.S. Raghavan and J.A. Filar, "Algorithms for stochastic games-A survey," ZOR-Methods Models Oper. Res., vol. 35, pp. 437-472, 1991.
- (1991) ZOR-Methods Models Oper. Res. , vol.35 , pp. 437-472
- Raghavan, T.E.S.¹ Filar, J.A.²

8
- 0003441446
- Norwell, MA: Kluwer
- T.E.S. Raghavan, T.S. Ferguson, T. Parthasarathy, and O.J. Vrieze, Stochastic Games and Related Topics. Norwell, MA: Kluwer, 1991.
- (1991) Stochastic Games and Related Topics
- Raghavan, T.E.S.¹ Ferguson, T.S.² Parthasarathy, T.³ Vrieze, O.J.⁴

9
- 0003989209
- New York: Springer-Verlag
- J. Filar and K. Vrieze, Competitive Markov Decision Processes. New York: Springer-Verlag, 1979.
- (1979) Competitive Markov Decision Processes
- Filar, J.¹ Vrieze, K.²

10
- 0004247096
- Cambridge, MA: MIT Press
- D. Fudenberg and D.K. Levine, The Theory of Learning in Games. Cambridge, MA: MIT Press, 1998.
- (1998) The Theory of Learning in Games
- Fudenberg, D.¹ Levine, D.K.²

11
- 0003830866
- Amsterdam, The Netherlands: North Holland
- J.P. Aubin, Mathematical Methods of Game and Economic Theory. Amsterdam, The Netherlands: North Holland, 1979.
- (1979) Mathematical Methods of Game and Economic Theory
- Aubin, J.P.¹

12
- 4243274437
- Dép. d'Econ., Fac. Sci. Econ. Sociales, Univ. Genève, Genève, France, Rep. 94.03
- Y. Balasco and D. Royer, "Stability of competitive equilibrium with respect to recursive and learning processes," Dép. d'Econ., Fac. Sci. Econ. Sociales, Univ. Genève, Genève, France, Rep. 94.03, 1994.
- (1994) Stability of competitive equilibrium with respect to recursive and learning processes
- Balasco, Y.¹ Royer, D.²

13
- 0003981511
- Philadelphia, PA: SIAM
- T. Basar and G.J. Olsder, Dynamic Noncooperative Game Theory, 2nd ed. Philadelphia, PA: SIAM, 1998.
- (1998) Dynamic Noncooperative Game Theory, 2nd ed.
- Basar, T.¹ Olsder, G.J.²

14
- 0003487482
- Belmont, MA: Athena Scientific
- D. Bertsekas and J.N. Tsiklitis, Neuro-Dynamic Programming. Belmont, MA: Athena Scientific, 1996.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.¹ Tsiklitis, J.N.²

15
- 0031281590
- Learning through reinforcement and replicator dynamics
- T. Börgers and R. Sarin, "Learning through reinforcement and replicator dynamics," J. Econ. Theory, vol. 77, pp. 1-14, 1997.
- (1997) J. Econ. Theory , vol.77 , pp. 1-14
- Börgers, T.¹ Sarin, R.²

16
- 0011807465
- Nonparametric adaptive learning with feed-back
- X. Chen anal H. White, "Nonparametric adaptive learning with feed-back," J. Econ. Theory, vol. 82, pp. 190-222, 1998.
- (1998) J. Econ. Theory , vol.82 , pp. 190-222
- Chen, X.¹ White, H.²

17
- 0004169430
- New York: Springer-Verlag
- M. Duflo, Random Iterative Models. New York: Springer-Verlag, 1997.
- (1997) Random Iterative Models
- Duflo, M.¹

18
- 0030215342
- Learning by observation within the firm
- J. Dutta and K. Prasad, "Learning by observation within the firm," J. Econ. Dynam. Contr., vol. 20, pp. 1395-1425, 1996.
- (1996) J. Econ. Dynam. Contr. , vol.20 , pp. 1395-1425
- Dutta, J.¹ Prasad, K.²

19
- 0011857881
- New York: Springer-Verlag
- Y.M. El-Fattah and C. Foulard, Learning Systems: Decision, Simulation, and Control. New York: Springer-Verlag, 1978.
- (1978) Learning Systems: Decision, Simulation, and Control
- El-Fattah, Y.M.¹ Foulard, C.²

20
- 0038829878
- Predicting how people play games: Reinforcement learning in experimental games with unique, mixed strategy equilibria
- I. Erev and A.E. Roth, "Predicting how people play games: Reinforcement learning in experimental games with unique, mixed strategy equilibria," Amer. Econ. Rev., vol. 88, pp. 848-881, 1998.
- (1998) Amer. Econ. Rev. , vol.88 , pp. 848-881
- Erev, I.¹ Roth, A.E.²

21
- 0032218188
- Evolutionary economics goes mainstream: A review of the theory of learning in games
- D. Friedman, "Evolutionary economics goes mainstream: A review of the theory of learning in games," J. Evol. Econ., vol. 8, pp. 423-432, 1998.
- (1998) J. Evol. Econ. , vol.8 , pp. 423-432
- Friedman, D.¹

22
- 0000221289
- Rational learning leads to Nash equilibrium
- E. Kalai and E. Lehrer, "Rational learning leads to Nash equilibrium," Econometrica, vol. 61, pp. 1019-1045, 1993.
- (1993) Econometrica , vol.61 , pp. 1019-1045
- Kalai, E.¹ Lehrer, E.²

23
- 0000199420
- Adaptive learning with nonlinear dynamics driven by dependent processes
- C.M. Kuan and H. White, "Adaptive learning with nonlinear dynamics driven by dependent processes," Econometrica, vol. 62, pp. 1087-1114, 1994.
- (1994) Econometrica , vol.62 , pp. 1087-1114
- Kuan, C.M.¹ White, H.²

24
- 0003650765
- New York: Springer-Verlag
- S. Lakshmivarahan, Learning Algorithms Theory and Applications. New York: Springer-Verlag, 1981.
- (1981) Learning Algorithms Theory and Applications
- Lakshmivarahan, S.¹

25
- 0003988124
- New York: Pergamon
- K. Najim and A.S. Poznyak, Learning Automata: Theory and Applications. New York: Pergamon, 1994.
- (1994) Learning Automata: Theory and Applications
- Najim, K.¹ Poznyak, A.S.²

26
- 0030212622
- Multimodal searching technique based on learning automata with continuous input and changing number of actions
- Aug.
- _, "Multimodal searching technique based on learning automata with continuous input and changing number of actions," IEEE Trans. Syst., Man, Cybern. B, vol. 26, pp. 666-673, Aug. 1996.
- (1996) IEEE Trans. Syst., Man, Cybern. B , vol.26 , pp. 666-673

27
- 0035400337
- Adaptive policy for two finite Markov chains zero-sum stochastic games with unknown transition matrices and average payoffs
- K. Najim, A.S. Poznyak, and E. Gomez, "Adaptive policy for two finite Markov chains zero-sum stochastic games with unknown transition matrices and average payoffs," IFAC Automatica, vol. 37, pp. 1008-1018, 2001.
- (2001) IFAC Automatica , vol.37 , pp. 1008-1018
- Najim, K.¹ Poznyak, A.S.² Gomez, E.³

28
- 0003891507
- Englewood Cliffs, NJ: Prentice-Hall
- K.S. Narendra and M.A.L. Thathachar, Learning Automata: An Introduction. Englewood Cliffs, NJ: Prentice-Hall, 1989.
- (1989) Learning Automata: An Introduction
- Narendra, K.S.¹ Thathachar, M.A.L.²

29
- 0002021736
- Equilibrium points in n-person games
- J. Nash, "Equilibrium points in n-person games," Proc. Nat. Acad. USA, vol. 36, pp. 48-49, 1950.
- (1950) Proc. Nat. Acad. USA , vol.36 , pp. 48-49
- Nash, J.¹

30
- 0011881447
- Matrix N-person game with incomplete information
- A.V. Nazin and A.S. Poznyak, "Matrix N-person game with incomplete information," Econ. Math. Meth., vol. 14, pp. 958-968, 1978.
- (1978) Econ. Math. Meth. , vol.14 , pp. 958-968
- Nazin, A.V.¹ Poznyak, A.S.²

31
- 0004187109
- Moscow: Nauka
- _, Adaptive Choice of Variants (in Russian). Moscow: Nauka, 1986.
- (1986) Adaptive Choice of Variants (in Russian)

32
- 0003722746
- New York: Academic
- M.F. Norman, Markov Processes and Learning Models. New York: Academic, 1972.
- (1972) Markov Processes and Learning Models
- Norman, M.F.¹

33
- 0001100221
- Learning the optimum as a Nash equilibrium
- S. Özyildirim and N.M. Alenmdar, "Learning the optimum as a Nash equilibrium," J. Econ. Dynam. Contr., vol. 24, pp. 483-499, 2000.
- (2000) J. Econ. Dynam. Contr. , vol.24 , pp. 483-499
- Özyildirim, S.¹ Alenmdar, N.M.²

34
- 0003730403
- New York: Springer-Verlag
- A.S. Poznyak and K. Najim, Learning Automata and Stochastic Optimization. New York: Springer-Verlag, 1997.
- (1997) Learning Automata and Stochastic Optimization
- Poznyak, A.S.¹ Najim, K.²

35
- 0004139304
- New York: Marcel Dekker
- A.S. Poznyak, K. Najim, and E. Gomez, Self-Learning Control for Finite Markov Chains. New York: Marcel Dekker, 2000.
- (2000) Self-Learning Control for Finite Markov Chains
- Poznyak, A.S.¹ Najim, K.² Gomez, E.³

36
- 0035496949
- Bush-Mosteller learning for zero-sum repeated game with random pay-offs
- A. Poznyak and K. Najim, "Bush-Mosteller learning for zero-sum repeated game with random pay-offs," Int. J. Syst. Sci., vol. 32, no. 10, pp. 1251-1260, 2001.
- (2001) Int. J. Syst. Sci. , vol.32 , Issue.10 , pp. 1251-1260
- Poznyak, A.¹ Najim, K.²

37
- 58149324992
- Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term
- A.E. Roth and I. Erev, "Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term," Games Econ. Beh., vol. 8, pp. 164-212, 1995.
- (1995) Games Econ. Beh. , vol.8 , pp. 164-212
- Roth, A.E.¹ Erev, I.²

38
- 0002686402
- A convergence theorem for nonnegative almost supermartingales and some applications
- J.S. Rustagi, Ed. New York: Academic
- H. Robbins and D. Siegmund, "A convergence theorem for nonnegative almost supermartingales and some applications," in Optimizing Methods in Statistics, J.S. Rustagi, Ed. New York: Academic, 1971.
- (1971) Optimizing Methods in Statistics
- Robbins, H.¹ Siegmund, D.²

39
- 0018922522
- Existence and uniqueness of equilibrium points for concave N-persons games
- J.B. Rosen, "Existence and uniqueness of equilibrium points for concave N-persons games," Econometrica, vol. 33, pp. 520-534, 1965.
- (1965) Econometrica , vol.33 , pp. 520-534
- Rosen, J.B.¹

40
- 0028423534
- Decentralized learning of Nash equilibria in multiperson stochastic game with incomplete information
- May
- P.S. Sastry, V.V. Phansalkar, and M.A.L. Thathachar, "Decentralized learning of Nash equilibria in multiperson stochastic game with incomplete information," IEEE Trans. Syst., Man, Cybern., vol. 24, pp. 769-777, May 1994.
- (1994) IEEE Trans. Syst., Man, Cybern. , vol.24 , pp. 769-777
- Sastry, P.S.¹ Phansalkar, V.V.² Thathachar, M.A.L.³

41
- 0011853562
- Stochastic games with average cost constraints
- Annals of the International Society of Dynamic Games, T. Basar and H. Haurie, Eds. Cambridge, MA: Birkhaüser
- N. Shimkin, "Stochastic games with average cost constraints," in Annals of the International Society of Dynamic Games, T. Basar and H. Haurie, Eds. Cambridge, MA: Birkhaüser, 1994, vol. 1, Advances in Dynamic Games and Applications.
- (1994) Advances in Dynamic Games and Applications , vol.1
- Shimkin, N.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.