SCOPUS 정보 검색 플랫폼

IIE Transactions (Institute of Industrial Engineers)

Volumn 41, Issue 2, 2009, Pages 158-167

A reinforcement learning algorithm for obtaining the Nash equilibrium of multi-player matrix games

(2) Nanduri, Vishnu a Das, Tapas K a

a University of South Florida (United States)

Author keywords

Matrix games; Reinforcement Learning; Restructured electricity markets; Vash equilibrium

Indexed keywords

ALGORITHMS; APPROXIMATION ALGORITHMS; COMMERCE; COMPETITION; ELECTRICITY; ELECTRONIC COMMERCE; GAME THEORY; LEARNING SYSTEMS; REINFORCEMENT; REINFORCEMENT LEARNING; SOLUTIONS; STOCHASTIC MODELS; TELECOMMUNICATION NETWORKS;

ACTION SPACES; AVERAGE REWARDS; BROADER IMPACTS; COMPARATIVE ANALYSIS; COMPUTATIONAL ALGORITHMS; E COMMERCES; ELECTRIC POWER MARKETS; MARKET SCENARIOS; MARKOV DECISION PROCESSES; MARKOV DECISIONS; MATRIX GAMES; NASH EQUILIBRIUMS; REINFORCEMENT LEARNING ALGORITHMS; RESTRUCTURED ELECTRICITY MARKETS; SOLUTION APPROACHES; STOCHASTIC GAMES; STRATEGIC BIDDINGS; VASH EQUILIBRIUM;

LEARNING ALGORITHMS;

EID: 56749179073 PISSN: 0740817X EISSN: 15458830 Source Type: Journal
DOI: 10.1080/07408170802369417 Document Type: Article

Times cited : (18)

References (37)

1
- 0033501158
- Understanding how market power can arise in network competition: A game theoretic approach
- Berry, C. A., Hobbs, B. F., Meroney, W. A., O'Neill, R. P. and Stewart Jr., W. R. (1999) Understanding how market power can arise in network competition: A game theoretic approach. Utilities Policy, 8, pp. 139-158.
- (1999) Utilities Policy , vol.8 , pp. 139-158
- Berry, C.A.¹ Hobbs, B.F.² Meroney, W.A.³ O'Neill, R.P.⁴ Stewart Jr., W.R.⁵

2
- 0003863106
- Computer Science Department, Carnegie Mellon University, Pittsburgh, PA
- Bowling, M. and Veloso, M. M. (2000) An analysis of stochastic game theory for multiagent reinforcement learning, Computer Science Department, Carnegie Mellon University, Pittsburgh, PA
- (2000) An Analysis of Stochastic Game Theory for Multiagent Reinforcement Learning
- Bowling, M.¹ Veloso, M.M.²

3
- 18744371204
- Reinforcement learning in Markovian evolutionary games
- Borkar, V. S. (2002) Reinforcement learning in Markovian evolutionary games. Advances in Complex Systems, 5:1, pp. 55-72.
- (2002) Advances in Complex Systems , vol.5 , Issue.1 , pp. 55-72
- Borkar, V.S.¹

4
- 0031091534
- Market power and strategic interaction in electricity networks
- Cardell, J. B., Hitt, C. C. and Hogan, W. W. (1997) Market power and strategic interaction in electricity networks. Resource and Energy Economics, 19, pp. 109-137.
- (1997) Resource and Energy Economics , vol.19 , pp. 109-137
- Cardell, J.B.¹ Hitt, C.C.² Hogan, W.W.³

5
- 0036474461
- An empirical study of applied game theory: Transmission constrained Cournot behavior
- Cunningham, L. B., Baldick, R. and Baughman, M. L. (2002) An empirical study of applied game theory: Transmission constrained Cournot behavior. IEEE Transactions on Power Systems, 17:1, pp. 166-172.
- (2002) IEEE Transactions on Power Systems , vol.17 , Issue.1 , pp. 166-172
- Cunningham, L.B.¹ Baldick, R.² Baughman, M.L.³

6
- 84996565038
- Learning rate schedules for faster stochastic gradient search
- IEEE Press, Piscataway, NJ
- Darken, C., Chang, J. and Moody, J. (1992) Learning rate schedules for faster stochastic gradient search. Neural Networks for Signal Processing 2-Proceedings of the 1992 IEEE Workshop IEEE Press, Piscataway, NJ
- (1992) Neural Networks for Signal Processing 2-Proceedings of the 1992 IEEE Workshop
- Darken, C.¹ Chang, J.² Moody, J.³

7
- 0032643313
- Solving semi-Markov decision problems using average reward reinforcement learning
- Das, T. K., Gosavi, A., Mahadevan, S. and Marchalleck, N. (1999) Solving semi-Markov decision problems using average reward reinforcement learning. Management Science, 45:4, pp. 560-574.
- (1999) Management Science , vol.45 , Issue.4 , pp. 560-574
- Das, T.K.¹ Gosavi, A.² Mahadevan, S.³ Marchalleck, N.⁴

8
- 0031200040
- Transaction analysis in deregulated power systems using game theory
- Ferrero, R. W., Shahidehpour, M. and Ramesh, V. C. (1999) Transaction analysis in deregulated power systems using game theory. IEEE Transactions on Power Systems, 12:3, pp. 1340-1347.
- (1999) IEEE Transactions on Power Systems , vol.12 , Issue.3 , pp. 1340-1347
- Ferrero, R.W.¹ Shahidehpour, M.² Ramesh, V.C.³

9
- 0003989209
- Springer Verlag, New York, NY
- Filar, J. and Vrieze, K. (1997) Competitive Markov Decision Processes, Springer Verlag, New York, NY
- (1997) Competitive Markov Decision Processes
- Filar, J.¹ Vrieze, K.²

10
- 0742319170
- Reinforcement learning for long-run average cost
- Gosavi, A. (2004) Reinforcement learning for long-run average cost. European Journal of Operational Research, 155, pp. 654-674.
- (2004) European Journal of Operational Research , vol.155 , pp. 654-674
- Gosavi, A.¹

11
- 0036722536
- A reinforcement learning approach to airline seat allocation for multiple fare classes with overbooking
- Gosavi, A., Bandla, N. and Das, T. K. (2002) A reinforcement learning approach to airline seat allocation for multiple fare classes with overbooking. IIE Transactions, 34:9, pp. 729-742.
- (2002) IIE Transactions , vol.34 , Issue.9 , pp. 729-742
- Gosavi, A.¹ Bandla, N.² Das, T.K.³

12
- 0344704362
- Finite dimensional variational inequality and nonlinear complementarity problems: A survey of theory, algorithms and applications
- Harker, P. T. and Pang, J. S. (1990) Finite dimensional variational inequality and nonlinear complementarity problems: A survey of theory, algorithms and applications. Mathematical Programming, 48, pp. 161-220.
- (1990) Mathematical Programming , vol.48 , pp. 161-220
- Harker, P.T.¹ Pang, J.S.²

13
- 0035340070
- Linear complementarity models of Nash-Cournot competition in bilateral and POOLCO power markets
- Hobbs, B. F. (2001) Linear complementarity models of Nash-Cournot competition in bilateral and POOLCO power markets. IEEE Transactions on Power Systems, 16:2, pp. 194-202.
- (2001) IEEE Transactions on Power Systems , vol.16 , Issue.2 , pp. 194-202
- Hobbs, B.F.¹

14
- 0034186850
- Strategic gaming analysis for electric power systems: An MPEC approach
- Hobbs, B. F., Carolyn, B. M. and Pang, J. S. (2006) Strategic gaming analysis for electric power systems: An MPEC approach. IEEE Transactions on Power Systems, 15:2, pp. 638-645.
- (2006) IEEE Transactions on Power Systems , vol.15 , Issue.2 , pp. 638-645
- Hobbs, B.F.¹ Carolyn, B.M.² Pang, J.S.³

15
- 0000929496
- Multiagent reinforcement learning: Theoretical framework and an algorithm
- Presented at the Madison, WI
- Hu, J. and Wellman, M. P. (1998) Multiagent reinforcement learning: theoretical framework and an algorithm. Presented at the 15th International Conference on Machine Learning Madison, WI
- (1998) 15th International Conference on Machine Learning
- Hu, J.¹ Wellman, M.P.²

16
- 4644369748
- Nash Q-learning for general-sum stochastic games
- Hu, J. and Wellman, M. P. (2003) Nash Q-learning for general-sum stochastic games. Journal of Machine Learning Research, 4, pp. 1039-1069.
- (2003) Journal of Machine Learning Research , vol.4 , pp. 1039-1069
- Hu, J.¹ Wellman, M.P.²

17
- 0029679044
- Reinforcement learning: A survey
- Kaelbling, L. P., Littman, M. L. and Moore, A. W. (1996) Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4, pp. 237-285.
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 237-285
- Kaelbling, L.P.¹ Littman, M.L.² Moore, A.W.³

18
- 0344827154
- Solving three-player games by the matrix approach with an application to an electric power market
- Lee, K. H. and Baldick, R. (2003) Solving three-player games by the matrix approach with an application to an electric power market. IEEE Transactions on Power Systems, 18:4, pp. 166-172.
- (2003) IEEE Transactions on Power Systems , vol.18 , Issue.4 , pp. 166-172
- Lee, K.H.¹ Baldick, R.²

19
- 0000176346
- Equilibrium points of bimatrix games
- Lemke, C. E. and Howson Jr., J. T. (1964) Equilibrium points of bimatrix games. SIAM Journal of Applied Mathematics, 12, pp. 413-423.
- (1964) SIAM Journal of Applied Mathematics , vol.12 , pp. 413-423
- Lemke, C.E.¹ Howson Jr., J.T.²

20
- 56749086441
- A reinforcement learning (Nash-r) algorithm for average reward irreducible stochastic games
- Li, J., Ramachandran, K. and Das, T. K. (2007) A reinforcement learning (Nash-r) algorithm for average reward irreducible stochastic games. Journal of Machine Learning Research
- (2007) Journal of Machine Learning Research
- Li, J.¹ Ramachandran, K.² Das, T.K.³

21
- 85149834820
- Markov games as a framework for multi-agent reinforcement learning
- Littman, M. L. (1994) Markov games as a framework for multi-agent reinforcement learning. Proceedings of the 11th International Conference on Machine Learning Morgan Kaufman, pp. 157-163.
- (1994) Proceedings of the 11th International Conference on Machine Learning Morgan Kaufman , pp. 157-163
- Littman, M.L.¹

22
- 0003999118
- Cambridge University Press, Boston, MA
- Luo, Z., Pang, J. S. and Ralph, D. (1996) Mathematical Programs with Equilibrium Constraints, Cambridge University Press, Boston, MA
- (1996) Mathematical Programs With Equilibrium Constraints
- Luo, Z.¹ Pang, J.S.² Ralph, D.³

23
- 0022149015
- Computational experience in solving equilibrium models by a sequence of linear complementarity problems
- Mathiesen, L. (1985) Computational experience in solving equilibrium models by a sequence of linear complementarity problems. Operations Research, 33:6, pp. 1225-1250.
- (1985) Operations Research , vol.33 , Issue.6 , pp. 1225-1250
- Mathiesen, L.¹

24
- 70350102387
- Computation of equilibria in finite games
- Elsevier, The Netherlands
- McKelvey, R. and McLennan, A. (1996) Computation of equilibria in finite games. Handbook of Computational Economics, pp. 87-142. Elsevier, The Netherlands
- (1996) Handbook of Computational Economics , pp. 87-142
- McKelvey, R.¹ McLennan, A.²

25
- 0742310651
- McKelvey, R. D., McLennan, A. M. and Turocy, T. L. (2007) Gambit: software tools for game theory, version 0.2007.01.30
- (2007) Gambit: Software Tools for Game Theory, Version 0.2007.01.30
- McKelvey, R.D.¹ McLennan, A.M.² Turocy, T.L.³

26
- 0001730497
- Non-cooperative games
- Nash, J. (1951) Non-cooperative games. The Annals of Mathematics, 54:2, pp. 286-295.
- (1951) The Annals of Mathematics , vol.54 , Issue.2 , pp. 286-295
- Nash, J.¹

27
- 0031231882
- Using co-evolutionary programming to simulate strategic behavior in markets
- Price, T. C. (1997) Using co-evolutionary programming to simulate strategic behavior in markets. Journal of Evolutionary Economics, 7, pp. 219-254.
- (1997) Journal of Evolutionary Economics , vol.7 , pp. 219-254
- Price, T.C.¹

28
- 0003998452
- Wiley, New York, NY
- Puterman, M. L. (1994) Markov Decision Processes, Wiley, New York, NY
- (1994) Markov Decision Processes
- Puterman, M.L.¹

29
- 79960569526
- Epecs as models for electricity markets
- Presented at the Atlanta, GA
- Ralph, D. and Smeers, Y. (2006) Epecs as models for electricity markets. Presented at the Power Systems Conference and Exposition Atlanta, GA
- (2006) Power Systems Conference and Exposition
- Ralph, D.¹ Smeers, Y.²

30
- 0002958728
- On the generalization of the Lemke-Howson algorithm to noncooperative n-person games
- Rosenmuller, J. (1971) On the generalization of the Lemke-Howson algorithm to noncooperative n-person games. SIAM Journal on Applied Mathematics, 21:1, pp. 73-79.
- (1971) SIAM Journal on Applied Mathematics , vol.21 , Issue.1 , pp. 73-79
- Rosenmuller, J.¹

31
- 56749166432
- Stochastic games
- Princeton University Press, Princeton, NJ
- Shapley, L. S. (1953) Stochastic games. Classics in Game Theory, Princeton University Press, Princeton, NJ
- (1953) Classics in Game Theory
- Shapley, L.S.¹

32
- 0008056524
- Using game theory to study market power in simple networks
- IEEE, Piscataway, NJ
- Stoft, S. (1999) Using game theory to study market power in simple networks. IEEE Tutorial on Game Theory in Electric Power Markets, pp. 33-40. IEEE, Piscataway, NJ
- (1999) IEEE Tutorial on Game Theory in Electric Power Markets , pp. 33-40
- Stoft, S.¹

33
- 0004019390
- IEEE Press, Piscataway, NJ
- Stoft, S. (2002) Power System Economics, IEEE Press, Piscataway, NJ
- (2002) Power System Economics
- Stoft, S.¹

34
- 38549135253
- Equilibrium problems with equilibrium constraints: Stationarities, algorithms, and applications
- Ph.D. Dissertation. Department of Management Sciences, Stanford University, Stanford, CA
- Su, C. L. (2005) Equilibrium problems with equilibrium constraints: stationarities, algorithms, and applications, Ph.D. Dissertation. Department of Management Sciences, Stanford University, Stanford, CA
- (2005)
- Su, C.L.¹

35
- 0004007508
- The MIT Press, Cambridge, MA
- Sutton, R. S. and Barto, A. G. (1998) Reinforcement Learning, The MIT Press, Cambridge, MA
- (1998) Reinforcement Learning
- Sutton, R.S.¹ Barto, A.G.²

36
- 0002958732
- Computing equilibria of n-person games
- Wilson, R. (1971) Computing equilibria of n-person games. SIAM Journal of Applied Mathematics, 21, pp. 80-87.
- (1971) SIAM Journal of Applied Mathematics , vol.21 , pp. 80-87
- Wilson, R.¹

37
- 12344318209
- Computing Cournot equilibria in two settlement electricity markets with transmission constraints
- IEEE Computer Society, Los Alamitos, CA
- Yao, J., Oren, S. and Adler, I. (2004) Computing Cournot equilibria in two settlement electricity markets with transmission constraints. Proceedings of the 37th Hawaii International Conference on System Sciences, vol. 2 IEEE Computer Society, p. 20051b. Los Alamitos, CA
- (2004) Proceedings of the 37th Hawaii International Conference on System Sciences , vol.2
- Yao, J.¹ Oren, S.² Adler, I.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.