메뉴 건너뛰기




Volumn 95, Issue 3, 2003, Pages 513-532

A policy-improvement type algorithm for solving zero-sum two-person stochastic games of perfect information

Author keywords

MDP; Perfect information; Policy iteration; Stochastic games

Indexed keywords

MDP; PERFECT INFORMATION; POLICY ITERATION; STOCHASTIC GAMES;

EID: 33746343278     PISSN: 00255610     EISSN: None     Source Type: Journal    
DOI: 10.1007/s10107-002-0312-3     Document Type: Article
Times cited : (20)

References (24)
  • 2
    • 0026820657 scopus 로고
    • The complexity of stochastic games
    • Condon, A. [1992]: The Complexity of Stochastic Games. Information and Computing 96. 203-224.
    • (1992) Information and Computing , vol.96 , pp. 203-224
    • Condon, A.1
  • 4
    • 0002887378 scopus 로고
    • Recursive games
    • Dresher, M., A.W. Tucker & P. Wolfe (eds.). Contribution to the theory of games, Vol. III, Princeton Univ. Press, Princeton
    • Everett, H. [1957]: Recursive Games, In: Dresher, M., A.W. Tucker & P. Wolfe (eds.), pp 47-78. Contribution to the theory of games, Vol. III, Ann. of Math. Stud. 39, Princeton Univ. Press, Princeton.
    • (1957) Ann. of Math. Stud. , vol.39 , pp. 47-78
    • Everett, H.1
  • 6
    • 0000703817 scopus 로고
    • Stochastic games with zero stop probabilities
    • Dresher, M., A.W. Tucker & P. Wolfe (eds.). Contribution to the theory of games, Vol. III, Princeton Univ. Press, Princeton
    • Gillette, D. [1957]: Stochastic games with zero stop probabilities, In: Dresher, M., A.W. Tucker & P. Wolfe (eds.), pp 179-188. Contribution to the theory of games, Vol. III, Ann. of Math. Stud. 39, Princeton Univ. Press, Princeton.
    • (1957) Ann. of Math. Stud. , vol.39 , pp. 179-188
    • Gillette, D.1
  • 8
    • 0012251278 scopus 로고
    • On non-terminating stochastic games
    • Hoffman, A.J. and Karp, R.M. [1966]: On Non-Terminating Stochastic Games. Management Science, 12, 359-370.
    • (1966) Management Science , vol.12 , pp. 359-370
    • Hoffman, A.J.1    Karp, R.M.2
  • 9
    • 0018455841 scopus 로고
    • Linear programming and Markovian decision chains
    • Hordijk, A. and Kallenberg, L.C.M. [1979]: Linear Programming and Markovian Decision Chains. Management Science 25, 352-362.
    • (1979) Management Science , vol.25 , pp. 352-362
    • Hordijk, A.1    Kallenberg, L.C.M.2
  • 11
    • 0002775664 scopus 로고
    • Linear Programming and Finite Markovian Control Problems
    • Centre for Mathematics and Computer Science, Amsterdam
    • Kallenberg, L.C.M. [1983]: Linear Programming and Finite Markovian Control Problems. Mathematical Centre Tract 148, Centre for Mathematics and Computer Science, Amsterdam.
    • (1983) Mathematical Centre Tract , vol.148
    • Kallenberg, L.C.M.1
  • 12
    • 0000564361 scopus 로고
    • A polynomial algorithm in linear programming
    • Kachian L.G. [1979]: A polynomial algorithm in linear programming. Soviet Math. Doklady, 20, 191-194.
    • (1979) Soviet Math. Doklady , vol.20 , pp. 191-194
    • Kachian, L.G.1
  • 13
    • 0014584045 scopus 로고
    • Stochastic games with perfect information and time average payoff
    • Liggett, T.M. and S. A. Lippman [1969]: Stochastic Games with Perfect Information and Time Average Payoff. SIAM Review 11, 604-607.
    • (1969) SIAM Review , vol.11 , pp. 604-607
    • Liggett, T.M.1    Lippman, S.A.2
  • 14
    • 0013455073 scopus 로고
    • A subexponential randomized algorithm for the simple stochastic game problem
    • Ludwig, W. [1995]: A subexponential Randomized Algorithm for the Simple Stochastic Game Problem. Information and Computation 117, 151-155.
    • (1995) Information and Computation , vol.117 , pp. 151-155
    • Ludwig, W.1
  • 15
    • 0012315622 scopus 로고
    • On the complexity of the policy improvement algorithm for Markov decision processes
    • Melekopoglou, M. and A. Condon [1994]: On the Complexity of the Policy Improvement Algorithm for Markov Decision Processes. ORSA Journal on Computing 6, 188-192.
    • (1994) ORSA Journal on Computing , vol.6 , pp. 188-192
    • Melekopoglou, M.1    Condon, A.2
  • 16
    • 0010813516 scopus 로고
    • Discrete dynamic programming with a small interest rate
    • Miller, B and A. Veinott Jr. [1969]: Discrete Dynamic Programming with a Small Interest Rate Ann. Math. Statistics 40, 366-370.
    • (1969) Ann. Math. Statistics , vol.40 , pp. 366-370
    • Miller, B.1    Veinott Jr., A.2
  • 17
    • 0019537014 scopus 로고
    • An order field property for stochastic games when one player controls transition probabilities
    • Parthasarathy, T. and T.E.S. Raghavan [1981]: An order field property for stochastic games when one player controls transition probabilities, J. Optimization Theory and Appl. 33:375-392.
    • (1981) J. Optimization Theory and Appl. , vol.33 , pp. 375-392
    • Parthasarathy, T.1    Raghavan, T.E.S.2
  • 18
    • 30244521809 scopus 로고
    • Algorithms for stochastic games with geometrical interpretation
    • Pollatschek and Avi-Itzhak [1969]: Algorithms for Stochastic Games with Geometrical Interpretation Management Science 15, 399-415.
    • (1969) Management Science , vol.15 , pp. 399-415
    • Pollatschek1    Avi-Itzhak2
  • 21
    • 30244444308 scopus 로고
    • Discounted Markov games: Successive approximations and stopping times
    • Van der Waal, J. [1977]: Discounted Markov Games: Successive Approximations and Stopping Times International J. Game Theory 6, 11-22.
    • (1977) International J. Game Theory , vol.6 , pp. 11-22
    • Van Der Waal, J.1
  • 24
    • 0030143640 scopus 로고    scopus 로고
    • The complexity of mean payoff games on graphs
    • Zwick, U and M.S. Paterson [1996]: The complexity of mean payoff games on graphs. Theoretical Computer Science, 158, 343-359.
    • (1996) Theoretical Computer Science , vol.158 , pp. 343-359
    • Zwick, U.1    Paterson, M.S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.