메뉴 건너뛰기




Volumn 22, Issue 3, 1997, Pages 588-618

Contraction conditions for average and α-discount optimality in countable state Markov games with unbounded rewards

Author keywords

Birth death control; Equilibrium policies; Noncooperative Markov games; Value iteration; geometric recurrence

Indexed keywords

ALGORITHMS; COMPUTATIONAL COMPLEXITY; CONVERGENCE OF NUMERICAL METHODS; ITERATIVE METHODS; MARKOV PROCESSES; MATHEMATICAL MODELS; OPTIMIZATION; PROBLEM SOLVING; QUEUEING THEORY; STATE SPACE METHODS;

EID: 0031210649     PISSN: 0364765X     EISSN: None     Source Type: Journal    
DOI: 10.1287/moor.22.3.588     Document Type: Article
Times cited : (45)

References (45)
  • 1
    • 0013297856 scopus 로고
    • Monotonicity of optimal policies in a zero sum game: A flow control model
    • Altman, E. (1994a). Monotonicity of optimal policies in a zero sum game: A flow control model. Advances of Dynamic Games and App. 1 269-286.
    • (1994) Advances of Dynamic Games and App. , vol.1 , pp. 269-286
    • Altman, E.1
  • 2
    • 0028416504 scopus 로고
    • Flow control using the theory of zero-sum Markov games
    • _ (1994b). Flow control using the theory of zero-sum Markov games. IEEE Trans. Automat. Control, 39 814-818.
    • (1994) IEEE Trans. Automat. Control , vol.39 , pp. 814-818
  • 4
    • 0000611954 scopus 로고
    • Zero-sum Markov games and worst case optimal control of queueing systems
    • _, A. Hordijk (1995). Zero-sum Markov games and worst case optimal control of queueing systems. Queueing Systems 21 415-447.
    • (1995) Queueing Systems , vol.21 , pp. 415-447
    • Hordijk, A.1
  • 6
    • 0016988713 scopus 로고
    • The asymptotic theory of stochastic games
    • Bewley, T., E. Kohlberg (1976). The asymptotic theory of stochastic games. Math. Oper. Res. 1 197-208.
    • (1976) Math. Oper. Res. , vol.1 , pp. 197-208
    • Bewley, T.1    Kohlberg, E.2
  • 7
    • 0027555501 scopus 로고
    • Denumerable state stochastic games with limiting average payoff
    • Borkar, V. S., M. K. Ghosh (1993). Denumerable state stochastic games with limiting average payoff. J. Optim. Theory Appl. 539-560.
    • (1993) J. Optim. Theory Appl. , pp. 539-560
    • Borkar, V.S.1    Ghosh, M.K.2
  • 8
    • 0007472843 scopus 로고
    • Recent results on conditions for the existence of average optimal stationary policies
    • Cavazos-Cadena, R. (1991). Recent results on conditions for the existence of average optimal stationary policies. Ann. Oper. Res. 28 3-28.
    • (1991) Ann. Oper. Res. , vol.28 , pp. 3-28
    • Cavazos-Cadena, R.1
  • 10
    • 0012793161 scopus 로고
    • Average, sensitive and Blackwell optimality in denumcrablc state Markov decision chains with unbounded rewards
    • Dekker, R., A. Hordijk (1988). Average, sensitive and Blackwell optimality in denumcrablc state Markov decision chains with unbounded rewards. Math. Oper. Res. 13 395-421.
    • (1988) Math. Oper. Res. , vol.13 , pp. 395-421
    • Dekker, R.1    Hordijk, A.2
  • 11
    • 0001128053 scopus 로고
    • Recurrence conditions for average and Blackwell optimality in denumerable state Markov decision chains
    • _, _ (1992). Recurrence conditions for average and Blackwell optimality in denumerable state Markov decision chains. Math Oper. Res. 17 271-290.
    • (1992) Math Oper. Res. , vol.17 , pp. 271-290
  • 12
    • 0012218373 scopus 로고
    • On the relation between recurrence and ergodicity properties in denumerable Markov decision chains
    • Technical report, Leiden University, Technical report no. TW-91-02, Dept. of Math, and Comp. Sci., Univ. of Leiden (extended version), shortened version
    • _, _, F. M. Spieksma (1994). On the relation between recurrence and ergodicity properties in denumerable Markov decision chains. Technical report, Leiden University, Technical report no. TW-91-02, Dept. of Math, and Comp. Sci., Univ. of Leiden (extended version), Math. Oper. Res. 19 539-559 (shortened version).
    • (1994) Math. Oper. Res. , vol.19 , pp. 539-559
    • Spieksma, F.M.1
  • 13
    • 0000883929 scopus 로고
    • A note on memoryless rules for controlling sequential control processes
    • Derman, C., R. Strauch (1966). A note on memoryless rules for controlling sequential control processes. Ann. Math. Statist. 37 276-278.
    • (1966) Ann. Math. Statist. , vol.37 , pp. 276-278
    • Derman, C.1    Strauch, R.2
  • 15
    • 0001381152 scopus 로고
    • On N-person stochastic games with denumerable state space
    • Federgruen, A. (1978). On N-person stochastic games with denumerable state space. Adv. Appl. Probab. 10 452-471.
    • (1978) Adv. Appl. Probab. , vol.10 , pp. 452-471
    • Federgruen, A.1
  • 16
    • 6244297705 scopus 로고
    • On the functional equations in undiscounted and sensitive discounted stochastic games
    • _ (1980a). On the functional equations in undiscounted and sensitive discounted stochastic games. Z. Oper. Res. 24 243-262.
    • (1980) Z. Oper. Res. , vol.24 , pp. 243-262
  • 17
    • 84925924593 scopus 로고
    • Successive approximation methods in undiscounted stochastic games
    • _ (1980b). Successive approximation methods in undiscounted stochastic games. Oper. Res. 28 794-810.
    • (1980) Oper. Res. , vol.28 , pp. 794-810
  • 18
    • 84968504254 scopus 로고
    • A further generalization of the Kakutani fixed point theorem, with application to Nash equilibrium points
    • Glicksberg, L. L. (1952). A further generalization of the Kakutani fixed point theorem, with application to Nash equilibrium points. Proc. Amer. Math. Soc. 3 170-174.
    • (1952) Proc. Amer. Math. Soc. , vol.3 , pp. 170-174
    • Glicksberg, L.L.1
  • 20
    • 0001511265 scopus 로고
    • Average optimal policies in Markov decision processes with applications to a queueing and a replacement model
    • _, F. A. van der Duyn Schouten (1983). Average optimal policies in Markov decision processes with applications to a queueing and a replacement model. Adv. Appl. Probab. 15 274-303.
    • (1983) Adv. Appl. Probab. , vol.15 , pp. 274-303
    • Van Der Duyn Schouten, F.A.1
  • 21
    • 0016521921 scopus 로고
    • The asymptotic behaviour of the minimal total expected cost for the denumerable Markov decision model
    • _, P. J. Schweitzer, H. C. Tijms (1975). The asymptotic behaviour of the minimal total expected cost for the denumerable Markov decision model. J. Appl. Probab. 12 298-305.
    • (1975) J. Appl. Probab. , vol.12 , pp. 298-305
    • Schweitzer, P.J.1    Tijms, H.C.2
  • 22
    • 0001144425 scopus 로고
    • On ergodicity and recurrence properties of a Markov chain with an application to an open Jackson network
    • _, F. M. Spieksma (1992). On ergodicity and recurrence properties of a Markov chain with an application to an open Jackson network. Adv. Appl. Probab. 24 343-376.
    • (1992) Adv. Appl. Probab. , vol.24 , pp. 343-376
    • Spieksma, F.M.1
  • 23
  • 24
    • 0016533472 scopus 로고
    • Applying a new device in the optimization of exponential queueing systems
    • Lippman, S. A, (1975). Applying a new device in the optimization of exponential queueing systems. Oper. Res. 23 687-710.
    • (1975) Oper. Res. , vol.23 , pp. 687-710
    • Lippman, S.A.1
  • 25
    • 0012270789 scopus 로고
    • Borel stochastic games with limsup payoff
    • Maitra, A., W. Sudderth (1993). Borel stochastic games with limsup payoff. Ann. Probab. 21 861-885.
    • (1993) Ann. Probab. , vol.21 , pp. 861-885
    • Maitra, A.1    Sudderth, W.2
  • 26
    • 0003353344 scopus 로고
    • Symmetric stochastic games of resource extraction: The existence of nonrandomized stationary equilibrium
    • Kluwer Academic Publishers, Dordrecht, The Netherlands
    • Majumdar, M., R. Sundaran (1991). Symmetric stochastic games of resource extraction: The existence of nonrandomized stationary equilibrium. Stochastic Games and Related Topics, Kluwer Academic Publishers, Dordrecht, The Netherlands, 175-190.
    • (1991) Stochastic Games and Related Topics , pp. 175-190
    • Majumdar, M.1    Sundaran, R.2
  • 28
    • 0004005693 scopus 로고
    • Equilibria for discounted stochastic games
    • Université Catholique de Louvain
    • _, T. Parthasarathy (1987). Equilibria for discounted stochastic games. CORE Discussion Paper No. 8750, Université Catholique de Louvain.
    • (1987) CORE Discussion Paper No. 8750
    • Parthasarathy, T.1
  • 29
    • 0001327766 scopus 로고
    • Stationary equilibria for nonzero-sum average payoff ergodic stochastic games with general state space
    • Basar and Haurie, Eds., Annals of the Intern. Soc. of Dynamic Games, Birkhauser
    • Nowak, A. S. (1994). Stationary equilibria for nonzero-sum average payoff ergodic stochastic games with general state space. Basar and Haurie, Eds., Advances in dynamic games and applications, Annals of the Intern. Soc. of Dynamic Games, Vol. 1, Birkhauser, 231-246.
    • (1994) Advances in Dynamic Games and Applications , vol.1 , pp. 231-246
    • Nowak, A.S.1
  • 30
    • 0012328504 scopus 로고
    • Existence of stationary correlated equilibria with symmetric information for discounted stochastic games
    • _, T. E. S. Raghavan (1992). Existence of stationary correlated equilibria with symmetric information for discounted stochastic games. Math. Oper. Res. 17 519-526.
    • (1992) Math. Oper. Res. , vol.17 , pp. 519-526
    • Raghavan, T.E.S.1
  • 31
    • 0012218231 scopus 로고
    • Existence of stationary equilibrium strategies in nonzero-sum discounted games with uncountable state space and state independent transitions, Internat
    • T. Parthasarathy, S. Sinha (1989). Existence of stationary equilibrium strategies in nonzero-sum discounted games with uncountable state space and state independent transitions, Internat. J. Game Theory 18 189-194.
    • (1989) J. Game Theory , vol.18 , pp. 189-194
    • Parthasarathy, T.1    Sinha, S.2
  • 32
    • 0002282886 scopus 로고
    • Markov games - A survey
    • Roxin, Liu and Sternberg, Eds.
    • _, M. Stern (1977). Markov games - A survey. Roxin, Liu and Sternberg, Eds., Differential Games and Control Theory II, 1-46.
    • (1977) Differential Games and Control Theory , vol.2 , pp. 1-46
    • Stern, M.1
  • 33
    • 0000066148 scopus 로고
    • Algorithms for stochastic games-A survey
    • T. E. S. Raghavan, J. A. Filar (1991). Algorithms for stochastic games-A survey. Z. Oper. Res. 35 437-472.
    • (1991) Z. Oper. Res. , vol.35 , pp. 437-472
    • Raghavan, T.E.S.1    Filar, J.A.2
  • 34
    • 0001938903 scopus 로고
    • Equilibrium plans for nonzero-sum Markov games
    • Moeschlin and Pallaschke, Eds., North-Holland, Amsterdam
    • Rieder, U. (1979). Equilibrium plans for nonzero-sum Markov games. Moeschlin and Pallaschke, Eds., Game Theory and Related Topics, North-Holland, Amsterdam, 91-102.
    • (1979) Game Theory and Related Topics , pp. 91-102
    • Rieder, U.1
  • 35
    • 0003655416 scopus 로고
    • Macmillan Publishing Company, New York
    • Royden, H. L. (1988). Real Analysis, 3rd ed., Macmillan Publishing Company, New York.
    • (1988) Real Analysis, 3rd Ed.
    • Royden, H.L.1
  • 36
    • 0141601275 scopus 로고
    • On the second optimality equation for semi-Markov decision models
    • Schal, M. (1992). On the second optimality equation for semi-Markov decision models. Math. Oper. Res. 17 470-486.
    • (1992) Math. Oper. Res. , vol.17 , pp. 470-486
    • Schal, M.1
  • 37
    • 0015080430 scopus 로고
    • Iterative solution to the functional equations of undiscounted Markov renewal programming
    • Schweitzer, P. J. (1971). Iterative solution to the functional equations of undiscounted Markov renewal programming. J. Math. Anal. Appl. 34 495-501.
    • (1971) J. Math. Anal. Appl. , vol.34 , pp. 495-501
    • Schweitzer, P.J.1
  • 38
    • 0012219343 scopus 로고
    • Zero-sum stochastic games with unbounded costs: Discounted and average cost cases
    • Sennott, L. I. (1994). Zero-sum stochastic games with unbounded costs: Discounted and average cost cases. Z. Oper. Res. 40 145-162.
    • (1994) Z. Oper. Res. , vol.40 , pp. 145-162
    • Sennott, L.I.1
  • 39
    • 0001111183 scopus 로고
    • An equivalence between continuous and discrete time Markov decision processes
    • Serfozo, R. F. (1978). An equivalence between continuous and discrete time Markov decision processes. Oper. Res. 27 616-620.
    • (1978) Oper. Res. , vol.27 , pp. 616-620
    • Serfozo, R.F.1
  • 42
    • 0039500125 scopus 로고
    • Strengthening ergodicity to geometric ergodicity of Markov chains
    • _, R. L. Tweedie (1994). Strengthening ergodicity to geometric ergodicity of Markov chains. Stoch. Mod. 10 45-75.
    • (1994) Stoch. Mod. , vol.10 , pp. 45-75
    • Tweedie, R.L.1
  • 44
    • 0347518293 scopus 로고
    • Successive approximations for average reward Markov games
    • _ (1980b). Successive approximations for average reward Markov games. Internat. J. Game Theory 9 13-24.
    • (1980) Internat. J. Game Theory , vol.9 , pp. 13-24
  • 45
    • 0141716097 scopus 로고
    • Markov games with unbounded rewards
    • Memorandum COSOR 76-05; M. Schäl, Ed., Bonner Math. Schriften Bonn.
    • Wessels, J. (1977). Markov games with unbounded rewards. Memorandum COSOR 76-05; M. Schäl, Ed., Dynamische Optimierung, Bonner Math. Schriften 98, Bonn.
    • (1977) Dynamische Optimierung , vol.98
    • Wessels, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.