SCOPUS 정보 검색 플랫폼

Mathematics of Operations Research

Volumn 22, Issue 3, 1997, Pages 588-618

Contraction conditions for average and α-discount optimality in countable state Markov games with unbounded rewards

(3) Altman, E a Hordijk, A b Spieksma, F M b

a INRIA (France)

b LEIDEN UNIVERSITY (Netherlands)

Author keywords

Birth death control; Equilibrium policies; Noncooperative Markov games; Value iteration; geometric recurrence

Indexed keywords

ALGORITHMS; COMPUTATIONAL COMPLEXITY; CONVERGENCE OF NUMERICAL METHODS; ITERATIVE METHODS; MARKOV PROCESSES; MATHEMATICAL MODELS; OPTIMIZATION; PROBLEM SOLVING; QUEUEING THEORY; STATE SPACE METHODS;

COMPACT ACTION SPACES; STATIONARY EQUILIBRIUM POLICY; ZERO SUM MARKOV GAMES;

GAME THEORY;

EID: 0031210649 PISSN: 0364765X EISSN: None Source Type: Journal
DOI: 10.1287/moor.22.3.588 Document Type: Article

Times cited : (46)

References (45)

1
- 0013297856
- Monotonicity of optimal policies in a zero sum game: A flow control model
- Altman, E. (1994a). Monotonicity of optimal policies in a zero sum game: A flow control model. Advances of Dynamic Games and App. 1 269-286.
- (1994) Advances of Dynamic Games and App. , vol.1 , pp. 269-286
- Altman, E.¹

2
- 0028416504
- Flow control using the theory of zero-sum Markov games
- _ (1994b). Flow control using the theory of zero-sum Markov games. IEEE Trans. Automat. Control, 39 814-818.
- (1994) IEEE Trans. Automat. Control , vol.39 , pp. 814-818

3
- 0003989208
- INRIA report No. 2574
- _ (1995). Constrained Markov decision processes. INRIA report No. 2574.
- (1995) Constrained Markov Decision Processes

4
- 0000611954
- Zero-sum Markov games and worst case optimal control of queueing systems
- _, A. Hordijk (1995). Zero-sum Markov games and worst case optimal control of queueing systems. Queueing Systems 21 415-447.
- (1995) Queueing Systems , vol.21 , pp. 415-447
- Hordijk, A.¹

5
- 6244277373
- Weighted discounted zero-sum stochastic games with perfect information
- December 16-18, 1996 Shonan Village Center, Kanagawa, Japan, (to appear)
- _, E. A. Feinberg, A, Shwartz (1996). Weighted discounted zero-sum stochastic games with perfect information, Proceedings of the Seventh International Symposium on Dynamic Games and Applications, December 16-18, 1996 Shonan Village Center, Kanagawa, Japan, (to appear).
- (1996) Proceedings of the Seventh International Symposium on Dynamic Games and Applications
- Feinberg, E.A.¹ Shwartz, A.²

6
- 0016988713
- The asymptotic theory of stochastic games
- Bewley, T., E. Kohlberg (1976). The asymptotic theory of stochastic games. Math. Oper. Res. 1 197-208.
- (1976) Math. Oper. Res. , vol.1 , pp. 197-208
- Bewley, T.¹ Kohlberg, E.²

7
- 0027555501
- Denumerable state stochastic games with limiting average payoff
- Borkar, V. S., M. K. Ghosh (1993). Denumerable state stochastic games with limiting average payoff. J. Optim. Theory Appl. 539-560.
- (1993) J. Optim. Theory Appl. , pp. 539-560
- Borkar, V.S.¹ Ghosh, M.K.²

8
- 0007472843
- Recent results on conditions for the existence of average optimal stationary policies
- Cavazos-Cadena, R. (1991). Recent results on conditions for the existence of average optimal stationary policies. Ann. Oper. Res. 28 3-28.
- (1991) Ann. Oper. Res. , vol.28 , pp. 3-28
- Cavazos-Cadena, R.¹

9
- 6244285802
- Springer Verlag, Berlin
- Chung, K. L. (1967). Markov Chains with Stationary Transition Probabilities, 2nd. ed., Springer Verlag, Berlin.
- (1967) Markov Chains with Stationary Transition Probabilities, 2nd. Ed.
- Chung, K.L.¹

10
- 0012793161
- Average, sensitive and Blackwell optimality in denumcrablc state Markov decision chains with unbounded rewards
- Dekker, R., A. Hordijk (1988). Average, sensitive and Blackwell optimality in denumcrablc state Markov decision chains with unbounded rewards. Math. Oper. Res. 13 395-421.
- (1988) Math. Oper. Res. , vol.13 , pp. 395-421
- Dekker, R.¹ Hordijk, A.²

11
- 0001128053
- Recurrence conditions for average and Blackwell optimality in denumerable state Markov decision chains
- _, _ (1992). Recurrence conditions for average and Blackwell optimality in denumerable state Markov decision chains. Math Oper. Res. 17 271-290.
- (1992) Math Oper. Res. , vol.17 , pp. 271-290

12
- 0012218373
- On the relation between recurrence and ergodicity properties in denumerable Markov decision chains
- Technical report, Leiden University, Technical report no. TW-91-02, Dept. of Math, and Comp. Sci., Univ. of Leiden (extended version), shortened version
- _, _, F. M. Spieksma (1994). On the relation between recurrence and ergodicity properties in denumerable Markov decision chains. Technical report, Leiden University, Technical report no. TW-91-02, Dept. of Math, and Comp. Sci., Univ. of Leiden (extended version), Math. Oper. Res. 19 539-559 (shortened version).
- (1994) Math. Oper. Res. , vol.19 , pp. 539-559
- Spieksma, F.M.¹

13
- 0000883929
- A note on memoryless rules for controlling sequential control processes
- Derman, C., R. Strauch (1966). A note on memoryless rules for controlling sequential control processes. Ann. Math. Statist. 37 276-278.
- (1966) Ann. Math. Statist. , vol.37 , pp. 276-278
- Derman, C.¹ Strauch, R.²

14
- 0342507670
- Ph.D. Thesis, Leiden University
- van der Duyn Schouten, F. A. (1979). Markov decision drift Processes with continuous time parameter, Ph.D. Thesis, Leiden University.
- (1979) Markov Decision Drift Processes with Continuous Time Parameter
- Van Der Duyn Schouten, F.A.¹

15
- 0001381152
- On N-person stochastic games with denumerable state space
- Federgruen, A. (1978). On N-person stochastic games with denumerable state space. Adv. Appl. Probab. 10 452-471.
- (1978) Adv. Appl. Probab. , vol.10 , pp. 452-471
- Federgruen, A.¹

16
- 6244297705
- On the functional equations in undiscounted and sensitive discounted stochastic games
- _ (1980a). On the functional equations in undiscounted and sensitive discounted stochastic games. Z. Oper. Res. 24 243-262.
- (1980) Z. Oper. Res. , vol.24 , pp. 243-262

17
- 84925924593
- Successive approximation methods in undiscounted stochastic games
- _ (1980b). Successive approximation methods in undiscounted stochastic games. Oper. Res. 28 794-810.
- (1980) Oper. Res. , vol.28 , pp. 794-810

18
- 84968504254
- A further generalization of the Kakutani fixed point theorem, with application to Nash equilibrium points
- Glicksberg, L. L. (1952). A further generalization of the Kakutani fixed point theorem, with application to Nash equilibrium points. Proc. Amer. Math. Soc. 3 170-174.
- (1952) Proc. Amer. Math. Soc. , vol.3 , pp. 170-174
- Glicksberg, L.L.¹

19
- 0004211484
- Mathematical Centre Tract 51, C.W.I., Amsterdam
- Hordijk, A. (1974). Dynamic Programming and Markov Potential Theory. Mathematical Centre Tract 51, C.W.I., Amsterdam.
- (1974) Dynamic Programming and Markov Potential Theory
- Hordijk, A.¹

20
- 0001511265
- Average optimal policies in Markov decision processes with applications to a queueing and a replacement model
- _, F. A. van der Duyn Schouten (1983). Average optimal policies in Markov decision processes with applications to a queueing and a replacement model. Adv. Appl. Probab. 15 274-303.
- (1983) Adv. Appl. Probab. , vol.15 , pp. 274-303
- Van Der Duyn Schouten, F.A.¹

21
- 0016521921
- The asymptotic behaviour of the minimal total expected cost for the denumerable Markov decision model
- _, P. J. Schweitzer, H. C. Tijms (1975). The asymptotic behaviour of the minimal total expected cost for the denumerable Markov decision model. J. Appl. Probab. 12 298-305.
- (1975) J. Appl. Probab. , vol.12 , pp. 298-305
- Schweitzer, P.J.¹ Tijms, H.C.²

22
- 0001144425
- On ergodicity and recurrence properties of a Markov chain with an application to an open Jackson network
- _, F. M. Spieksma (1992). On ergodicity and recurrence properties of a Markov chain with an application to an open Jackson network. Adv. Appl. Probab. 24 343-376.
- (1992) Adv. Appl. Probab. , vol.24 , pp. 343-376
- Spieksma, F.M.¹

23
- 0003373577
- Semi-Markov strategies in stochastic games
- _, O. J. Vrieze, G. L. Wanrooij (1983). Semi-Markov strategies in stochastic games. Internat. J. Game Theory 12 81-89.
- (1983) Internat. J. Game Theory , vol.12 , pp. 81-89
- Vrieze, O.J.¹ Wanrooij, G.L.²

24
- 0016533472
- Applying a new device in the optimization of exponential queueing systems
- Lippman, S. A, (1975). Applying a new device in the optimization of exponential queueing systems. Oper. Res. 23 687-710.
- (1975) Oper. Res. , vol.23 , pp. 687-710
- Lippman, S.A.¹

25
- 0012270789
- Borel stochastic games with limsup payoff
- Maitra, A., W. Sudderth (1993). Borel stochastic games with limsup payoff. Ann. Probab. 21 861-885.
- (1993) Ann. Probab. , vol.21 , pp. 861-885
- Maitra, A.¹ Sudderth, W.²

26
- 0003353344
- Symmetric stochastic games of resource extraction: The existence of nonrandomized stationary equilibrium
- Kluwer Academic Publishers, Dordrecht, The Netherlands
- Majumdar, M., R. Sundaran (1991). Symmetric stochastic games of resource extraction: The existence of nonrandomized stationary equilibrium. Stochastic Games and Related Topics, Kluwer Academic Publishers, Dordrecht, The Netherlands, 175-190.
- (1991) Stochastic Games and Related Topics , pp. 175-190
- Majumdar, M.¹ Sundaran, R.²

27
- 0002310119
- Stochastic Games
- Mertens, J.-F., A. Neyman (1981). Stochastic Games. Internat. J. Game Theory 10 53-66.
- (1981) Internat. J. Game Theory , vol.10 , pp. 53-66
- Mertens, J.-F.¹ Neyman, A.²

28
- 0004005693
- Equilibria for discounted stochastic games
- Université Catholique de Louvain
- _, T. Parthasarathy (1987). Equilibria for discounted stochastic games. CORE Discussion Paper No. 8750, Université Catholique de Louvain.
- (1987) CORE Discussion Paper No. 8750
- Parthasarathy, T.¹

29
- 0001327766
- Stationary equilibria for nonzero-sum average payoff ergodic stochastic games with general state space
- Basar and Haurie, Eds., Annals of the Intern. Soc. of Dynamic Games, Birkhauser
- Nowak, A. S. (1994). Stationary equilibria for nonzero-sum average payoff ergodic stochastic games with general state space. Basar and Haurie, Eds., Advances in dynamic games and applications, Annals of the Intern. Soc. of Dynamic Games, Vol. 1, Birkhauser, 231-246.
- (1994) Advances in Dynamic Games and Applications , vol.1 , pp. 231-246
- Nowak, A.S.¹

30
- 0012328504
- Existence of stationary correlated equilibria with symmetric information for discounted stochastic games
- _, T. E. S. Raghavan (1992). Existence of stationary correlated equilibria with symmetric information for discounted stochastic games. Math. Oper. Res. 17 519-526.
- (1992) Math. Oper. Res. , vol.17 , pp. 519-526
- Raghavan, T.E.S.¹

31
- 0012218231
- Existence of stationary equilibrium strategies in nonzero-sum discounted games with uncountable state space and state independent transitions, Internat
- T. Parthasarathy, S. Sinha (1989). Existence of stationary equilibrium strategies in nonzero-sum discounted games with uncountable state space and state independent transitions, Internat. J. Game Theory 18 189-194.
- (1989) J. Game Theory , vol.18 , pp. 189-194
- Parthasarathy, T.¹ Sinha, S.²

32
- 0002282886
- Markov games - A survey
- Roxin, Liu and Sternberg, Eds.
- _, M. Stern (1977). Markov games - A survey. Roxin, Liu and Sternberg, Eds., Differential Games and Control Theory II, 1-46.
- (1977) Differential Games and Control Theory , vol.2 , pp. 1-46
- Stern, M.¹

33
- 0000066148
- Algorithms for stochastic games-A survey
- T. E. S. Raghavan, J. A. Filar (1991). Algorithms for stochastic games-A survey. Z. Oper. Res. 35 437-472.
- (1991) Z. Oper. Res. , vol.35 , pp. 437-472
- Raghavan, T.E.S.¹ Filar, J.A.²

34
- 0001938903
- Equilibrium plans for nonzero-sum Markov games
- Moeschlin and Pallaschke, Eds., North-Holland, Amsterdam
- Rieder, U. (1979). Equilibrium plans for nonzero-sum Markov games. Moeschlin and Pallaschke, Eds., Game Theory and Related Topics, North-Holland, Amsterdam, 91-102.
- (1979) Game Theory and Related Topics , pp. 91-102
- Rieder, U.¹

35
- 0003655416
- Macmillan Publishing Company, New York
- Royden, H. L. (1988). Real Analysis, 3rd ed., Macmillan Publishing Company, New York.
- (1988) Real Analysis, 3rd Ed.
- Royden, H.L.¹

36
- 0141601275
- On the second optimality equation for semi-Markov decision models
- Schal, M. (1992). On the second optimality equation for semi-Markov decision models. Math. Oper. Res. 17 470-486.
- (1992) Math. Oper. Res. , vol.17 , pp. 470-486
- Schal, M.¹

37
- 0015080430
- Iterative solution to the functional equations of undiscounted Markov renewal programming
- Schweitzer, P. J. (1971). Iterative solution to the functional equations of undiscounted Markov renewal programming. J. Math. Anal. Appl. 34 495-501.
- (1971) J. Math. Anal. Appl. , vol.34 , pp. 495-501
- Schweitzer, P.J.¹

38
- 0012219343
- Zero-sum stochastic games with unbounded costs: Discounted and average cost cases
- Sennott, L. I. (1994). Zero-sum stochastic games with unbounded costs: Discounted and average cost cases. Z. Oper. Res. 40 145-162.
- (1994) Z. Oper. Res. , vol.40 , pp. 145-162
- Sennott, L.I.¹

39
- 0001111183
- An equivalence between continuous and discrete time Markov decision processes
- Serfozo, R. F. (1978). An equivalence between continuous and discrete time Markov decision processes. Oper. Res. 27 616-620.
- (1978) Oper. Res. , vol.27 , pp. 616-620
- Serfozo, R.F.¹

40
- 0000392613
- Stochastic games
- Shapley, L. S. (1953). Stochastic games. Proc. Nat. Acad. Sci. USA 39 1095-1100.
- (1953) Proc. Nat. Acad. Sci. USA , vol.39 , pp. 1095-1100
- Shapley, L.S.¹

41
- 0012219344
- Ph.D. Thesis, Leiden University (available on request from the author)
- Spieksma, F. M. (1990). Geometrically ergodic Markov chains and the optimal control of queues, Ph.D. Thesis, Leiden University (available on request from the author).
- (1990) Geometrically Ergodic Markov Chains and the Optimal Control of Queues
- Spieksma, F.M.¹

42
- 0039500125
- Strengthening ergodicity to geometric ergodicity of Markov chains
- _, R. L. Tweedie (1994). Strengthening ergodicity to geometric ergodicity of Markov chains. Stoch. Mod. 10 45-75.
- (1994) Stoch. Mod. , vol.10 , pp. 45-75
- Tweedie, R.L.¹

43
- 0141824325
- Ph.D. Thesis, Eindhoven
- van der Wal (1980a). Stochastic Dynamic Programming: successive approximations and nearly optimal strategies for Markov decision processes and Markov games. Ph.D. Thesis, Eindhoven.
- (1980) Stochastic Dynamic Programming: Successive Approximations and Nearly Optimal Strategies for Markov Decision Processes and Markov Games
- Van Wal, D.¹

44
- 0347518293
- Successive approximations for average reward Markov games
- _ (1980b). Successive approximations for average reward Markov games. Internat. J. Game Theory 9 13-24.
- (1980) Internat. J. Game Theory , vol.9 , pp. 13-24

45
- 0141716097
- Markov games with unbounded rewards
- Memorandum COSOR 76-05; M. Schäl, Ed., Bonner Math. Schriften Bonn.
- Wessels, J. (1977). Markov games with unbounded rewards. Memorandum COSOR 76-05; M. Schäl, Ed., Dynamische Optimierung, Bonner Math. Schriften 98, Bonn.
- (1977) Dynamische Optimierung , vol.98
- Wessels, J.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.