SCOPUS 정보 검색 플랫폼

Mathematical Methods of Operations Research

Volumn 48, Issue 3, 1998, Pages 387-417

Constrained Markov decision processes with total cost criteria: Lagrangian approach and dual linear program

(1) Altman, Eitan a

a INRIA (France)

Author keywords

Countable state space; Extra constraints; Markov decision process; Total cost

Indexed keywords

EID: 1942424978 PISSN: 14322994 EISSN: None Source Type: Journal
DOI: 10.1007/s001860050035 Document Type: Article

Times cited : (93)

References (77)

1
- 0342888467
- Asymptotic properties of constrained Markov decision processes
- Altman E (1993) Asymptotic properties of constrained Markov decision processes. ZOR 37:151-170
- (1993) ZOR , vol.37 , pp. 151-170
- Altman, E.¹

2
- 0001061406
- Denumerable constrained Markov Decision Processes and finite approximations
- Altman E (1994) Denumerable constrained Markov Decision Processes and finite approximations. Math. of Operations Research 19:169-191
- (1994) Math. of Operations Research , vol.19 , pp. 169-191
- Altman, E.¹

3
- 0009459044
- Constrained Markov decision processes with total cost criteria: Occupation measures and primal LP
- Altman E (1996) Constrained Markov decision processes with total cost criteria: Occupation measures and primal LP. ZOR 43:45-72
- (1996) ZOR , vol.43 , pp. 45-72
- Altman, E.¹

4
- 1942501834
- Constrained Markov decision processes
- Altman E (1995) Constrained Markov decision processes. Book in preparation
- (1995) Book in Preparation
- Altman, E.¹

5
- 0026188613
- Markov decision problems and state-action frequencies
- Altman E, Shwartz A (1991a) Markov decision problems and state-action frequencies. SIAM J. Control and Optimization 29:786-809
- (1991) SIAM J. Control and Optimization , vol.29 , pp. 786-809
- Altman, E.¹ Shwartz, A.²

6
- 0027699625
- Time-sharing policies for controlled Markov chains
- Altman E, Shwartz A (1993) Time-sharing policies for controlled Markov chains. Operations Research 41:1116-1124
- (1993) Operations Research , vol.41 , pp. 1116-1124
- Altman, E.¹ Shwartz, A.²

7
- 0344238495
- Constrained Markov games: Nash equilibria
- submitted
- Altman E, Shwartz A (1995) Constrained Markov games: Nash equilibria. Annals of dynamic games, submitted
- (1995) Annals of Dynamic Games
- Altman, E.¹ Shwartz, A.²

8
- 0342839405
- The linear program approach in Markov decision problems revisited
- Altman E, Spieksma F (1995) The linear program approach in Markov decision problems revisited. ZOR 42:169-188
- (1995) ZOR , vol.42 , pp. 169-188
- Altman, E.¹ Spieksma, F.²

9
- 1942469570
- Rate of convergence of empirical measures and costs in controlled Markov chains and transient optimality
- Altman E, Zeitouni O (1994) Rate of convergence of empirical measures and costs in controlled Markov chains and transient optimality. Math, of Operations Research 19:955-974
- (1994) Math, of Operations Research , vol.19 , pp. 955-974
- Altman, E.¹ Zeitouni, O.²

10
- 0004002106
- Wiley, England
- Anderson J, Nash P (1987) Linear programming in infinite-dimentional spaces. Wiley, England
- (1987) Linear Programming in Infinite-dimentional Spaces
- Anderson, J.¹ Nash, P.²

11
- 0027557742
- Discrete-time controlled Markov processes with average cost criterion: A survey
- Arapostathis A, Borkar VS, Fernández-Gaucherand E, Ghosh MK, Marcus SI (1993) Discrete-time controlled Markov processes with average cost criterion: A survey. SIAM J. Control and Optimization 31:282-344
- (1993) SIAM J. Control and Optimization , vol.31 , pp. 282-344
- Arapostathis, A.¹ Borkar, V.S.² Fernández-Gaucherand, E.³ Ghosh, M.K.⁴ Marcus, S.I.⁵

12
- 0004099213
- Springer-Verlag
- Aubin JP (1993) Optima and equilibria, An introduction to nonlinear analysis. Springer-Verlag
- (1993) Optima and Equilibria, an Introduction to Nonlinear Analysis
- Aubin, J.P.¹

13
- 0001572612
- Variability sensitive Markov decision processes
- Bayal-Gursoy M, Ross KW (1992) Variability sensitive Markov decision processes. Math, of Operations Research 17:558-571
- (1992) Math, of Operations Research , vol.17 , pp. 558-571
- Bayal-Gursoy, M.¹ Ross, K.W.²

14
- 0003407041
- Wiley, New-York
- Billingsley P (1968) Convergence of probability measures. Wiley, New-York
- (1968) Convergence of Probability Measures
- Billingsley, P.¹

15
- 0022151359
- Optimal policies for controlled Markov chains with a constraint
- Beutler FJ, Ross KW (1985) Optimal policies for controlled Markov chains with a constraint. J. Mathematical Analysis and Applications 112:236-252
- (1985) J. Mathematical Analysis and Applications , vol.112 , pp. 236-252
- Beutler, F.J.¹ Ross, K.W.²

16
- 0008860655
- Time-average optimal constrained semi-Markov decision processes
- Beutler FJ, Ross KW (1986) Time-average optimal constrained semi-Markov decision processes. Advances of Applied Probability 18:341-359
- (1986) Advances of Applied Probability , vol.18 , pp. 341-359
- Beutler, F.J.¹ Ross, K.W.²

17
- 0343709784
- A convex analytic approach to Markov decision processes
- Borkar VS (1988) A convex analytic approach to Markov decision processes. Prob. Th. Rel. Fields 78:583-602
- (1988) Prob. Th. Rel. Fields , vol.78 , pp. 583-602
- Borkar, V.S.¹

18
- 0003448964
- Topics in controlled Markov chains
- Borkar VS (1990) Topics in controlled Markov chains. Longman Scientific & Technical
- (1990) Longman Scientific & Technical
- Borkar, V.S.¹

19
- 0028336218
- Ergodic control of Markov Chains with constraints - The general case
- Borkar VS (1994) Ergodic control of Markov Chains with constraints - the general case. SIAM J. Control and Optimization 32:176-186
- (1994) SIAM J. Control and Optimization , vol.32 , pp. 176-186
- Borkar, V.S.¹

20
- 0002885049
- Equivalence of Lyapunov stability criteria in a class of Markov decision processes
- Cavazos-Cadena R, Hernaández-Lerma O (1992) Equivalence of Lyapunov stability criteria in a class of Markov decision processes. Appl. Math. Optim. 26:113-137
- (1992) Appl. Math. Optim. , vol.26 , pp. 113-137
- Cavazos-Cadena, R.¹ Hernaández-Lerma, O.²

21
- 0004893920
- Les problèmes de décisions séquentielles
- De Ghellinck GT (1960) Les problèmes de décisions séquentielles. Cahiers du Centre de Recherche Opérationelle 2:161-179
- (1960) Cahiers du Centre de Recherche Opérationelle , vol.2 , pp. 161-179
- De Ghellinck, G.T.¹

22
- 0001554538
- On linear programming in a Markov decision problem
- Denardo EV (1970a) On linear programming in a Markov decision problem. Management Science 16:281-288
- (1970) Management Science , vol.16 , pp. 281-288
- Denardo, E.V.¹

23
- 0038956073
- Multichain Markov renewal programs
- Denardo EV, Fox BL (1968) Multichain Markov renewal programs. SIAM J. of Applied Math. 16:468-487
- (1968) SIAM J. of Applied Math. , vol.16 , pp. 468-487
- Denardo, E.V.¹ Fox, B.L.²

24
- 0343861205
- Sur un problème de production et de stockage dans l'aléatoire
- D'Epenoux F (1960) Sur un problème de production et de stockage dans l'aléatoire. Revue Française de Recherche Opérationelle 14:3-16
- (1960) Revue Française de Recherche Opérationelle , vol.14 , pp. 3-16
- D'Epenoux, F.¹

25
- 0006464452
- A probabilistic production and inventory problem
- D'Epenoux F (1963) A probabilistic production and inventory problem. Management Science 10:98-108
- (1963) Management Science , vol.10 , pp. 98-108
- D'Epenoux, F.¹

26
- 0003421685
- Academic Press
- Derman C (1970) Finite state Markovian decision processes. Academic Press
- (1970) Finite State Markovian Decision Processes
- Derman, C.¹

27
- 0009990403
- Some remarks on finite horizon Markovian decision models
- Derman C, Klein M (1965) Some remarks on finite horizon Markovian decision models. Operations research 13:272-278
- (1965) Operations Research , vol.13 , pp. 272-278
- Derman, C.¹ Klein, M.²

28
- 0000883929
- On memoryless rules for controlling sequential control processes
- Derman C, Strauch RE (1966) On memoryless rules for controlling sequential control processes. Ann. Math. Stat 37:276-278
- (1966) Ann. Math. Stat , vol.37 , pp. 276-278
- Derman, C.¹ Strauch, R.E.²

29
- 0347702026
- Constrained Markov decision chains
- Derman C, Veinott AF Jr (1972) Constrained Markov decision chains. Management Science 19:389-390
- (1972) Management Science , vol.19 , pp. 389-390
- Derman, C.¹ Veinott Jr., A.F.²

30
- 0004179052
- Springer-Verlag
- Doob JL (1994). Measure theory, Springer-Verlag
- (1994) Measure Theory
- Doob, J.L.¹

31
- 0003634432
- Springer-Verlag, Berlin
- Dynkin E, Yushkevich A (1979) Controlled Markov processes. Springer-Verlag, Berlin
- (1979) Controlled Markov Processes
- Dynkin, E.¹ Yushkevich, A.²

32
- 1942528886
- Geometric convergence of value-iteration in multichain Markov decision problems
- Federgruen A (1979) Geometric convergence of value-iteration in multichain Markov decision problems. Adv. Appl. Prob. 11:188-217
- (1979) Adv. Appl. Prob. , vol.11 , pp. 188-217
- Federgruen, A.¹

33
- 0342988500
- Sufficient classes of strategies in discrete dynamic programming I: Decomposition of randomized strategies and embedded models
- Feinberg EA (1986) Sufficient classes of strategies in discrete dynamic programming I: Decomposition of randomized strategies and embedded models. SIAM Theory Probab. Appl., 31:658-668
- (1986) SIAM Theory Probab. Appl. , vol.31 , pp. 658-668
- Feinberg, E.A.¹

34
- 0000049211
- Constrained semi-Markov decision processes with average rewards
- Feinberg EA (1995) Constrained semi-Markov decision processes with average rewards. ZOR 39:257-288
- (1995) ZOR , vol.39 , pp. 257-288
- Feinberg, E.A.¹

35
- 84974177891
- Optimality of randomized trunk reservation
- Feinberg EA, Reiman MI (1994) Optimality of randomized trunk reservation. Probability in the Engineering and Informational Sciences 8:463-489
- (1994) Probability in the Engineering and Informational Sciences , vol.8 , pp. 463-489
- Feinberg, E.A.¹ Reiman, M.I.²

36
- 0343424031
- Stationary and Markov policies in countable state dynamic programming
- Feinberg EA, Sonin I (1983) Stationary and Markov policies in countable state dynamic programming. Lecture notes in Mathematics 1021:111-129
- (1983) Lecture Notes in Mathematics , vol.1021 , pp. 111-129
- Feinberg, E.A.¹ Sonin, I.²

37
- 85034467566
- Unpublished Draft
- Feinberg EA, Sonin I (1993) The existence of an equivalent stationary strategy in the case of discount factor equal one. Unpublished Draft
- (1993) The Existence of an Equivalent Stationary Strategy in the Case of Discount Factor Equal One.
- Feinberg, E.A.¹ Sonin, I.²

38
- 0030357631
- Notes on equivalent stationary policies in Markov decision processes with total rewards
- Feinberg EA, Sonin I (1995) Notes on equivalent stationary policies in Markov decision processes with total rewards. ZOR 44:205-221
- (1995) ZOR , vol.44 , pp. 205-221
- Feinberg, E.A.¹ Sonin, I.²

39
- 0000184142
- Constrained discounted dynamic programming
- Feinberg EA, Shwartz A (1995) Constrained discounted dynamic programming. Math. of Operations Research 20:302-320
- (1995) Math. of Operations Research , vol.20 , pp. 302-320
- Feinberg, E.A.¹ Shwartz, A.²

40
- 0000217603
- Varinace-penalized Markov decision processes
- Filar JA, Kallenberg LCM, Lee HM (1989) Varinace-penalized Markov decision processes. Math. of Operations Research, 14:147-161
- (1989) Math. of Operations Research , vol.14 , pp. 147-161
- Filar, J.A.¹ Kallenberg, L.C.M.² Lee, H.M.³

41
- 0022305705
- Gain/variability tradeoffs in undiscounted Markov decision processes
- Filar JA, Lee HM (1985) Gain/variability tradeoffs in undiscounted Markov decision processes. Proceedings of 24th Conference on Decision and Control IEEE, pp. 1106-1112
- (1985) Proceedings of 24th Conference on Decision and Control IEEE , pp. 1106-1112
- Filar, J.A.¹ Lee, H.M.²

42
- 0347701478
- An example in denumerable decision processes
- Fisher L, Ross SM (1968) An example in denumerable decision processes. Ann. Math. Stat. 39:674-675
- (1968) Ann. Math. Stat. , vol.39 , pp. 674-675
- Fisher, L.¹ Ross, S.M.²

43
- 1942469561
- Generalized linear programming in Markovian decision problems
- Heilmann WR (1977) Generalized linear programming in Markovian decision problems. Bonner Math. Schriften 98:33-39
- (1977) Bonner Math. Schriften , vol.98 , pp. 33-39
- Heilmann, W.R.¹

44
- 0017933941
- Solving stochastic dynamic programming problems by linear programming - An annotated bibliography
- Heilmann WR (1978) Solving stochastic dynamic programming problems by linear programming - an annotated bibliography. Z. Oper. Res. 22:43-53
- (1978) Z. Oper. Res. , vol.22 , pp. 43-53
- Heilmann, W.R.¹

45
- 0345809499
- Discounted cost Markov decision processes on Borel spaces: The linear programming formulation
- Hernández-Lerma O, Hernández-Hernández D (1994) Discounted cost Markov decision processes on Borel spaces: The linear programming formulation. J. of Math. Anal. and Appl. 183:335-351
- (1994) J. of Math. Anal. and Appl. , vol.183 , pp. 335-351
- Hernández-Lerma, O.¹ Hernández-Hernández, D.²

46
- 0028397545
- Linear programming and average optimality on Borel spaces-unbounded costs
- Hernández-Lerma O, Lasserre JB (1994) Linear programming and average optimality on Borel spaces-unbounded costs. SIAM J. Control and Optimization 32:480-500
- (1994) SIAM J. Control and Optimization , vol.32 , pp. 480-500
- Hernández-Lerma, O.¹ Lasserre, J.B.²

47
- 0003434766
- Lecture Notes in Operations Research and Mathematical Systems, Springer-Verlag, Berlin
- Hinderer K (1970) Foundation of non-stationary dynamic programming with discrete time parameter. Vol. 33, Lecture Notes in Operations Research and Mathematical Systems, Springer-Verlag, Berlin
- (1970) Foundation of Non-stationary Dynamic Programming with Discrete Time Parameter , vol.33
- Hinderer, K.¹

48
- 0008627283
- Dynamic programming and Markov potential theory
- Mathematisch Centrum, Amsterdam
- Hordijk A (1977) Dynamic programming and Markov potential theory. Second Edition, Mathematical Centre Tracts 51, Mathematisch Centrum, Amsterdam
- (1977) Second Edition, Mathematical Centre Tracts , vol.51
- Hordijk, A.¹

49
- 0018455841
- Linear programming and Markov decision chains
- Hordijk A, Kallenberg LCM (1979) Linear programming and Markov decision chains. Management Science 25:352-362
- (1979) Management Science , vol.25 , pp. 352-362
- Hordijk, A.¹ Kallenberg, L.C.M.²

50
- 0021425220
- Constrained undiscounted stochastic dynamic programming
- Hordijk A, Kallenberg LCM (1984) Constrained undiscounted stochastic dynamic programming. Mathematics of Operations Research 9:276-289
- (1984) Mathematics of Operations Research , vol.9 , pp. 276-289
- Hordijk, A.¹ Kallenberg, L.C.M.²

51
- 0344325087
- Linear programming formulation of MDPs in countable state space: The multichain case
- Hordijk A, Lasserre JB (1994) Linear programming formulation of MDPs in countable state space: The multichain case. ZOR Research 40:91-108
- (1994) ZOR Research , vol.40 , pp. 91-108
- Hordijk, A.¹ Lasserre, J.B.²

52
- 0001324526
- On finding optimal policies for Markov decision chains: A unifying framework for mean-variance tradeoffs
- Huang Y, Kallenberg LCM (1994) On finding optimal policies for Markov decision chains: A unifying framework for mean-variance tradeoffs. Math. of Operations Research 19:434-448
- (1994) Math. of Operations Research , vol.19 , pp. 434-448
- Huang, Y.¹ Kallenberg, L.C.M.²

53
- 0002775664
- Linear programming and finite Markovian control problems
- Amsterdam
- Kallenberg LCM (1983) Linear programming and finite Markovian control problems. Mathematical Centre Tracts 148, Amsterdam
- (1983) Mathematical Centre Tracts , vol.148
- Kallenberg, L.C.M.¹

54
- 0141754345
- Survey of linear programming for standard and nonstandard Markovian control problems, Part I: Theory
- Kallenberg LCM (1994) Survey of linear programming for standard and nonstandard Markovian control problems, Part I: Theory. ZOR 40:1-42
- (1994) ZOR , vol.40 , pp. 1-42
- Kallenberg, L.C.M.¹

55
- 0023384370
- A variance minimization problem for a Markov decision process
- Kawai H (1987) A variance minimization problem for a Markov decision process. European Journal of operations Research 31:140-145
- (1987) European Journal of Operations Research , vol.31 , pp. 140-145
- Kawai, H.¹

56
- 0013260916
- Once more about the connection between elliptic operators and Ito's stochastic equations
- Krylov N et al. (eds.) New York
- Krylov N (1985) Once more about the connection between elliptic operators and Ito's stochastic equations. In: Krylov N et al. (eds.) Statistics and control of stochastic processes, Steklov Seminar 1984 Optimization Software, New York, pp. 69-101
- (1985) Statistics and Control of Stochastic Processes, Steklov Seminar 1984 Optimization Software , pp. 69-101
- Krylov, N.¹

57
- 1942437630
- Mathematical programming and the control of Markov chains
- Kushner H, Kleinman J (1971) Mathematical programming and the control of Markov chains. Internat. J. Control 13:801-820
- (1971) Internat. J. Control , vol.13 , pp. 801-820
- Kushner, H.¹ Kleinman, J.²

58
- 38149145972
- Average optimal stationary policies and Linear programming in countable state Markov decision processes
- Lasserre JB (1994) Average optimal stationary policies and Linear programming in countable state Markov decision processes. J. Math. Anal. Appl. 183:233-249
- (1994) J. Math. Anal. Appl. , vol.183 , pp. 233-249
- Lasserre, J.B.¹

59
- 0001257766
- Linear programming and sequential decisions
- Manne AS (1960) Linear programming and sequential decisions. Management Science 6:259-267
- (1960) Management Science , vol.6 , pp. 259-267
- Manne, A.S.¹

60
- 0003503424
- Optimal Control of Random Sequences in Problems with Constraints
- Kluwer Academic Publishers
- Piunovskiy AB (1997) Optimal Control of Random Sequences in Problems with Constraints. Mathematics and its Applications, Kluwer Academic Publishers
- (1997) Mathematics and Its Applications
- Piunovskiy, A.B.¹

61
- 0003998452
- John Wiley & Sons, New York
- Puterman M (1994), Markov decision processes. John Wiley & Sons, New York
- (1994) Markov Decision Processes
- Puterman, M.¹

62
- 0000066148
- Algorithms for stochastic games - A survey
- Raghavan TES, Filar JA (1991) Algorithms for stochastic games - a survey. ZOR 35:437-472
- (1991) ZOR , vol.35 , pp. 437-472
- Raghavan, T.E.S.¹ Filar, J.A.²

63
- 0003450909
- Society for Industrial and Applied Mathematics, 2nd printing, Philadelphia
- Rockafellar RT (1989) Conjugate duality and optimization. Society for Industrial and Applied Mathematics, 2nd printing, Philadelphia
- (1989) Conjugate Duality and Optimization.
- Rockafellar, R.T.¹

64
- 0024664332
- Randomized and past-dependent policies for Markov decision processes with multiple constraints
- Ross KW (1989) Randomized and past-dependent policies for Markov decision processes with multiple constraints. Operations Research 37:474-477
- (1989) Operations Research , vol.37 , pp. 474-477
- Ross, K.W.¹

65
- 0024737381
- Markov decision processes with sample path constraints: The communicating case
- Ross K, Varadarajan R (1989) Markov decision processes with sample path constraints: The communicating case. Operations Research 37:780-790
- (1989) Operations Research , vol.37 , pp. 780-790
- Ross, K.¹ Varadarajan, R.²

66
- 0001172487
- Multichain Markov decision processes with a sample path constraint: A decomposition approach
- Ross K, Varadarajan R (1991) Multichain Markov decision processes with a sample path constraint: A decomposition approach. Math. of Operations Research 16:195-207
- (1991) Math. of Operations Research , vol.16 , pp. 195-207
- Ross, K.¹ Varadarajan, R.²

67
- 0003655416
- Macmillan publishing Company, New York
- Royden HL (1988) Real analysis. 3rd Edition, Macmillan publishing Company, New York
- (1988) Real Analysis. 3rd Edition
- Royden, H.L.¹

68
- 0000392613
- Stochastic games
- Shapley LS (1953) Stochastic games. Proceedings Nat. Acad. of Science USA 39:1095-1100
- (1953) Proceedings Nat. Acad. of Science USA , vol.39 , pp. 1095-1100
- Shapley, L.S.¹

69
- 84974288946
- Constrained discounted Markov decision chains
- Sennott LI (1991) Constrained discounted Markov decision chains. Probability in the Engineering and Informational Sciences 5:463-475
- (1991) Probability in the Engineering and Informational Sciences , vol.5 , pp. 463-475
- Sennott, L.I.¹

70
- 0001839287
- Constrained average cost Markov decision chains
- Sennott LI (1993) Constrained average cost Markov decision chains. Probability in the Engineering and Informational Sciences 7:69-83
- (1993) Probability in the Engineering and Informational Sciences , vol.7 , pp. 69-83
- Sennott, L.I.¹

71
- 0022189863
- Maximal mean/standard deviation ratio in undiscounted MDP
- Sobel MJ (1985) Maximal mean/standard deviation ratio in undiscounted MDP. OR Letters 4:157-159
- (1985) OR Letters , vol.4 , pp. 157-159
- Sobel, M.J.¹

72
- 0009598143
- Mean-variance tradeoffs in an undiscounted MDP
- Sobel MJ (1994) Mean-variance tradeoffs in an undiscounted MDP. Operations Research 42:175-188
- (1994) Operations Research , vol.42 , pp. 175-188
- Sobel, M.J.¹

73
- 0012219344
- Ph.D. thesis, University of Leiden
- Spieksma FM (1990) Geometrically ergodic Markov chains and the optimal control of queues. Ph.D. thesis, University of Leiden
- (1990) Geometrically Ergodic Markov Chains and the Optimal Control of Queues
- Spieksma, F.M.¹

74
- 1942533658
- Continuity of optimal values and solutions of convex optimization, and constrained control of Markov chains
- submitted
- Tidball M, Altmian A (1996) Continuity of optimal values and solutions of convex optimization, and constrained control of Markov chains. SIAM J. Control and Optimization, submitted
- (1996) SIAM J. Control and Optimization
- Tidball, M.¹ Altmian, A.²

75
- 0003319773
- Stochastic dynamic programming
- Mathematisch Centrum, Amsterdam
- Van Der Wal J (1981) Stochastic dynamic programming. Mathematical Centre Tract 139, Mathematisch Centrum, Amsterdam
- (1981) Mathematical Centre Tract , vol.139
- Van Der Wal, J.¹

76
- 1942501821
- A mathematical programming approach to a problem in variance penalised Markov decision processes
- White DJ (1994) A mathematical programming approach to a problem in variance penalised Markov decision processes. OR Spectrum 15:225-230
- (1994) OR Spectrum , vol.15 , pp. 225-230
- White, D.J.¹

77
- 0004228766
- Cambridge University Press
- Williams D (1992) Probability and martingales. Cambridge University Press
- (1992) Probability and Martingales
- Williams, D.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.