메뉴 건너뛰기




Volumn 48, Issue 3, 1998, Pages 387-417

Constrained Markov decision processes with total cost criteria: Lagrangian approach and dual linear program

Author keywords

Countable state space; Extra constraints; Markov decision process; Total cost

Indexed keywords


EID: 1942424978     PISSN: 14322994     EISSN: None     Source Type: Journal    
DOI: 10.1007/s001860050035     Document Type: Article
Times cited : (93)

References (77)
  • 1
    • 0342888467 scopus 로고
    • Asymptotic properties of constrained Markov decision processes
    • Altman E (1993) Asymptotic properties of constrained Markov decision processes. ZOR 37:151-170
    • (1993) ZOR , vol.37 , pp. 151-170
    • Altman, E.1
  • 2
    • 0001061406 scopus 로고
    • Denumerable constrained Markov Decision Processes and finite approximations
    • Altman E (1994) Denumerable constrained Markov Decision Processes and finite approximations. Math. of Operations Research 19:169-191
    • (1994) Math. of Operations Research , vol.19 , pp. 169-191
    • Altman, E.1
  • 3
    • 0009459044 scopus 로고    scopus 로고
    • Constrained Markov decision processes with total cost criteria: Occupation measures and primal LP
    • Altman E (1996) Constrained Markov decision processes with total cost criteria: Occupation measures and primal LP. ZOR 43:45-72
    • (1996) ZOR , vol.43 , pp. 45-72
    • Altman, E.1
  • 4
    • 1942501834 scopus 로고
    • Constrained Markov decision processes
    • Altman E (1995) Constrained Markov decision processes. Book in preparation
    • (1995) Book in Preparation
    • Altman, E.1
  • 5
    • 0026188613 scopus 로고
    • Markov decision problems and state-action frequencies
    • Altman E, Shwartz A (1991a) Markov decision problems and state-action frequencies. SIAM J. Control and Optimization 29:786-809
    • (1991) SIAM J. Control and Optimization , vol.29 , pp. 786-809
    • Altman, E.1    Shwartz, A.2
  • 6
    • 0027699625 scopus 로고
    • Time-sharing policies for controlled Markov chains
    • Altman E, Shwartz A (1993) Time-sharing policies for controlled Markov chains. Operations Research 41:1116-1124
    • (1993) Operations Research , vol.41 , pp. 1116-1124
    • Altman, E.1    Shwartz, A.2
  • 8
    • 0342839405 scopus 로고
    • The linear program approach in Markov decision problems revisited
    • Altman E, Spieksma F (1995) The linear program approach in Markov decision problems revisited. ZOR 42:169-188
    • (1995) ZOR , vol.42 , pp. 169-188
    • Altman, E.1    Spieksma, F.2
  • 9
    • 1942469570 scopus 로고
    • Rate of convergence of empirical measures and costs in controlled Markov chains and transient optimality
    • Altman E, Zeitouni O (1994) Rate of convergence of empirical measures and costs in controlled Markov chains and transient optimality. Math, of Operations Research 19:955-974
    • (1994) Math, of Operations Research , vol.19 , pp. 955-974
    • Altman, E.1    Zeitouni, O.2
  • 16
    • 0008860655 scopus 로고
    • Time-average optimal constrained semi-Markov decision processes
    • Beutler FJ, Ross KW (1986) Time-average optimal constrained semi-Markov decision processes. Advances of Applied Probability 18:341-359
    • (1986) Advances of Applied Probability , vol.18 , pp. 341-359
    • Beutler, F.J.1    Ross, K.W.2
  • 17
    • 0343709784 scopus 로고
    • A convex analytic approach to Markov decision processes
    • Borkar VS (1988) A convex analytic approach to Markov decision processes. Prob. Th. Rel. Fields 78:583-602
    • (1988) Prob. Th. Rel. Fields , vol.78 , pp. 583-602
    • Borkar, V.S.1
  • 19
    • 0028336218 scopus 로고
    • Ergodic control of Markov Chains with constraints - The general case
    • Borkar VS (1994) Ergodic control of Markov Chains with constraints - the general case. SIAM J. Control and Optimization 32:176-186
    • (1994) SIAM J. Control and Optimization , vol.32 , pp. 176-186
    • Borkar, V.S.1
  • 20
    • 0002885049 scopus 로고
    • Equivalence of Lyapunov stability criteria in a class of Markov decision processes
    • Cavazos-Cadena R, Hernaández-Lerma O (1992) Equivalence of Lyapunov stability criteria in a class of Markov decision processes. Appl. Math. Optim. 26:113-137
    • (1992) Appl. Math. Optim. , vol.26 , pp. 113-137
    • Cavazos-Cadena, R.1    Hernaández-Lerma, O.2
  • 22
    • 0001554538 scopus 로고
    • On linear programming in a Markov decision problem
    • Denardo EV (1970a) On linear programming in a Markov decision problem. Management Science 16:281-288
    • (1970) Management Science , vol.16 , pp. 281-288
    • Denardo, E.V.1
  • 24
    • 0343861205 scopus 로고
    • Sur un problème de production et de stockage dans l'aléatoire
    • D'Epenoux F (1960) Sur un problème de production et de stockage dans l'aléatoire. Revue Française de Recherche Opérationelle 14:3-16
    • (1960) Revue Française de Recherche Opérationelle , vol.14 , pp. 3-16
    • D'Epenoux, F.1
  • 25
    • 0006464452 scopus 로고
    • A probabilistic production and inventory problem
    • D'Epenoux F (1963) A probabilistic production and inventory problem. Management Science 10:98-108
    • (1963) Management Science , vol.10 , pp. 98-108
    • D'Epenoux, F.1
  • 27
    • 0009990403 scopus 로고
    • Some remarks on finite horizon Markovian decision models
    • Derman C, Klein M (1965) Some remarks on finite horizon Markovian decision models. Operations research 13:272-278
    • (1965) Operations Research , vol.13 , pp. 272-278
    • Derman, C.1    Klein, M.2
  • 28
    • 0000883929 scopus 로고
    • On memoryless rules for controlling sequential control processes
    • Derman C, Strauch RE (1966) On memoryless rules for controlling sequential control processes. Ann. Math. Stat 37:276-278
    • (1966) Ann. Math. Stat , vol.37 , pp. 276-278
    • Derman, C.1    Strauch, R.E.2
  • 32
    • 1942528886 scopus 로고
    • Geometric convergence of value-iteration in multichain Markov decision problems
    • Federgruen A (1979) Geometric convergence of value-iteration in multichain Markov decision problems. Adv. Appl. Prob. 11:188-217
    • (1979) Adv. Appl. Prob. , vol.11 , pp. 188-217
    • Federgruen, A.1
  • 33
    • 0342988500 scopus 로고
    • Sufficient classes of strategies in discrete dynamic programming I: Decomposition of randomized strategies and embedded models
    • Feinberg EA (1986) Sufficient classes of strategies in discrete dynamic programming I: Decomposition of randomized strategies and embedded models. SIAM Theory Probab. Appl., 31:658-668
    • (1986) SIAM Theory Probab. Appl. , vol.31 , pp. 658-668
    • Feinberg, E.A.1
  • 34
    • 0000049211 scopus 로고
    • Constrained semi-Markov decision processes with average rewards
    • Feinberg EA (1995) Constrained semi-Markov decision processes with average rewards. ZOR 39:257-288
    • (1995) ZOR , vol.39 , pp. 257-288
    • Feinberg, E.A.1
  • 36
    • 0343424031 scopus 로고
    • Stationary and Markov policies in countable state dynamic programming
    • Feinberg EA, Sonin I (1983) Stationary and Markov policies in countable state dynamic programming. Lecture notes in Mathematics 1021:111-129
    • (1983) Lecture Notes in Mathematics , vol.1021 , pp. 111-129
    • Feinberg, E.A.1    Sonin, I.2
  • 38
    • 0030357631 scopus 로고
    • Notes on equivalent stationary policies in Markov decision processes with total rewards
    • Feinberg EA, Sonin I (1995) Notes on equivalent stationary policies in Markov decision processes with total rewards. ZOR 44:205-221
    • (1995) ZOR , vol.44 , pp. 205-221
    • Feinberg, E.A.1    Sonin, I.2
  • 42
    • 0347701478 scopus 로고
    • An example in denumerable decision processes
    • Fisher L, Ross SM (1968) An example in denumerable decision processes. Ann. Math. Stat. 39:674-675
    • (1968) Ann. Math. Stat. , vol.39 , pp. 674-675
    • Fisher, L.1    Ross, S.M.2
  • 43
    • 1942469561 scopus 로고
    • Generalized linear programming in Markovian decision problems
    • Heilmann WR (1977) Generalized linear programming in Markovian decision problems. Bonner Math. Schriften 98:33-39
    • (1977) Bonner Math. Schriften , vol.98 , pp. 33-39
    • Heilmann, W.R.1
  • 44
    • 0017933941 scopus 로고
    • Solving stochastic dynamic programming problems by linear programming - An annotated bibliography
    • Heilmann WR (1978) Solving stochastic dynamic programming problems by linear programming - an annotated bibliography. Z. Oper. Res. 22:43-53
    • (1978) Z. Oper. Res. , vol.22 , pp. 43-53
    • Heilmann, W.R.1
  • 45
    • 0345809499 scopus 로고
    • Discounted cost Markov decision processes on Borel spaces: The linear programming formulation
    • Hernández-Lerma O, Hernández-Hernández D (1994) Discounted cost Markov decision processes on Borel spaces: The linear programming formulation. J. of Math. Anal. and Appl. 183:335-351
    • (1994) J. of Math. Anal. and Appl. , vol.183 , pp. 335-351
    • Hernández-Lerma, O.1    Hernández-Hernández, D.2
  • 46
    • 0028397545 scopus 로고
    • Linear programming and average optimality on Borel spaces-unbounded costs
    • Hernández-Lerma O, Lasserre JB (1994) Linear programming and average optimality on Borel spaces-unbounded costs. SIAM J. Control and Optimization 32:480-500
    • (1994) SIAM J. Control and Optimization , vol.32 , pp. 480-500
    • Hernández-Lerma, O.1    Lasserre, J.B.2
  • 48
    • 0008627283 scopus 로고
    • Dynamic programming and Markov potential theory
    • Mathematisch Centrum, Amsterdam
    • Hordijk A (1977) Dynamic programming and Markov potential theory. Second Edition, Mathematical Centre Tracts 51, Mathematisch Centrum, Amsterdam
    • (1977) Second Edition, Mathematical Centre Tracts , vol.51
    • Hordijk, A.1
  • 49
    • 0018455841 scopus 로고
    • Linear programming and Markov decision chains
    • Hordijk A, Kallenberg LCM (1979) Linear programming and Markov decision chains. Management Science 25:352-362
    • (1979) Management Science , vol.25 , pp. 352-362
    • Hordijk, A.1    Kallenberg, L.C.M.2
  • 51
    • 0344325087 scopus 로고
    • Linear programming formulation of MDPs in countable state space: The multichain case
    • Hordijk A, Lasserre JB (1994) Linear programming formulation of MDPs in countable state space: The multichain case. ZOR Research 40:91-108
    • (1994) ZOR Research , vol.40 , pp. 91-108
    • Hordijk, A.1    Lasserre, J.B.2
  • 52
    • 0001324526 scopus 로고
    • On finding optimal policies for Markov decision chains: A unifying framework for mean-variance tradeoffs
    • Huang Y, Kallenberg LCM (1994) On finding optimal policies for Markov decision chains: A unifying framework for mean-variance tradeoffs. Math. of Operations Research 19:434-448
    • (1994) Math. of Operations Research , vol.19 , pp. 434-448
    • Huang, Y.1    Kallenberg, L.C.M.2
  • 53
    • 0002775664 scopus 로고
    • Linear programming and finite Markovian control problems
    • Amsterdam
    • Kallenberg LCM (1983) Linear programming and finite Markovian control problems. Mathematical Centre Tracts 148, Amsterdam
    • (1983) Mathematical Centre Tracts , vol.148
    • Kallenberg, L.C.M.1
  • 54
    • 0141754345 scopus 로고
    • Survey of linear programming for standard and nonstandard Markovian control problems, Part I: Theory
    • Kallenberg LCM (1994) Survey of linear programming for standard and nonstandard Markovian control problems, Part I: Theory. ZOR 40:1-42
    • (1994) ZOR , vol.40 , pp. 1-42
    • Kallenberg, L.C.M.1
  • 55
    • 0023384370 scopus 로고
    • A variance minimization problem for a Markov decision process
    • Kawai H (1987) A variance minimization problem for a Markov decision process. European Journal of operations Research 31:140-145
    • (1987) European Journal of Operations Research , vol.31 , pp. 140-145
    • Kawai, H.1
  • 57
    • 1942437630 scopus 로고
    • Mathematical programming and the control of Markov chains
    • Kushner H, Kleinman J (1971) Mathematical programming and the control of Markov chains. Internat. J. Control 13:801-820
    • (1971) Internat. J. Control , vol.13 , pp. 801-820
    • Kushner, H.1    Kleinman, J.2
  • 58
    • 38149145972 scopus 로고
    • Average optimal stationary policies and Linear programming in countable state Markov decision processes
    • Lasserre JB (1994) Average optimal stationary policies and Linear programming in countable state Markov decision processes. J. Math. Anal. Appl. 183:233-249
    • (1994) J. Math. Anal. Appl. , vol.183 , pp. 233-249
    • Lasserre, J.B.1
  • 59
    • 0001257766 scopus 로고
    • Linear programming and sequential decisions
    • Manne AS (1960) Linear programming and sequential decisions. Management Science 6:259-267
    • (1960) Management Science , vol.6 , pp. 259-267
    • Manne, A.S.1
  • 60
    • 0003503424 scopus 로고    scopus 로고
    • Optimal Control of Random Sequences in Problems with Constraints
    • Kluwer Academic Publishers
    • Piunovskiy AB (1997) Optimal Control of Random Sequences in Problems with Constraints. Mathematics and its Applications, Kluwer Academic Publishers
    • (1997) Mathematics and Its Applications
    • Piunovskiy, A.B.1
  • 62
    • 0000066148 scopus 로고
    • Algorithms for stochastic games - A survey
    • Raghavan TES, Filar JA (1991) Algorithms for stochastic games - a survey. ZOR 35:437-472
    • (1991) ZOR , vol.35 , pp. 437-472
    • Raghavan, T.E.S.1    Filar, J.A.2
  • 63
    • 0003450909 scopus 로고
    • Society for Industrial and Applied Mathematics, 2nd printing, Philadelphia
    • Rockafellar RT (1989) Conjugate duality and optimization. Society for Industrial and Applied Mathematics, 2nd printing, Philadelphia
    • (1989) Conjugate Duality and Optimization.
    • Rockafellar, R.T.1
  • 64
    • 0024664332 scopus 로고
    • Randomized and past-dependent policies for Markov decision processes with multiple constraints
    • Ross KW (1989) Randomized and past-dependent policies for Markov decision processes with multiple constraints. Operations Research 37:474-477
    • (1989) Operations Research , vol.37 , pp. 474-477
    • Ross, K.W.1
  • 65
    • 0024737381 scopus 로고
    • Markov decision processes with sample path constraints: The communicating case
    • Ross K, Varadarajan R (1989) Markov decision processes with sample path constraints: The communicating case. Operations Research 37:780-790
    • (1989) Operations Research , vol.37 , pp. 780-790
    • Ross, K.1    Varadarajan, R.2
  • 66
    • 0001172487 scopus 로고
    • Multichain Markov decision processes with a sample path constraint: A decomposition approach
    • Ross K, Varadarajan R (1991) Multichain Markov decision processes with a sample path constraint: A decomposition approach. Math. of Operations Research 16:195-207
    • (1991) Math. of Operations Research , vol.16 , pp. 195-207
    • Ross, K.1    Varadarajan, R.2
  • 71
    • 0022189863 scopus 로고
    • Maximal mean/standard deviation ratio in undiscounted MDP
    • Sobel MJ (1985) Maximal mean/standard deviation ratio in undiscounted MDP. OR Letters 4:157-159
    • (1985) OR Letters , vol.4 , pp. 157-159
    • Sobel, M.J.1
  • 72
    • 0009598143 scopus 로고
    • Mean-variance tradeoffs in an undiscounted MDP
    • Sobel MJ (1994) Mean-variance tradeoffs in an undiscounted MDP. Operations Research 42:175-188
    • (1994) Operations Research , vol.42 , pp. 175-188
    • Sobel, M.J.1
  • 74
    • 1942533658 scopus 로고    scopus 로고
    • Continuity of optimal values and solutions of convex optimization, and constrained control of Markov chains
    • submitted
    • Tidball M, Altmian A (1996) Continuity of optimal values and solutions of convex optimization, and constrained control of Markov chains. SIAM J. Control and Optimization, submitted
    • (1996) SIAM J. Control and Optimization
    • Tidball, M.1    Altmian, A.2
  • 75
    • 0003319773 scopus 로고
    • Stochastic dynamic programming
    • Mathematisch Centrum, Amsterdam
    • Van Der Wal J (1981) Stochastic dynamic programming. Mathematical Centre Tract 139, Mathematisch Centrum, Amsterdam
    • (1981) Mathematical Centre Tract , vol.139
    • Van Der Wal, J.1
  • 76
    • 1942501821 scopus 로고
    • A mathematical programming approach to a problem in variance penalised Markov decision processes
    • White DJ (1994) A mathematical programming approach to a problem in variance penalised Markov decision processes. OR Spectrum 15:225-230
    • (1994) OR Spectrum , vol.15 , pp. 225-230
    • White, D.J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.