메뉴 건너뛰기




Volumn , Issue , 2021, Pages 1-257

Constrained Markov Decision Processes

(1)  Altman, Eitan a  

a INRIA   (France)

Author keywords

[No Author keywords available]

Indexed keywords


EID: 85126423720     PISSN: None     EISSN: None     Source Type: Book    
DOI: 10.1201/9781315140223     Document Type: Book
Times cited : (1126)

References (173)
  • 1
    • 0342888467 scopus 로고
    • Asymptotic properties of constrained Markov decision processes
    • E. Altman (1993), 'Asymptotic properties of constrained Markov decision processes', ZOR-Methods and Models in Operations Research, 37, Issue 2, pp. 151-170.
    • (1993) ZOR-Methods and Models in Operations Research , vol.37 , Issue.2 , pp. 151-170
    • Altman, E.1
  • 2
    • 0001061406 scopus 로고
    • Denumerable constrained Markov decision processes and finite approximations
    • E. Altman (1994), 'Denumerable constrained Markov decision processes and finite approximations', Math, of Operations Research, 19, pp. 169-191.
    • (1994) Math, of Operations Research , vol.19 , pp. 169-191
    • Altman, E.1
  • 3
    • 0009459044 scopus 로고    scopus 로고
    • Constrained Markov decision processes with total cost criteria: Occupation measures and primal LP
    • E. Altman (1996), 'Constrained Markov decision processes with total cost criteria: occupation measures and primal LP', ZOR-Mathematical Methods in Operations Research, 43, Issue 1, pp. 45-72.
    • (1996) ZOR-Mathematical Methods in Operations Research , vol.43 , Issue.1 , pp. 45-72
    • Altman, E.1
  • 4
    • 1942424978 scopus 로고    scopus 로고
    • Constrained Markov decision processes with total cost criteria: Lagrange approach and dual LP
    • E. Altman (1998), 'Constrained Markov decision processes with total cost criteria: Lagrange approach and dual LP, ZOR-Mathematical Methods in Operations Research, 48, pp. 387-417, 1998.
    • (1998) ZOR-Mathematical Methods in Operations Research , vol.48 , pp. 387-417
    • Altman, E.1
  • 5
    • 0027607552 scopus 로고
    • Stability and singular perturbations in constrained Markov decision problems
    • E. Altman and V. A. Gaitsgory (1993), 'Stability and singular perturbations in constrained Markov decision problems', IEEE Transactions on Automatic Control, 38, pp. 971-975.
    • (1993) IEEE Transactions on Automatic Control , vol.38 , pp. 971-975
    • Altman, E.1    Gaitsgory, V.A.2
  • 8
    • 0031210649 scopus 로고    scopus 로고
    • Contraction conditions for average and α-discount optimality in countable state Markov games with unbounded rewards
    • E. Altman, A. Hordijk and F. M. Spieksma (1997), 'Contraction conditions for average and α-discount optimality in countable state Markov games with unbounded rewards', MOR, 22 No. 3, pp. 588-618.
    • (1997) MOR , vol.22 , Issue.3 , pp. 588-618
    • Altman, E.1    Hordijk, A.2    Spieksma, F.M.3
  • 9
    • 0024173282 scopus 로고
    • Markov optimization problems: State-action frequencies revisited
    • Austin, Texas, December 1988 (invited paper)
    • E. Altman and A. Shwartz (1988), 'Markov optimization problems: state-action frequencies revisited', 27th IEEE Conference on Decision and Control, Austin, Texas, December 1988 (invited paper).
    • (1988) 27th IEEE Conference on Decision and Control
    • Altman, E.1    Shwartz, A.2
  • 10
  • 11
    • 0026188613 scopus 로고
    • tMarkov decision problems and state-action frequencies'
    • E. Altman and A. Shwartz (1991a), tMarkov decision problems and state-action frequencies', SIAM J, Control and Optimization, 29, pp. 786-809.
    • (1991) SIAM J, Control and Optimization , vol.29 , pp. 786-809
    • Altman, E.1    Shwartz, A.2
  • 13
    • 0001843403 scopus 로고
    • Sensitivity of constrained Markov Decision Problems
    • E. Altman and A. Shwartz (1991c), 'Sensitivity of constrained Markov Decision Problems', Annals of Operations Research, 32, pp. 1-22.
    • (1991) Annals of Operations Research , vol.32 , pp. 1-22
    • Altman, E.1    Shwartz, A.2
  • 14
    • 0000235370 scopus 로고
    • Adaptive control of constrained Markov chains: Criteria and policies
    • 28, special issue on 'Markov Decision Processes', Eds. O. HemAndez-Lerma and J. B. Lasserre
    • E. Altman and A. Shwartz (1991d), 'Adaptive control of constrained Markov chains: criteria and policies', Annals of Operations Research 28, special issue on 'Markov Decision Processes', Eds. O. HemAndez-Lerma and J. B. Lasserre, pp. 101-134.
    • (1991) Annals of Operations Research , pp. 101-134
    • Altman, E.1    Shwartz, A.2
  • 15
    • 0027699625 scopus 로고
    • Time-sharing policies for controlled Markov chains
    • E. Altman and A. Shwartz (1993), 'Time-sharing policies for controlled Markov chains', Operations Research, 41, pp. 1116-1124.
    • (1993) Operations Research , vol.41 , pp. 1116-1124
    • Altman, E.1    Shwartz, A.2
  • 16
  • 17
    • 0342839405 scopus 로고
    • The Linear Program approach in Markov decision problems revisited
    • E. Altman and F. Spieksma (1995), 'The Linear Program approach in Markov decision problems revisited', ZOR-Methods and Models in Operations Research, 42, Issue 2, pp. 169-188.
    • (1995) ZOR-Methods and Models in Operations Research , vol.42 , Issue.2 , pp. 169-188
    • Altman, E.1    Spieksma, F.2
  • 18
    • 1942469570 scopus 로고
    • Rate of convergence of empirical measures and costs in controlled Markov chains and transient optimality
    • E. Altman and O. Zeitouni (1994), 'Rate of convergence of empirical measures and costs in controlled Markov chains and transient optimality', Math, of Operations Research, 19, pp. 955-974.
    • (1994) Math, of Operations Research , vol.19 , pp. 955-974
    • Altman, E.1    Zeitouni, O.2
  • 21
    • 0004099213 scopus 로고
    • Springer-Verlag, Berlin, Heidelberg, New York, London, Paris, Tokyo, Hong Kong, Barcelona, Budapest
    • J. P. Aubin (1993), Optima and Equilibria, An Introduction to Nonlinear Analysis, Springer-Verlag, Berlin, Heidelberg, New York, London, Paris, Tokyo, Hong Kong, Barcelona, Budapest.
    • (1993) Optima and Equilibria, An Introduction to Nonlinear Analysis
    • Aubin, J.P.1
  • 22
    • 0002829118 scopus 로고
    • Mixed and behavior strategies in infinite extensive games
    • Advances in Game Theory
    • R. J. Aumann (1964), 'Mixed and behavior strategies in infinite extensive games', Advances in Game Theory, Ann. Math. Study, 52, pp. 627-650.
    • (1964) Ann. Math. Study , vol.52 , pp. 627-650
    • Aumann, R.J.1
  • 23
    • 0015743763 scopus 로고
    • Optimal decision procedures for finite Markov chains. Part II: Communicating systems
    • J. Bather (1973), 'Optimal decision procedures for finite Markov chains. Part II: Communicating systems', Advances in Applied Probability, 5, pp. 521-540.
    • (1973) Advances in Applied Probability , vol.5 , pp. 521-540
    • Bather, J.1
  • 24
    • 0001572612 scopus 로고
    • Variability sensitive Markov decision processes
    • M. Bayal-Gursoy and K. W. Ross (1992), 'Variability sensitive Markov decision processes', Math, of Operations Research, 17, pp. 558-571.
    • (1992) Math, of Operations Research , vol.17 , pp. 558-571
    • Bayal-Gursoy, M.1    Ross, K.W.2
  • 25
    • 0026682237 scopus 로고
    • Information and strategies in dynamic games
    • P. Bernhard (1992), 'Information and strategies in dynamic games', SIAM J. Cont. and Opt., 30, pp. 212-228.
    • (1992) SIAM J. Cont. and Opt , vol.30 , pp. 212-228
    • Bernhard, P.1
  • 26
    • 0022151359 scopus 로고
    • Optimal policies for controlled Markov chains with a constraint
    • F. J. Beutler and K. W. Ross (1985), 'Optimal policies for controlled Markov chains with a constraint', J. Mathematical Analysis and Applications, 112, 236-252.
    • (1985) J. Mathematical Analysis and Applications , vol.112 , pp. 236-252
    • Beutler, F.J.1    Ross, K.W.2
  • 27
    • 0008860655 scopus 로고
    • Time-average optimal constrained Semi-Markov decision processes
    • F. J. Beutler and K. W. Ross (1986), 'Time-average optimal constrained Semi-Markov decision processes', Advances of Applied Probability, 18, pp. 341-359.
    • (1986) Advances of Applied Probability , vol.18 , pp. 341-359
    • Beutler, F.J.1    Ross, K.W.2
  • 29
    • 0022698417 scopus 로고
    • Designing approximating schemes for stochastic optimization problems
    • J. R. Birge and R. J. Wets (1986), 'Designing approximating schemes for stochastic optimization problems', Math. Programm. Study, 27, pp. 54-102.
    • (1986) Math. Programm. Study , vol.27 , pp. 54-102
    • Birge, J.R.1    Wets, R.J.2
  • 30
    • 0021520222 scopus 로고
    • On minimum cost per unit time control of Markov chains
    • V. S. Borkar (1983), 'On minimum cost per unit time control of Markov chains', SIAM J. Control Optim., 22, pp. 965-978.
    • (1983) SIAM J. Control Optim , vol.22 , pp. 965-978
    • Borkar, V.S.1
  • 31
    • 0343709784 scopus 로고
    • A convex analytic approach to Markov decision processes
    • V. S. Borkar (1988), 'A convex analytic approach to Markov decision processes', Prob. Th. Rel. Fields, 78, pp. 583-602.
    • (1988) Prob. Th. Rel. Fields , vol.78 , pp. 583-602
    • Borkar, V.S.1
  • 33
    • 38248999449 scopus 로고
    • Controlled diffusions with constraints
    • V. S. Borkar (1993), 'Controlled diffusions with constraints, IΓ, Journal of Math. Analysis and Appli., 176, No. 2, pp. 310-321.
    • (1993) Journal of Math. Analysis and Appli. , vol.176 , Issue.2 , pp. 310-321
    • Borkar, V.S.1
  • 34
    • 0028336218 scopus 로고
    • Ergodic control of Markov Chains with constraints-the general case
    • V. S. Borkar (1994), 'Ergodic control of Markov Chains with constraints-the general case', SIAM J. Control and Optimization, 32, pp. 176-186.
    • (1994) SIAM J. Control and Optimization , vol.32 , pp. 176-186
    • Borkar, V.S.1
  • 36
    • 1642535247 scopus 로고
    • The effect of delayed feedback information on network performance
    • A. D. Bovopoulos and A. A. Lazar (1991), 'The effect of delayed feedback information on network performance', Annals of Operations Res. 36, pp. 581-588.
    • (1991) Annals of Operations Res , vol.36 , pp. 581-588
    • Bovopoulos, A.D.1    Lazar, A.A.2
  • 38
    • 85126421284 scopus 로고
    • Finite-state approximations for denumerable state discounted Markov decision processes
    • R. Cavazos-Cadena (1986), 'Finite-state approximations for denumerable state discounted Markov decision processes', J. Applied Mathematics and Optimization, 14, pp. 27-47.
    • (1986) J. Applied Mathematics and Optimization , vol.14 , pp. 27-47
    • Cavazos-Cadena, R.1
  • 39
    • 0039451036 scopus 로고
    • tWeak conditions for the existence of optimal stationary policies in average cost Markov decision chains with unbounded cost
    • R. Cavazos-Cadena (1989), tWeak conditions for the existence of optimal stationary policies in average cost Markov decision chains with unbounded cost', Kybemetika, 25, 145-156.
    • (1989) Kybemetika , vol.25 , pp. 145-156
    • Cavazos-Cadena, R.1
  • 40
    • 34249833261 scopus 로고
    • Existence of optimal stationary policies in average Markov decision processes with a recurrent state
    • R. Cavazos-Cadena (1992), 'Existence of optimal stationary policies in average Markov decision processes with a recurrent state', Appl. Math. Optim., 26, pp. 171-194.
    • (1992) Appl. Math. Optim , vol.26 , pp. 171-194
    • Cavazos-Cadena, R.1
  • 41
    • 0002885049 scopus 로고
    • Equivalence of Lyapunov stability criteria in a class of Markov decision processes
    • R. Cavazos-Cadena and O. HernAndez-Lerma (1992), 'Equivalence of Lyapunov stability criteria in a class of Markov decision processes', Appl. Math. Optim., 26, pp. 113-137.
    • (1992) Appl. Math. Optim , vol.26 , pp. 113-137
    • Cavazos-Cadena, R.1    HernAndez-Lerma, O.2
  • 42
    • 0026811487 scopus 로고
    • Comparing recent assumptions for the existence of average optimal stationary policies
    • R. Cavazos-Cadena and L. I. Sennott (1992), 'Comparing recent assumptions for the existence of average optimal stationary policies', Operations Research Letters, 11, pp. 33-37.
    • (1992) Operations Research Letters , vol.11 , pp. 33-37
    • Cavazos-Cadena, R.1    Sennott, L.I.2
  • 45
    • 0001727463 scopus 로고
    • On the continuity of the minimum set of a continuous function
    • G. B. Dantzig, J. Folkman and N. Shapiro (1967), 'On the continuity of the minimum set of a continuous function', J. Math. Anal, and Applications, 17, pp. 519-548.
    • (1967) J. Math. Anal, and Applications , vol.17 , pp. 519-548
    • Dantzig, G.B.1    Folkman, J.2    Shapiro, N.3
  • 47
    • 0012793161 scopus 로고
    • Average, sensitive and Blackwell optimal policies in denumerable Markov decision chains with unbounded rewards
    • R. Dekker and A. Hordijk (1988), 'Average, sensitive and Blackwell optimal policies in denumerable Markov decision chains with unbounded rewards', Mathematics of Operations Research, 13, pp. 395-421.
    • (1988) Mathematics of Operations Research , vol.13 , pp. 395-421
    • Dekker, R.1    Hordijk, A.2
  • 48
    • 0012218373 scopus 로고
    • On the relation between recurrence and ergodicity properties in denumerable Markov decision chains
    • R. Dekker, A. Hordijk and F. M. Spieksma (1994), 'On the relation between recurrence and ergodicity properties in denumerable Markov decision chains', Math. Operat. Res., 19, pp. 539-559.
    • (1994) Math. Operat. Res , vol.19 , pp. 539-559
    • Dekker, R.1    Hordijk, A.2    Spieksma, F.M.3
  • 49
    • 0001554538 scopus 로고
    • On linear programming in a Markov decision problem
    • E. V. Denardo (1970), 'On linear programming in a Markov decision problem', Management Science, 16, pp. 281-288.
    • (1970) Management Science , vol.16 , pp. 281-288
    • Denardo, E.V.1
  • 50
    • 0038956073 scopus 로고
    • Multichain Markov renewal programs
    • E. V. Denardo and B. L. Fox (1968), 'Multichain Markov renewal programs', SIAM J. of Applied Math., 16, pp. 468-487.
    • (1968) SIAM J. of Applied Math , vol.16 , pp. 468-487
    • Denardo, E.V.1    Fox, B.L.2
  • 51
    • 0343861205 scopus 로고
    • Sur un problème de production et de stockage dans l'aléatoire
    • F. D'Epenoux (1960), 'Sur un problème de production et de stockage dans l'aléatoire', Revue Française de Recherche Opérationelle, 14, pp. 3-16.
    • (1960) Revue Française de Recherche Opérationelle , vol.14 , pp. 3-16
    • D'Epenoux, F.1
  • 52
    • 0006464452 scopus 로고
    • A probabilistic production and inventory problem
    • F. D'Epenoux (1963), 'A probabilistic production and inventory problem', Management Science, 10, 98-108.
    • (1963) Management Science , vol.10 , pp. 98-108
    • D'Epenoux, F.1
  • 54
    • 0009990403 scopus 로고
    • Some remarks on finite horizon Markovian decision models
    • C. Derman and M. Klein (1965), 'Some remarks on finite horizon Markovian decision models', Operations Research, 13, pp. 272-278.
    • (1965) Operations Research , vol.13 , pp. 272-278
    • Derman, C.1    Klein, M.2
  • 55
    • 0000883929 scopus 로고
    • On memoryless rules for controlling sequential control processes
    • C. Derman and R. E. Strauch (1966), 'On memoryless rules for controlling sequential control processes', Ann. Math. Stat., 37, pp. 276-278.
    • (1966) Ann. Math. Stat , vol.37 , pp. 276-278
    • Derman, C.1    Strauch, R.E.2
  • 56
    • 0347702026 scopus 로고
    • Constrained Markov decision chains
    • C. Derman and A. F. Veinott, Jr. (1972), 'Constrained Markov decision chains', Management Science, 19, pp. 389-390.
    • (1972) Management Science , vol.19 , pp. 389-390
    • Derman, C.1    Veinott, A.F.2
  • 57
    • 0004179052 scopus 로고
    • Springer-Verlag, New York, Berlin, Heidelberg, London, Paris, Tokyo, Hong Kong, Barcelona, Budapest
    • J. L. Doob (1994), Measure Theory, Springer-Verlag, New York, Berlin, Heidelberg, London, Paris, Tokyo, Hong Kong, Barcelona, Budapest.
    • (1994) Measure Theory
    • Doob, J.L.1
  • 58
    • 0003405647 scopus 로고
    • part I, John Wiley & Sons, New York, Chichester, Brisbane, Toronto, Singapore
    • N. Dunford and J. T. Schwartz (1988), Linear operators, part I, John Wiley & Sons, New York, Chichester, Brisbane, Toronto, Singapore.
    • (1988) Linear operators
    • Dunford, N.1    Schwartz, J.T.2
  • 60
    • 1942528886 scopus 로고
    • Geometric convergence of value-iteration in multichain Markov decision problems
    • A. Federgruen (1979), 'Geometric convergence of value-iteration in multichain Markov decision problems', Adv. Appl. Prob., 11, pp. 188-217.
    • (1979) Adv. Appl. Prob , vol.11 , pp. 188-217
    • Federgruen, A.1
  • 61
    • 0002610493 scopus 로고
    • Non-randomized Markov and semi-Markov strategies in dynamic programming
    • E. A. Feinberg (1982), 'Non-randomized Markov and semi-Markov strategies in dynamic programming', Theor. Probab. and its Applications, 27, pp. 116-126.
    • (1982) Theor. Probab. and its Applications , vol.27 , pp. 116-126
    • Feinberg, E.A.1
  • 62
    • 0342988500 scopus 로고
    • Sufficient classes of strategies in discrete dynamic programming I: Decomposition of randomized strategies and embedded models
    • E. A. Feinberg (1986), 'Sufficient classes of strategies in discrete dynamic programming I: decomposition of randomized strategies and embedded models', SIAM Theory Probab. Appl., 31, pp. 658-668.
    • (1986) SIAM Theory Probab. Appl , vol.31 , pp. 658-668
    • Feinberg, E.A.1
  • 63
    • 0342988500 scopus 로고
    • Sufficient classes of strategies in discrete dynamic programming
    • E. A. Feinberg (1986), 'Sufficient classes of strategies in discrete dynamic programming', Theory Probability Appl., 31, pp. 658-668.
    • (1986) Theory Probability Appl. , vol.31 , pp. 658-668
    • Feinberg, E.A.1
  • 64
    • 0008766469 scopus 로고
    • Non-randomized strategies in stochastic decision processes
    • E. A. Feinberg (1991), 'Non-randomized strategies in stochastic decision processes', Annals of Operations Research, 29, pp. 315-332.
    • (1991) Annals of Operations Research , vol.29 , pp. 315-332
    • Feinberg, E.A.1
  • 65
    • 0000049211 scopus 로고
    • Constrained semi-Markov decision processes with average rewards
    • E. A. Feinberg (1995), 'Constrained semi-Markov decision processes with average rewards', ZOR-Methods and Models in Operations Research, 39, pp. 257-288.
    • (1995) ZOR-Methods and Models in Operations Research , vol.39 , pp. 257-288
    • Feinberg, E.A.1
  • 66
    • 0030556310 scopus 로고    scopus 로고
    • Bicriterion optimization of an M/G/l queue with a removable server
    • E. A. Feinberg and D. J. Kim (1996), 'Bicriterion optimization of an M/G/l queue with a removable server', Probab. in the Eng. and Inf. Sciences, 10, pp. 57-73.
    • (1996) Probab. in the Eng. and Inf. Sciences , vol.10 , pp. 57-73
    • Feinberg, E.A.1    Kim, D.J.2
  • 68
    • 0343424031 scopus 로고
    • Stationary and Markov policies in countable state dynamic programming
    • E. A. Feinberg and I. Sonin (1983), 'Stationary and Markov policies in countable state dynamic programming', Lecture Notes in Mathematics, 1021, pp. 111-129.
    • (1983) Lecture Notes in Mathematics , vol.1021 , pp. 111-129
    • Feinberg, E.A.1    Sonin, I.2
  • 70
    • 0030357631 scopus 로고
    • Notes on equivalent stationary policies in Markov decision processes with total rewards
    • E. A. Feinberg and I. Sonin (1995), 'Notes on equivalent stationary policies in Markov decision processes with total rewards', ZOR-Methods and Models in Operations Research, 44, pp. 205-221.
    • (1995) ZOR-Methods and Models in Operations Research , vol.44 , pp. 205-221
    • Feinberg, E.A.1    Sonin, I.2
  • 71
    • 0000184142 scopus 로고
    • Constrained Markov decision models with weighted discounted rewards
    • E. A. Feinberg E. and A. Shwartz (1995), 'Constrained Markov decision models with weighted discounted rewards', Math, of Operations Research, 20, pp. 302-320.
    • (1995) Math, of Operations Research , vol.20 , pp. 302-320
    • Feinberg, E.A.E.1    Shwartz, A.2
  • 73
    • 0015970360 scopus 로고
    • Convergence properties of local solutions of convex optimization problems
    • A. V. Fiacco (1974), 'Convergence properties of local solutions of convex optimization problems', J. Optim. Theory Appl., 13, pp. 1-12.
    • (1974) J. Optim. Theory Appl , vol.13 , pp. 1-12
    • Fiacco, A.V.1
  • 76
    • 0012175283 scopus 로고
    • On recurrent denumerable decision processes
    • L. Fisher (1968), 'On recurrent denumerable decision processes', Ann. Math. Stat., 39, pp. 424-434.
    • (1968) Ann. Math. Stat , vol.39 , pp. 424-434
    • Fisher, L.1
  • 77
    • 0347701478 scopus 로고
    • An example in denumerable decision processes
    • L. Fisher and S. M. Ross (1968), 'An example in denumerable decision processes', Ann. Math. Stat., 39, pp. 674-675.
    • (1968) Ann. Math. Stat , vol.39 , pp. 674-675
    • Fisher, L.1    Ross, S.M.2
  • 78
    • 0022740027 scopus 로고
    • Perturbation theory for mathematical programming problems
    • V. A. Gaitsgory and A. A. Pervozvanskii (1986), 'Perturbation theory for mathematical programming problems', JOTA, pp. 389-410.
    • (1986) JOTA , pp. 389-410
    • Gaitsgory, V.A.1    Pervozvanskii, A.A.2
  • 79
    • 0002539999 scopus 로고
    • A statewide Pavement Management System
    • K. Golabi, R. B. Kulkarni and G. B. Way (1982), 'A statewide Pavement Management System', Interfaces, 12, pp. 5-21.
    • (1982) Interfaces , vol.12 , pp. 5-21
    • Golabi, K.1    Kulkarni, R.B.2    Way, G.B.3
  • 80
    • 0009421908 scopus 로고
    • On constrained Markov decision processes
    • M. Haviv (1995), 'On constrained Markov decision processes', OR Letters, 19, Issue 1, pp. 25-28.
    • (1995) OR Letters , vol.19 , Issue.1 , pp. 25-28
    • Haviv, M.1
  • 81
    • 1942469561 scopus 로고
    • Generalized linear programming in Markovian decision problems
    • W. R. Heilmann (1977), 'Generalized linear programming in Markovian decision problems', Bonner Math. Schriften, 98, pp. 33-39.
    • (1977) Bonner Math. Schriften , vol.98 , pp. 33-39
    • Heilmann, W.R.1
  • 82
    • 0017933941 scopus 로고
    • Solving stochastic dynamic programming prob-lems by linear programming-an annotated bibliography
    • W. R. Heilmann (1978), 'Solving stochastic dynamic programming prob-lems by linear programming-an annotated bibliography', Z. Oper. Res., 22, pp. 43-53.
    • (1978) Z. Oper. Res , vol.22 , pp. 43-53
    • Heilmann, W.R.1
  • 83
    • 0022662192 scopus 로고
    • Finite state approximations for denumerable multidimensional-state discounted Markov decision processes
    • O. Hernández-Lerma (1986), 'Finite state approximations for denumerable multidimensional-state discounted Markov decision processes', J. Mathematical Analysis and Applications, 113, pp. 382-389.
    • (1986) J. Mathematical Analysis and Applications , vol.113 , pp. 382-389
    • Hernández-Lerma, O.1
  • 85
    • 0345809499 scopus 로고
    • Discounted cost Markov decision processes on Borel spaces: The linear programming formulation
    • O. Hernández-Lerma and D. Hernández-Hernández (1994), 'Discounted cost Markov decision processes on Borel spaces: the linear programming formulation', J. of Math. Anal, and Appl., 183, pp. 335-351.
    • (1994) J. of Math. Anal, and Appl , vol.183 , pp. 335-351
    • Hernández-Lerma, O.1    Hernández-Hernández, D.2
  • 86
    • 0028397545 scopus 로고
    • Linear programming and average optimality on Borel spaces-unbounded costs
    • O. Hernández-Lerma and J. B. Lasserre (1994), 'Linear programming and average optimality on Borel spaces-unbounded costs', SIAM J. Control and Optimization, 32, pp. 480-500.
    • (1994) SIAM J. Control and Optimization , vol.32 , pp. 480-500
    • Hernández-Lerma, O.1    Lasserre, J.B.2
  • 89
    • 0004211484 scopus 로고
    • second edition, Mathematical Centre Tracts 51, Mathematisch Centrum, Amsterdam
    • A. Hordijk (1977), Dynamic Programming and Markov Potential Theory, second edition, Mathematical Centre Tracts 51, Mathematisch Centrum, Amsterdam.
    • (1977) Dynamic Programming and Markov Potential Theory
    • Hordijk, A.1
  • 90
    • 0018455841 scopus 로고
    • Linear programming and Markov decision chains
    • A. Hordijk and L. C. M. Kallenberg (1979), 'Linear programming and Markov decision chains', Management Science, 25, pp. 352-362.
    • (1979) Management Science , vol.25 , pp. 352-362
    • Hordijk, A.1    Kallenberg, L.C.M.2
  • 92
    • 0344325087 scopus 로고
    • Linear programming formulation of MDPs in countable state space: The multichain case
    • A. Hordijk and J. B. Lasserre (1994), 'Linear programming formulation of MDPs in countable state space: the multichain case', ZOR-Methods and Models in Operations Research, 40, pp. 91-108.
    • (1994) ZOR-Methods and Models in Operations Research , vol.40 , pp. 91-108
    • Hordijk, A.1    Lasserre, J.B.2
  • 93
    • 0001300801 scopus 로고
    • Constrained admission control to a queuing system
    • A. Hordijk and F. Spieksma (1989), 'Constrained admission control to a queuing system', Advances of Applied Probability, 21, pp. 409-431.
    • (1989) Advances of Applied Probability , vol.21 , pp. 409-431
    • Hordijk, A.1    Spieksma, F.2
  • 95
    • 0026260718 scopus 로고
    • Optimal decentralized flow control of Markovian queueing networks with multiple controllers
    • M. T. Hsiao and A. A. Lazar (1991), 'Optimal decentralized flow control of Markovian queueing networks with multiple controllers', Performance Evaluation, 13, pp. 181-204.
    • (1991) Performance Evaluation , vol.13 , pp. 181-204
    • Hsiao, M.T.1    Lazar, A.A.2
  • 96
    • 0001324526 scopus 로고
    • On finding optimal policies for Markov decision chains: A unifying framework for mean-variance tradeoffs
    • Y. Huang and L. C. M. Kallenberg (1994), 'On finding optimal policies for Markov decision chains: A unifying framework for mean-variance tradeoffs', Math, of Operations Research, 19, pp. 434-448.
    • (1994) Math, of Operations Research , vol.19 , pp. 434-448
    • Huang, Y.1    Kallenberg, L.C.M.2
  • 97
    • 84931901113 scopus 로고
    • On randomized policies and mixtures of deterministic policies in dynamic programming
    • D. Kadelka (1983), 'On randomized policies and mixtures of deterministic policies in dynamic programming', Methods of Operations Research, 46, pp. 67-75.
    • (1983) Methods of Operations Research , vol.46 , pp. 67-75
    • Kadelka, D.1
  • 99
    • 0141754345 scopus 로고
    • Survey of linear programming for standard and nonstandard Markovian control problems, Part I: Theory
    • L. C. M. Kallenberg (1994), 'Survey of linear programming for standard and nonstandard Markovian control problems, Part I: Theory', ZOR-Methods and Models in Operations Research, 40, pp. 1-42.
    • (1994) ZOR-Methods and Models in Operations Research , vol.40 , pp. 1-42
    • Kallenberg, L.C.M.1
  • 100
    • 0020828785 scopus 로고
    • Uniform convergence of convex optimization problems
    • P. Kanniappan and S. M. A. Sastry (1974), 'Uniform convergence of convex optimization problems', J. Math. Anal. Appli., 96, pp. 1-12.
    • (1974) J. Math. Anal. Appli , vol.96 , pp. 1-12
    • Kanniappan, P.1    Sastry, S.M.A.2
  • 101
    • 0023384370 scopus 로고
    • A variance minimization problem for a Markov decision process
    • H. Kawai (1987), 'A variance minimization problem for a Markov decision process', European Journal of Operations Research, 31, pp. 140-145.
    • (1987) European Journal of Operations Research , vol.31 , pp. 140-145
    • Kawai, H.1
  • 103
  • 104
    • 0014736910 scopus 로고
    • A Markovian model for hospital admission and scheduling
    • P. Kolesar (1970), 'A Markovian model for hospital admission and scheduling', Management Science, 16, pp. 384-396.
    • (1970) Management Science , vol.16 , pp. 384-396
    • Kolesar, P.1
  • 105
    • 0348129472 scopus 로고
    • (translation: Stochastic dynamic programming with additional constraints) Master thesis, Leiden University, The Netherlands
    • G. M. Koole (1988), Stochastische Dynamische Programmering met Bi-jvoorwaarden (translation: Stochastic dynamic programming with additional constraints) Master thesis, Leiden University, The Netherlands.
    • (1988) Stochastische Dynamische Programmering met Bi-jvoorwaarden
    • Koole, G.M.1
  • 106
    • 84976754886 scopus 로고
    • On the existence of equilibria in noncooperative optimal flow control
    • Y. A. Korilis and A. Lazar (1995a), 'On the existence of equilibria in noncooperative optimal flow control', J. of the Association for Computing Machinery, 42, No. 3, pp. 584-613.
    • (1995) J. of the Association for Computing Machinery , vol.42 , Issue.3 , pp. 584-613
    • Korilis, Y.A.1    Lazar, A.2
  • 109
    • 0010704427 scopus 로고
    • On extreme points of regularly convex sets
    • M. Krein and D. Milman (1940), 'On extreme points of regularly convex sets', Studia Math., 9, pp. 133-138.
    • (1940) Studia Math , vol.9 , pp. 133-138
    • Krein, M.1    Milman, D.2
  • 110
    • 0013260916 scopus 로고
    • Once more about the connection between elliptic operators and Ito's stochastic equations
    • Steklov Seminar 1984 (Krylov N. et al., Eds.), Optimization Software, New York
    • N. Krylov (1985), 'Once more about the connection between elliptic operators and Ito's stochastic equations', Statistics and Control of Stochastic Processes, Steklov Seminar 1984 (Krylov N. et al., Eds.), Optimization Software, New York, 69-101.
    • (1985) Statistics and Control of Stochastic Processes , pp. 69-101
    • Krylov, N.1
  • 111
    • 0000619048 scopus 로고
    • Extensive games and the problem of information
    • H. W. Kuhn (1953), 'Extensive games and the problem of information', Ann. Math. Stud., 28, pp. 193-216.
    • (1953) Ann. Math. Stud , vol.28 , pp. 193-216
    • Kuhn, H.W.1
  • 112
    • 1942437630 scopus 로고
    • Mathematical programming and the control of Markov chains
    • H. Kushner and J. Kleinman (1971), 'Mathematical programming and the control of Markov chains', Internat. J. Control, 13, pp. 801-820.
    • (1971) Internat. J. Control , vol.13 , pp. 801-820
    • Kushner, H.1    Kleinman, J.2
  • 113
    • 38149145972 scopus 로고
    • Average optimal stationary policies and linear programming in ∞untable state Markov decision processes
    • J. B. Lasserre (1994), 'Average optimal stationary policies and linear programming in ∞untable state Markov decision processes', J. Math. Anal. Appl., 183, pp. 233-249.
    • (1994) J. Math. Anal. Appl , vol.183 , pp. 233-249
    • Lasserre, J.B.1
  • 114
    • 0020847468 scopus 로고
    • Optimal flow control of a class of queuing networks in equilibrium
    • A. Lazar (1983), 'Optimal flow control of a class of queuing networks in equilibrium', IEEE Transactions on Automatic Control, 28, pp. 1001-1007.
    • (1983) IEEE Transactions on Automatic Control , vol.28 , pp. 1001-1007
    • Lazar, A.1
  • 115
    • 0000157908 scopus 로고
    • Convergences of marginal functions with dependent constraints
    • M. B. Lignota and J. Morgan (1992), 'Convergences of marginal functions with dependent constraints', Optimization, 23, pp. 189-213.
    • (1992) Optimization , vol.23 , pp. 189-213
    • Lignota, M.B.1    Morgan, J.2
  • 116
    • 0038546908 scopus 로고
    • Convergence of minima of integral functionals, with applications to optimal control and stochastic optimization
    • R. Lucchetti and R. J. B Wets (1993), 'Convergence of minima of integral functionals, with applications to optimal control and stochastic optimization', Statistics and Decisions, 11, pp. 69-84.
    • (1993) Statistics and Decisions , vol.11 , pp. 69-84
    • Lucchetti, R.1    Wets, R.J.B.2
  • 117
    • 0024174033 scopus 로고
    • A class of steering policies under a recurrence condition
    • Austin, TX, December
    • D.-J. Ma and A. M. Makowski (1988), 'A class of steering policies under a recurrence condition', 27th IEEE Conference on Decision and Control, Austin, TX, December, pp. 1192-1197.
    • (1988) 27th IEEE Conference on Decision and Control , pp. 1192-1197
    • Ma, D.-J.1    Makowski, A.M.2
  • 118
    • 0009403054 scopus 로고
    • A class of two-dimensional stochastic approximations and steering policies for Markov decision processes
    • Tucson, Arizona
    • D.-J. Ma and A. M. Makowski (1992), 'A class of two-dimensional stochastic approximations and steering policies for Markov decision processes', 31st IEEE Conference on Decision and Control, Tucson, Arizona, pp. 3344-3349.
    • (1992) 31st IEEE Conference on Decision and Control , pp. 3344-3349
    • Ma, D.-J.1    Makowski, A.M.2
  • 120
    • 0020098295 scopus 로고
    • Optimal fixed frame multiplexing in integrated line-and packet-switched Commrmication networks
    • B. Maglaris and M. Schwartz (1982), 'Optimal fixed frame multiplexing in integrated line-and packet-switched Commrmication networks', IEEE Transactions on Information Theory, IT-28, pp. 263-273.
    • (1982) IEEE Transactions on Information Theory , pp. 263-273
    • Maglaris, B.1    Schwartz, M.2
  • 122
    • 0026953213 scopus 로고
    • Stochastic approximations and adaptive control of a discrete-time single server network with random routing
    • A. M. Makowski and A. Shwartz (1992), 'Stochastic approximations and adaptive control of a discrete-time single server network with random routing', SIAM J. Control and Optimization, 30, pp. 1476-1506.
    • (1992) SIAM J. Control and Optimization , vol.30 , pp. 1476-1506
    • Makowski, A.M.1    Shwartz, A.2
  • 123
    • 0001257766 scopus 로고
    • Linear programming and sequential decisions
    • A. S. Manne (1960), 'Linear programming and sequential decisions', Management Science, 6, pp. 259-267.
    • (1960) Management Science , vol.6 , pp. 259-267
    • Manne, A.S.1
  • 125
    • 0242291491 scopus 로고
    • Optimal priority assignment with hard constraint
    • P. Nain and K. W. Ross (1986), 'Optimal priority assignment with hard constraint', Transactions on Automatic Control, 31, pp. 883-888.
    • (1986) Transactions on Automatic Control , vol.31 , pp. 883-888
    • Nain, P.1    Ross, K.W.2
  • 126
    • 0022045624 scopus 로고
    • Existence of equilibrium stationary strategies in discounted noncooperative stochastic games with uncountable state space
    • A. S. Nowak (1985), 'Existence of equilibrium stationary strategies in discounted noncooperative stochastic games with uncountable state space', JOTA, 45, pp. 592-602.
    • (1985) JOTA , vol.45 , pp. 592-602
    • Nowak, A.S.1
  • 128
    • 0039420966 scopus 로고
    • Control of random sequences in problems with constraints
    • translated from Russian
    • A. B. Piunovskiy (1993), 'Control of random sequences in problems with constraints', Theory Probab. Appl., 38, No. 4, translated from Russian.
    • (1993) Theory Probab. Appl , vol.38 , Issue.4
    • Piunovskiy, A.B.1
  • 129
    • 0028419924 scopus 로고
    • Control of jump-like processes in constrained problems
    • Translated into English in Automation and Remote Control, 55, No. 4, 1994
    • A. B. Piunovskiy (1994), 'Control of jump-like processes in constrained problems', Avtomatika i Telemekhanika, 4, pp. 75-89. Translated into English in Automation and Remote Control, 55, No. 4, 1994.
    • (1994) Avtomatika i Telemekhanika , vol.4 , pp. 75-89
    • Piunovskiy, A.B.1
  • 130
    • 84929075517 scopus 로고
    • Multicriteria control problems for stochastic jump processes
    • Rome, Italy, September
    • A. B. Piunovskiy (1995), 'Multicriteria control problems for stochastic jump processes', Proceedings of 3rd European Control Conference, Rome, Italy, September, pp. 492-495.
    • (1995) Proceedings of 3rd European Control Conference , pp. 492-495
    • Piunovskiy, A.B.1
  • 131
    • 85126416597 scopus 로고    scopus 로고
    • A multicriteria model of optimal control of a stochastic linear system
    • A. B. Piunovskiy (1996), 'A multicriteria model of optimal control of a stochastic linear system', Automation and Remote Control, 57, No. 6, Part 1, pp. 831-842.
    • (1996) Automation and Remote Control , vol.57 , Issue.6 , pp. 831-842
    • Piunovskiy, A.B.1
  • 133
    • 0031488520 scopus 로고    scopus 로고
    • Optimal control of stochastic sequences in sequences with constraints
    • A. B. Piunovskiy (1997b), 'Optimal control of stochastic sequences in sequences with constraints', Stochastic Analysis and Applications, No. 2.
    • (1997) Stochastic Analysis and Applications , Issue.2
    • Piunovskiy, A.B.1
  • 136
    • 0003737306 scopus 로고
    • North-Holland, Amsterdam, The Netherlands
    • D. Revuz (1975), Markov Chains, North-Holland, Amsterdam, The Netherlands.
    • (1975) Markov Chains
    • Revuz, D.1
  • 137
    • 0003450909 scopus 로고
    • Society for Industrial and Applied Mathematics, 2nd printing, Philadelphia
    • R. T. Rockafellar (1989), Conjugate Duality and Optimization, Society for Industrial and Applied Mathematics, 2nd printing, Philadelphia.
    • (1989) Conjugate Duality and Optimization
    • Rockafellar, R.T.1
  • 138
    • 0024664332 scopus 로고
    • Randomized and past-dependent policies for Markov decision processes with multiple constraints
    • K. W. Ross (1989), 'Randomized and past-dependent policies for Markov decision processes with multiple constraints', Operations Research, 37, pp. 474-477.
    • (1989) Operations Research , vol.37 , pp. 474-477
    • Ross, K.W.1
  • 139
    • 0023981745 scopus 로고
    • Optimal scheduling of interactive and non-interactive traffic in telecommunication systems
    • K. W. Ross and B. Chen (1988), 'Optimal scheduling of interactive and non-interactive traffic in telecommunication systems', IEEE Transactions on Automatic Control, 33, pp. 261-267.
    • (1988) IEEE Transactions on Automatic Control , vol.33 , pp. 261-267
    • Ross, K.W.1    Chen, B.2
  • 140
    • 0024737381 scopus 로고
    • Markov decision processes with sample path constraints: The communicating case
    • K. Ross and R. Varadarajan (1989), 'Markov decision processes with sample path constraints: the communicating case', Operations Research, 37, pp. 780-790.
    • (1989) Operations Research , vol.37 , pp. 780-790
    • Ross, K.1    Varadarajan, R.2
  • 141
    • 0001172487 scopus 로고
    • Multichain Markov decision processes with a sample path constraint: A decomposition approach
    • K. Ross and R. Varadarajan (1991), 'Multichain Markov decision processes with a sample path constraint: a decomposition approach', Math, of Operations Research, 16, pp. 195-207.
    • (1991) Math, of Operations Research , vol.16 , pp. 195-207
    • Ross, K.1    Varadarajan, R.2
  • 142
    • 0003655416 scopus 로고
    • 3rd edition, Macmillan Publishing Company, New York
    • H. L. Royden (1988), Real Analysis, 3rd edition, Macmillan Publishing Company, New York.
    • (1988) Real Analysis
    • Royden, H.L.1
  • 143
    • 0001295069 scopus 로고
    • Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal
    • M. Schäl (1975), 'Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal', Z. Wahrscheinlichkeitstheorie und verw. Geb., 32, pp. 179-196.
    • (1975) Z. Wahrscheinlichkeitstheorie und verw. Geb , vol.32 , pp. 179-196
    • Schäl, M.1
  • 144
    • 0001821168 scopus 로고
    • Estimation and control in discounted dynamic programming
    • M. Schäl (1987), 'Estimation and control in discounted dynamic programming', Stochastics, 20, pp. 51-71.
    • (1987) Stochastics , vol.20 , pp. 51-71
    • Schäl, M.1
  • 145
    • 0003336976 scopus 로고
    • Pointwise versions of the maximum theorem with applications to optimization
    • I. E. Schochetman (1990), 'Pointwise versions of the maximum theorem with applications to optimization', Appl. Math. Lett., 3, pp. 89-92.
    • (1990) Appl. Math. Lett , vol.3 , pp. 89-92
    • Schochetman, I.E.1
  • 146
    • 0001971284 scopus 로고
    • Convergence of selections with applications in optimization
    • 278-242
    • L E. Schochetman and R. L. Smith (1991), 'Convergence of selections with applications in optimization', J. Math. Anal. Appl., 155, pp. 278-242.
    • (1991) J. Math. Anal. Appl , vol.155
    • Schochetman, L.E.1    Smith, R.L.2
  • 147
    • 0024702152 scopus 로고
    • Average cost optimal stationary policies in average cost Markov decision processes
    • L. I. Sennott (1989), 'Average cost optimal stationary policies in average cost Markov decision processes', Operations Research, 37, pp. 626-633.
    • (1989) Operations Research , vol.37 , pp. 626-633
    • Sennott, L.I.1
  • 150
    • 5244249297 scopus 로고    scopus 로고
    • On computing average optimal policies with appli-cation to routing to parallel queues
    • L. I. Sennott (1997), 'On computing average optimal policies with appli-cation to routing to parallel queues', Mathematical Methods of Operations Research, 45, pp. 45-62.
    • (1997) Mathematical Methods of Operations Research , vol.45 , pp. 45-62
    • Sennott, L.I.1
  • 153
    • 0022189863 scopus 로고
    • Maximal mean/standard deviation ratio in undiscounted MDP
    • M. J. Sobel (1985), 'Maximal mean/standard deviation ratio in undiscounted MDP', OR Letters, 4, pp. 157-159.
    • (1985) OR Letters , vol.4 , pp. 157-159
    • Sobel, M.J.1
  • 154
    • 0009598143 scopus 로고
    • Mean-variance tradeoffs in an undiscounted MDP
    • M. J. Sobel (1994), 'Mean-variance tradeoffs in an undiscounted MDP, Operations Research, 42, pp. 175-188.
    • (1994) Operations Research , vol.42 , pp. 175-188
    • Sobel, M.J.1
  • 156
    • 0026930388 scopus 로고
    • Some comments on a theorem of Hardy and Littlewood
    • R. Sznadjer and J. A. Filar (1992), 'Some comments on a theorem of Hardy and Littlewood', J. Optim. Theory Appl., 75, pp. 210-218.
    • (1992) J. Optim. Theory Appl , vol.75 , pp. 210-218
    • Sznadjer, R.1    Filar, J.A.2
  • 157
    • 3142650709 scopus 로고
    • Finite state approximation algorithms for average cost denumerable state Markov decision processes
    • L. C. Thomas and D. Stengos (1985), 'Finite state approximation algorithms for average cost denumerable state Markov decision processes', OR Spectrum, 7, pp. 27-37.
    • (1985) OR Spectrum , vol.7 , pp. 27-37
    • Thomas, L.C.1    Stengos, D.2
  • 158
    • 0029771680 scopus 로고    scopus 로고
    • Approximations in dynamic zerosum games, I
    • M. Tidball and E. Altman (1996a), 'Approximations in dynamic zerosum games, I', SIAM J. Control and Optimization, 34, No. 1, pp. 311-328.
    • (1996) SIAM J. Control and Optimization , vol.34 , Issue.1 , pp. 311-328
    • Tidball, M.1    Altman, E.2
  • 159
    • 1942533658 scopus 로고    scopus 로고
    • Continuity of optimal values and solutions of convex optimization, and constrained control of Markov chains
    • M. Tidball, O. Pourtallier and E. Altman (1996b), 'Continuity of optimal values and solutions of convex optimization, and constrained control of Markov chains', submitted to SIAM J. Control and Optimization.
    • (1996) SIAM J. Control and Optimization
    • Tidball, M.1    Pourtallier, O.2    Altman, E.3
  • 161
    • 0023092939 scopus 로고
    • Flow control protocols for integrated networks with partially observed voice traffic
    • F. Vakil and A. A. Lazar (1987), 'Flow control protocols for integrated networks with partially observed voice traffic', IEEE Transactions on Automatic Control, AC-32, pp. 2-14.
    • (1987) IEEE Transactions on Automatic Control , pp. 2-14
    • Vakil, F.1    Lazar, A.A.2
  • 162
    • 0141824325 scopus 로고
    • Mathematical Centre Tract 139, Mathematisch Centrum, Amsterdam
    • J. Van Der Wal (1981a), Stochastic Dynamic Programming, Mathematical Centre Tract 139, Mathematisch Centrum, Amsterdam.
    • (1981) Stochastic Dynamic Programming
    • van der Wal, J.1
  • 164
    • 0141716097 scopus 로고
    • Markov Games with unbounded rewards
    • M. Schäl (Editor) Bonner Mathematische Schriften, Nr. 98, Bonn
    • J. Wessels (1977), 'Markov Games with unbounded rewards', Dynamische Optimierung, M. Schäl (Editor) Bonner Mathematische Schriften, Nr. 98, Bonn.
    • (1977) Dynamische Optimierung
    • Wessels, J.1
  • 165
    • 4243079533 scopus 로고
    • Finite state approximations for denumerable state infinite horizon discounted Markov decision Processes
    • D. J. White (1980), 'Finite state approximations for denumerable state infinite horizon discounted Markov decision Processes', J. Mathematical Analysis and Applications, 74, pp. 292-295.
    • (1980) J. Mathematical Analysis and Applications , vol.74 , pp. 292-295
    • White, D.J.1
  • 166
    • 4243171133 scopus 로고
    • Finite state approximations for denumerable state infinite horizon discounted Markov decision processes with unbounded rewards
    • D. J. White (1982), 'Finite state approximations for denumerable state infinite horizon discounted Markov decision processes with unbounded rewards', J. Mathematical Analysis and Applications, 86, pp. 292-306.
    • (1982) J. Mathematical Analysis and Applications , vol.86 , pp. 292-306
    • White, D.J.1
  • 167
    • 0000594684 scopus 로고
    • Utility, probabilistic constraints, mean variance of discounted rewards in Markov decision processes
    • D. J. White (1987), 'Utility, probabilistic constraints, mean variance of discounted rewards in Markov decision processes', OR Spectrum, 9, pp. 13-22.
    • (1987) OR Spectrum , vol.9 , pp. 13-22
    • White, D.J.1
  • 168
    • 1942501821 scopus 로고
    • A mathematical programming approach to a problem in variance penalized Markov decision processes
    • D. J. White (1994), 'A mathematical programming approach to a problem in variance penalized Markov decision processes', OR Spectrum, 15, pp. 225-230.
    • (1994) OR Spectrum , vol.15 , pp. 225-230
    • White, D.J.1
  • 169
    • 0017997986 scopus 로고
    • Approximations of dynamic programs
    • W. Whitt (1978), 'Approximations of dynamic programs, Γ, Mathematics of Operations Research, 3, No. 3, pp. 231-243.
    • (1978) Mathematics of Operations Research , vol.3 , Issue.3 , pp. 231-243
    • Whitt, W.1
  • 170
    • 0018925119 scopus 로고
    • Representation and approximation of noncooperative sequential games
    • W. Whitt (1980), 'Representation and approximation of noncooperative sequential games', SIAM J. Control and Opt., 18, No. 1, pp. 33-43.
    • (1980) SIAM J. Control and Opt , vol.18 , Issue.1 , pp. 33-43
    • Whitt, W.1
  • 173
    • 0004906569 scopus 로고
    • On a class of strategies in general Markov decision models
    • A. A. Yushkevich (1973), 'On a class of strategies in general Markov decision models', Theory Probab. Appl., 18, pp. 777-779.
    • (1973) Theory Probab. Appl , vol.18 , pp. 777-779
    • Yushkevich, A.A.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.