메뉴 건너뛰기




Volumn 42, Issue 2, 1996, Pages 203-222

Value iteration in average cost Markov control processes on Borel spaces

Author keywords

Average cost; Markov control (or decision) processes; Value iteration (or successive approximations)

Indexed keywords


EID: 0346144338     PISSN: 01678019     EISSN: None     Source Type: Journal    
DOI: 10.1007/BF00047169     Document Type: Article
Times cited : (10)

References (42)
  • 4
    • 0007375022 scopus 로고
    • Probabilistic properties of the general nonlinear Markovian process of order one and applications to time series modelling
    • Laboratoire de Statistique Théorique et Appliquée, CNRS-URA 1321, Université Paris VI
    • Diebolt, J. and Guégan, D.: Probabilistic properties of the general nonlinear Markovian process of order one and applications to time series modelling, Rapport Technique No. 125, Laboratoire de Statistique Théorique et Appliquée, CNRS-URA 1321, Université Paris VI, 1990.
    • (1990) Rapport Technique No. 125
    • Diebolt, J.1    Guégan, D.2
  • 5
    • 0003535772 scopus 로고
    • Allyn and Bacon, Boston
    • Dugundji, J.: Topology, Allyn and Bacon, Boston, 1966.
    • (1966) Topology
    • Dugundji, J.1
  • 8
    • 84983602577 scopus 로고
    • On optimality criteria for dynamic programs with long finite horizons
    • Flynn, J.: On optimality criteria for dynamic programs with long finite horizons, J. Math. Anal. Appl. 76 (1980), 202-208.
    • (1980) J. Math. Anal. Appl. , vol.76 , pp. 202-208
    • Flynn, J.1
  • 9
    • 24944525736 scopus 로고
    • Contrôle de chaînes de Markov sur des espaces arbitraires
    • Georgin, J.-P.: Contrôle de chaînes de Markov sur des espaces arbitraires, Ann. Inst. Henri Poincaré, Sect. B 23 (1978), 255-277.
    • (1978) Ann. Inst. Henri Poincaré, Sect. B , vol.23 , pp. 255-277
    • Georgin, J.-P.1
  • 10
    • 0001677238 scopus 로고
    • Average cost Markov control processes with weighted norms: Existence of canonical policies
    • Reporte Interno # 157, Departamento de Matemáticas, CINVESTAV-IPN, to appear
    • Gordienko, E. and Hernández-Lerma, O.: Average cost Markov control processes with weighted norms: existence of canonical policies, Reporte Interno # 157, Departamento de Matemáticas, CINVESTAV-IPN, 1994, to appear in Applic. Math.
    • (1994) Applic. Math.
    • Gordienko, E.1    Hernández-Lerma, O.2
  • 11
    • 0001677238 scopus 로고
    • Average cost Markov control processes with weighted norms: Value iteration
    • Reporte Interno # 158, Departamento de Matemáticas, CINVESTAV-IPN, to appear
    • Gordienko, E. and Hernández-Lerma, O.: Average cost Markov control processes with weighted norms: value iteration; Reporte Interno # 158, Departamento de Matemáticas, CINVESTAV-IPN, to appear in Applic. Math. 23 (1995).
    • (1995) Applic. Math. , vol.23
    • Gordienko, E.1    Hernández-Lerma, O.2
  • 13
    • 0026220873 scopus 로고
    • Average optimality in dynamic programming on Borel spaces - Unbounded costs and controls
    • Hernández-Lerma, O.: Average optimality in dynamic programming on Borel spaces - unbounded costs and controls, Systems Control Lett. 17 (1991), 237-242.
    • (1991) Systems Control Lett. , vol.17 , pp. 237-242
    • Hernández-Lerma, O.1
  • 14
    • 2342594962 scopus 로고
    • Existence of average optimal policies in Markov control processes with strictly unbounded costs
    • Hernández-Lerma, O.: Existence of average optimal policies in Markov control processes with strictly unbounded costs, Kybernetika (Prague) 29 (1993), 1-17.
    • (1993) Kybernetika (Prague) , vol.29 , pp. 1-17
    • Hernández-Lerma, O.1
  • 15
    • 0346361230 scopus 로고
    • Density estimation and adaptive control of Markov processes: Average and discounted criteria
    • Hernández-Lerma, O. and Cavazos-Cadena, R.: Density estimation and adaptive control of Markov processes: average and discounted criteria, Acta Appl. Math. 20 (1990), 285-307.
    • (1990) Acta Appl. Math. , vol.20 , pp. 285-307
    • Hernández-Lerma, O.1    Cavazos-Cadena, R.2
  • 16
    • 0009258232 scopus 로고
    • Average cost Markov decision processes: Optimality conditions
    • Hernández-Lerma, O., Hennet, J. C., and Lasserre, J. B.: Average cost Markov decision processes: optimality conditions, J. Math. Anal. Appl. 158 (1991), 396-406.
    • (1991) J. Math. Anal. Appl. , vol.158 , pp. 396-406
    • Hernández-Lerma, O.1    Hennet, J.C.2    Lasserre, J.B.3
  • 17
    • 0002471077 scopus 로고
    • Value iteration and rolling plans for Markov control processes with unbounded rewards
    • Hernández-Lerma, O. and Lasserre, J. B.: Value iteration and rolling plans for Markov control processes with unbounded rewards, J. Math. Anal. Appl. 177 (1993), 38-55.
    • (1993) J. Math. Anal. Appl. , vol.177 , pp. 38-55
    • Hernández-Lerma, O.1    Lasserre, J.B.2
  • 18
    • 0028397545 scopus 로고
    • Linear programming and average optimality of Markov control processes on Borel spaces - Unbounded costs
    • Hernández-Lerma, O. and Lasserre, J. B.: Linear programming and average optimality of Markov control processes on Borel spaces - unbounded costs, SIAM J. Control Optim. 32 (1994), 480-500.
    • (1994) SIAM J. Control Optim. , vol.32 , pp. 480-500
    • Hernández-Lerma, O.1    Lasserre, J.B.2
  • 19
    • 24944498800 scopus 로고
    • Invariant probabilities for Feller-Markov chains
    • Reporte Interno No. 122, CINVESTAV-IPN, México, to appear
    • Hernández-Lerma, O. and Lasserre, J. B.: Invariant probabilities for Feller-Markov chains, Reporte Interno No. 122, CINVESTAV-IPN, México, 1993, to appear in J. Appl. Math. Stoch. Anal.
    • (1993) J. Appl. Math. Stoch. Anal.
    • Hernández-Lerma, O.1    Lasserre, J.B.2
  • 20
    • 38249023041 scopus 로고
    • Discretization procedures for adaptive Markov control processes
    • Hernández-Lerma, O. and Marcus, S. I.: Discretization procedures for adaptive Markov control processes, J. Math. Anal. Appl. 137 (1989), 485-514.
    • (1989) J. Math. Anal. Appl. , vol.137 , pp. 485-514
    • Hernández-Lerma, O.1    Marcus, S.I.2
  • 21
    • 0006238280 scopus 로고
    • Recurrence conditions for Markov decision processes with Borel state space: A survey
    • Hernández-Lerma, O., Montes-de-Oca, R., and Cavazos-Cadena, R.: Recurrence conditions for Markov decision processes with Borel state space: a survey, Ann. Oper. Res. 28 (1991), 29-46.
    • (1991) Ann. Oper. Res. , vol.28 , pp. 29-46
    • Hernández-Lerma, O.1    Montes-de-Oca, R.2    Cavazos-Cadena, R.3
  • 22
    • 0008620275 scopus 로고
    • Discrete-time Markov control processes with discounted unbounded costs: Optimality criteria
    • Hernández-Lerma, O. and Muñoz de Ozak, M.: Discrete-time Markov control processes with discounted unbounded costs: optimality criteria, Kybernetika (Prague) 28 (1992), 191-212.
    • (1992) Kybernetika (Prague) , vol.28 , pp. 191-212
    • Hernández-Lerma, O.1    Muñoz De Ozak, M.2
  • 24
    • 0003434766 scopus 로고
    • Foundations of Non-Stationary Dynamic Programming with Discrete Time Parameter
    • Springer-Verlag, New York
    • Hinderer, K.: Foundations of Non-Stationary Dynamic Programming with Discrete Time Parameter, Lecture Notes Oper. Res. 33, Springer-Verlag, New York, 1970.
    • (1970) Lecture Notes Oper. Res. 33
    • Hinderer, K.1
  • 25
    • 84986786266 scopus 로고
    • Sur un modèle autorégressif nonlinéare: Ergodicité et ergodicité géométrique
    • Mokkadem, A.: Sur un modèle autorégressif nonlinéare: ergodicité et ergodicité géométrique, J. Time Series Anal. 8 (1987), 195-205.
    • (1987) J. Time Series Anal. , vol.8 , pp. 195-205
    • Mokkadem, A.1
  • 26
    • 0028439286 scopus 로고
    • The average cost optimality equation for Markov control processes on Borel spaces
    • Montes-de-Oca, R.: The average cost optimality equation for Markov control processes on Borel spaces, Systems Control Lett. 22 (1994), 351-357.
    • (1994) Systems Control Lett. , vol.22 , pp. 351-357
    • Montes-de-Oca, R.1
  • 27
    • 24944524878 scopus 로고
    • Conditions for average optimality in Markov control processes with unbounded costs and controls
    • Montes-de-Oca, R. and Hernández-Lerma, O.: Conditions for average optimality in Markov control processes with unbounded costs and controls, J. Math. Systems, Estimation Control 4 (1994), 1-19.
    • (1994) J. Math. Systems, Estimation Control , vol.4 , pp. 1-19
    • Montes-de-Oca, R.1    Hernández-Lerma, O.2
  • 31
    • 24944580792 scopus 로고
    • Measurable selection theorems for optimization problems
    • Rieder, U.: Measurable selection theorems for optimization problems, Manuscripta Math. 24 (1978), 507-518.
    • (1978) Manuscripta Math. , vol.24 , pp. 507-518
    • Rieder, U.1
  • 34
    • 0001295069 scopus 로고
    • Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal
    • Schäl, M.: Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal, Z. Wahrsch. verw. Geb. 32 (1975), 179-196.
    • (1975) Z. Wahrsch. Verw. Geb. , vol.32 , pp. 179-196
    • Schäl, M.1
  • 35
    • 0007153543 scopus 로고
    • Average optimality in dynamic programming with general state space
    • Schäl, M.: Average optimality in dynamic programming with general state space, Math. Oper. Res. 18 (1993), 163-172.
    • (1993) Math. Oper. Res. , vol.18 , pp. 163-172
    • Schäl, M.1
  • 36
    • 0040504235 scopus 로고
    • Value iteration in countable state average cost Markov decision processes with unbounded costs
    • Sennott, L. I.: Value iteration in countable state average cost Markov decision processes with unbounded costs, Ann. Oper. Res. 29 (1991), 261-271.
    • (1991) Ann. Oper. Res. , vol.29 , pp. 261-271
    • Sennott, L.I.1
  • 37
    • 84975961356 scopus 로고
    • The average cost optimality equation and critical number policies
    • Sennott, L. I.: The average cost optimality equation and critical number policies, Prob. Eng. Inform. Sci. 7 (1993), 47-67.
    • (1993) Prob. Eng. Inform. Sci. , vol.7 , pp. 47-67
    • Sennott, L.I.1
  • 39
    • 0003361166 scopus 로고
    • Invariant measures for Markov chains with no irreducibility assumptions
    • Tweedie, R. L.: Invariant measures for Markov chains with no irreducibility assumptions, J. Appl. Prob. 25A (1988), 275-285.
    • (1988) J. Appl. Prob. , vol.25 A , pp. 275-285
    • Tweedie, R.L.1
  • 40
    • 0003122592 scopus 로고
    • Dynamic programming, Markov chains, and the method of successive approximations
    • White, D. J.: Dynamic programming, Markov chains, and the method of successive approximations, J. Math. Anal. Appl. 6 (1963), 373-376.
    • (1963) J. Math. Anal. Appl. , vol.6 , pp. 373-376
    • White, D.J.1
  • 41
    • 24944520826 scopus 로고
    • Existence of average optimal strategies in markovian decision problems with strictly unbounded costs
    • M. L. Puterman (ed.), Academic Press, New York
    • Wijngaard, J.: Existence of average optimal strategies in markovian decision problems with strictly unbounded costs, in: M. L. Puterman (ed.), Dynamic Programming and Its Applications, Academic Press, New York, 1978, pp. 369-386.
    • (1978) Dynamic Programming and Its Applications , pp. 369-386
    • Wijngaard, J.1
  • 42
    • 0004906569 scopus 로고
    • On a class of strategies in general Markov decision models
    • Yushkevich, A. A.: On a class of strategies in general Markov decision models, Theory Probab. Appl. 18 (1973), 777-779.
    • (1973) Theory Probab. Appl. , vol.18 , pp. 777-779
    • Yushkevich, A.A.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.