SCOPUS 정보 검색 플랫폼

Acta Applicandae Mathematicae

Volumn 42, Issue 2, 1996, Pages 203-222

Value iteration in average cost Markov control processes on Borel spaces

(2) Montes de Oca, Raúl a Hernández Lerma, Onésimo b

a DEPARTAMENTO DE QUÍMICA (Mexico)

b DEPARTAMENTO DE FÍSICA (Mexico)

Author keywords

Average cost; Markov control (or decision) processes; Value iteration (or successive approximations)

Indexed keywords

EID: 0346144338 PISSN: 01678019 EISSN: None Source Type: Journal
DOI: 10.1007/BF00047169 Document Type: Article

Times cited : (10)

References (42)

1
- 0027557742
- Controlled Markov processes with an average cost criterion: A survey
- Arapostathis, A., Borkar, V., Fernández-Gaucherand, E., Ghosh, M. K., and Marcus, S. I.: Controlled Markov processes with an average cost criterion: a survey, SIAM J. Control Optim. 31 (1993), 282-344.
- (1993) SIAM J. Control Optim. , vol.31 , pp. 282-344
- Arapostathis, A.¹ Borkar, V.² Fernández-Gaucherand, E.³ Ghosh, M.K.⁴ Marcus, S.I.⁵

2
- 0003944893
- Academic Press, New York
- Ash, R. B.: Real Analysis and Probability, Academic Press, New York, 1972.
- (1972) Real Analysis and Probability
- Ash, R.B.¹

3
- 0003565779
- Prentice-Hall, Englewood Cliffs, N.J.
- Bertsekas, D. P.: Dynamic Programming: Deterministic and Stochastic Models, Prentice-Hall, Englewood Cliffs, N.J., 1987.
- (1987) Dynamic Programming: Deterministic and Stochastic Models
- Bertsekas, D.P.¹

4
- 0007375022
- Probabilistic properties of the general nonlinear Markovian process of order one and applications to time series modelling
- Laboratoire de Statistique Théorique et Appliquée, CNRS-URA 1321, Université Paris VI
- Diebolt, J. and Guégan, D.: Probabilistic properties of the general nonlinear Markovian process of order one and applications to time series modelling, Rapport Technique No. 125, Laboratoire de Statistique Théorique et Appliquée, CNRS-URA 1321, Université Paris VI, 1990.
- (1990) Rapport Technique No. 125
- Diebolt, J.¹ Guégan, D.²

5
- 0003535772
- Allyn and Bacon, Boston
- Dugundji, J.: Topology, Allyn and Bacon, Boston, 1966.
- (1966) Topology
- Dugundji, J.¹

6
- 0003634432
- Springer-Verlag, Berlin
- Dynkin, E. B. and Yushkevich, A. A.: Controlled Markov Processes, Springer-Verlag, Berlin, 1979.
- (1979) Controlled Markov Processes
- Dynkin, E.B.¹ Yushkevich, A.A.²

7
- 12144281448
- Convex stochastic control problems
- Tucson, AZ, Dec.
- Fernández-Gaucherand, E., Marcus, S. I., and Arapostathis, A.: Convex stochastic control problems, Proc. 31st IEEE-CDC, Tucson, AZ, Dec. 1992.
- (1992) Proc. 31st IEEE-CDC
- Fernández-Gaucherand, E.¹ Marcus, S.I.² Arapostathis, A.³

8
- 84983602577
- On optimality criteria for dynamic programs with long finite horizons
- Flynn, J.: On optimality criteria for dynamic programs with long finite horizons, J. Math. Anal. Appl. 76 (1980), 202-208.
- (1980) J. Math. Anal. Appl. , vol.76 , pp. 202-208
- Flynn, J.¹

9
- 24944525736
- Contrôle de chaînes de Markov sur des espaces arbitraires
- Georgin, J.-P.: Contrôle de chaînes de Markov sur des espaces arbitraires, Ann. Inst. Henri Poincaré, Sect. B 23 (1978), 255-277.
- (1978) Ann. Inst. Henri Poincaré, Sect. B , vol.23 , pp. 255-277
- Georgin, J.-P.¹

10
- 0001677238
- Average cost Markov control processes with weighted norms: Existence of canonical policies
- Reporte Interno # 157, Departamento de Matemáticas, CINVESTAV-IPN, to appear
- Gordienko, E. and Hernández-Lerma, O.: Average cost Markov control processes with weighted norms: existence of canonical policies, Reporte Interno # 157, Departamento de Matemáticas, CINVESTAV-IPN, 1994, to appear in Applic. Math.
- (1994) Applic. Math.
- Gordienko, E.¹ Hernández-Lerma, O.²

11
- 0001677238
- Average cost Markov control processes with weighted norms: Value iteration
- Reporte Interno # 158, Departamento de Matemáticas, CINVESTAV-IPN, to appear
- Gordienko, E. and Hernández-Lerma, O.: Average cost Markov control processes with weighted norms: value iteration; Reporte Interno # 158, Departamento de Matemáticas, CINVESTAV-IPN, to appear in Applic. Math. 23 (1995).
- (1995) Applic. Math. , vol.23
- Gordienko, E.¹ Hernández-Lerma, O.²

12
- 0003952172
- Springer-Verlag, New York
- Hernández-Lerma, O.: Adaptive Markov Control Processes, Springer-Verlag, New York, 1989.
- (1989) Adaptive Markov Control Processes
- Hernández-Lerma, O.¹

13
- 0026220873
- Average optimality in dynamic programming on Borel spaces - Unbounded costs and controls
- Hernández-Lerma, O.: Average optimality in dynamic programming on Borel spaces - unbounded costs and controls, Systems Control Lett. 17 (1991), 237-242.
- (1991) Systems Control Lett. , vol.17 , pp. 237-242
- Hernández-Lerma, O.¹

14
- 2342594962
- Existence of average optimal policies in Markov control processes with strictly unbounded costs
- Hernández-Lerma, O.: Existence of average optimal policies in Markov control processes with strictly unbounded costs, Kybernetika (Prague) 29 (1993), 1-17.
- (1993) Kybernetika (Prague) , vol.29 , pp. 1-17
- Hernández-Lerma, O.¹

15
- 0346361230
- Density estimation and adaptive control of Markov processes: Average and discounted criteria
- Hernández-Lerma, O. and Cavazos-Cadena, R.: Density estimation and adaptive control of Markov processes: average and discounted criteria, Acta Appl. Math. 20 (1990), 285-307.
- (1990) Acta Appl. Math. , vol.20 , pp. 285-307
- Hernández-Lerma, O.¹ Cavazos-Cadena, R.²

16
- 0009258232
- Average cost Markov decision processes: Optimality conditions
- Hernández-Lerma, O., Hennet, J. C., and Lasserre, J. B.: Average cost Markov decision processes: optimality conditions, J. Math. Anal. Appl. 158 (1991), 396-406.
- (1991) J. Math. Anal. Appl. , vol.158 , pp. 396-406
- Hernández-Lerma, O.¹ Hennet, J.C.² Lasserre, J.B.³

17
- 0002471077
- Value iteration and rolling plans for Markov control processes with unbounded rewards
- Hernández-Lerma, O. and Lasserre, J. B.: Value iteration and rolling plans for Markov control processes with unbounded rewards, J. Math. Anal. Appl. 177 (1993), 38-55.
- (1993) J. Math. Anal. Appl. , vol.177 , pp. 38-55
- Hernández-Lerma, O.¹ Lasserre, J.B.²

18
- 0028397545
- Linear programming and average optimality of Markov control processes on Borel spaces - Unbounded costs
- Hernández-Lerma, O. and Lasserre, J. B.: Linear programming and average optimality of Markov control processes on Borel spaces - unbounded costs, SIAM J. Control Optim. 32 (1994), 480-500.
- (1994) SIAM J. Control Optim. , vol.32 , pp. 480-500
- Hernández-Lerma, O.¹ Lasserre, J.B.²

19
- 24944498800
- Invariant probabilities for Feller-Markov chains
- Reporte Interno No. 122, CINVESTAV-IPN, México, to appear
- Hernández-Lerma, O. and Lasserre, J. B.: Invariant probabilities for Feller-Markov chains, Reporte Interno No. 122, CINVESTAV-IPN, México, 1993, to appear in J. Appl. Math. Stoch. Anal.
- (1993) J. Appl. Math. Stoch. Anal.
- Hernández-Lerma, O.¹ Lasserre, J.B.²

20
- 38249023041
- Discretization procedures for adaptive Markov control processes
- Hernández-Lerma, O. and Marcus, S. I.: Discretization procedures for adaptive Markov control processes, J. Math. Anal. Appl. 137 (1989), 485-514.
- (1989) J. Math. Anal. Appl. , vol.137 , pp. 485-514
- Hernández-Lerma, O.¹ Marcus, S.I.²

21
- 0006238280
- Recurrence conditions for Markov decision processes with Borel state space: A survey
- Hernández-Lerma, O., Montes-de-Oca, R., and Cavazos-Cadena, R.: Recurrence conditions for Markov decision processes with Borel state space: a survey, Ann. Oper. Res. 28 (1991), 29-46.
- (1991) Ann. Oper. Res. , vol.28 , pp. 29-46
- Hernández-Lerma, O.¹ Montes-de-Oca, R.² Cavazos-Cadena, R.³

22
- 0008620275
- Discrete-time Markov control processes with discounted unbounded costs: Optimality criteria
- Hernández-Lerma, O. and Muñoz de Ozak, M.: Discrete-time Markov control processes with discounted unbounded costs: optimality criteria, Kybernetika (Prague) 28 (1992), 191-212.
- (1992) Kybernetika (Prague) , vol.28 , pp. 191-212
- Hernández-Lerma, O.¹ Muñoz De Ozak, M.²

23
- 0001930797
- Monotone approximations for convex stochastic control problems
- Hernández-Lerma, O. and Runggaldier, J. W.: Monotone approximations for convex stochastic control problems, J. Math. Systems, Estimation Control 4 (1994), 99-140.
- (1994) J. Math. Systems, Estimation Control , vol.4 , pp. 99-140
- Hernández-Lerma, O.¹ Runggaldier, J.W.²

24
- 0003434766
- Foundations of Non-Stationary Dynamic Programming with Discrete Time Parameter
- Springer-Verlag, New York
- Hinderer, K.: Foundations of Non-Stationary Dynamic Programming with Discrete Time Parameter, Lecture Notes Oper. Res. 33, Springer-Verlag, New York, 1970.
- (1970) Lecture Notes Oper. Res. 33
- Hinderer, K.¹

25
- 84986786266
- Sur un modèle autorégressif nonlinéare: Ergodicité et ergodicité géométrique
- Mokkadem, A.: Sur un modèle autorégressif nonlinéare: ergodicité et ergodicité géométrique, J. Time Series Anal. 8 (1987), 195-205.
- (1987) J. Time Series Anal. , vol.8 , pp. 195-205
- Mokkadem, A.¹

26
- 0028439286
- The average cost optimality equation for Markov control processes on Borel spaces
- Montes-de-Oca, R.: The average cost optimality equation for Markov control processes on Borel spaces, Systems Control Lett. 22 (1994), 351-357.
- (1994) Systems Control Lett. , vol.22 , pp. 351-357
- Montes-de-Oca, R.¹

27
- 24944524878
- Conditions for average optimality in Markov control processes with unbounded costs and controls
- Montes-de-Oca, R. and Hernández-Lerma, O.: Conditions for average optimality in Markov control processes with unbounded costs and controls, J. Math. Systems, Estimation Control 4 (1994), 1-19.
- (1994) J. Math. Systems, Estimation Control , vol.4 , pp. 1-19
- Montes-de-Oca, R.¹ Hernández-Lerma, O.²

28
- 1942524845
- Conditions for average optimality in Markov control processes on Borel spaces
- Montes-de-Oca, R., Minjárez-Sosa, J. A., and Hernández-Lerma, O.: Conditions for average optimality in Markov control processes on Borel spaces, Boletín Soc. Mat. Mexicana 39 (1994), 39-50.
- (1994) Boletín Soc. Mat. Mexicana , vol.39 , pp. 39-50
- Montes-de-Oca, R.¹ Minjárez-Sosa, J.A.² Hernández-Lerma, O.³

29
- 0003677479
- Van Nostrand-Reinhold, London
- Orey, S.: Limit Theorems for Markov Chains Transition Probabilities, Van Nostrand-Reinhold, London, 1971.
- (1971) Limit Theorems for Markov Chains Transition Probabilities
- Orey, S.¹

30
- 0003998452
- Wiley, New York
- Puterman, M. L.: Markov Decision Processes, Wiley, New York, 1994.
- (1994) Markov Decision Processes
- Puterman, M.L.¹

31
- 24944580792
- Measurable selection theorems for optimization problems
- Rieder, U.: Measurable selection theorems for optimization problems, Manuscripta Math. 24 (1978), 507-518.
- (1978) Manuscripta Math. , vol.24 , pp. 507-518
- Rieder, U.¹

32
- 24944527417
- Preprint, University of Ulm
- Rieder, U.: On non-discounted dynamic programming with arbitrary state space, Preprint, University of Ulm, 1979.
- (1979) On Non-discounted Dynamic Programming with Arbitrary State Space
- Rieder, U.¹

33
- 0003655416
- Macmillan, New York
- Royden, H. L.: Real Analysis, 3rd. edn, Macmillan, New York, 1968.
- (1968) Real Analysis, 3rd. Edn
- Royden, H.L.¹

34
- 0001295069
- Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal
- Schäl, M.: Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal, Z. Wahrsch. verw. Geb. 32 (1975), 179-196.
- (1975) Z. Wahrsch. Verw. Geb. , vol.32 , pp. 179-196
- Schäl, M.¹

35
- 0007153543
- Average optimality in dynamic programming with general state space
- Schäl, M.: Average optimality in dynamic programming with general state space, Math. Oper. Res. 18 (1993), 163-172.
- (1993) Math. Oper. Res. , vol.18 , pp. 163-172
- Schäl, M.¹

36
- 0040504235
- Value iteration in countable state average cost Markov decision processes with unbounded costs
- Sennott, L. I.: Value iteration in countable state average cost Markov decision processes with unbounded costs, Ann. Oper. Res. 29 (1991), 261-271.
- (1991) Ann. Oper. Res. , vol.29 , pp. 261-271
- Sennott, L.I.¹

37
- 84975961356
- The average cost optimality equation and critical number policies
- Sennott, L. I.: The average cost optimality equation and critical number policies, Prob. Eng. Inform. Sci. 7 (1993), 47-67.
- (1993) Prob. Eng. Inform. Sci. , vol.7 , pp. 47-67
- Sennott, L.I.¹

38
- 24944554849
- Report BW 55/75, Mathematisch Centrum, Amsterdam
- Tijms, H. C.: On dynamic programming with arbitrary state space, compact action space and the average reward as criterion, Report BW 55/75, Mathematisch Centrum, Amsterdam, 1975.
- (1975) On Dynamic Programming with Arbitrary State Space, Compact Action Space and the Average Reward as Criterion
- Tijms, H.C.¹

39
- 0003361166
- Invariant measures for Markov chains with no irreducibility assumptions
- Tweedie, R. L.: Invariant measures for Markov chains with no irreducibility assumptions, J. Appl. Prob. 25A (1988), 275-285.
- (1988) J. Appl. Prob. , vol.25 A , pp. 275-285
- Tweedie, R.L.¹

40
- 0003122592
- Dynamic programming, Markov chains, and the method of successive approximations
- White, D. J.: Dynamic programming, Markov chains, and the method of successive approximations, J. Math. Anal. Appl. 6 (1963), 373-376.
- (1963) J. Math. Anal. Appl. , vol.6 , pp. 373-376
- White, D.J.¹

41
- 24944520826
- Existence of average optimal strategies in markovian decision problems with strictly unbounded costs
- M. L. Puterman (ed.), Academic Press, New York
- Wijngaard, J.: Existence of average optimal strategies in markovian decision problems with strictly unbounded costs, in: M. L. Puterman (ed.), Dynamic Programming and Its Applications, Academic Press, New York, 1978, pp. 369-386.
- (1978) Dynamic Programming and Its Applications , pp. 369-386
- Wijngaard, J.¹

42
- 0004906569
- On a class of strategies in general Markov decision models
- Yushkevich, A. A.: On a class of strategies in general Markov decision models, Theory Probab. Appl. 18 (1973), 777-779.
- (1973) Theory Probab. Appl. , vol.18 , pp. 777-779
- Yushkevich, A.A.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.