SCOPUS 정보 검색 플랫폼

SIAM Journal on Control and Optimization

Volumn 34, Issue 6, 1996, Pages 1848-1873

Value iteration in a class of communicating Markov decision chains with the average cost criterion

a UNIVERSIDAD AUTÓNOMA AGRARIA ANTONIO NARRO (Mexico)

Author keywords

Almost monotone cost function; Average cost criterion; Markov decision chains; Pointwise convergence; Value iteration scheme

Indexed keywords

CONTROL SYSTEM ANALYSIS; CONTROL THEORY; FUNCTIONS; ITERATIVE METHODS; MARKOV PROCESSES; STATE SPACE METHODS;

AVERAGE COST CRITERION; POINTWISE CONVERGENCE;

CONTROL;

EID: 0030290475 PISSN: 03630129 EISSN: None Source Type: Journal
DOI: 10.1137/S1064827590192863 Document Type: Article

Times cited : (9)

References (30)

1
- 0020780366
- Controlled Markov chains and stochastic networks
- V. S. BORKAR, Controlled Markov chains and stochastic networks, SIAM J. Control Optim., 21 (1983), pp. 652-666.
- (1983) SIAM J. Control Optim. , vol.21 , pp. 652-666
- Borkar, V.S.¹

2
- 0021520222
- On minimum cost per unit of time control of Markov chains
- _, On minimum cost per unit of time control of Markov chains, SIAM J. Control Optim., 22 (1984), pp. 965-978.
- (1984) SIAM J. Control Optim. , vol.22 , pp. 965-978

3
- 0039451036
- Weak conditions for the existence of average optimal stationary policies in average Markov decision chains with unbounded costs
- R. CAVAZOS-CADENA, Weak conditions for the existence of average optimal stationary policies in average Markov decision chains with unbounded costs, Kybernetika, 25 (1989), pp. 145-156.
- (1989) Kybernetika , vol.25 , pp. 145-156
- Cavazos-Cadena, R.¹

4
- 84975985090
- Solution to the optimality equation in a class of Markov decision chains with the average cost criterion
- _, Solution to the optimality equation in a class of Markov decision chains with the average cost criterion, Kybernetika, 27 (1991), pp. 26-37.
- (1991) Kybernetika , vol.27 , pp. 26-37

5
- 0007472843
- Recent results on conditions for the existence of average optimal stationary policies
- _, Recent results on conditions for the existence of average optimal stationary policies, Ann. Oper. Res., 28 (1991), pp. 3-28.
- (1991) Ann. Oper. Res. , vol.28 , pp. 3-28

6
- 0026811487
- Comparing recent assumptions for the existence of optimal stationary policies
- R. CAVAZOS-CADENA AND L. I. SENNOTT, Comparing recent assumptions for the existence of optimal stationary policies, Oper. Res. Lett., 11 (1992), pp. 33-37.
- (1992) Oper. Res. Lett. , vol.11 , pp. 33-37
- Cavazos-Cadena, R.¹ Sennott, L.I.²

7
- 0029771207
- Undiscounted value iteration in stable Markov decision chains with bounded rewards
- R. CAVAZOS-CADENA, Undiscounted value iteration in stable Markov decision chains with bounded rewards, J. Math. Systems Estim. Control, 6 (1994), pp. 243-246.
- (1994) J. Math. Systems Estim. Control , vol.6 , pp. 243-246
- Cavazos-Cadena, R.¹

8
- 6244255673
- Cesàro convergence of the undiscounted value iteration method in Markov decision processes under the Lyapunov stability condition
- _, Cesàro convergence of the undiscounted value iteration method in Markov decision processes under the Lyapunov stability condition. Bol. Soc. Mat. Mexicana (2), 38 (1993), pp. 33-46.
- (1993) Bol. Soc. Mat. Mexicana , vol.38 , Issue.2 , pp. 33-46

9
- 85033028046
- Report 03-91-DEC, Universidad Autónoma Agraria Antonio Narro, Saltillo Coah, México
- _, Value Iteration in Controlled Markov Chains with Penalized Costs: Cesàro Convergence Results, Report 03-91-DEC, Universidad Autónoma Agraria Antonio Narro, Saltillo Coah, México, 1991.
- (1991) Value Iteration in Controlled Markov Chains with Penalized Costs: Cesàro Convergence Results

10
- 0040898816
- Value iteration in stable Markov decision chains with unbounded costs: Necessary and sufficient conditions for pointwise convergence
- to appear
- R. CAVAZOS-CADENA AND E. FERNÁNDEZ-GAUCHERAND, Value iteration in stable Markov decision chains with unbounded costs: Necessary and sufficient conditions for pointwise convergence, J. Appl. Probab., (1996), to appear.
- (1996) J. Appl. Probab.
- Cavazos-Cadena, R.¹ Fernández-Gaucherand, E.²

11
- 0003535772
- Allyn and Bacon, Boston
- J. DUGUNDJI, Topology, Allyn and Bacon, Boston, 1966.
- (1966) Topology
- Dugundji, J.¹

12
- 0040504241
- A survey of asymptotic value iteration for undiscounted Markovian decision processes
- R. Hartley, L. C. Thomas, and D. J. White, eds., Academic Press, New York
- A. FEDERGRUEN AND P. J. SCHWEITZER, A survey of asymptotic value iteration for undiscounted Markovian decision processes, in Recent Developments in Markov Decision Processes, R. Hartley, L. C. Thomas, and D. J. White, eds., Academic Press, New York, 1980.
- (1980) Recent Developments in Markov Decision Processes
- Federgruen, A.¹ Schweitzer, P.J.²

13
- 38249014767
- On strong average optimality of Markov decision processes with unbounded costs
- M. K. GOSH AND S. I. MARCUS, On strong average optimality of Markov decision processes with unbounded costs, Oper. Res. Lett., 11 (1992), pp. 99-104.
- (1992) Oper. Res. Lett. , vol.11 , pp. 99-104
- Gosh, M.K.¹ Marcus, S.I.²

14
- 0003952172
- Springer-Verlag, New York
- O. HERNÁNDEZ-LERMA, Adaptive Markov Control Processes, Springer-Verlag, New York, 1989.
- (1989) Adaptive Markov Control Processes
- Hernández-Lerma, O.¹

15
- 0003265935
- Foundations of Non-Stationary Dynamic Programming with Discrete Time Parameter
- Springer-Verlag, New York
- K. HINDERER, Foundations of Non-Stationary Dynamic Programming with Discrete Time Parameter, Lecture Notes in Oper. Res. 33, Springer-Verlag, New York, 1970.
- (1970) Lecture Notes in Oper. Res. , vol.33
- Hinderer, K.¹

16
- 0004211484
- Mathematical Centre Tract 51, Mathematisch Centrum, Amsterdam
- A. HORDIJK, Dynamic Programming and Markov Potential Theory, Mathematical Centre Tract 51, Mathematisch Centrum, Amsterdam, 1974.
- (1974) Dynamic Programming and Markov Potential Theory
- Hordijk, A.¹

17
- 0016521921
- The asymptotic behavior of the minimal total expected cost for the denumerable state Markov decision model
- A. HORDIJK, P. J. SCHWEITZER, AND H. C. TIJMS, The asymptotic behavior of the minimal total expected cost for the denumerable state Markov decision model, J. Appl. Probab., 12 (1975), pp. 298-305.
- (1975) J. Appl. Probab. , vol.12 , pp. 298-305
- Hordijk, A.¹ Schweitzer, P.J.² Tijms, H.C.³

18
- 0003981320
- Springer-Verlag, New York
- M. LOÈVE, Probability Theory I, Springer-Verlag, New York, 1978.
- (1978) Probability Theory , vol.1
- Loève, M.¹

19
- 0346144338
- Value iteration in average cost Markov control processes on Borel spaces
- R. MONTES-DE-OCA AND O. HERNÁNDEZ-LERMA, Value iteration in average cost Markov control processes on Borel spaces, Acta Appl. Math., 42 (1996), pp. 203-221.
- (1996) Acta Appl. Math. , vol.42 , pp. 203-221
- Montes-De-Oca, R.¹ Hernández-Lerma, O.²

20
- 0003998452
- Wiley, New York
- M. L. PUTERMAN, Markov Decision Processes, Wiley, New York, 1994.
- (1994) Markov Decision Processes
- Puterman, M.L.¹

21
- 0003655416
- Macmillan, New York
- H. L. ROYDEN, Real Analysis, 2nd ed., Macmillan, New York, 1968.
- (1968) Real Analysis, 2nd Ed.
- Royden, H.L.¹

22
- 0003644137
- Holden-Day, San Francisco
- S. M. ROSS, Applied Probability Models with Optimization Applications, Holden-Day, San Francisco, 1970.
- (1970) Applied Probability Models with Optimization Applications
- Ross, S.M.¹

23
- 0024702152
- Average cost optimal stationary policies in infinite state Markov decision processes with unbounded costs
- L. I. SENNOTT, Average cost optimal stationary policies in infinite state Markov decision processes with unbounded costs, Oper. Res., 37 (1989), pp. 626-633.
- (1989) Oper. Res. , vol.37 , pp. 626-633
- Sennott, L.I.¹

24
- 0040504235
- Value iteration in countable state Markov decision processes with unbounded costs
- _, Value iteration in countable state Markov decision processes with unbounded costs, Ann. Oper. Res., 28 (1991), pp. 261-272.
- (1991) Ann. Oper. Res. , vol.28 , pp. 261-272

25
- 6244232295
- submitted
- _, The convergence of value iteration in average cost Markov decision chains, (1994), submitted.
- (1994) The Convergence of Value Iteration in Average Cost Markov Decision Chains

26
- 0011109733
- The asymptotic behavior of undiscounted value iteration in Markov decision proceses
- P. J. SCHWEITZER AND A. FEDERGRUEN, The asymptotic behavior of undiscounted value iteration in Markov decision proceses, Math. Oper. Res., 2 (1977), pp. 360-381.
- (1977) Math. Oper. Res. , vol.2 , pp. 360-381
- Schweitzer, P.J.¹ Federgruen, A.²

27
- 0015080430
- Iterative solution of the functional equations for undiscounted Markov renewal programming
- P. J. SCHWEITZER, Iterative solution of the functional equations for undiscounted Markov renewal programming, J. Math. Anal. Appl., 34 (1971), pp. 495-501.
- (1971) J. Math. Anal. Appl. , vol.34 , pp. 495-501
- Schweitzer, P.J.¹

28
- 0003286366
- Connectedness conditions for denumerable state Markov decision processes
- R. Hartley, L. C. Thomas, and D. J. White, eds., Academic Press, New York
- L. C. THOMAS, Connectedness conditions for denumerable state Markov decision processes, in Recent Developments in Markov Decision Processes, R. Hartley, L. C. Thomas, and D. J. White, eds., Academic Press, New York, 1980.
- (1980) Recent Developments in Markov Decision Processes
- Thomas, L.C.¹

29
- 0000048705
- Optimal control of service rates in networks of queues
- R. R. WEBER AND S. STIDHAM, Optimal control of service rates in networks of queues, Advances in Applied Probability, 19 (1987), pp. 202-218.
- (1987) Advances in Applied Probability , vol.19 , pp. 202-218
- Weber, R.R.¹ Stidham, S.²

30
- 0003122592
- Dynamic programming, Markov chains, and the method of succesive approximations
- D. J. WHITE, Dynamic programming, Markov chains, and the method of succesive approximations, J. Math. Anal. Appl., (1963), pp. 373-376.
- (1963) J. Math. Anal. Appl. , pp. 373-376
- White, D.J.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.