메뉴 건너뛰기




Volumn 34, Issue 6, 1996, Pages 1848-1873

Value iteration in a class of communicating Markov decision chains with the average cost criterion

Author keywords

Almost monotone cost function; Average cost criterion; Markov decision chains; Pointwise convergence; Value iteration scheme

Indexed keywords

CONTROL SYSTEM ANALYSIS; CONTROL THEORY; FUNCTIONS; ITERATIVE METHODS; MARKOV PROCESSES; STATE SPACE METHODS;

EID: 0030290475     PISSN: 03630129     EISSN: None     Source Type: Journal    
DOI: 10.1137/S1064827590192863     Document Type: Article
Times cited : (9)

References (30)
  • 1
    • 0020780366 scopus 로고
    • Controlled Markov chains and stochastic networks
    • V. S. BORKAR, Controlled Markov chains and stochastic networks, SIAM J. Control Optim., 21 (1983), pp. 652-666.
    • (1983) SIAM J. Control Optim. , vol.21 , pp. 652-666
    • Borkar, V.S.1
  • 2
    • 0021520222 scopus 로고
    • On minimum cost per unit of time control of Markov chains
    • _, On minimum cost per unit of time control of Markov chains, SIAM J. Control Optim., 22 (1984), pp. 965-978.
    • (1984) SIAM J. Control Optim. , vol.22 , pp. 965-978
  • 3
    • 0039451036 scopus 로고
    • Weak conditions for the existence of average optimal stationary policies in average Markov decision chains with unbounded costs
    • R. CAVAZOS-CADENA, Weak conditions for the existence of average optimal stationary policies in average Markov decision chains with unbounded costs, Kybernetika, 25 (1989), pp. 145-156.
    • (1989) Kybernetika , vol.25 , pp. 145-156
    • Cavazos-Cadena, R.1
  • 4
    • 84975985090 scopus 로고
    • Solution to the optimality equation in a class of Markov decision chains with the average cost criterion
    • _, Solution to the optimality equation in a class of Markov decision chains with the average cost criterion, Kybernetika, 27 (1991), pp. 26-37.
    • (1991) Kybernetika , vol.27 , pp. 26-37
  • 5
    • 0007472843 scopus 로고
    • Recent results on conditions for the existence of average optimal stationary policies
    • _, Recent results on conditions for the existence of average optimal stationary policies, Ann. Oper. Res., 28 (1991), pp. 3-28.
    • (1991) Ann. Oper. Res. , vol.28 , pp. 3-28
  • 6
    • 0026811487 scopus 로고
    • Comparing recent assumptions for the existence of optimal stationary policies
    • R. CAVAZOS-CADENA AND L. I. SENNOTT, Comparing recent assumptions for the existence of optimal stationary policies, Oper. Res. Lett., 11 (1992), pp. 33-37.
    • (1992) Oper. Res. Lett. , vol.11 , pp. 33-37
    • Cavazos-Cadena, R.1    Sennott, L.I.2
  • 7
    • 0029771207 scopus 로고
    • Undiscounted value iteration in stable Markov decision chains with bounded rewards
    • R. CAVAZOS-CADENA, Undiscounted value iteration in stable Markov decision chains with bounded rewards, J. Math. Systems Estim. Control, 6 (1994), pp. 243-246.
    • (1994) J. Math. Systems Estim. Control , vol.6 , pp. 243-246
    • Cavazos-Cadena, R.1
  • 8
    • 6244255673 scopus 로고
    • Cesàro convergence of the undiscounted value iteration method in Markov decision processes under the Lyapunov stability condition
    • _, Cesàro convergence of the undiscounted value iteration method in Markov decision processes under the Lyapunov stability condition. Bol. Soc. Mat. Mexicana (2), 38 (1993), pp. 33-46.
    • (1993) Bol. Soc. Mat. Mexicana , vol.38 , Issue.2 , pp. 33-46
  • 10
    • 0040898816 scopus 로고    scopus 로고
    • Value iteration in stable Markov decision chains with unbounded costs: Necessary and sufficient conditions for pointwise convergence
    • to appear
    • R. CAVAZOS-CADENA AND E. FERNÁNDEZ-GAUCHERAND, Value iteration in stable Markov decision chains with unbounded costs: Necessary and sufficient conditions for pointwise convergence, J. Appl. Probab., (1996), to appear.
    • (1996) J. Appl. Probab.
    • Cavazos-Cadena, R.1    Fernández-Gaucherand, E.2
  • 11
    • 0003535772 scopus 로고
    • Allyn and Bacon, Boston
    • J. DUGUNDJI, Topology, Allyn and Bacon, Boston, 1966.
    • (1966) Topology
    • Dugundji, J.1
  • 12
    • 0040504241 scopus 로고
    • A survey of asymptotic value iteration for undiscounted Markovian decision processes
    • R. Hartley, L. C. Thomas, and D. J. White, eds., Academic Press, New York
    • A. FEDERGRUEN AND P. J. SCHWEITZER, A survey of asymptotic value iteration for undiscounted Markovian decision processes, in Recent Developments in Markov Decision Processes, R. Hartley, L. C. Thomas, and D. J. White, eds., Academic Press, New York, 1980.
    • (1980) Recent Developments in Markov Decision Processes
    • Federgruen, A.1    Schweitzer, P.J.2
  • 13
    • 38249014767 scopus 로고
    • On strong average optimality of Markov decision processes with unbounded costs
    • M. K. GOSH AND S. I. MARCUS, On strong average optimality of Markov decision processes with unbounded costs, Oper. Res. Lett., 11 (1992), pp. 99-104.
    • (1992) Oper. Res. Lett. , vol.11 , pp. 99-104
    • Gosh, M.K.1    Marcus, S.I.2
  • 15
    • 0003265935 scopus 로고
    • Foundations of Non-Stationary Dynamic Programming with Discrete Time Parameter
    • Springer-Verlag, New York
    • K. HINDERER, Foundations of Non-Stationary Dynamic Programming with Discrete Time Parameter, Lecture Notes in Oper. Res. 33, Springer-Verlag, New York, 1970.
    • (1970) Lecture Notes in Oper. Res. , vol.33
    • Hinderer, K.1
  • 17
    • 0016521921 scopus 로고
    • The asymptotic behavior of the minimal total expected cost for the denumerable state Markov decision model
    • A. HORDIJK, P. J. SCHWEITZER, AND H. C. TIJMS, The asymptotic behavior of the minimal total expected cost for the denumerable state Markov decision model, J. Appl. Probab., 12 (1975), pp. 298-305.
    • (1975) J. Appl. Probab. , vol.12 , pp. 298-305
    • Hordijk, A.1    Schweitzer, P.J.2    Tijms, H.C.3
  • 18
    • 0003981320 scopus 로고
    • Springer-Verlag, New York
    • M. LOÈVE, Probability Theory I, Springer-Verlag, New York, 1978.
    • (1978) Probability Theory , vol.1
    • Loève, M.1
  • 19
    • 0346144338 scopus 로고    scopus 로고
    • Value iteration in average cost Markov control processes on Borel spaces
    • R. MONTES-DE-OCA AND O. HERNÁNDEZ-LERMA, Value iteration in average cost Markov control processes on Borel spaces, Acta Appl. Math., 42 (1996), pp. 203-221.
    • (1996) Acta Appl. Math. , vol.42 , pp. 203-221
    • Montes-De-Oca, R.1    Hernández-Lerma, O.2
  • 23
    • 0024702152 scopus 로고
    • Average cost optimal stationary policies in infinite state Markov decision processes with unbounded costs
    • L. I. SENNOTT, Average cost optimal stationary policies in infinite state Markov decision processes with unbounded costs, Oper. Res., 37 (1989), pp. 626-633.
    • (1989) Oper. Res. , vol.37 , pp. 626-633
    • Sennott, L.I.1
  • 24
    • 0040504235 scopus 로고
    • Value iteration in countable state Markov decision processes with unbounded costs
    • _, Value iteration in countable state Markov decision processes with unbounded costs, Ann. Oper. Res., 28 (1991), pp. 261-272.
    • (1991) Ann. Oper. Res. , vol.28 , pp. 261-272
  • 26
    • 0011109733 scopus 로고
    • The asymptotic behavior of undiscounted value iteration in Markov decision proceses
    • P. J. SCHWEITZER AND A. FEDERGRUEN, The asymptotic behavior of undiscounted value iteration in Markov decision proceses, Math. Oper. Res., 2 (1977), pp. 360-381.
    • (1977) Math. Oper. Res. , vol.2 , pp. 360-381
    • Schweitzer, P.J.1    Federgruen, A.2
  • 27
    • 0015080430 scopus 로고
    • Iterative solution of the functional equations for undiscounted Markov renewal programming
    • P. J. SCHWEITZER, Iterative solution of the functional equations for undiscounted Markov renewal programming, J. Math. Anal. Appl., 34 (1971), pp. 495-501.
    • (1971) J. Math. Anal. Appl. , vol.34 , pp. 495-501
    • Schweitzer, P.J.1
  • 28
    • 0003286366 scopus 로고
    • Connectedness conditions for denumerable state Markov decision processes
    • R. Hartley, L. C. Thomas, and D. J. White, eds., Academic Press, New York
    • L. C. THOMAS, Connectedness conditions for denumerable state Markov decision processes, in Recent Developments in Markov Decision Processes, R. Hartley, L. C. Thomas, and D. J. White, eds., Academic Press, New York, 1980.
    • (1980) Recent Developments in Markov Decision Processes
    • Thomas, L.C.1
  • 29
    • 0000048705 scopus 로고
    • Optimal control of service rates in networks of queues
    • R. R. WEBER AND S. STIDHAM, Optimal control of service rates in networks of queues, Advances in Applied Probability, 19 (1987), pp. 202-218.
    • (1987) Advances in Applied Probability , vol.19 , pp. 202-218
    • Weber, R.R.1    Stidham, S.2
  • 30
    • 0003122592 scopus 로고
    • Dynamic programming, Markov chains, and the method of succesive approximations
    • D. J. WHITE, Dynamic programming, Markov chains, and the method of succesive approximations, J. Math. Anal. Appl., (1963), pp. 373-376.
    • (1963) J. Math. Anal. Appl. , pp. 373-376
    • White, D.J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.