메뉴 건너뛰기




Volumn 38, Issue 1, 1999, Pages 79-93

Sample-path optimality and variance-minimization of average cost Markov control processes

Author keywords

[No Author keywords available]

Indexed keywords

MARKOV PROCESSES; OPTIMAL CONTROL SYSTEMS; PROCESS CONTROL;

EID: 18544392871     PISSN: 03630129     EISSN: None     Source Type: Journal    
DOI: 10.1137/S0363012998340673     Document Type: Article
Times cited : (53)

References (37)
  • 3
    • 0042613487 scopus 로고
    • Control of Markov chains with long-run average cost criterion
    • Stochastic Differential Systems, Stochastic Control Theory and Applications, W. Fleming and P.L. Lions, eds., Springer-Verlag, Berlin
    • B.S. BORKAR, Control of Markov chains with long-run average cost criterion, in Stochastic Differential Systems, Stochastic Control Theory and Applications, W. Fleming and P.L. Lions, eds., IMA Vol. Math. Appl. 10, Springer-Verlag, Berlin, 1988, pp. 57-77.
    • (1988) IMA Vol. Math. Appl. , vol.10 , pp. 57-77
    • Borkar, B.S.1
  • 4
    • 0003448964 scopus 로고    scopus 로고
    • Topics in controlled Markov processes
    • Longman, Harlow, UK
    • V.S. BORKAR, Topics in Controlled Markov Processes, Pitman Res. Notes Math. Ser. 240, Longman, Harlow, UK, 1991.
    • Pitman Res. Notes Math. Ser. , vol.240 , pp. 1991
    • Borkar, V.S.1
  • 5
    • 0004864592 scopus 로고
    • Denumerable controlled Markov chains with average reward criterion: Sample path optimality
    • R. CAVAZOS-CADENA AND E. FERNÁNDEZ-GAUCHERAND, Denumerable controlled Markov chains with average reward criterion: Sample path optimality, Math. Methods Oper. Res., 41 (1995), pp. 89-108.
    • (1995) Math. Methods Oper. Res. , vol.41 , pp. 89-108
    • Cavazos-Cadena, R.1    Fernández-Gaucherand, E.2
  • 6
    • 0001128053 scopus 로고
    • Recurrence conditions for average and Blackwell optimality in denumerable state Markov decision chains
    • R. DEKKER AND A. HORDIJK, Recurrence conditions for average and Blackwell optimality in denumerable state Markov decision chains, Math. Oper. Res., 17 (1992), pp. 271-289.
    • (1992) Math. Oper. Res , vol.17 , pp. 271-289
    • Dekker, R.1    Hordijk, A.2
  • 9
    • 0030522182 scopus 로고    scopus 로고
    • A Lyapunov bound for solutions of the Poisson equation
    • P.W. GLYNN AND S.P. MEYN, A Lyapunov bound for solutions of the Poisson equation, Ann. Probab., 24 (1996), pp. 916-931.
    • (1996) Ann. Probab. , vol.24 , pp. 916-931
    • Glynn, P.W.1    Meyn, S.P.2
  • 10
    • 0345073172 scopus 로고    scopus 로고
    • Envelopes of sets of measures, tightness, and Markov control processes
    • J. GONZÁLEZ-HERNÁNDEZ AND O. HERNÁNDEZ-LERMA, Envelopes of sets of measures, tightness, and Markov control processes, Appl. Math. Optim., 40 (1999), pp. 377-392.
    • (1999) Appl. Math. Optim. , vol.40 , pp. 377-392
    • González-Hernández, J.1    Hernández-Lerma, O.2
  • 12
    • 0001259333 scopus 로고
    • Average cost Markov control processes with weighted norms: Existence of canonical policies
    • E. GORDIENKO AND O. HERNÁNDEZ-LERMA, Average cost Markov control processes with weighted norms: Existence of canonical policies, Appl. Math. (Warsaw), 23 (1995), pp. 199-218.
    • (1995) Appl. Math. (Warsaw) , vol.23 , pp. 199-218
    • Gordienko, E.1    Hernández-Lerma, O.2
  • 13
    • 0001677238 scopus 로고
    • Average cost Markov control processes with weighted norms: Value iteration
    • E. GORDIENKO AND O. HERNÁNDEZ-LERMA, Average cost Markov control processes with weighted norms: Value iteration, Appl. Math. (Warsaw), 23 (1995), pp. 219-237.
    • (1995) Appl. Math. (Warsaw) , vol.23 , pp. 219-237
    • Gordienko, E.1    Hernández-Lerma, O.2
  • 17
    • 0008657932 scopus 로고    scopus 로고
    • Policy iteration for average cost Markov control processes on Borel spaces
    • O. HERNÁNDEZ-LERMA AND J.B. LASSERRE, Policy iteration for average cost Markov control processes on Borel spaces, Acta Appl. Math., 47 (1997), pp. 125-154.
    • (1997) Acta Appl. Math. , vol.47 , pp. 125-154
    • Hernández-Lerma, O.1    Lasserre, J.B.2
  • 18
    • 0000563004 scopus 로고    scopus 로고
    • Infinite-horizon Markov control processes with undiscounted cost criteria: From average to overtaking optimality
    • O. HERNÁNDEZ-LERMA AND O. VEGA-AMAYA, Infinite-horizon Markov control processes with undiscounted cost criteria: From average to overtaking optimality, Appl. Math. (Warsaw), 25 (1998), pp. 153-178.
    • (1998) Appl. Math. (Warsaw) , vol.25 , pp. 153-178
    • Hernández-Lerma, O.1    Vega-Amaya, O.2
  • 20
    • 0040888873 scopus 로고    scopus 로고
    • Blackwell optimality in the class of stationary policies in Markov decision chains with a Borel state space and unbounded rewards
    • A. HORDIJK AND A.A. YUSHKEVICH, Blackwell optimality in the class of stationary policies in Markov decision chains with a Borel state space and unbounded rewards, Math. Methods Oper. Res., 49 (1999), pp. 1-39.
    • (1999) Math. Methods Oper. Res. , vol.49 , pp. 1-39
    • Hordijk, A.1    Yushkevich, A.A.2
  • 21
    • 0001324526 scopus 로고
    • On finding optimal policies for Markov decision chains: A unifying framework for mean-variance tradeoffs
    • Y. HUANG AND L.C.M. KALLENBERG, On finding optimal policies for Markov decision chains: A unifying framework for mean-variance tradeoffs, Math. Oper. Res., 19 (1994), pp. 434-448.
    • (1994) Math. Oper. Res. , vol.19 , pp. 434-448
    • Huang, Y.1    Kallenberg, L.C.M.2
  • 24
    • 45949117079 scopus 로고
    • Markov decision processes with a minimum-variance criterion
    • M. KURANO, Markov decision processes with a minimum-variance criterion, J. Math. Anal. Appl., 123 (1987), pp. 572-583.
    • (1987) J. Math. Anal. Appl. , vol.123 , pp. 572-583
    • Kurano, M.1
  • 26
    • 0033326571 scopus 로고    scopus 로고
    • Sample-path average optimality for Markov control processes
    • J.B. LASSERRE, Sample-path average optimality for Markov control processes, IEEE Trans. Automat. Control, 44 (1999), pp. 1966-1971.
    • (1999) IEEE Trans. Automat. Control , vol.44 , pp. 1966-1971
    • Lasserre, J.B.1
  • 28
    • 0002029988 scopus 로고
    • On the variance in controlled Markov chains
    • P. MANDL, On the variance in controlled Markov chains, Kybernetika (Prague), 7 (1971), pp. 1-12.
    • (1971) Kybernetika (Prague) , vol.7 , pp. 1-12
    • Mandl, P.1
  • 29
    • 0000977770 scopus 로고
    • A connection between controlled Markov chains and martingales
    • P. MANDL, A connection between controlled Markov chains and martingales, Kybernetika (Prague), 9 (1973), pp. 237-241.
    • (1973) Kybernetika (Prague) , vol.9 , pp. 237-241
    • Mandl, P.1
  • 30
    • 0002807389 scopus 로고
    • Estimation and control in Markov chains
    • P. MANDL, Estimation and control in Markov chains, Adv. Appl. Probab., 6 (1974), pp. 40-60.
    • (1974) Adv. Appl. Probab. , vol.6 , pp. 40-60
    • Mandl, P.1
  • 35
    • 0004899732 scopus 로고    scopus 로고
    • Sample-path average optimality of Markov control processes with strictly unbounded cost
    • to appear
    • O. VEGA-AMAYA, Sample-path average optimality of Markov control processes with strictly unbounded cost, Appl. Math. (Warsaw), to appear.
    • Appl. Math. (Warsaw)
    • Vega-Amaya, O.1
  • 37
    • 0004906569 scopus 로고
    • On a class of strategies in general Markov decision models
    • A.A. YUSHKEVICH, On a class of strategies in general Markov decision models, Theory Probab. Appl., 18 (1973), pp. 777-779.
    • (1973) Theory Probab. Appl. , vol.18 , pp. 777-779
    • Yushkevich, A.A.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.