메뉴 건너뛰기




Volumn 50, Issue 3, 1999, Pages 421-448

Blackwell optimality in the class of all policies in Markov decision chains with a Borel state space and unbounded rewards

Author keywords

Blackwell optimality; Drift and recurrence conditions; Markov decision chains

Indexed keywords


EID: 0000296818     PISSN: 14322994     EISSN: None     Source Type: Journal    
DOI: 10.1007/s001860050079     Document Type: Article
Times cited : (25)

References (17)
  • 1
    • 0012793161 scopus 로고
    • Average, sensitive and Blackwell optimal policies in denumerable Markov decision chains with unbounded rewards
    • Dekker R, Hordijk A (1988) Average, sensitive and Blackwell optimal policies in denumerable Markov decision chains with unbounded rewards. Math Oper Res 13:395-420
    • (1988) Math Oper Res , vol.13 , pp. 395-420
    • Dekker, R.1    Hordijk, A.2
  • 2
    • 0001128053 scopus 로고
    • Recurrence conditions for average and Blackwell optimality in denumerable state Markov decision chains
    • Dekker R, Hordijk A (1992) Recurrence conditions for average and Blackwell optimality in denumerable state Markov decision chains. Math Oper Res 17:271-289
    • (1992) Math Oper Res , vol.17 , pp. 271-289
    • Dekker, R.1    Hordijk, A.2
  • 3
    • 0012218373 scopus 로고
    • On the relation between recurrence and ergodicity properties in denumerable Markov decision chains
    • Dekker R, Hordijk A, Spieksma FM (1994) On the relation between recurrence and ergodicity properties in denumerable Markov decision chains. Math Oper Res 19:1-21
    • (1994) Math Oper Res , vol.19 , pp. 1-21
    • Dekker, R.1    Hordijk, A.2    Spieksma, F.M.3
  • 4
    • 0039340205 scopus 로고    scopus 로고
    • Time discretization for controlled Markov processes Part II: A jump and diffusion application
    • Prague
    • Van Dijk NM, Hordijk A (1996) Time discretization for controlled Markov processes Part II: A jump and diffusion application. Kybernetika (Prague) 32:139-158
    • (1996) Kybernetika , vol.32 , pp. 139-158
    • Van Dijk, N.M.1    Hordijk, A.2
  • 6
    • 0006238280 scopus 로고
    • Recurrence conditions for Markov decision processes with Borel state space: A survey
    • Hernandéz-Lerma O, Montes-de-Oca R, Cavazos-Cadena R (1991) Recurrence conditions for Markov decision processes with Borel state space: a survey. Ann Oper Res 28:29-46
    • (1991) Ann Oper Res , vol.28 , pp. 29-46
    • Hernandéz-Lerma, O.1    Montes-de-Oca, R.2    Cavazos-Cadena, R.3
  • 7
    • 0346799579 scopus 로고
    • Sensitive optimality criteria in countable state dynamic programming
    • Hordijk A, Sladký K (1977) Sensitive optimality criteria in countable state dynamic programming. Math Oper Res 2:1-14
    • (1977) Math Oper Res , vol.2 , pp. 1-14
    • Hordijk, A.1    Sladký, K.2
  • 8
    • 0001144425 scopus 로고
    • On ergodicity and recurrence properties of a Markov chain with an application to an open Jackson network
    • Hordijk A, Spieksma FM (1992) On ergodicity and recurrence properties of a Markov chain with an application to an open Jackson network. Adv Appl Prob 24:343-376
    • (1992) Adv Appl Prob , vol.24 , pp. 343-376
    • Hordijk, A.1    Spieksma, F.M.2
  • 10
    • 0040888873 scopus 로고    scopus 로고
    • Blackwell optimality in the class of stationary policies in Markov decision chains with a Borel state space and unbounded rewards
    • Hordijk A, Yushkevich AA (1999) Blackwell optimality in the class of stationary policies in Markov decision chains with a Borel state space and unbounded rewards. Math Meth Oper Res 49:1-39
    • (1999) Math Meth Oper Res , vol.49 , pp. 1-39
    • Hordijk, A.1    Yushkevich, A.A.2
  • 12
    • 0000006155 scopus 로고    scopus 로고
    • The policy improvement algorithm for Markov decision processes with general state space
    • Meyn SP (1997) The policy improvement algorithm for Markov decision processes with general state space. Transactions on Automatic Control AC-42:191-196
    • (1997) Transactions on Automatic Control , vol.AC-42 , pp. 191-196
    • Meyn, S.P.1
  • 14
    • 0015985471 scopus 로고
    • On the set of optimal controls for Markov chains with rewards
    • Prague
    • Sladký K (1974) On the set of optimal controls for Markov chains with rewards. Kybernetika (Prague) 10:350-367.
    • (1974) Kybernetika , vol.10 , pp. 350-367
    • Sladký, K.1
  • 15
    • 0000576225 scopus 로고
    • Negative dynamic programming
    • Strauch RE (1969) Negative dynamic programming. Ann Math Stat 37:871-890
    • (1969) Ann Math Stat , vol.37 , pp. 871-890
    • Strauch, R.E.1
  • 16
    • 0010878540 scopus 로고
    • Blackwell optimal policies in a Markov decision process with a Borel state space
    • Yushkevich AA (1994) Blackwell optimal policies in a Markov decision process with a Borel state space. Z Oper Res 40:253-288
    • (1994) Z Oper Res , vol.40 , pp. 253-288
    • Yushkevich, A.A.1
  • 17
    • 0031268177 scopus 로고    scopus 로고
    • Blackwell optimality in Borelian continuous in action Markov decision processes
    • Yushkevich AA (1997) Blackwell optimality in Borelian continuous in action Markov decision processes. SIAM J Control 35:2157-2182
    • (1997) SIAM J Control , vol.35 , pp. 2157-2182
    • Yushkevich, A.A.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.