메뉴 건너뛰기




Volumn 27, Issue 3, 2002, Pages 545-566

Achieving target state-action frequencies in multichain average-reward Markov decision processes

Author keywords

Average reward criterion; Constrained Markov decision processes; Markov decision processes; Markov decision processes with nonstandard reward criteria; State action frequencies

Indexed keywords

ALGORITHMS; CONFORMAL MAPPING; CONSTRAINT THEORY; DECISION THEORY; LINEAR PROGRAMMING; PROBLEM SOLVING; SET THEORY; VECTORS;

EID: 0036672648     PISSN: 0364765X     EISSN: None     Source Type: Journal    
DOI: 10.1287/moor.27.3.545.316     Document Type: Article
Times cited : (11)

References (26)
  • 12
    • 0005352809 scopus 로고
    • Constrained undiscounted stochastic dynamic programming
    • (1984) Math. Oper. Res. , vol.2 , pp. 159-217
  • 20
    • 0024664332 scopus 로고
    • Randomized and past-dependent policies in Markov decision processes with multiple constraints
    • (1989) Oper. Res. , vol.37 , pp. 474-477
    • Ross, K.W.1
  • 21
    • 0024737381 scopus 로고
    • Markov decision processes with sample-path constraints: The communicating case
    • (1989) Oper. Res. , vol.37 , pp. 780-790
    • Varadarajan, R.1
  • 22
    • 0001172487 scopus 로고
    • Multichain Markov decision processes with a sample-path constraints: The decomposition approach
    • (1991) Math. Oper. Res. , vol.16 , pp. 195-207
  • 24
    • 0000209621 scopus 로고
    • Dynamic programming and probabilistic constraints
    • (1974) Oper. Res. , vol.22 , pp. 654-664
    • White, D.J.1
  • 25
    • 0023842256 scopus 로고
    • Mean, variance and probabilistic criteria in finite Markov decision processes
    • (1988) J. Optim. Theory Appl. , vol.56 , pp. 1-29


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.