SCOPUS 정보 검색 플랫폼

Volumn 4539 LNAI, Issue , 2007, Pages 263-277

Bounded parameter Markov decision processes with average reward criterion

Author keywords

[No Author keywords available]

Indexed keywords

BRANCH AND BOUND METHOD; CONVERGENCE OF NUMERICAL METHODS; DECISION SUPPORT SYSTEMS; FUNCTION EVALUATION; OPTIMAL SYSTEMS; PARAMETER ESTIMATION;

BLACKWELL OPTIMAL POLICIES; MARKOV DECISION PROCESS (MDP); OPTIMAL VALUE FUNCTIONS; PARAMETER MARKOV DECISION PROCESSES (BMDP);

MARKOV PROCESSES;

EID: 38049021455 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-540-72927-3_20 Document Type: Conference Paper

Times cited : (36)

References (8)

1
- 0034272032
- Bounded-parameter Markov decision processes
- Givan, R., Leach, S., Dean, T.: Bounded-parameter Markov decision processes. Artificial Intelligence 122, 71-109 (2000)
- (2000) Artificial Intelligence , vol.122 , pp. 71-109
- Givan, R.¹ Leach, S.² Dean, T.³

4
- 0041965975
- R-MAX - a general polynomial time algorithm for near-optimal reinforcement learning
- Brafman, R.I., Tennenholtz, M.: R-MAX - a general polynomial time algorithm for near-optimal reinforcement learning. Journal of Machine Learning Research 3, 213-231 (2002)
- (2002) Journal of Machine Learning Research , vol.3 , pp. 213-231
- Brafman, R.I.¹ Tennenholtz, M.²

6
- 14344250395
- Robust control of Markov decision processes with uncertain transition matrices
- Nilim, A., El Ghaoui, L.: Robust control of Markov decision processes with uncertain transition matrices. Operations Research 53, 780-798 (2005)
- (2005) Operations Research , vol.53 , pp. 780-798
- Nilim, A.¹ El Ghaoui, L.²

7
- 0003565783
- Athena Scientific, Belmont, MA
- Bertsekas, D.P.: Dynamic Programming and Optimal Control. Vol. 2. Athena Scientific, Belmont, MA (1995)
- (1995) Dynamic Programming and Optimal Control , vol.2
- Bertsekas, D.P.¹

8
- 0031070051
- Optimal adaptive policies for Markov decision processes
- Burnetas, A.N., Katehakis, M.N.: Optimal adaptive policies for Markov decision processes. Mathematics of Operations Research 22, 222-255 (1997)
- (1997) Mathematics of Operations Research , vol.22 , pp. 222-255
- Burnetas, A.N.¹ Katehakis, M.N.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.