메뉴 건너뛰기




Volumn 4539 LNAI, Issue , 2007, Pages 263-277

Bounded parameter Markov decision processes with average reward criterion

Author keywords

[No Author keywords available]

Indexed keywords

BRANCH AND BOUND METHOD; CONVERGENCE OF NUMERICAL METHODS; DECISION SUPPORT SYSTEMS; FUNCTION EVALUATION; OPTIMAL SYSTEMS; PARAMETER ESTIMATION;

EID: 38049021455     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-540-72927-3_20     Document Type: Conference Paper
Times cited : (36)

References (8)
  • 1
    • 0034272032 scopus 로고    scopus 로고
    • Bounded-parameter Markov decision processes
    • Givan, R., Leach, S., Dean, T.: Bounded-parameter Markov decision processes. Artificial Intelligence 122, 71-109 (2000)
    • (2000) Artificial Intelligence , vol.122 , pp. 71-109
    • Givan, R.1    Leach, S.2    Dean, T.3
  • 3
    • 56449090814 scopus 로고    scopus 로고
    • Logarithmic online regret bounds for undiscounted reinforcement learning
    • MIT Press, Cambridge , to appear
    • Auer, P., Ortner, R.: Logarithmic online regret bounds for undiscounted reinforcement learning. In: dvances in Neural Information Processing Systems 19, MIT Press, Cambridge (2007) (to appear)
    • (2007) dvances in Neural Information Processing Systems , vol.19
    • Auer, P.1    Ortner, R.2
  • 4
    • 0041965975 scopus 로고    scopus 로고
    • R-MAX - a general polynomial time algorithm for near-optimal reinforcement learning
    • Brafman, R.I., Tennenholtz, M.: R-MAX - a general polynomial time algorithm for near-optimal reinforcement learning. Journal of Machine Learning Research 3, 213-231 (2002)
    • (2002) Journal of Machine Learning Research , vol.3 , pp. 213-231
    • Brafman, R.I.1    Tennenholtz, M.2
  • 5
    • 38049054483 scopus 로고    scopus 로고
    • Convergence of optimistic and incremental Q-learning
    • MIT Press, Cambridge
    • Even-Dar, E., Mansour, Y.: Convergence of optimistic and incremental Q-learning. In: Advances in Neural Information Processing Systems 14, pp. 1499-1506. MIT Press, Cambridge (2001)
    • (2001) Advances in Neural Information Processing Systems , vol.14 , pp. 1499-1506
    • Even-Dar, E.1    Mansour, Y.2
  • 6
    • 14344250395 scopus 로고    scopus 로고
    • Robust control of Markov decision processes with uncertain transition matrices
    • Nilim, A., El Ghaoui, L.: Robust control of Markov decision processes with uncertain transition matrices. Operations Research 53, 780-798 (2005)
    • (2005) Operations Research , vol.53 , pp. 780-798
    • Nilim, A.1    El Ghaoui, L.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.