메뉴 건너뛰기




Volumn , Issue , 2009, Pages 198-206

Learning to explore and exploit in POMDPs

Author keywords

[No Author keywords available]

Indexed keywords

ACTIVE LEARNING; AGENT BEHAVIOR; ASSOCIATED COSTS; EXPLORATION AND EXPLOITATION; EXPLORATION/EXPLOITATION; OPTIMAL ACTIONS; PARTIALLY OBSERVABLE ENVIRONMENTS; REINFORCEMENT LEARNINGS; SPECIFIC PROBLEMS; THEORETICAL GUARANTEES;

EID: 80055053199     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (24)

References (10)
  • 2
    • 0041965975 scopus 로고    scopus 로고
    • R-max - A general polynomial time algorithm for near-optimal reinforcement learning
    • OCT
    • R. I. Brafman and M. Tennenholtz. R-max - a general polynomial time algorithm for near-optimal reinforcement learning. Journal of Machine Learning Research, 3(OCT):213-231, 2002.
    • (2002) Journal of Machine Learning Research , vol.3 , pp. 213-231
    • Brafman, R.I.1    Tennenholtz, M.2
  • 5
    • 0012257655 scopus 로고    scopus 로고
    • Near-optimal performance for reinforcement learning in polynomial time
    • M. Kearns and S. P. Singh. Near-optimal performance for reinforcement learning in polynomial time. In Proc. ICML, pages 260-268, 1998.
    • (1998) Proc. ICML , pp. 260-268
    • Kearns, M.1    Singh, S.P.2
  • 6
    • 66849131425 scopus 로고    scopus 로고
    • Multi-task reinforcement learning in partially observable stochastic environments
    • H. Li, X. Liao, and L. Carin. Multi-task reinforcement learning in partially observable stochastic environments. Journal of Machine Learning Research, 10:1131-1186, 2009.
    • (2009) Journal of Machine Learning Research , vol.10 , pp. 1131-1186
    • Li, H.1    Liao, X.2    Carin, L.3
  • 7
    • 85138579181 scopus 로고
    • Learning policies for partially observable environments: Scaling up
    • M.L. Littman, A.R. Cassandra, and L.P. Kaelbling. Learning policies for partially observable environments: scaling up. In ICML, 1995.
    • (1995) ICML
    • Littman, M.L.1    Cassandra, A.R.2    Kaelbling, L.P.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.