메뉴 건너뛰기




Volumn , Issue , 2011, Pages 1-10

Graphical models for bandit problems

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; GRAPH ALGORITHMS; PROBABILITY;

EID: 80053151768     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (12)

References (13)
  • 4
    • 0344118814 scopus 로고    scopus 로고
    • Reinforcement learning with immediate rewards and linear hypotheses
    • Naoki Abe, Alan W. Biermann, and Philip M. Long.Reinforcement learning with immediate rewards and linear hypotheses. Algorithmica, 37:263-293, 2003.
    • (2003) Algorithmica , vol.37 , pp. 263-293
    • Abe, N.1    Biermann, A.W.2    Long, P.M.3
  • 5
    • 0041966002 scopus 로고    scopus 로고
    • Using confidence bounds for exploitation-exploration trade-offs
    • Peter Auer. Using confidence bounds for exploitation-exploration trade-offs. Journal of Machine Learning Research, 3:397-422, 2002.
    • (2002) Journal of Machine Learning Research , vol.3 , pp. 397-422
    • Auer, P.1
  • 11
    • 85162058047 scopus 로고    scopus 로고
    • Online linear regression and its application to model-based reinforcement learning
    • Alexander Strehl andMichael L Littman. Online linear regression and its application to model-based reinforcement learning. In Advances in Neural Information Processing Systems 20, 2007.
    • (2007) Advances in Neural Information Processing Systems , vol.20
    • Strehl, A.1    Littman, M.L.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.