메뉴 건너뛰기




Volumn , Issue , 2010, Pages 311-318

Temporal difference Bayesian model averaging: A Bayesian perspective on adapting lambda

Author keywords

[No Author keywords available]

Indexed keywords

BAYESIAN APPROACHES; BAYESIAN MODEL AVERAGING; BAYESIAN PERSPECTIVE; EXPECTED RETURN; FAST CONVERGENCE; FIXED PARAMETERS; OPTIMAL CONVERGENCE; SAMPLED DATA; TEMPORAL DIFFERENCES;

EID: 77956543059     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (31)

References (11)
  • 3
    • 22944449970 scopus 로고    scopus 로고
    • Model based bayesian exploration
    • Dearden, Richard, Friedman, Nir, and Andre, David. Model based bayesian exploration. In UAI, 1999.
    • (1999) UAI
    • Dearden, R.1    Friedman, N.2    Andre, D.3
  • 4
    • 31844451013 scopus 로고    scopus 로고
    • Reinforcement learning with Gaussian processes
    • New York, NY, USA
    • Engel, Yaakov, Mannor, Shie and Meir, Ron. Reinforcement learning with Gaussian processes. In ICML-05, pp. 201-208, New York, NY, USA, 2005.
    • (2005) ICML-05 , pp. 201-208
    • Engel, Y.1    Mannor, S.2    Meir, R.3
  • 6
    • 0003294665 scopus 로고    scopus 로고
    • The art of computer programming
    • Addison-Wesley, Boston
    • Knuth, Donald E. The Art of Computer Programming, Volume 2: Seminumerical Algorithms. Addison-Wesley, Boston, 1998.
    • (1998) Seminumerical Algorithms , vol.2
    • Knuth, D.E.1
  • 7
    • 33749251297 scopus 로고    scopus 로고
    • An analytic solution to discrete bayesian reinforcement learning
    • New York, NY, USA, ACM
    • Poupart, Pascal, Vlassis, Nikos, Hoey, Jesse and Regan, Kevin. An analytic solution to discrete bayesian reinforcement learning. In ICML '06, pp. 697-704, New York, NY, USA, 2006. ACM.
    • (2006) ICML '06 , pp. 697-704
    • Poupart, P.1    Vlassis, N.2    Hoey, J.3    Regan, K.4
  • 11
    • 31844436266 scopus 로고    scopus 로고
    • Bayesian sparse sampling for online reward optimization
    • Wang, Tao, Lizotte, Daniel, Bowling, Michael and Schuurmans, Dale. Bayesian sparse sampling for online reward optimization. In ICML-05, 2005.
    • (2005) ICML-05
    • Wang, T.1    Lizotte, D.2    Bowling, M.3    Schuurmans, D.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.