SCOPUS 정보 검색 플랫폼

ICML 2010 - Proceedings, 27th International Conference on Machine Learning

Volumn , Issue , 2010, Pages 311-318

Temporal difference Bayesian model averaging: A Bayesian perspective on adapting lambda

(2) Downey, Carlton a Sanner, Scott b

a VICTORIA UNIVERSITY OF WELLINGTON (New Zealand)

b AUSTRALIAN NATIONAL UNIVERSITY (Australia)

Author keywords

[No Author keywords available]

Indexed keywords

BAYESIAN APPROACHES; BAYESIAN MODEL AVERAGING; BAYESIAN PERSPECTIVE; EXPECTED RETURN; FAST CONVERGENCE; FIXED PARAMETERS; OPTIMAL CONVERGENCE; SAMPLED DATA; TEMPORAL DIFFERENCES;

BAYESIAN NETWORKS; CONVERGENCE OF NUMERICAL METHODS; REINFORCEMENT LEARNING;

LEARNING ALGORITHMS;

EID: 77956543059 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (31)

References (11)

1
- 0002700781
- Learning to act using real-time dynamic programming
- Barto, Andrew G., Bradtke, Steven J. and Singh, Satinder P. Learning to act using real-time dynamic programming. Technical Report UM-CS-1993-002, U. Mass. Amherst, 1993.
- (1993) Technical Report UM-CS-1993-002, U. Mass. Amherst
- Barto, A.G.¹ Bradtke, S.J.² Singh, S.P.³

2
- 0031619316
- Bayesian q-learning
- Dearden, Richard, Friedman, Nir and Russell, Stuart J. Bayesian q-learning. In AAAI/IAAI, pp. 761-768, 1998.
- (1998) AAAI/IAAI , pp. 761-768
- Dearden, R.¹ Friedman, N.² Russell, S.J.³

3
- 22944449970
- Model based bayesian exploration
- Dearden, Richard, Friedman, Nir, and Andre, David. Model based bayesian exploration. In UAI, 1999.
- (1999) UAI
- Dearden, R.¹ Friedman, N.² Andre, D.³

4
- 31844451013
- Reinforcement learning with Gaussian processes
- New York, NY, USA
- Engel, Yaakov, Mannor, Shie and Meir, Ron. Reinforcement learning with Gaussian processes. In ICML-05, pp. 201-208, New York, NY, USA, 2005.
- (2005) ICML-05 , pp. 201-208
- Engel, Y.¹ Mannor, S.² Meir, R.³

5
- 23244466805
- PhD thesis, University College London, London, UK, March
- Kakade, Sham. On the Sample Complexity of Reinforcement Learning. PhD thesis, University College London, London, UK, March 2003.
- (2003) On the Sample Complexity of Reinforcement Learning
- Kakade, S.¹

6
- 0003294665
- The art of computer programming
- Addison-Wesley, Boston
- Knuth, Donald E. The Art of Computer Programming, Volume 2: Seminumerical Algorithms. Addison-Wesley, Boston, 1998.
- (1998) Seminumerical Algorithms , vol.2
- Knuth, D.E.¹

7
- 33749251297
- An analytic solution to discrete bayesian reinforcement learning
- New York, NY, USA, ACM
- Poupart, Pascal, Vlassis, Nikos, Hoey, Jesse and Regan, Kevin. An analytic solution to discrete bayesian reinforcement learning. In ICML '06, pp. 697-704, New York, NY, USA, 2006. ACM.
- (2006) ICML '06 , pp. 697-704
- Poupart, P.¹ Vlassis, N.² Hoey, J.³ Regan, K.⁴

8
- 85102627959
- Wiley, New York
- Puterman, Martin L. Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley, New York, 1994.
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

9
- 0010495476
- On bias and step size in temporal-difference learning
- New Haven, CT
- Sutton, R. S. and Singh, S. P. On bias and step size in temporal-difference learning. In Proceedings of the Eighth Yale Workshop on Adaptive and Learning Systems, pp. 91-96, New Haven, CT, 1994.
- (1994) Proceedings of the Eighth Yale Workshop on Adaptive and Learning Systems , pp. 91-96
- Sutton, R.S.¹ Singh, S.P.²

10
- 0004007508
- MIT Press
- Sutton, Richard and Barto, Andrew. Reinforcement Learning. MIT Press, 1998.
- (1998) Reinforcement Learning
- Sutton, R.¹ Barto, A.²

11
- 31844436266
- Bayesian sparse sampling for online reward optimization
- Wang, Tao, Lizotte, Daniel, Bowling, Michael and Schuurmans, Dale. Bayesian sparse sampling for online reward optimization. In ICML-05, 2005.
- (2005) ICML-05
- Wang, T.¹ Lizotte, D.² Bowling, M.³ Schuurmans, D.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.