SCOPUS 정보 검색 플랫폼

Volumn 9, Issue , 2010, Pages 241-248

Variational methods for reinforcement learning

a UNIVERSITY COLLEGE LONDON (United Kingdom)

Author keywords

[No Author keywords available]

Indexed keywords

APPROXIMATE SOLUTION; BAYESIAN ALTERNATIVES; EXPECTATION PROPAGATION; MARKOV DECISION PROCESSES; OPTIMAL DECISIONS; POINT ESTIMATE; TRANSITION MATRICES; TRANSITION MODEL; VARIATIONAL BAYES; VARIATIONAL METHODS;

ARTIFICIAL INTELLIGENCE; ESTIMATION; MARKOV PROCESSES; REINFORCEMENT LEARNING;

UNCERTAINTY ANALYSIS;

EID: 84862273812 PISSN: 15324435 EISSN: 15337928 Source Type: Journal
DOI: None Document Type: Conference Paper

Times cited : (46)

References (15)

1
- 84864030941
- An application of reinforcement learning to aerobatic helicopter flight
- P. Abbeel, A. Coates, M. Quigley, and A. Ng. An Application of Reinforcement Learning to Aerobatic Helicopter Flight. NIPS, 19:1-8, 2007.
- (2007) NIPS , vol.19 , pp. 1-8
- Abbeel, P.¹ Coates, A.² Quigley, M.³ Ng, A.⁴

2
- 13844295342
- The variational Bayesian em algorithm for incomplete data: With application to scoring graphical model structures
- Oxford University Press
- M. J. Beal and Z. Ghahramani. The Variational Bayesian EM Algorithm for Incomplete Data: with Application to Scoring Graphical Model Structures. In Bayesian Statistics, volume 7, pages 453-464. Oxford University Press, 2003.
- (2003) Bayesian Statistics , vol.7 , pp. 453-464
- Beal, M.J.¹ Ghahramani, Z.²

3
- 85156187730
- Improving elevator performance using reinforcement learning
- R. Crites and A. Barto. Improving Elevator Performance Using Reinforcement Learning. NIPS, 8: 1017-1023, 1995.
- (1995) NIPS , vol.8 , pp. 1017-1023
- Crites, R.¹ Barto, A.²

4
- 0346982426
- Using expectation-maximization for reinforcement learning
- P. Dayan and G. E. Hinton. Using Expectation-Maximization for Reinforcement Learning. Neural Computation, 9:271-278, 1997.
- (1997) Neural Computation , vol.9 , pp. 271-278
- Dayan, P.¹ Hinton, G.E.²

5
- 0031619316
- Bayesian Q learning
- R. Dearden, N. Friedman, and S. Russell. Bayesian Q learning. AAAI, 15:761-768, 1998.
- (1998) AAAI , vol.15 , pp. 761-768
- Dearden, R.¹ Friedman, N.² Russell, S.³

6
- 1942450858
- PhD thesis, University of Massachusetts Amherst
- M. Duff. Optimal Learning: Computational Procedures for Bayes-Adaptive Markov Decision Processes. PhD thesis, University of Massachusetts Amherst, 2002.
- (2002) Optimal Learning: Computational Procedures for Bayes-Adaptive Markov Decision Processes
- Duff, M.¹

7
- 84907554788
- Solving deterministic policy (PO)MPDs using Expectation-Maximisation and Antifreeze
- Workshop on Learning and data Mining for Robotics
- T. Furmston and D. Barber. Solving deterministic policy (PO)MPDs using Expectation-Maximisation and Antifreeze. European Conference on Machine Learning (ECML), 1:50-65, 2009. Workshop on Learning and data Mining for Robotics.
- (2009) European Conference on Machine Learning (ECML) , vol.1 , pp. 50-65
- Furmston, T.¹ Barber, D.²

8
- 85162074018
- Trans-dimensional MCMC for Bayesian policy learning
- M. Hoffman, A. Doucet, N. de Freitas, and A. Jasra. Trans-dimensional MCMC for Bayesian Policy Learning. NIPS, 20:665-672, 2008.
- (2008) NIPS , vol.20 , pp. 665-672
- Hoffman, M.¹ Doucet, A.² De Freitas, N.³ Jasra, A.⁴

9
- 84858754385
- Policy search for motor primitives in robotics
- J. Kober and J. Peters. Policy search for motor primitives in robotics. NIPS, 21:849-856, 2009.
- (2009) NIPS , vol.21 , pp. 849-856
- Kober, J.¹ Peters, J.²

10
- 0035246564
- Factor graphs and the sum-product algorithm
- F. R. Kschischang, B. J. Frey, and H-A. Loeliger. Factor graphs and the sum-product algorithm. IEEE Transactions on Information Theory, 47:498-519, 2001.
- (2001) IEEE Transactions on Information Theory , vol.47 , pp. 498-519
- Kschischang, F.R.¹ Frey, B.J.² Loeliger, H.-A.³

11
- 0345978970
- Expectation propagation for approximate Bayesian inference
- T. P. Minka. Expectation Propagation for approximate Bayesian inference. In UAI '01: Proceedings of the 17th Conference in Uncertainty in Artificial Intelligence, pages 362-369, 2001.
- (2001) UAI '01: Proceedings of the 17th Conference in Uncertainty in Artificial Intelligence , pp. 362-369
- Minka, T.P.¹

12
- 58449109750
- Probabilistic inference for fast learning in control
- S. Girgin, M. Loth, R. Munos, P. Preux, and D. Ryabko, editors
- C. Rasmussen and M. Deisenroth. Probabilistic inference for fast learning in control. In S. Girgin, M. Loth, R. Munos, P. Preux, and D. Ryabko, editors, Recent Advances in Reinforcement Learning, pages 229-242, 2008.
- (2008) Recent Advances in Reinforcement Learning , pp. 229-242
- Rasmussen, C.¹ Deisenroth, M.²

13
- 0004102479
- MIT Press
- R. S. Sutton and A. G. Barto. Reinforcement Learning: An Introduction. MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

14
- 51349153274
- Probabilistic inference for solving (PO)MDPs
- School of Informatics
- M. Toussaint, S. Harmeling, and A. Storkey. Probabilistic inference for solving (PO)MDPs. Research Report EDI-INF-RR-0934, University of Edinburgh, School of Informatics, 2006.
- (2006) Research Report EDI-INF-RR-0934, University of Edinburgh
- Toussaint, M.¹ Harmeling, S.² Storkey, A.³

15
- 65749118363
- Graphical models, exponential families, and variational inference
- M. J. Wainwright and M. I. Jordan. Graphical models, exponential families, and variational inference. Foundations and Trends in Machine Learning, 1(1-2):1-305, 2008.
- (2008) Foundations and Trends in Machine Learning , vol.1 , Issue.1-2 , pp. 1-305
- Wainwright, M.J.¹ Jordan, M.I.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.