SCOPUS 정보 검색 플랫폼

Proceedings of the 27th Conference on Uncertainty in Artificial Intelligence, UAI 2011

Volumn , Issue , 2011, Pages 221-229

Effcient inference in Markov control problems

(2) Furmston, Thomas a Barber, David a

a UNIVERSITY COLLEGE LONDON (United Kingdom)

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE;

CONTROL PROBLEMS; EXACT INFERENCE; EXPECTATION-MAXIMISATION; FINITE HORIZONS; INFINITE HORIZON PROBLEMS; INFINITE HORIZONS; MARKOV DECISION PROBLEM; NOVEL ALGORITHM;

INFERENCE ENGINES;

EID: 80053139999 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (6)

References (17)

1
- 77956228214
- Cambridge University Press
- D. Barber. Bayesian Reasoning and Machine Learning. Cambridge University Press, 2011.
- (2011) Bayesian Reasoning and Machine Learning
- Barber, D.¹

2
- 0346982426
- Using expectation-maximization for reinforcement learning
- P. Dayan and G. E. Hinton. Using Expectation-Maximization for Reinforcement Learning. Neural Computation, 9:271-278, 1997. (Pubitemid 127635391)
- (1997) Neural Computation , vol.9 , Issue.2 , pp. 271-278
- Dayan, P.¹ Hinton, G.E.²

3
- 84862273812
- Variational methods for reinforcement learning
- T. Furmston and D. Barber. Variational Methods for Reinforcement Learning. AISTATS, 9(13):241-248, 2010.
- (2010) AISTATS , vol.9 , Issue.13 , pp. 241-248
- Furmston, T.¹ Barber, D.²

4
- 85162074018
- Trans-dimensional MCMC for Bayesian policy learning
- M. Hoffman, A. Doucet, N. de Freitas, and A. Jasra. Trans-dimensional MCMC for Bayesian Policy Learning. NIPS, 20:665-672, 2008.
- (2008) NIPS , vol.20 , pp. 665-672
- Hoffman, M.¹ Doucet, A.² De Freitas, N.³ Jasra, A.⁴

5
- 84862277035
- An expectation maximization algorithm for continuous Markov decision processes with arbitrary rewards
- M. Hoffman, N. de Freitas, A. Doucet, and J. Peters. An Expectation Maximization Algorithm for Continuous Markov Decision Processes with Arbitrary Rewards. AISTATS, 5(12):232-239, 2009.
- (2009) AISTATS , vol.5 , Issue.12 , pp. 232-239
- Hoffman, M.¹ De Freitas, N.² Doucet, A.³ Peters, J.⁴

6
- 0032073263
- Planning and acting in partially observable stochastic domains
- PII S000437029800023X
- L. Kaelbling, M. Littman, and A. Cassandra. Planning and Acting in Partially Observable Stochastic Domains. Artificial Intelligence, 101:99-134, 1998. (Pubitemid 128387390)
- (1998) Artificial Intelligence , vol.101 , Issue.1-2 , pp. 99-134
- Kaelbling, L.P.¹ Littman, M.L.² Cassandra, A.R.³

7
- 0004178386
- Prentice Hall
- H. Khalil. Nonlinear Systems. Prentice Hall, 2001.
- (2001) Nonlinear Systems.
- Khalil, H.¹

8
- 84858754385
- Policy search for motor primitives in robotics
- J. Kober and J. Peters. Policy search for motor primitives in robotics. NIPS, 21:849-856, 2009.
- (2009) NIPS , vol.21 , pp. 849-856
- Kober, J.¹ Peters, J.²

9
- 34250635407
- Policy gradient methods for robotics
- DOI 10.1109/IROS.2006.282564, 4058714, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2006
- J. Peters and S. Schaal. Policy Gradient Methods for Robotics. IROS, 21:2219-2225, 2006. (Pubitemid 46928224)
- (2006) IEEE International Conference on Intelligent Robots and Systems , pp. 2219-2225
- Peters, J.¹ Schaal, S.²

10
- 1942420675
- Optimization with em and expectation-conjugate-gradient
- R. Salakhutdinov, S. Roweis, and Z. Ghahramani. Optimization with EM and Expectation-Conjugate-Gradient. ICML, (20):672-679, 2003.
- (2003) ICML , Issue.20 , pp. 672-679
- Salakhutdinov, R.¹ Roweis, S.² Ghahramani, Z.³

11
- 52449128863
- John Wiley & Sons
- M. Spong, S. Hutchinson, and M. Vidyasagar. Robot Modelling and Control. John Wiley & Sons, 2005.
- (2005) Robot Modelling and Control
- Spong, M.¹ Hutchinson, S.² Vidyasagar, M.³

12
- 84898939480
- Policy gradient methods for reinforcement learning with function approximation
- R. Sutton, D. McAllester, S. Singh, and Y. Mansour. Policy Gradient Methods for Reinforcement Learning with Function Approximation. NIPS, 13, 2000.
- (2000) NIPS , vol.13
- Sutton, R.¹ McAllester, D.² Singh, S.³ Mansour, Y.⁴

13
- 0004102479
- MIT Press
- R. S. Sutton and A. G. Barto. Reinforcement Learning: An Introduction. MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction.
- Sutton, R.S.¹ Barto, A.G.²

14
- 77956023387
- Pros and cons of truncated Gaussian EP in the context of approximate inference control
- M. Toussaint. Pros and Cons of truncated Gaussian EP in the context of Approximate Inference Control. NIPS - Workshop on Probabilistic Approaches for Robotics and Control., 21, 2009.
- (2009) NIPS - Workshop on Probabilistic Approaches for Robotics and Control. , vol.21
- Toussaint, M.¹

15
- 51349153274
- Probabilistic inference for solving (PO)MDPs
- M. Toussaint, S. Harmeling, and A. Storkey. Probabilistic inference for solving (PO)MDPs. Research Report EDI-INF-RR-0934, University of Edinburgh, School of Informatics, 2006.
- (2006) Research Report EDI-INF-RR-0934, University of Edinburgh, School of Informatics
- Toussaint, M.¹ Harmeling, S.² Storkey, A.³

16
- 85008372553
- Cambridge University Press, In press. See userpage.fu-berlin.de/~mtoussai
- M. Toussaint, A. Storkey, and S. Harmeling. Bayesian Time Series Models, chapter Expectation-Maximization methods for solving (PO)MDPs and optimal control problems. Cambridge University Press, 2011. In press. See userpage.fu-berlin.de/~mtoussai.
- (2011) Bayesian Time Series Models, Chapter Expectation-Maximization Methods for Solving (PO)MDPs and Optimal Control Problems
- Toussaint, M.¹ Storkey, A.² Harmeling, S.³

17
- 65749118363
- Graphical models, exponential families, and Variational inference
- M. J. Wainwright and M. I. Jordan. Graphical Models, Exponential Families, and Variational Inference. Foundations and Trends in Machine Learning, 1(1-2):1-305, 2008.
- (2008) Foundations and Trends in Machine Learning , vol.1 , Issue.1-2 , pp. 1-305
- Wainwright, M.J.¹ Jordan, M.I.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.