SCOPUS 정보 검색 플랫폼

Autonomous Robots

Volumn 27, Issue 2, 2009, Pages 123-130

Learning Model-free robot control by a Monte Carlo em algorithm

(4) Vlassis, Nikos a Toussaint, Marc b Kontes, Georgios a Piperidis, Savas a

a TECHNICAL UNIVERSITY OF CRETE (Greece)

b TECHNISCHE UNIVERSITÄT BERLIN (Germany)

Author keywords

EM algorithm; Model free robot control; Probabilistic inference; Reinforcement learning

Indexed keywords

CONTROLLER PARAMETER; EM ALGORITHM; GEOMETRIC DISTRIBUTION; INFINITE HORIZONS; LEARNING MODELS; MODEL FREE; MODEL-FREE ROBOT CONTROL; MONTE CARLO EM ALGORITHM; MONTREAL , CANADA; NEURAL INFORMATION PROCESSING SYSTEMS; ON-MACHINES; PROBABILISTIC INFERENCE; PROBABILISTIC MODELS; REAL ROBOT; ROBOT CONTROLS; ROBOT TRAJECTORY;

DATA PROCESSING; EDUCATION; INFERENCE ENGINES; LIGHT MEASUREMENT; MATHEMATICAL MODELS; MONTE CARLO METHODS; REINFORCEMENT; REINFORCEMENT LEARNING; ROBOT LEARNING; ROBOTS;

LEARNING ALGORITHMS;

EID: 70349327392 PISSN: 09295593 EISSN: None Source Type: Journal
DOI: 10.1007/s10514-009-9132-0 Document Type: Article

Times cited : (56)

References (21)

1
- 84864030941
- An application of reinforcement learning to aerobatic helicopter flight
- Abbeel, P., Coates, A., Quigley, M., & Ng, A. Y. (2007). An application of reinforcement learning to aerobatic helicopter flight. In Proc. neural information processing systems.
- (2007) Proc. Neural Information Processing Systems
- Abbeel, P.¹ Coates, A.² Quigley, M.³ Ng, A.Y.⁴

2
- 0003487482
- Athena Scientific Nashua 0924.68163
- Bertsekas, D. P., & Tsitsiklis, J. N. (1996). Neuro-dynamic programming. Nashua: Athena Scientific.
- (1996) Neuro-dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

3
- 0008586604
- A method for using belief networks as influence diagrams
- Minneapolis, Minnesota
- Cooper, G. F. (1988). A method for using belief networks as influence diagrams. In Proc. 4th workshop on uncertainty in artificial intelligence (pp. 55-63), Minneapolis, Minnesota.
- (1988) Proc. 4th Workshop on Uncertainty in Artificial Intelligence , pp. 55-63
- Cooper, G.F.¹

4
- 0346982426
- Using expectation-maximization for reinforcement learning
- 0876.68090 10.1162/neco.1997.9.2.271
- P. Dayan G. E. Hinton 1997 Using expectation-maximization for reinforcement learning Neural Computation 9 2 271 278 0876.68090 10.1162/neco.1997.9.2.271
- (1997) Neural Computation , vol.9 , Issue.2 , pp. 271-278
- Dayan, P.¹ Hinton, G.E.²

5
- 0002629270
- Maximum likelihood from incomplete data via the em algorithm
- 0364.62022 501537
- A. P. Dempster N. M. Laird D. B. Rubin 1977 Maximum likelihood from incomplete data via the EM algorithm Journal of the Royal Statistical Society Series B 39 1 38 0364.62022 501537
- (1977) Journal of the Royal Statistical Society Series B , vol.39 , pp. 1-38
- Dempster, A.P.¹ Laird, N.M.² Rubin, D.B.³

6
- 70350090880
- Bayesian policy learning with trans-dimensional MCMC
- Hoffman, M., Doucet, A., de Freitas, N., & Jasra, A. (2008). Bayesian policy learning with trans-dimensional MCMC. In Proc. neural information processing systems.
- (2008) Proc. Neural Information Processing Systems
- Hoffman, M.¹ Doucet, A.² De Freitas, N.³ Jasra, A.⁴

7
- 33645325949
- Dynamic analysis of a nonholonomic two-wheeled inverted pendulum robot
- 10.1007/s10846-005-9022-4
- Y. Kim S. H. Kim Y. K. Kwak 2005 Dynamic analysis of a nonholonomic two-wheeled inverted pendulum robot Journal of Intelligent and Robotic Systems 44 1 25 46 10.1007/s10846-005-9022-4
- (2005) Journal of Intelligent and Robotic Systems , vol.44 , Issue.1 , pp. 25-46
- Kim, Y.¹ Kim, S.H.² Kwak, Y.K.³

8
- 78049390740
- Policy search for motor primitives in robotics
- Kober, J., & Peters, J. (2009). Policy search for motor primitives in robotics. In Proc. neural information processing systems.
- (2009) Proc. Neural Information Processing Systems
- Kober, J.¹ Peters, J.²

9
- 70349325516
- A Bayesian exploration-exploitation approach for optimal online sensing and planning with a visually guided mobile robot
- doi: 10.1007/s10514-009-9130-2
- Martinez-Cantin, R., de Freitas, N., Castellanos, J. A., & Doucet, A. (2009). A Bayesian exploration-exploitation approach for optimal online sensing and planning with a visually guided mobile robot. Autonomous Robots. doi: 10.1007/s10514-009-9130-2.
- (2009) Autonomous Robots
- Martinez-Cantin, R.¹ De Freitas, N.² Castellanos, J.A.³ Doucet, A.⁴

10
- 0002788893
- A view of the em algorithm that justifies incremental, sparse, and other variants
- M. I. Jordan (eds). Kluwer Academic Dordrecht
- Neal, R. M., & Hinton, G. E. (1998). A view of the EM algorithm that justifies incremental, sparse, and other variants. In M. I. Jordan (Ed.), Learning in graphical models (pp. 355-368). Dordrecht: Kluwer Academic.
- (1998) Learning in Graphical Models , pp. 355-368
- Neal, R.M.¹ Hinton, G.E.²

11
- 0141819580
- PEGASUS: A policy search method for large MDPs and POMDPs
- Ng, A. Y., & Jordan, M. I. (2000). PEGASUS: a policy search method for large MDPs and POMDPs. In Proc. uncertainty in artificial intelligence.
- (2000) Proc. Uncertainty in Artificial Intelligence
- Ng, A.Y.¹ Jordan, M.I.²

12
- 40649106649
- Natural actor critic
- 10.1016/j.neucom.2007.11.026
- J. Peters S. Schaal 2008 Natural actor critic Neurocomputing 71 7-9 1180 1190 10.1016/j.neucom.2007.11.026
- (2008) Neurocomputing , vol.71 , Issue.79 , pp. 1180-1190
- Peters, J.¹ Schaal, S.²

13
- 44949241322
- Reinforcement learning of motor skills with policy gradients
- 10.1016/j.neunet.2008.02.003
- J. Peters S. Schaal 2008 Reinforcement learning of motor skills with policy gradients Neural Networks 21 4 682 697 10.1016/j.neunet.2008.02.003
- (2008) Neural Networks , vol.21 , Issue.4 , pp. 682-697
- Peters, J.¹ Schaal, S.²

14
- 67650458573
- Using Reward-weighted imitation for robot reinforcement learning
- Peters, J., & Kober, J. (2009). Using reward-weighted imitation for robot reinforcement learning. In Proc. 2009 IEEE int. symp. on approximate dynamic programming and reinforcement learning.
- (2009) Proc. 2009 IEEE Int. Symp. on Approximate Dynamic Programming and Reinforcement Learning
- Peters, J.¹ Kober, J.²

15
- 67650996818
- Reinforcement learning for robot soccer
- 10.1007/s10514-009-9120-4 This issue, part A
- M. Riedmiller T. Gabel R. Hafner S. Lange 2009 Reinforcement learning for robot soccer Autonomous Robots 27 1 55 73 10.1007/s10514-009-9120-4 This issue, part A
- (2009) Autonomous Robots , vol.27 , Issue.1 , pp. 55-73
- Riedmiller, M.¹ Gabel, T.² Hafner, R.³ Lange, S.⁴

16
- 70349307164
- State-dependent exploration for policy gradient methods
- Rückstie, T., Felder, M., & Schmidhuber, J. (2008). State-dependent exploration for policy gradient methods. In Proc. European conf. on machine learning.
- (2008) Proc. European Conf. on Machine Learning
- Rückstie, T.¹

17
- 0004102479
- MIT Press Cambridge
- Sutton, R. S., & Barto, A. G. (1998). Reinforcement learning: an introduction. Cambridge: MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

18
- 34250679869
- Learning to walk in 20 minutes
- Tedrake, R., Zhang, T. W., & Seung, H. S. (2005). Learning to walk in 20 minutes. In Proc. 14th Yale workshop on adaptive and learning systems.
- (2005) Proc. 14th Yale Workshop on Adaptive and Learning Systems
- Tedrake, R.¹ Zhang, T.W.² Seung, H.S.³

19
- 34250728061
- Probabilistic inference for solving discrete and continuous state Markov decision processes
- Toussaint, M., & Storkey, A. (2006). Probabilistic inference for solving discrete and continuous state Markov decision processes. In Proc. int. conf. on machine learning.
- (2006) Proc. Int. Conf. on Machine Learning
- Toussaint, M.¹ Storkey, A.²

20
- 71249143630
- Model-free reinforcement learning as mixture learning
- Montreal, Canada
- Vlassis, N., & Toussaint, M. (2009). Model-free reinforcement learning as mixture learning. In Proc. int. conf. on machine learning, Montreal, Canada.
- (2009) Proc. Int. Conf. on Machine Learning
- Vlassis, N.¹ Toussaint, M.²

21
- 84950432017
- A Monte Carlo implementation of the em algorithm and the poor man's data augmentation algorithm
- 10.2307/2290005
- G. Wei M. Tanner 1990 A Monte Carlo implementation of the EM algorithm and the poor man's data augmentation algorithm Journal of the American Statistical Association 85 699 704 10.2307/2290005
- (1990) Journal of the American Statistical Association , vol.85 , pp. 699-704
- Wei, G.¹ Tanner, M.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.