SCOPUS 정보 검색 플랫폼

Robotics: Science and Systems

Volumn , Issue , 2014, Pages

Combining the benefits of function approximation and trajectory optimization

(2) Mordatch, Igor a Todorov, Emanuel a

a Department of Health Systems and Population Health (United States)

Author keywords

[No Author keywords available]

Indexed keywords

INVERSE PROBLEMS; LEARNING SYSTEMS; NEURAL NETWORKS; TEACHING; TRAJECTORIES;

APPROXIMATION OPTIMIZATION; FUNCTIONS APPROXIMATIONS; HARD PROBLEMS; HIGH-FIDELITY SOLUTIONS; IN-CONTROL; MACHINE-LEARNING; NEURAL-NETWORKS; OPTIMIZERS; ROBOTIC CONTROLS; TRAJECTORY OPTIMIZATION;

AERODYNAMICS;

EID: 85131220210 PISSN: None EISSN: 2330765X Source Type: Conference Proceeding
DOI: 10.15607/RSS.2014.X.052 Document Type: Conference Paper

Times cited : (128)

References (39)

1
- 85158005713
- An application of reinforcement learning to aerobatic helicopter flight
- P. Abbeel, A. Coates, M. Quigley, and A. Ng. An application of reinforcement learning to aerobatic helicopter flight. NIPS, 2006.
- (2006) NIPS
- Abbeel, P.¹ Coates, A.² Quigley, M.³ Ng, A.⁴

2
- 84879209923
- Trajectory optimization for full-body movements with complex contacts
- M. Al Borno, M. de Lasa, and A. Hertzmann. Trajectory optimization for full-body movements with complex contacts. IEEE Trans Visualization and Computer Graphics, 2013.
- (2013) IEEE Trans Visualization and Computer Graphics
- Al Borno, M.¹ de Lasa, M.² Hertzmann, A.³

3
- 84898945898
- A message-passing algorithm for multi-agent trajectory planning
- Jos Bento, Nate Derbinsky, Javier Alonso-Mora, and Jonathan S. Yedidia. A message-passing algorithm for multi-agent trajectory planning. In NIPS, pages 521–529, 2013.
- (2013) NIPS , pp. 521-529
- Bento, Jos¹ Derbinsky, Nate² Alonso-Mora, Javier³ Yedidia, Jonathan S.⁴

4
- 0003487482
- Athena Scientific, Belmont, MA
- D. Bertsekas and J. Tsitsiklis. Neuro-dynamic programming. Athena Scientific, Belmont, MA, 1997.
- (1997) Neuro-dynamic programming
- Bertsekas, D.¹ Tsitsiklis, J.²

5
- 79957583390
- Distributed optimization and statistical learning via the alternating direction method of multipliers
- S. Boyd, N. Pariks, E. Chu, B. Peleato, and J. Eckstein. Distributed optimization and statistical learning via the alternating direction method of multipliers. Foundations and Trends in Machine Learning, 2011.
- (2011) Foundations and Trends in Machine Learning
- Boyd, S.¹ Pariks, N.² Chu, E.³ Peleato, B.⁴ Eckstein, J.⁵

6
- 84890526837
- New types of deep neural network learning for speech recognition and related applications: An overview
- L. Deng, G. Hinton, and B. Kingsbury. New types of deep neural network learning for speech recognition and related applications: An overview. ICASSP, 2013.
- (2013) ICASSP
- Deng, L.¹ Hinton, G.² Kingsbury, B.³

7
- 77953234105
- A muscle-reflex model that encodes principles of legged mechanics produces human walking dynamics and muscle activities
- Hartmut Geyer and Hugh Herr. A muscle-reflex model that encodes principles of legged mechanics produces human walking dynamics and muscle activities. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 18(3), 2010.
- (2010) IEEE Transactions on Neural Systems and Rehabilitation Engineering , vol.18 , Issue.3
- Geyer, Hartmut¹ Herr, Hugh²

8
- 37249089713
- Quadrotor helicopter flight dynamics and control: Theory and experiment
- G. Hoffmann, H. Huang, S. Waslander, and C. Tomlin. Quadrotor helicopter flight dynamics and control: Theory and experiment. Proc AIAA, 2007.
- (2007) Proc AIAA
- Hoffmann, G.¹ Huang, H.² Waslander, S.³ Tomlin, C.⁴

9
- 40049092971
- Central pattern generators for locomotion control in animals and robots: a review
- A. Ijspeert. Central pattern generators for locomotion control in animals and robots: a review. Neural Networks, 2008.
- (2008) Neural Networks
- Ijspeert, A.¹

10
- 84876231242
- Imagenet classification with deep convolutional neural networks
- A. Krizhevsky, I. Sutskever, and G. Hinton. Imagenet classification with deep convolutional neural networks. NIPS, 2012.
- (2012) NIPS
- Krizhevsky, A.¹ Sutskever, I.² Hinton, G.³

11
- 33746100752
- Least-squares policy iteration
- M. Lagoudakis and R. Parr. Least-squares policy iteration. Journal of Machine Learning Research, 2003.
- (2003) Journal of Machine Learning Research
- Lagoudakis, M.¹ Parr, R.²

12
- 0032203257
- Gradient-based learning applied to document recognition
- Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner. Gradient-based learning applied to document recognition. Proc of the IEEE, 1998.
- (1998) Proc of the IEEE
- LeCun, Y.¹ Bottou, L.² Bengio, Y.³ Haffner, P.⁴

13
- 84898932265
- Variational policy search via trajectory optimization
- S. Levine and V. Koltun. Variational policy search via trajectory optimization. NIPS, 2013.
- (2013) NIPS
- Levine, S.¹ Koltun, V.²

14
- 84897529781
- Guided policy search
- S. Levine and V. Koltun. Guided policy search. ICML, 2013.
- (2013) ICML
- Levine, S.¹ Koltun, V.²

15
- 84919913608
- Learning complex neural network policies with trajectory optimization
- Sergey Levine and Vladlen Koltun. Learning complex neural network policies with trajectory optimization. In ICML ’14: Proceedings of the 31st International Conference on Machine Learning, 2014.
- (2014) ICML ’14: Proceedings of the 31st International Conference on Machine Learning
- Levine, Sergey¹ Koltun, Vladlen²

16
- 84860126760
- Trajectory generation and control for precise aggressive maneuvers with quadrotors
- D. Mellinger, N. Michael, and V. Kumar. Trajectory generation and control for precise aggressive maneuvers with quadrotors. Internatiional Journal of Robotics Research, 2012.
- (2012) Internatiional Journal of Robotics Research
- Mellinger, D.¹ Michael, N.² Kumar, V.³

17
- 0004059199
- MIT Press
- W. Miller, R. Sutton, and P. Werbos. Neural Networks for Control. MIT Press, 1990.
- (1990) Neural Networks for Control
- Miller, W.¹ Sutton, R.² Werbos, P.³

18
- 84988928384
- Contact-invariant optimization for hand manipulation
- I. Mordatch, Z. Popovic, and E Todorov. Contact-invariant optimization for hand manipulation. Symposium on Computer Animation, 2012.
- (2012) Symposium on Computer Animation
- Mordatch, I.¹ Popovic, Z.² Todorov, E³

19
- 84872224046
- Discovery of complex behaviors through contact-invariant optimization
- I. Mordatch, E Todorov, and Z. Popovic. Discovery of complex behaviors through contact-invariant optimization. SIGGRAPH, 2012.
- (2012) SIGGRAPH
- Mordatch, I.¹ Todorov, E² Popovic, Z.³

20
- 84887847814
- Animating human lower limbs using contact-invariant optimization
- I. Mordatch, J. Wang, E. Todorov, and V. Koltun. Animating human lower limbs using contact-invariant optimization. SIGGRAPH Asia, 2013.
- (2013) SIGGRAPH Asia
- Mordatch, I.¹ Wang, J.² Todorov, E.³ Koltun, V.⁴

21
- 84891310987
- Stochastic alternating direction method of multipliers
- JMLR.org
- Hua Ouyang, Niao He, Long Tran, and Alexander G. Gray. Stochastic alternating direction method of multipliers. In ICML (1), volume 28 of JMLR Proceedings, pages 80–88. JMLR.org, 2013.
- (2013) ICML (1), volume 28 of JMLR Proceedings , pp. 80-88
- Ouyang, Hua¹ He, Niao² Tran, Long³ Gray, Alexander G.⁴

22
- 40649106649
- Natural actor-critic
- J. Peters and S. Schaal. Natural actor-critic. Neurocom- puting, 71:1180–1190, 2008.
- (2008) Neurocom- puting , vol.71 , pp. 1180-1190
- Peters, J.¹ Schaal, S.²

23
- 84892720625
- A direct method for trajectory optimization of rigid bodies through contact
- M. Posa, C. Cantu, and R. Tedrake. A direct method for trajectory optimization of rigid bodies through contact. International Journal of Robotics Research, 2014.
- (2014) International Journal of Robotics Research
- Posa, M.¹ Cantu, C.² Tedrake, R.³

24
- 80053460450
- Contractive auto-encoders: Explicit invariance during feature extraction
- Salah Rifai, Pascal Vincent, Xavier Muller, Xavier Glo-rot, and Yoshua Bengio. Contractive auto-encoders: Explicit invariance during feature extraction. In ICML, pages 833–840, 2011.
- (2011) ICML , pp. 833-840
- Rifai, Salah¹ Vincent, Pascal² Muller, Xavier³ Glo-rot, Xavier⁴ Bengio, Yoshua⁵

25
- 84862273266
- A reduction of imitation learning and structured prediction to no-regret online learning
- JMLR.org
- Stphane Ross, Geoffrey J. Gordon, and Drew Bagnell. A reduction of imitation learning and structured prediction to no-regret online learning. In AISTATS, volume 15 of JMLR Proceedings, pages 627–635. JMLR.org, 2011.
- (2011) AISTATS, volume 15 of JMLR Proceedings , pp. 627-635
- Ross, Stphane¹ Gordon, Geoffrey J.² Bagnell, Drew³

26
- 0037471828
- Computational approaches to motor learning by imitation
- S. Schaal, A. Ijspeert, and A. Billard. Computational approaches to motor learning by imitation. Transactions of the Royal Society B, 2003.
- (2003) Transactions of the Royal Society B
- Schaal, S.¹ Ijspeert, A.² Billard, A.³

27
- 85083951635
- arXiv preprint
- P. Sermanet, D. Eigen, X. Zhang, M. Mathieu, R. Fergus, and Y. LeCun. Overfeat: Integrated recognition, localization and detection using convolutional networks. arXiv preprint, 2014.
- (2014) Overfeat: Integrated recognition, localization and detection using convolutional networks
- Sermanet, P.¹ Eigen, D.² Zhang, X.³ Mathieu, M.⁴ Fergus, R.⁵ LeCun, Y.⁶

28
- 85058603073
- Wiley-IEEE Press
- J. Si, A. Barto, W. Powell, and D. Wunsch. Handbook of Learning and Approximate Dynamic Programming. Wiley-IEEE Press, 2004.
- (2004) Handbook of Learning and Approximate Dynamic Programming
- Si, J.¹ Barto, A.² Powell, W.³ Wunsch, D.⁴

29
- 0005031076
- Transformation invariance in pattern recognition: Tangent distance and tangent propagation
- P. Simard, Y. LeCun, J. Denker, and B. Victorri. Transformation invariance in pattern recognition: Tangent distance and tangent propagation. Lecture Notes in Computer Science, 1998.
- (1998) Lecture Notes in Computer Science
- Simard, P.¹ LeCun, Y.² Denker, J.³ Victorri, B.⁴

30
- 0004294973
- Dover, New York
- R. Stengel. Optimal Control and Estimation. Dover, New York, 1994.
- (1994) Optimal Control and Estimation
- Stengel, R.¹

31
- 84898939480
- Policy gradient methods for reinforcement learning with function approximation
- R. Sutton, D. Mcallester, S. Singh, and Y. Mansour. Policy gradient methods for reinforcement learning with function approximation. In Advances in Neural Information Processing Systems, 2000.
- (2000) Advances in Neural Information Processing Systems
- Sutton, R.¹ Mcallester, D.² Singh, S.³ Mansour, Y.⁴

32
- 0004102479
- MIT Press, Cambridge MA
- R.S. Sutton and A.G. Barto. Reinforcement Learning: An Introduction. MIT Press, Cambridge MA, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

33
- 84872363924
- Synthesis and stabilization of complex behaviors through online trajectory optimization
- Y. Tassa, T. Erez, and E. Todorov. Synthesis and stabilization of complex behaviors through online trajectory optimization. IROS, 2012.
- (2012) IROS
- Tassa, Y.¹ Erez, T.² Todorov, E.³

34
- 84862011769
- Learning policy improvements with path integrals
- E. Theodorou, J. Buchli, and S. Schaal. Learning policy improvements with path integrals. AISTATS, 2010.
- (2010) AISTATS
- Theodorou, E.¹ Buchli, J.² Schaal, S.³

35
- 23944452693
- A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems
- E. Todorov and W. Li. A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems. American Control Conference, pages 300–306, 2005.
- (2005) American Control Conference , pp. 300-306
- Todorov, E.¹ Li, W.²

36
- 84872292044
- MuJoCo: A physics engine for model-based control
- E. Todorov, T. Erez, and Y. Tassa. MuJoCo: A physics engine for model-based control. IROS, 2012.
- (2012) IROS
- Todorov, E.¹ Erez, T.² Tassa, Y.³

37
- 71149083296
- Robot trajectory optimization using approximate inference
- M. Toussaint. Robot trajectory optimization using approximate inference. International Conference on Machine Learning, 26:1049–1056, 2009.
- (2009) International Conference on Machine Learning , vol.26 , pp. 1049-1056
- Toussaint, M.¹

38
- 85131218715
- SIG-GRAPH
- A. Witkin and M. Kass. Spacetime constraints. SIG-GRAPH, 1988.
- (1988) Spacetime constraints
- Witkin, A.¹ Kass, M.²

39
- 85131217346
- Value function approximation and model-predictive control
- M. Zhong, M. abd Johnson, Y. Tassa, T. Erez, and E Todorov. Value function approximation and model-predictive control. IEEE ADPRL, 2013.
- (2013) IEEE ADPRL
- Zhong, M.¹ abd Johnson, M.² Tassa, Y.³ Erez, T.⁴ Todorov, E⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.