메뉴 건너뛰기




Volumn , Issue , 2014, Pages

Combining the benefits of function approximation and trajectory optimization

Author keywords

[No Author keywords available]

Indexed keywords

INVERSE PROBLEMS; LEARNING SYSTEMS; NEURAL NETWORKS; TEACHING; TRAJECTORIES;

EID: 85131220210     PISSN: None     EISSN: 2330765X     Source Type: Conference Proceeding    
DOI: 10.15607/RSS.2014.X.052     Document Type: Conference Paper
Times cited : (128)

References (39)
  • 1
    • 85158005713 scopus 로고    scopus 로고
    • An application of reinforcement learning to aerobatic helicopter flight
    • P. Abbeel, A. Coates, M. Quigley, and A. Ng. An application of reinforcement learning to aerobatic helicopter flight. NIPS, 2006.
    • (2006) NIPS
    • Abbeel, P.1    Coates, A.2    Quigley, M.3    Ng, A.4
  • 6
    • 84890526837 scopus 로고    scopus 로고
    • New types of deep neural network learning for speech recognition and related applications: An overview
    • L. Deng, G. Hinton, and B. Kingsbury. New types of deep neural network learning for speech recognition and related applications: An overview. ICASSP, 2013.
    • (2013) ICASSP
    • Deng, L.1    Hinton, G.2    Kingsbury, B.3
  • 7
    • 77953234105 scopus 로고    scopus 로고
    • A muscle-reflex model that encodes principles of legged mechanics produces human walking dynamics and muscle activities
    • Hartmut Geyer and Hugh Herr. A muscle-reflex model that encodes principles of legged mechanics produces human walking dynamics and muscle activities. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 18(3), 2010.
    • (2010) IEEE Transactions on Neural Systems and Rehabilitation Engineering , vol.18 , Issue.3
    • Geyer, Hartmut1    Herr, Hugh2
  • 8
    • 37249089713 scopus 로고    scopus 로고
    • Quadrotor helicopter flight dynamics and control: Theory and experiment
    • G. Hoffmann, H. Huang, S. Waslander, and C. Tomlin. Quadrotor helicopter flight dynamics and control: Theory and experiment. Proc AIAA, 2007.
    • (2007) Proc AIAA
    • Hoffmann, G.1    Huang, H.2    Waslander, S.3    Tomlin, C.4
  • 9
    • 40049092971 scopus 로고    scopus 로고
    • Central pattern generators for locomotion control in animals and robots: a review
    • A. Ijspeert. Central pattern generators for locomotion control in animals and robots: a review. Neural Networks, 2008.
    • (2008) Neural Networks
    • Ijspeert, A.1
  • 10
    • 84876231242 scopus 로고    scopus 로고
    • Imagenet classification with deep convolutional neural networks
    • A. Krizhevsky, I. Sutskever, and G. Hinton. Imagenet classification with deep convolutional neural networks. NIPS, 2012.
    • (2012) NIPS
    • Krizhevsky, A.1    Sutskever, I.2    Hinton, G.3
  • 13
    • 84898932265 scopus 로고    scopus 로고
    • Variational policy search via trajectory optimization
    • S. Levine and V. Koltun. Variational policy search via trajectory optimization. NIPS, 2013.
    • (2013) NIPS
    • Levine, S.1    Koltun, V.2
  • 14
    • 84897529781 scopus 로고    scopus 로고
    • Guided policy search
    • S. Levine and V. Koltun. Guided policy search. ICML, 2013.
    • (2013) ICML
    • Levine, S.1    Koltun, V.2
  • 19
    • 84872224046 scopus 로고    scopus 로고
    • Discovery of complex behaviors through contact-invariant optimization
    • I. Mordatch, E Todorov, and Z. Popovic. Discovery of complex behaviors through contact-invariant optimization. SIGGRAPH, 2012.
    • (2012) SIGGRAPH
    • Mordatch, I.1    Todorov, E2    Popovic, Z.3
  • 20
    • 84887847814 scopus 로고    scopus 로고
    • Animating human lower limbs using contact-invariant optimization
    • I. Mordatch, J. Wang, E. Todorov, and V. Koltun. Animating human lower limbs using contact-invariant optimization. SIGGRAPH Asia, 2013.
    • (2013) SIGGRAPH Asia
    • Mordatch, I.1    Wang, J.2    Todorov, E.3    Koltun, V.4
  • 22
    • 40649106649 scopus 로고    scopus 로고
    • Natural actor-critic
    • J. Peters and S. Schaal. Natural actor-critic. Neurocom- puting, 71:1180–1190, 2008.
    • (2008) Neurocom- puting , vol.71 , pp. 1180-1190
    • Peters, J.1    Schaal, S.2
  • 24
  • 25
    • 84862273266 scopus 로고    scopus 로고
    • A reduction of imitation learning and structured prediction to no-regret online learning
    • JMLR.org
    • Stphane Ross, Geoffrey J. Gordon, and Drew Bagnell. A reduction of imitation learning and structured prediction to no-regret online learning. In AISTATS, volume 15 of JMLR Proceedings, pages 627–635. JMLR.org, 2011.
    • (2011) AISTATS, volume 15 of JMLR Proceedings , pp. 627-635
    • Ross, Stphane1    Gordon, Geoffrey J.2    Bagnell, Drew3
  • 33
    • 84872363924 scopus 로고    scopus 로고
    • Synthesis and stabilization of complex behaviors through online trajectory optimization
    • Y. Tassa, T. Erez, and E. Todorov. Synthesis and stabilization of complex behaviors through online trajectory optimization. IROS, 2012.
    • (2012) IROS
    • Tassa, Y.1    Erez, T.2    Todorov, E.3
  • 34
    • 84862011769 scopus 로고    scopus 로고
    • Learning policy improvements with path integrals
    • E. Theodorou, J. Buchli, and S. Schaal. Learning policy improvements with path integrals. AISTATS, 2010.
    • (2010) AISTATS
    • Theodorou, E.1    Buchli, J.2    Schaal, S.3
  • 35
    • 23944452693 scopus 로고    scopus 로고
    • A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems
    • E. Todorov and W. Li. A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems. American Control Conference, pages 300–306, 2005.
    • (2005) American Control Conference , pp. 300-306
    • Todorov, E.1    Li, W.2
  • 36
    • 84872292044 scopus 로고    scopus 로고
    • MuJoCo: A physics engine for model-based control
    • E. Todorov, T. Erez, and Y. Tassa. MuJoCo: A physics engine for model-based control. IROS, 2012.
    • (2012) IROS
    • Todorov, E.1    Erez, T.2    Tassa, Y.3
  • 37
    • 71149083296 scopus 로고    scopus 로고
    • Robot trajectory optimization using approximate inference
    • M. Toussaint. Robot trajectory optimization using approximate inference. International Conference on Machine Learning, 26:1049–1056, 2009.
    • (2009) International Conference on Machine Learning , vol.26 , pp. 1049-1056
    • Toussaint, M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.