메뉴 건너뛰기




Volumn 2005, Issue , 2005, Pages 2381-2386

Poincaré-map-based reinforcement learning for biped walking

Author keywords

Biped walking; Poincar map; Reinforcement learning

Indexed keywords

BIPED LOCOMOTION; COMPUTER SIMULATION; CONFORMAL MAPPING; LEARNING ALGORITHMS; MATHEMATICAL MODELS; WALKING AIDS;

EID: 33846149670     PISSN: 10504729     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ROBOT.2005.1570469     Document Type: Conference Paper
Times cited : (53)

References (22)
  • 1
  • 2
    • 0036708925 scopus 로고    scopus 로고
    • Dynamic bipedal walking assisted by learning
    • C. Chew and G. A. Pratt. Dynamic bipedal walking assisted by learning. Robotica, 20:477-491, 2002.
    • (2002) Robotica , vol.20 , pp. 477-491
    • Chew, C.1    Pratt, G.A.2
  • 3
    • 0033629916 scopus 로고    scopus 로고
    • Reinforcement Learning in Continuous Time and Space
    • K. Doya. Reinforcement Learning in Continuous Time and Space. Neural Computation, 12(1):219-245, 2000.
    • (2000) Neural Computation , vol.12 , Issue.1 , pp. 219-245
    • Doya, K.1
  • 6
    • 0022417008 scopus 로고
    • The coordination of arm movements: An experimentally confirmed mathematical model
    • T. Flash and N. Hogan. The coordination of arm movements: An experimentally confirmed mathematical model. The Journal of Neuroscience, 5:1688-1703, 1985.
    • (1985) The Journal of Neuroscience , vol.5 , pp. 1688-1703
    • Flash, T.1    Hogan, N.2
  • 9
    • 33846137871 scopus 로고    scopus 로고
    • J. Morimoto and C. G. Atkeson. Robust low-torque biped walking using differential dynamic programming with a minimax criterion. In Philippe Bidaud and Faiz Ben Amar, editors, Proceedings of the 5th International Conference on Climbing and Walking Robots, pages 453-459. Professional Engineering Publishing, Bury St Edmunds and London, UK, 2002.
    • J. Morimoto and C. G. Atkeson. Robust low-torque biped walking using differential dynamic programming with a minimax criterion. In Philippe Bidaud and Faiz Ben Amar, editors, Proceedings of the 5th International Conference on Climbing and Walking Robots, pages 453-459. Professional Engineering Publishing, Bury St Edmunds and London, UK, 2002.
  • 10
    • 84898963340 scopus 로고    scopus 로고
    • Minimax differential dynamic programming: An application to robust biped walking
    • Suzanna Becker, Sebastian Thrun, and Klaus Obermayer, editors, MIT Press, Cambridge, MA
    • J. Morimoto and C. G. Atkeson. Minimax differential dynamic programming: An application to robust biped walking. In Suzanna Becker, Sebastian Thrun, and Klaus Obermayer, editors, Advances in Neural Information Processing Systems 15, pages 1563-1570. MIT Press, Cambridge, MA, 2003.
    • (2003) Advances in Neural Information Processing Systems 15 , pp. 1563-1570
    • Morimoto, J.1    Atkeson, C.G.2
  • 11
    • 3042678716 scopus 로고    scopus 로고
    • J. Morimoto, G. Cheng, C. G. Atkeson, and G. Zeglin. A Simple Reinforcememtn Learning Algorithm For Biped Walking. In Proceedings of the 2004 IEEE International Conference on Robotics and Automation, pages 3030-3035, 2004.
    • J. Morimoto, G. Cheng, C. G. Atkeson, and G. Zeglin. A Simple Reinforcememtn Learning Algorithm For Biped Walking. In Proceedings of the 2004 IEEE International Conference on Robotics and Automation, pages 3030-3035, 2004.
  • 12
    • 33846122824 scopus 로고    scopus 로고
    • Y. Nakamura, M. Sato, and S. Ishii. Reinforcement learning for biped robot. In Proceedings of the 2nd International Symposium on Adaptive Motion of Animals and Machines, pages ThP-II-5, 2003.
    • Y. Nakamura, M. Sato, and S. Ishii. Reinforcement learning for biped robot. In Proceedings of the 2nd International Symposium on Adaptive Motion of Animals and Machines, pages ThP-II-5, 2003.
  • 15
    • 0001108227 scopus 로고    scopus 로고
    • Constructive incremental learning from only local information
    • S. Schaal and C. G. Atkeson. Constructive incremental learning from only local information. Neural Computation, 10(8):2047-2084, 1998.
    • (1998) Neural Computation , vol.10 , Issue.8 , pp. 2047-2084
    • Schaal, S.1    Atkeson, C.G.2
  • 18
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning
    • R. S. Sutton, D. Precup, and S. Singh. Between MDPs and semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning. Artificial Intelligence, 112:181-211, 1999.
    • (1999) Artificial Intelligence , vol.112 , pp. 181-211
    • Sutton, R.S.1    Precup, D.2    Singh, S.3
  • 19
    • 14044262287 scopus 로고    scopus 로고
    • R. Tedrake, T. W. Zhang, and H. S. Seung. Stochastic policy gradient reinforcement learning on a simple 3d biped. In Proceedings of the 2004 IEEE/RSJ International Conference on Intelligen Robots and Systems, page (to appear), 2004.
    • R. Tedrake, T. W. Zhang, and H. S. Seung. Stochastic policy gradient reinforcement learning on a simple 3d biped. In Proceedings of the 2004 IEEE/RSJ International Conference on Intelligen Robots and Systems, page (to appear), 2004.
  • 21
    • 0029317915 scopus 로고
    • A theory for cursive handwriting based on the minimization principle
    • Y. Wada and M. Kawato. A theory for cursive handwriting based on the minimization principle. Biological Cybernetics, 73:3-15, 1995.
    • (1995) Biological Cybernetics , vol.73 , pp. 3-15
    • Wada, Y.1    Kawato, M.2
  • 22
    • 0000211627 scopus 로고
    • Development of a biped walking robot compensating for three-axis moment by trunk motion
    • J. Yamaguchi, A. Takanishi, and I. Kato. Development of a biped walking robot compensating for three-axis moment by trunk motion. Journal of the Robotics Society of Japan, 11(4):581-586, 1993.
    • (1993) Journal of the Robotics Society of Japan , vol.11 , Issue.4 , pp. 581-586
    • Yamaguchi, J.1    Takanishi, A.2    Kato, I.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.