메뉴 건너뛰기




Volumn , Issue , 2009, Pages 226-232

Using reward-weighted imitations for robot reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

LEARNING TASKS; MOTOR PRIMITIVES; MOTOR SKILLS; NOVEL ALGORITHM; REINFORCEMENT LEARNING APPROACH; ROBOT ARMS;

EID: 67650458573     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ADPRL.2009.4927549     Document Type: Conference Paper
Times cited : (7)

References (25)
  • 4
    • 34948857495 scopus 로고    scopus 로고
    • Reinforcement learning for imitating constrained reaching movements
    • F. Guenter. M. Hersch. S. Cali non. and A. Billard. Reinforcement learning for imitating constrained reaching movements. RSf Advanced Robotics. 21. 1521-1544. 2007. (Pubitemid 47529845)
    • (2007) Advanced Robotics , vol.21 , Issue.13 , pp. 1521-1544
    • Guenter, F.1    Hersch, M.2    Calinon, S.3    Billard, A.4
  • 6
    • 0000337576 scopus 로고
    • Simple statistical gradient-following algorithms for conncctionist reinforcemcnt leal11ing
    • R. I. Williams. Simple statistical gradient-following algorithms for conncctionist reinforcemcnt leal11ing. Machine Learning, 8 :229-256, 1992.
    • (1992) Machine Learning , vol.8 , pp. 229-256
    • Williams, R.I.1
  • 9
    • 34250635407 scopus 로고    scopus 로고
    • Policy gradient methods for robotics
    • DOI 10.1109/IROS.2006.282564, 4058714, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2006
    • J. Peters and S. Schaa Policy gradient methods for robotics. In International Conference on Intelligent Robots and Systems (IROS), 2006. (Pubitemid 46928224)
    • (2006) IEEE International Conference on Intelligent Robots and Systems , pp. 2219-2225
    • Peters, J.1    Schaal, S.2
  • 10
    • 67650352549 scopus 로고    scopus 로고
    • Attention and motor skill learning
    • Champaign. IL.
    • G. Wulf Attention and motor skill learning. Human Kinetics. Champaign. IL., 2007.
    • (2007) Human Kinetics
    • Wulf, G.1
  • 12
    • 0346982426 scopus 로고    scopus 로고
    • Using expectation-maximization for reinforcement learning
    • P. Dayan and G. E. Hinton. Using expectation-maximization for reinforcement learning. Neural Computation, 9(2):271-278, 1997. (Pubitemid 127635391)
    • (1997) Neural Computation , vol.9 , Issue.2 , pp. 271-278
    • Dayan, P.1    Hinton, G.E.2
  • 16
    • 0039816976 scopus 로고
    • Using local trajectory optimizer.I' to speed up global optimization in dynamic programming
    • C. G. Atkeson. Using local trajectory optimizer.I' to speed up global optimization in dynamic programming. In Advances in Neural Information Processing Systems (NIPS), 1994.
    • (1994) Advances in Neural Information Processing Systems (NIPS)
    • Atkeson, C.G.1
  • 19
    • 34848832311 scopus 로고    scopus 로고
    • Dynamics systems vs. optimal control - a unifying view
    • DOI 10.1016/S0079-6123(06)65027-9, PII S0079612306650279, Computational Neuroscience: Theoretical Insights into Brain Function
    • S. Schaal, P. Mohajerian, and A. Ijspeert. Dynamics systems V.I'. optimal control - a unifying view. Progrcss in Brain Rcsem-ch, 165(1):425-445, 2007. (Pubitemid 47513886)
    • (2007) Progress in Brain Research , vol.165 , pp. 425-445
    • Schaal, S.1    Mohajerian, P.2    Ijspeert, A.3
  • 20
    • 67650398856 scopus 로고    scopus 로고
    • Wikipedia, May 31
    • Wikipedia, May 31, 2008. http:I//en.wikipedia.org/wiki/Ball-in-aCUP
    • (2008)
  • 23
    • 0029753170 scopus 로고    scopus 로고
    • Canonical parameterization of excess motor degrees of freedom with self-organizing maps
    • PII S1045922796004870
    • D. DeMers and K. Kreutz-Delgado. Canonical parameterization of excess motor degrees offreedOln with self organizing maps. In IEEE Transactions on Neural Networks, Vol. 7, PP'. 43-55, 1996. (Pubitemid 126811727)
    • (1996) IEEE Transactions on Neural Networks , vol.7 , Issue.1 , pp. 43-55
    • DeMers, D.1    Kreutz-Delgado, K.2
  • 25
    • 37249003309 scopus 로고    scopus 로고
    • A unifying framework for robot control with redundant DOFs
    • DOI 10.1007/s10514-007-9051-x
    • J. Peters, M. Mistry', F. Udwadia, R. Cory, J. Nakanishi, and S. Schaal. A unifying methodology fin robot control with redundant DOFs In Autonomous Robots, pp.1-12, Vol 24 (1), 2008. (Pubitemid 350276040)
    • (2008) Autonomous Robots , vol.24 , Issue.1 , pp. 1-12
    • Peters, J.1    Mistry, M.2    Udwadia, F.3    Nakanishi, J.4    Schaal, S.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.