메뉴 건너뛰기




Volumn 264, Issue , 2010, Pages 293-309

Motor learning at intermediate reynolds number: Experiments with policy gradient on the flapping flight of a rigid wing

Author keywords

[No Author keywords available]

Indexed keywords


EID: 74049113741     PISSN: 1860949X     EISSN: None     Source Type: Book Series    
DOI: 10.1007/978-3-642-05181-4_13     Document Type: Conference Paper
Times cited : (14)

References (19)
  • 1
    • 23844516492 scopus 로고    scopus 로고
    • Coherent locomotion as an attracting state for a free flapping body
    • Alben, S., Shelley, M.: Coherent locomotion as an attracting state for a free flapping body. Proceedings of the National Academy of Science 102, 11163-11166 (2005)
    • (2005) Proceedings of the National Academy of Science , vol.102 , pp. 11163-11166
    • Alben, S.1    Shelley, M.2
  • 2
    • 0000396062 scopus 로고    scopus 로고
    • Natural gradient works efficiently in learning
    • Amari, S.: Natural gradient works efficiently in learning. Neural Computation 10, 251-276 (1998) (Pubitemid 128463152)
    • (1998) Neural Computation , vol.10 , Issue.2 , pp. 251-276
    • Amari, S.-I.1
  • 5
    • 13844306287 scopus 로고    scopus 로고
    • Efficient bipedal robots based on passivedynamic walkers
    • Collins, S.H., Ruina, A., Tedrake, R., Wisse, M.: Efficient bipedal robots based on passivedynamic walkers. Science 307, 1082-1085 (2005)
    • (2005) Science , vol.307 , pp. 1082-1085
    • Collins, S.H.1    Ruina, A.2    Tedrake, R.3    Wisse, M.4
  • 6
    • 84897694817 scopus 로고    scopus 로고
    • Variance reduction techniques for gradient estimates in reinforcement learning
    • Greensmith, E., Bartlett, P.L., Baxter, J.: Variance reduction techniques for gradient estimates in reinforcement learning. Journal of Machine Learning Research 5, 1471-1530 (2004)
    • (2004) Journal of Machine Learning Research , vol.5 , pp. 1471-1530
    • Greensmith, E.1    Bartlett, P.L.2    Baxter, J.3
  • 7
    • 74049141404 scopus 로고    scopus 로고
    • Methods for learning control policies from variable-constraint demonstrations
    • Sigaud, O., Peters, J. (eds.). SCI. Springer, Heidelberg
    • Howard, M., Klanke, S., Gienger, M., Goerick, C., Vijayakumar, S.: Methods for learning control policies from variable-constraint demonstrations. In: Sigaud, O., Peters, J. (eds.) From Motor Learning to Interaction Learning in Robots. SCI, vol.264, pp. 253-291. Springer, Heidelberg (2010)
    • (2010) From Motor Learning to Interaction Learning in Robots , vol.264 , pp. 253-291
    • Howard, M.1    Klanke, S.2    Gienger, M.3    Goerick, C.4    Vijayakumar, S.5
  • 8
    • 0026712578 scopus 로고
    • Weight perturbation: An optimal architecture and learning technique for analog VLSI feedforward and recurrent multilayer networks
    • Jabri, M., Flower, B.:Weight perturbation: An optimal architecture and learning technique for analog VLSI feedforward and recurrent multilayer networks. IEEE Trans. Neural Netw. 3, 154-157 (1992)
    • (1992) IEEE Trans. Neural Netw. , vol.3 , pp. 154-157
    • Jabri, M.1    Flower, B.2
  • 10
    • 74049090442 scopus 로고    scopus 로고
    • Imitation and reinforcement learning for motor primitives with perceptual coupling
    • Sigaud, O., Peters, J. (eds.). SCI. Springer, Heidelberg
    • Kober, J., Mohler, B., Peters, J.: Imitation and reinforcement learning for motor primitives with perceptual coupling. In: Sigaud, O., Peters, J. (eds.) From Motor Learning to Interaction Learning in Robots. SCI, vol.264, pp. 209-225. Springer, Heidelberg (2010)
    • (2010) From Motor Learning to Interaction Learning in Robots , vol.264 , pp. 209-225
    • Kober, J.1    Mohler, B.2    Peters, J.3
  • 12
    • 74049092902 scopus 로고    scopus 로고
    • Policy gradient methods for robot control
    • University of Southern California
    • Peters, J., Vijayakumar, S., Schaal, S.: Policy gradient methods for robot control (Technical Report CS-03-787). University of Southern California (2003)
    • (2003) Technical Report CS-03-787
    • Peters, J.1    Vijayakumar, S.2    Schaal, S.3
  • 19
    • 0000337576 scopus 로고
    • Simple statistical gradient-following algorithms for connectionist reinforcement learning
    • Williams, R.: Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning 8, 229-256 (1992)
    • (1992) Machine Learning , vol.8 , pp. 229-256
    • Williams, R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.