메뉴 건너뛰기




Volumn , Issue , 2004, Pages 623-630

Reinforcement learning for a CPG-driven biped robot

Author keywords

[No Author keywords available]

Indexed keywords

BIPED ROBOTS; CONTROL MECHANISM; POLICY GRADIENT METHODS; REINFORCEMENT LEARNING;

EID: 9444286978     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (57)

References (20)
  • 3
    • 0001771345 scopus 로고    scopus 로고
    • Linear least-squares algorithms for temporal difference learning
    • Bradtke, S., and Barto, A. 1996. Linear least-squares algorithms for temporal difference learning. Machine Learning 22.
    • (1996) Machine Learning , vol.22
    • Bradtke, S.1    Barto, A.2
  • 5
    • 85047678284 scopus 로고
    • A combined neuronal and mechanical model of fish swimming
    • Ekeberg, Ö. 1993. A combined neuronal and mechanical model of fish swimming. Biological Cybernetics 69:363-374.
    • (1993) Biological Cybernetics , vol.69 , pp. 363-374
    • Ekeberg, Ö.1
  • 6
    • 0037459319 scopus 로고    scopus 로고
    • Discrete coding of reward probability and uncertainty by dopamine neurons
    • Fiorillo, C. D.; Tobler, P. N.; and Schultz, W. 2003. Discrete coding of reward probability and uncertainty by dopamine neurons. SCIENCE 299:1898-1902.
    • (2003) SCIENCE , vol.299 , pp. 1898-1902
    • Fiorillo, C.D.1    Tobler, P.N.2    Schultz, W.3
  • 7
    • 0037645833 scopus 로고    scopus 로고
    • Adaptive dynamic walking of a quadruped robot on irregular terrain based on biological concepts
    • Fukuoka, Y.; Kimura, H.; and Cohen, A. H. 2003. Adaptive dynamic walking of a quadruped robot on irregular terrain based on biological concepts. the International Journal of Robotics Research 22:187-202.
    • (2003) The International Journal of Robotics Research , vol.22 , pp. 187-202
    • Fukuoka, Y.1    Kimura, H.2    Cohen, A.H.3
  • 8
    • 0026011636 scopus 로고
    • Neuronal network generating locomotor behavior in lamprey: Circuitry, transmitters, membrane properties and simulations
    • Grillner, S.; Wallen, P.; Brodin, L.; and Lansner, A. 1991. Neuronal network generating locomotor behavior in lamprey: circuitry, transmitters, membrane properties and simulations. Annual Review of Neuroscience 14:169-199.
    • (1991) Annual Review of Neuroscience , vol.14 , pp. 169-199
    • Grillner, S.1    Wallen, P.2    Brodin, L.3    Lansner, A.4
  • 10
    • 0035350209 scopus 로고    scopus 로고
    • A connectionist central pattern generator for the aquatic and terrestrial gaits of a simulated salamander
    • Ijspeert, A. J. 2001. A connectionist central pattern generator for the aquatic and terrestrial gaits of a simulated salamander. Biological Cybernetics 84:331-348.
    • (2001) Biological Cybernetics , vol.84 , pp. 331-348
    • Ijspeert, A.J.1
  • 11
    • 0042758707 scopus 로고    scopus 로고
    • phd thesis. Department of Electrical Engineering and Computer Science Massachusetts Institute of Technology
    • Konda, V. R. 2002. Actor-critic algorithms, phd thesis. Department of Electrical Engineering and Computer Science Massachusetts Institute of Technology.
    • (2002) Actor-critic Algorithms
    • Konda, V.R.1
  • 13
    • 0035979437 scopus 로고    scopus 로고
    • Acquisition of stand-up behavior by a real robot using hierarchical reinforcement learning
    • Morimoto, J., and Doya, K. 2001. Acquisition of stand-up behavior by a real robot using hierarchical reinforcement learning. Robotics and Autonomous Systems 36:37-51.
    • (2001) Robotics and Autonomous Systems , vol.36 , pp. 37-51
    • Morimoto, J.1    Doya, K.2
  • 14
    • 0035218760 scopus 로고    scopus 로고
    • Generation of human bipedal locomotion by a bio-mimetic neuro-musculo-skeletal model
    • Ogihara, N., and Yamazaki, N. 2001. Generation of human bipedal locomotion by a bio-mimetic neuro-musculo-skeletal model. Biological Cybernetics 84:1-11.
    • (2001) Biological Cybernetics , vol.84 , pp. 1-11
    • Ogihara, N.1    Yamazaki, N.2
  • 16
    • 0025020623 scopus 로고
    • A real time learning algorithm for recurrent analog neural networks
    • Sato, M. 1990. A real time learning algorithm for recurrent analog neural networks. Biological Cybernetics 62:237-241.
    • (1990) Biological Cybernetics , vol.62 , pp. 237-241
    • Sato, M.1
  • 19
    • 0026045478 scopus 로고
    • Self-organized control of bipedal locomotion by neural oscillators in unpredictable environment
    • Taga, G.; Yamaguchi, Y.; and Shimizu, H. 1991. Self-organized control of bipedal locomotion by neural oscillators in unpredictable environment. Biological Cybernetics 65:147-159.
    • (1991) Biological Cybernetics , vol.65 , pp. 147-159
    • Taga, G.1    Yamaguchi, Y.2    Shimizu, H.3
  • 20
    • 0000337576 scopus 로고
    • Simple statistical gradient following algorithms for connectionist reinforcement learning
    • Williams, R. 1992. Simple statistical gradient following algorithms for connectionist reinforcement learning. Machine Learning 8:279-292.
    • (1992) Machine Learning , vol.8 , pp. 279-292
    • Williams, R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.