SCOPUS 정보 검색 플랫폼

Proceedings of the National Conference on Artificial Intelligence

Volumn , Issue , 2004, Pages 623-630

Reinforcement learning for a CPG-driven biped robot

(4) Mori, Takeshi a Nakamura, Yutaka a,c Sato, Masa Aki b,c Ishii, Shin a,c

a NARA INSTITUTE OF SCIENCE AND TECHNOLOGY (Japan)

b ADVANCED TELECOMMUNICATIONS RESEARCH INSTITUTE INTERNATIONAL (Japan)

c JAPAN SCIENCE AND TECHNOLOGY AGENCY (Japan)

Author keywords

[No Author keywords available]

Indexed keywords

BIPED ROBOTS; CONTROL MECHANISM; POLICY GRADIENT METHODS; REINFORCEMENT LEARNING;

APPROXIMATION THEORY; CONTROL EQUIPMENT; ERRORS; GENETIC ALGORITHMS; NEURAL NETWORKS; PROBLEM SOLVING; ROBOT APPLICATIONS; ROBUSTNESS (CONTROL SYSTEMS);

MOBILE ROBOTS;

EID: 9444286978 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (57)

References (20)

1
- 0032294861
- Hexapodal robot locomotion over uneven terrain
- Barnes, D. P. 1998. Hexapodal robot locomotion over uneven terrain. In Proceedings of IEEE Conference on Control Applications, 441-445.
- (1998) Proceedings of IEEE Conference on Control Applications , pp. 441-445
- Barnes, D.P.¹

2
- 0003487482
- Athena Scientific
- Bertsekas, D. P., and Tsitsiklis, J. N. 1996. Neuro-Dynamic Programming. Athena Scientific.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

3
- 0001771345
- Linear least-squares algorithms for temporal difference learning
- Bradtke, S., and Barto, A. 1996. Linear least-squares algorithms for temporal difference learning. Machine Learning 22.
- (1996) Machine Learning , vol.22
- Bradtke, S.¹ Barto, A.²

4
- 0031626085
- Scout: A simple quadruped that walks, climbs and runs
- Buehler, M.; Battaglia, R.; Cocosco, A.; Hawker, G.; Sarkis, J.; and Yamazaki, K. 1998. Scout: A simple quadruped that walks, climbs and runs. In Proceedings of the 1998 IEEE International Conference on Robotics & Automation.
- (1998) Proceedings of the 1998 IEEE International Conference on Robotics & Automation
- Buehler, M.¹ Battaglia, R.² Cocosco, A.³ Hawker, G.⁴ Sarkis, J.⁵ Yamazaki, K.⁶

5
- 85047678284
- A combined neuronal and mechanical model of fish swimming
- Ekeberg, Ö. 1993. A combined neuronal and mechanical model of fish swimming. Biological Cybernetics 69:363-374.
- (1993) Biological Cybernetics , vol.69 , pp. 363-374
- Ekeberg, Ö.¹

6
- 0037459319
- Discrete coding of reward probability and uncertainty by dopamine neurons
- Fiorillo, C. D.; Tobler, P. N.; and Schultz, W. 2003. Discrete coding of reward probability and uncertainty by dopamine neurons. SCIENCE 299:1898-1902.
- (2003) SCIENCE , vol.299 , pp. 1898-1902
- Fiorillo, C.D.¹ Tobler, P.N.² Schultz, W.³

7
- 0037645833
- Adaptive dynamic walking of a quadruped robot on irregular terrain based on biological concepts
- Fukuoka, Y.; Kimura, H.; and Cohen, A. H. 2003. Adaptive dynamic walking of a quadruped robot on irregular terrain based on biological concepts. the International Journal of Robotics Research 22:187-202.
- (2003) The International Journal of Robotics Research , vol.22 , pp. 187-202
- Fukuoka, Y.¹ Kimura, H.² Cohen, A.H.³

8
- 0026011636
- Neuronal network generating locomotor behavior in lamprey: Circuitry, transmitters, membrane properties and simulations
- Grillner, S.; Wallen, P.; Brodin, L.; and Lansner, A. 1991. Neuronal network generating locomotor behavior in lamprey: circuitry, transmitters, membrane properties and simulations. Annual Review of Neuroscience 14:169-199.
- (1991) Annual Review of Neuroscience , vol.14 , pp. 169-199
- Grillner, S.¹ Wallen, P.² Brodin, L.³ Lansner, A.⁴

9
- 0031638777
- The development of honda humanoid robot
- Hirai, K.; Hirose, M.; Haikawa, Y.; and Takenaka, T. 1998. The development of honda humanoid robot. In Proceedings of the 1998 IEEE International Conference on Robotics & Automation.
- (1998) Proceedings of the 1998 IEEE International Conference on Robotics & Automation
- Hirai, K.¹ Hirose, M.² Haikawa, Y.³ Takenaka, T.⁴

10
- 0035350209
- A connectionist central pattern generator for the aquatic and terrestrial gaits of a simulated salamander
- Ijspeert, A. J. 2001. A connectionist central pattern generator for the aquatic and terrestrial gaits of a simulated salamander. Biological Cybernetics 84:331-348.
- (2001) Biological Cybernetics , vol.84 , pp. 331-348
- Ijspeert, A.J.¹

11
- 0042758707
- phd thesis. Department of Electrical Engineering and Computer Science Massachusetts Institute of Technology
- Konda, V. R. 2002. Actor-critic algorithms, phd thesis. Department of Electrical Engineering and Computer Science Massachusetts Institute of Technology.
- (2002) Actor-critic Algorithms
- Konda, V.R.¹

12
- 0035249254
- Simulation-based optimization of markov reward processes
- Marbach, P., and Tsitsiklis, J. N. 2001. Simulation-based optimization of markov reward processes. IEEE Transactions on Automatic Control 46:191-209.
- (2001) IEEE Transactions on Automatic Control , vol.46 , pp. 191-209
- Marbach, P.¹ Tsitsiklis, J.N.²

13
- 0035979437
- Acquisition of stand-up behavior by a real robot using hierarchical reinforcement learning
- Morimoto, J., and Doya, K. 2001. Acquisition of stand-up behavior by a real robot using hierarchical reinforcement learning. Robotics and Autonomous Systems 36:37-51.
- (2001) Robotics and Autonomous Systems , vol.36 , pp. 37-51
- Morimoto, J.¹ Doya, K.²

14
- 0035218760
- Generation of human bipedal locomotion by a bio-mimetic neuro-musculo-skeletal model
- Ogihara, N., and Yamazaki, N. 2001. Generation of human bipedal locomotion by a bio-mimetic neuro-musculo-skeletal model. Biological Cybernetics 84:1-11.
- (2001) Biological Cybernetics , vol.84 , pp. 1-11
- Ogihara, N.¹ Yamazaki, N.²

15
- 0002997066
- Reinforcement learning based on on-line em algorithm
- Sato, M., and Ishii, S. 1999. Reinforcement learning based on on-line em algorithm. In Advances in Neural Information Processing Systems, volume 11, 1052-1058.
- (1999) Advances in Neural Information Processing Systems , vol.11 , pp. 1052-1058
- Sato, M.¹ Ishii, S.²

16
- 0025020623
- A real time learning algorithm for recurrent analog neural networks
- Sato, M. 1990. A real time learning algorithm for recurrent analog neural networks. Biological Cybernetics 62:237-241.
- (1990) Biological Cybernetics , vol.62 , pp. 237-241
- Sato, M.¹

17
- 0004102479
- MIT Press
- Sutton, R., and Barto, A. 1998. Reinforcement Learning: An Introduction. MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Barto, A.²

18
- 84898939480
- Policy gradient methods for reinforcement learning with function approximation
- Sutton, R. S.; McAllester, D.; Singh, S.; and Mansour, Y. 2000. Policy gradient methods for reinforcement learning with function approximation. In Advances in Neural Information Processing Systems, volume 12, 1057-1063.
- (2000) Advances in Neural Information Processing Systems , vol.12 , pp. 1057-1063
- Sutton, R.S.¹ McAllester, D.² Singh, S.³ Mansour, Y.⁴

19
- 0026045478
- Self-organized control of bipedal locomotion by neural oscillators in unpredictable environment
- Taga, G.; Yamaguchi, Y.; and Shimizu, H. 1991. Self-organized control of bipedal locomotion by neural oscillators in unpredictable environment. Biological Cybernetics 65:147-159.
- (1991) Biological Cybernetics , vol.65 , pp. 147-159
- Taga, G.¹ Yamaguchi, Y.² Shimizu, H.³

20
- 0000337576
- Simple statistical gradient following algorithms for connectionist reinforcement learning
- Williams, R. 1992. Simple statistical gradient following algorithms for connectionist reinforcement learning. Machine Learning 8:279-292.
- (1992) Machine Learning , vol.8 , pp. 279-292
- Williams, R.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.