-
3
-
-
0033629916
-
Reinforcement learning in continuous time and space
-
Doya, K. 2000. Reinforcement learning in continuous time and space. Neural Computation 12:219-245.
-
(2000)
Neural Computation
, vol.12
, pp. 219-245
-
-
Doya, K.1
-
4
-
-
33846140190
-
Experimental studies of a neural oscillator for biped locomotion with QRIO
-
Endo, G.; Nakanishi, J.; Morimoto, J.; and Cheng, G. 2005. Experimental studies of a neural oscillator for biped locomotion with QRIO. In IEEE International Conference on Robotics and Automation, 598-604.
-
(2005)
IEEE International Conference on Robotics and Automation
, pp. 598-604
-
-
Endo, G.1
Nakanishi, J.2
Morimoto, J.3
Cheng, G.4
-
5
-
-
0032251175
-
Computer simulation of the ontogeny of biped walking
-
Hase, K., and Yamazaki, N. 1998. Computer simulation of the ontogeny of biped walking. Anthropological Science 106(4):327-347.
-
(1998)
Anthropological Science
, vol.106
, Issue.4
, pp. 327-347
-
-
Hase, K.1
Yamazaki, N.2
-
10
-
-
0008336447
-
An analysis of actor/critic algorithms using eligibility traces: Reinforcement learning with imperfect value function
-
Kimura, H., and Kobayashi, S. 1998. An analysis of actor/critic algorithms using eligibility traces: Reinforcement learning with imperfect value function. In International Conference on Machine Learning, 278-286.
-
(1998)
International Conference on Machine Learning
, pp. 278-286
-
-
Kimura, H.1
Kobayashi, S.2
-
13
-
-
33846174631
-
Learning feedback pathways in CPG with policy gradient for biped locomotion
-
Matsubara, T.; Morimoto, J.; Nakanishi, J.; and Doya, K. 2005. Learning feedback pathways in CPG with policy gradient for biped locomotion. In IEEE International Conference on Robotics and Automation, 4175-4180.
-
(2005)
IEEE International Conference on Robotics and Automation
, pp. 4175-4180
-
-
Matsubara, T.1
Morimoto, J.2
Nakanishi, J.3
Doya, K.4
-
14
-
-
0022390346
-
Sustained oscillations generated by mutually inhibiting neurons with adaptation
-
Matsuoka, K. 1985. Sustained oscillations generated by mutually inhibiting neurons with adaptation. Biological Cybernetics 52:345-353.
-
(1985)
Biological Cybernetics
, vol.52
, pp. 345-353
-
-
Matsuoka, K.1
-
15
-
-
9444286978
-
Reinforcement learning for a cpg-driven biped robot
-
Mori, T.; Nakamura, Y.; Sato, M.; and Ishii, S. 2004. Reinforcement learning for a cpg-driven biped robot. In Nineteenth National Conference on Artificial Intelligence, 623-630.
-
(2004)
Nineteenth National Conference on Artificial Intelligence
, pp. 623-630
-
-
Mori, T.1
Nakamura, Y.2
Sato, M.3
Ishii, S.4
-
17
-
-
84898939480
-
Policy gradient methods for reinforcement learning with imperfect value function
-
Sutton, R. S.; McAllester, D.; Singh, S.; and Mansour, Y. 2000. Policy gradient methods for reinforcement learning with imperfect value function. Advances in Neural Information Processing Systems 12:1057-1063.
-
(2000)
Advances in Neural Information Processing Systems
, vol.12
, pp. 1057-1063
-
-
Sutton, R.S.1
McAllester, D.2
Singh, S.3
Mansour, Y.4
-
18
-
-
0029337594
-
A model of the neuro-musculo-skeletal system for human locomotion I. Emergence of basic gait
-
Taga, G. 1995. A model of the neuro-musculo-skeletal system for human locomotion I. emergence of basic gait. Biological Cybernetics 73:97-111.
-
(1995)
Biological Cybernetics
, vol.73
, pp. 97-111
-
-
Taga, G.1
-
20
-
-
0032191803
-
Neural control of rhythmic arm movements
-
Williamson, M. 1998. Neural control of rhythmic arm movements. Neural Networks 11(7-8): 1379-1394.
-
(1998)
Neural Networks
, vol.11
, Issue.7-8
, pp. 1379-1394
-
-
Williamson, M.1
|