-
3
-
-
0001771345
-
Linear least-squares algorithms for temporal difference learning
-
Bradtke, S., and Barto, A. 1996. Linear least-squares algorithms for temporal difference learning. Machine Learning 22.
-
(1996)
Machine Learning
, vol.22
-
-
Bradtke, S.1
Barto, A.2
-
4
-
-
0031626085
-
Scout: A simple quadruped that walks, climbs and runs
-
Buehler, M.; Battaglia, R.; Cocosco, A.; Hawker, G.; Sarkis, J.; and Yamazaki, K. 1998. Scout: A simple quadruped that walks, climbs and runs. In Proceedings of the 1998 IEEE International Conference on Robotics & Automation.
-
(1998)
Proceedings of the 1998 IEEE International Conference on Robotics & Automation
-
-
Buehler, M.1
Battaglia, R.2
Cocosco, A.3
Hawker, G.4
Sarkis, J.5
Yamazaki, K.6
-
5
-
-
85047678284
-
A combined neuronal and mechanical model of fish swimming
-
Ekeberg, Ö. 1993. A combined neuronal and mechanical model of fish swimming. Biological Cybernetics 69:363-374.
-
(1993)
Biological Cybernetics
, vol.69
, pp. 363-374
-
-
Ekeberg, Ö.1
-
6
-
-
0037459319
-
Discrete coding of reward probability and uncertainty by dopamine neurons
-
Fiorillo, C. D.; Tobler, P. N.; and Schultz, W. 2003. Discrete coding of reward probability and uncertainty by dopamine neurons. SCIENCE 299:1898-1902.
-
(2003)
SCIENCE
, vol.299
, pp. 1898-1902
-
-
Fiorillo, C.D.1
Tobler, P.N.2
Schultz, W.3
-
7
-
-
0037645833
-
Adaptive dynamic walking of a quadruped robot on irregular terrain based on biological concepts
-
Fukuoka, Y.; Kimura, H.; and Cohen, A. H. 2003. Adaptive dynamic walking of a quadruped robot on irregular terrain based on biological concepts. the International Journal of Robotics Research 22:187-202.
-
(2003)
The International Journal of Robotics Research
, vol.22
, pp. 187-202
-
-
Fukuoka, Y.1
Kimura, H.2
Cohen, A.H.3
-
8
-
-
0026011636
-
Neuronal network generating locomotor behavior in lamprey: Circuitry, transmitters, membrane properties and simulations
-
Grillner, S.; Wallen, P.; Brodin, L.; and Lansner, A. 1991. Neuronal network generating locomotor behavior in lamprey: circuitry, transmitters, membrane properties and simulations. Annual Review of Neuroscience 14:169-199.
-
(1991)
Annual Review of Neuroscience
, vol.14
, pp. 169-199
-
-
Grillner, S.1
Wallen, P.2
Brodin, L.3
Lansner, A.4
-
10
-
-
0035350209
-
A connectionist central pattern generator for the aquatic and terrestrial gaits of a simulated salamander
-
Ijspeert, A. J. 2001. A connectionist central pattern generator for the aquatic and terrestrial gaits of a simulated salamander. Biological Cybernetics 84:331-348.
-
(2001)
Biological Cybernetics
, vol.84
, pp. 331-348
-
-
Ijspeert, A.J.1
-
11
-
-
0042758707
-
-
phd thesis. Department of Electrical Engineering and Computer Science Massachusetts Institute of Technology
-
Konda, V. R. 2002. Actor-critic algorithms, phd thesis. Department of Electrical Engineering and Computer Science Massachusetts Institute of Technology.
-
(2002)
Actor-critic Algorithms
-
-
Konda, V.R.1
-
13
-
-
0035979437
-
Acquisition of stand-up behavior by a real robot using hierarchical reinforcement learning
-
Morimoto, J., and Doya, K. 2001. Acquisition of stand-up behavior by a real robot using hierarchical reinforcement learning. Robotics and Autonomous Systems 36:37-51.
-
(2001)
Robotics and Autonomous Systems
, vol.36
, pp. 37-51
-
-
Morimoto, J.1
Doya, K.2
-
14
-
-
0035218760
-
Generation of human bipedal locomotion by a bio-mimetic neuro-musculo-skeletal model
-
Ogihara, N., and Yamazaki, N. 2001. Generation of human bipedal locomotion by a bio-mimetic neuro-musculo-skeletal model. Biological Cybernetics 84:1-11.
-
(2001)
Biological Cybernetics
, vol.84
, pp. 1-11
-
-
Ogihara, N.1
Yamazaki, N.2
-
16
-
-
0025020623
-
A real time learning algorithm for recurrent analog neural networks
-
Sato, M. 1990. A real time learning algorithm for recurrent analog neural networks. Biological Cybernetics 62:237-241.
-
(1990)
Biological Cybernetics
, vol.62
, pp. 237-241
-
-
Sato, M.1
-
18
-
-
84898939480
-
Policy gradient methods for reinforcement learning with function approximation
-
Sutton, R. S.; McAllester, D.; Singh, S.; and Mansour, Y. 2000. Policy gradient methods for reinforcement learning with function approximation. In Advances in Neural Information Processing Systems, volume 12, 1057-1063.
-
(2000)
Advances in Neural Information Processing Systems
, vol.12
, pp. 1057-1063
-
-
Sutton, R.S.1
McAllester, D.2
Singh, S.3
Mansour, Y.4
-
19
-
-
0026045478
-
Self-organized control of bipedal locomotion by neural oscillators in unpredictable environment
-
Taga, G.; Yamaguchi, Y.; and Shimizu, H. 1991. Self-organized control of bipedal locomotion by neural oscillators in unpredictable environment. Biological Cybernetics 65:147-159.
-
(1991)
Biological Cybernetics
, vol.65
, pp. 147-159
-
-
Taga, G.1
Yamaguchi, Y.2
Shimizu, H.3
-
20
-
-
0000337576
-
Simple statistical gradient following algorithms for connectionist reinforcement learning
-
Williams, R. 1992. Simple statistical gradient following algorithms for connectionist reinforcement learning. Machine Learning 8:279-292.
-
(1992)
Machine Learning
, vol.8
, pp. 279-292
-
-
Williams, R.1
|