-
3
-
-
0020970738
-
Neuronlike adaptive elements that can solve difficult learning control problems
-
Barto A.G., Sutton R.S., Anderson C.W. Neuronlike adaptive elements that can solve difficult learning control problems. In IEEE Transactions on Systems, Man, and Cybernetics. 3:1983;834-846.
-
(1983)
In IEEE Transactions on Systems, Man, and Cybernetics
, vol.3
, pp. 834-846
-
-
Barto, A.G.1
Sutton, R.S.2
Anderson, C.W.3
-
4
-
-
85156231814
-
Temporal difference learning in continuous time and space
-
D.S. Touretzky, M.C. Mozer, Hasselmo M.E. Cambridge, MA: MIT Press
-
Doya K. Temporal difference learning in continuous time and space. Touretzky D.S., Mozer M.C., Hasselmo M.E. Advances in neural information processing systems. 8:1996;1073-1079 MIT Press, Cambridge, MA.
-
(1996)
Advances in Neural Information Processing Systems
, vol.8
, pp. 1073-1079
-
-
Doya, K.1
-
5
-
-
0000406101
-
Efficient nonlinear control with actor-tutor architecture
-
M.C. Mozer, Jordan M.I. Cambridge, MA: MIT Press
-
Doya K. Efficient nonlinear control with actor-tutor architecture. Mozer M.C., Jordan M.I. Advances in neural information processing systems. 9:1997;1012-1018 MIT Press, Cambridge, MA.
-
(1997)
Advances in Neural Information Processing Systems
, vol.9
, pp. 1012-1018
-
-
Doya, K.1
-
6
-
-
0033629916
-
Reinforcement learning in continuous time and space
-
Doya K. Reinforcement learning in continuous time and space. Neural Computation. 12:2000;243-269.
-
(2000)
Neural Computation
, vol.12
, pp. 243-269
-
-
Doya, K.1
-
7
-
-
0022417008
-
The coordination of arm movements: An experimentally confirmed mathematical model
-
Flash T., Hogan N. The coordination of arm movements: An experimentally confirmed mathematical model. Journal of Neuroscience. 5:1985;1688-1703.
-
(1985)
Journal of Neuroscience
, vol.5
, pp. 1688-1703
-
-
Flash, T.1
Hogan, N.2
-
8
-
-
0025600638
-
A stochastic reinforcement learning algorithm for learning real-valued functions
-
Gullapalli V. A stochastic reinforcement learning algorithm for learning real-valued functions. Neural Networks. 3:1990;671-692.
-
(1990)
Neural Networks
, vol.3
, pp. 671-692
-
-
Gullapalli, V.1
-
9
-
-
0032552114
-
Signal-dependent noise determines motor planning
-
Harris C.M., Wolpert D.M. Signal-dependent noise determines motor planning. Nature. 394:(20):1998;780-784.
-
(1998)
Nature
, vol.394
, Issue.20
, pp. 780-784
-
-
Harris, C.M.1
Wolpert, D.M.2
-
10
-
-
72749118903
-
Models of trajectory formation and temporal interaction of reach and grasp
-
Hoff B., Arbib M.A. Models of trajectory formation and temporal interaction of reach and grasp. Journal of Motor Behavior. 25:(3):1993;175-192.
-
(1993)
Journal of Motor Behavior
, vol.25
, Issue.3
, pp. 175-192
-
-
Hoff, B.1
Arbib, M.A.2
-
11
-
-
0001246127
-
Optimization and learning in neural networks for formation and control of coordinated movement
-
D. Meyer, & S. Kornblum. Cambridge, MA: MIT Press
-
Kawato M. Optimization and learning in neural networks for formation and control of coordinated movement. Meyer D., Kornblum S. Attention and performance, XIV: synergies in experimental psychology, artificial intelligence, and cognitive neuroscience - A silver jubilee. 1992;821-849 MIT Press, Cambridge, MA.
-
(1992)
Attention and Performance, XIV: Synergies in Experimental Psychology, Artificial Intelligence, and Cognitive Neuroscience - A Silver Jubilee
, pp. 821-849
-
-
Kawato, M.1
-
12
-
-
0003543129
-
Macro-actions in reinforcement learning: An empirical analysis
-
University of Massachusetts, Department of Computer Science.
-
McGovern, A., Sutton, R.S (1998) Macro-actions in reinforcement learning: An empirical analysis. Technical Report 98-70, University of Massachusetts, Department of Computer Science.
-
(1998)
Technical Report 98-70
-
-
McGovern, A.1
Sutton, R.S.2
-
14
-
-
0032191729
-
A tennis serve and upswing learning robot based on dynamic optimization theory
-
Miyamoto H., Kawato M. A tennis serve and upswing learning robot based on dynamic optimization theory. Neural Networks. 11:(7-8):1998;1331-1344.
-
(1998)
Neural Networks
, vol.11
, Issue.78
, pp. 1331-1344
-
-
Miyamoto, H.1
Kawato, M.2
-
15
-
-
0030297195
-
A Kendama learning robot based on dynamic optimization theory
-
Miyamoto H., Schaal S., Gandolfo F., Gomi H., Koike Y., Osu R., Nakano E., Wada Y., Kawato M. A Kendama learning robot based on dynamic optimization theory. Neural Networks. 9:(8):1996;1281-1302.
-
(1996)
Neural Networks
, vol.9
, Issue.8
, pp. 1281-1302
-
-
Miyamoto, H.1
Schaal, S.2
Gandolfo, F.3
Gomi, H.4
Koike, Y.5
Osu, R.6
Nakano, E.7
Wada, Y.8
Kawato, M.9
-
17
-
-
0033151712
-
Is imitation learning the way to humanoid robots?
-
Schaal S. Is imitation learning the way to humanoid robots? Trends in Cognitive Sciences. 3:(6):1999;233-242.
-
(1999)
Trends in Cognitive Sciences
, vol.3
, Issue.6
, pp. 233-242
-
-
Schaal, S.1
-
20
-
-
0024314287
-
Formation and control of optimal trajectory in human multijoint arm movement - Minimum torque-change model
-
Uno Y., Kawato M., Suzuki R. Formation and control of optimal trajectory in human multijoint arm movement - minimum torque-change model. Biological Cybernetics. 61:1989;89-101.
-
(1989)
Biological Cybernetics
, vol.61
, pp. 89-101
-
-
Uno, Y.1
Kawato, M.2
Suzuki, R.3
-
22
-
-
0027884471
-
A neural network model for arm trajectory formation using forward and inverse dynamics models
-
Wada Y., Kawato M. A neural network model for arm trajectory formation using forward and inverse dynamics models. Neural Networks. 6:(7):1993;919-932.
-
(1993)
Neural Networks
, vol.6
, Issue.7
, pp. 919-932
-
-
Wada, Y.1
Kawato, M.2
|