-
1
-
-
0039816976
-
Using local trajectory optimizers to speed up global optimization in dynamic programming
-
Cowan, J. D. Tesauro, G. and Alspector, J. editors., Morgan Kaufmann, San Mateo, CA
-
Atkeson, C. G. (1994). Using local trajectory optimizers to speed up global optimization in dynamic programming. In Cowan, J. D., Tesauro, G., and Alspector, J., editors. Advances in Neural Information Processing Systems 6, pages 663-670. Morgan Kaufmann, San Mateo, CA.
-
(1994)
Advances in Neural Information Processing Systems
, vol.6
, pp. 663-670
-
-
Atkeson, C.G.1
-
2
-
-
0031074521
-
Locally weighted learning
-
Atkeson, C. G., Moore, A. W., and Schaal, S. (1997a). Locally weighted learning. Artificial Intelligence Review, 11:11-73.
-
(1997)
Artificial Intelligence Review
, vol.11
, pp. 11-73
-
-
Atkeson, C.G.1
Moore, A.W.2
Schaal, S.3
-
3
-
-
0031073475
-
Locally weighted learning for control
-
Atkeson, C. G., Moore, A. W., and Schaal, S. (1997b). Locally weighted learning for control. Artificial Intelligence Review, 11:75-113.
-
(1997)
Artificial Intelligence Review
, vol.11
, pp. 75-113
-
-
Atkeson, C.G.1
Moore, A.W.2
Schaal, S.3
-
5
-
-
0029210635
-
Learning to act using real-time dynamic programming
-
Barto, A. G., Bradtke, S. J., and Singh, S. P. (1995). Learning to act using real-time dynamic programming. Artificial Intelligence, 72( 1):81-138.
-
(1995)
Artificial Intelligence
, vol.72
, Issue.1
, pp. 81-138
-
-
Barto, A.G.1
Bradtke, S.J.2
Singh, S.P.3
-
6
-
-
0026890244
-
Interactive spacetime control for animation
-
Cohen, M. F. (1992). Interactive spacetime control for animation. Computer Graphics, 26(2):293-302.
-
(1992)
Computer Graphics
, vol.26
, Issue.2
, pp. 293-302
-
-
Cohen, M.F.1
-
9
-
-
0029679044
-
Reinforcement learning: A survey
-
Kaelbling, L. P., Littman, M. L., and Moore, A. W. (1996). Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4:237-285.
-
(1996)
Journal of Artificial Intelligence Research
, vol.4
, pp. 237-285
-
-
Kaelbling, L.P.1
Littman, M.L.2
Moore, A.W.3
-
10
-
-
84992279217
-
Hierarchical spacetime control
-
Liu, Z., Gortler, S. J., and Cohen, M. F. (1994). Hierarchical spacetime control. Computer Graphics (S1GGRAPH '94 Proceedings), pages 35-42.
-
(1994)
Computer Graphics (S1GGRAPH '94 Proceedings)
, pp. 35-42
-
-
Liu, Z.1
Gortler, S.J.2
Cohen, M.F.3
-
11
-
-
0003474751
-
-
Cambridge University Press, New York, NY
-
Press, W. H., Teukolsky, S. A., Vetterling, W. T., and Flannery, B. P. (1988). Numerical Recipes in C. Cambridge University Press, New York, NY.
-
(1988)
Numerical Recipes in C
-
-
Press, W.H.1
Teukolsky, S.A.2
Vetterling, W.T.3
Flannery, B.P.4
-
12
-
-
84898995067
-
Learning from demonstration
-
Mozer, M. C. Jordan, M. and Petsche, T. editors, MIT Press, Cambridge, MA
-
Schaal, S. (1997). Learning from demonstration. In Mozer, M. C., Jordan, M., and Petsche, T., editors, Advances in Neural Information Processing Systems 9, pages 1040-1046. MIT Press, Cambridge, MA.
-
(1997)
Advances in Neural Information Processing Systems
, vol.9
, pp. 1040-1046
-
-
Schaal, S.1
-
13
-
-
0029255284
-
The swing up control problem for the acrobot
-
Spong, M. W. (1995). The swing up control problem for the acrobot. IEEE Control Systems Magazine, 15(1):49-55.
-
(1995)
IEEE Control Systems Magazine
, vol.15
, Issue.1
, pp. 49-55
-
-
Spong, M.W.1
-
14
-
-
85132026293
-
Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
-
Morgan Kaufmann, San Mateo, CA
-
Sutton, R. S. (1990). Integrated architectures for learning, planning, and reacting based on approximating dynamic programming. In Seventh International Machine Learning Workshop, pages 216-224. Morgan Kaufmann, San Mateo, CA. http://envy.cs.umass.edu/People/sutton/publications.html.
-
(1990)
Seventh International Machine Learning Workshop
, pp. 216-224
-
-
Sutton, R.S.1
-
15
-
-
0037631835
-
Dyna, an integrated architecture for learning, planning and reacting
-
151-155 and SIGART Bulletin
-
Sutton R. S. (1991a). Dyna, an integrated architecture for learning, planning and reacting. http://envy.cs.umass.edu/People/sutton/publications.html, Working Notes of the 1991 AAAI Spring Symposium on Integrated Intelligent Architectures pp. 151-155 and SIGART Bulletin 2, pp. 160-163.
-
(1991)
Working Notes of the 1991 AAAI Spring Symposium on Integrated Intelligent Architectures
, vol.2
, pp. 160-163
-
-
Sutton, R.S.1
-
16
-
-
85152618928
-
Planning by incremental dynamic programming
-
Morgan Kaufmann, San Mateo, CA
-
Sutton, R. S. (1991b). Planning by incremental dynamic programming. In Eighth International Machine Learning Workshop, pages 353-357. Morgan Kaufmann, San Mateo, CA. http://envy.cs.umass.edu/People/sutton/publications.html.
-
(1991)
Eighth International Machine Learning Workshop
, pp. 353-357
-
-
Sutton, R.S.1
-
17
-
-
0026852362
-
Reinforcement learning is direct adaptive optimal control
-
Sutton, R. S., Barto, A. G., and Williams, R. J. (1992). Reinforcement learning is direct adaptive optimal control. IEEE Control Systems Magazine, 12:19-22.
-
(1992)
IEEE Control Systems Magazine
, vol.12
, pp. 19-22
-
-
Sutton, R.S.1
Barto, A.G.2
Williams, R.J.3
|