SCOPUS 정보 검색 플랫폼

Volumn , Issue , 1998, Pages 1008-1014

Nonparametric model-based reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

MODELS; PLANNING; TRAJECTORIES;

MINIMIZING COSTS; MODEL-BASED REINFORCEMENT LEARNING; NON-PARAMETRIC; OPTIMIZERS; PARAMETRIC MODELS; PLANNING ALGORITHMS;

REINFORCEMENT LEARNING;

EID: 49049119585 PISSN: 10495258 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (21)

References (17)

2
- 0031074521
- Locally weighted learning
- Atkeson, C. G., Moore, A. W., and Schaal, S. (1997a). Locally weighted learning. Artificial Intelligence Review, 11:11-73.
- (1997) Artificial Intelligence Review , vol.11 , pp. 11-73
- Atkeson, C.G.¹ Moore, A.W.² Schaal, S.³

3
- 0031073475
- Locally weighted learning for control
- Atkeson, C. G., Moore, A. W., and Schaal, S. (1997b). Locally weighted learning for control. Artificial Intelligence Review, 11:75-113.
- (1997) Artificial Intelligence Review , vol.11 , pp. 75-113
- Atkeson, C.G.¹ Moore, A.W.² Schaal, S.³

4
- 0002130986
- Robot learning from demonstration
- Atkeson, C. G. and Schaal, S. (1997). Robot learning from demonstration. In Proceedings of the 1997 International Conference on Machine Learning.
- (1997) Proceedings of the 1997 International Conference on Machine Learning
- Atkeson, C.G.¹ Schaal, S.²

5
- 0029210635
- Learning to act using real-time dynamic programming
- Barto, A. G., Bradtke, S. J., and Singh, S. P. (1995). Learning to act using real-time dynamic programming. Artificial Intelligence, 72( 1):81-138.
- (1995) Artificial Intelligence , vol.72 , Issue.1 , pp. 81-138
- Barto, A.G.¹ Bradtke, S.J.² Singh, S.P.³

6
- 0026890244
- Interactive spacetime control for animation
- Cohen, M. F. (1992). Interactive spacetime control for animation. Computer Graphics, 26(2):293-302.
- (1992) Computer Graphics , vol.26 , Issue.2 , pp. 293-302
- Cohen, M.F.¹

7
- 0004276055
- Academic, NY
- Dyer, P. and McReynolds, S. (1970). The Computational Theory of Optimal Control. Academic, NY.
- (1970) The Computational Theory of Optimal Control
- Dyer, P.¹ McReynolds, S.²

8
- 0004291983
- Elsevier, NY
- Jacobson, D. and Mayne, D. (1970). Differential Dynamic Programming. Elsevier, NY.
- (1970) Differential Dynamic Programming
- Jacobson, D.¹ Mayne, D.²

9
- 0029679044
- Reinforcement learning: A survey
- Kaelbling, L. P., Littman, M. L., and Moore, A. W. (1996). Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4:237-285.
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 237-285
- Kaelbling, L.P.¹ Littman, M.L.² Moore, A.W.³

10
- 84992279217
- Hierarchical spacetime control
- Liu, Z., Gortler, S. J., and Cohen, M. F. (1994). Hierarchical spacetime control. Computer Graphics (S1GGRAPH '94 Proceedings), pages 35-42.
- (1994) Computer Graphics (S1GGRAPH '94 Proceedings) , pp. 35-42
- Liu, Z.¹ Gortler, S.J.² Cohen, M.F.³

11
- 0003474751
- Cambridge University Press, New York, NY
- Press, W. H., Teukolsky, S. A., Vetterling, W. T., and Flannery, B. P. (1988). Numerical Recipes in C. Cambridge University Press, New York, NY.
- (1988) Numerical Recipes in C
- Press, W.H.¹ Teukolsky, S.A.² Vetterling, W.T.³ Flannery, B.P.⁴

13
- 0029255284
- The swing up control problem for the acrobot
- Spong, M. W. (1995). The swing up control problem for the acrobot. IEEE Control Systems Magazine, 15(1):49-55.
- (1995) IEEE Control Systems Magazine , vol.15 , Issue.1 , pp. 49-55
- Spong, M.W.¹

17
- 0026852362
- Reinforcement learning is direct adaptive optimal control
- Sutton, R. S., Barto, A. G., and Williams, R. J. (1992). Reinforcement learning is direct adaptive optimal control. IEEE Control Systems Magazine, 12:19-22.
- (1992) IEEE Control Systems Magazine , vol.12 , pp. 19-22
- Sutton, R.S.¹ Barto, A.G.² Williams, R.J.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.