메뉴 건너뛰기




Volumn , Issue , 1998, Pages 1008-1014

Nonparametric model-based reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

MODELS; PLANNING; TRAJECTORIES;

EID: 49049119585     PISSN: 10495258     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (21)

References (17)
  • 1
    • 0039816976 scopus 로고
    • Using local trajectory optimizers to speed up global optimization in dynamic programming
    • Cowan, J. D. Tesauro, G. and Alspector, J. editors., Morgan Kaufmann, San Mateo, CA
    • Atkeson, C. G. (1994). Using local trajectory optimizers to speed up global optimization in dynamic programming. In Cowan, J. D., Tesauro, G., and Alspector, J., editors. Advances in Neural Information Processing Systems 6, pages 663-670. Morgan Kaufmann, San Mateo, CA.
    • (1994) Advances in Neural Information Processing Systems , vol.6 , pp. 663-670
    • Atkeson, C.G.1
  • 5
    • 0029210635 scopus 로고
    • Learning to act using real-time dynamic programming
    • Barto, A. G., Bradtke, S. J., and Singh, S. P. (1995). Learning to act using real-time dynamic programming. Artificial Intelligence, 72( 1):81-138.
    • (1995) Artificial Intelligence , vol.72 , Issue.1 , pp. 81-138
    • Barto, A.G.1    Bradtke, S.J.2    Singh, S.P.3
  • 6
    • 0026890244 scopus 로고
    • Interactive spacetime control for animation
    • Cohen, M. F. (1992). Interactive spacetime control for animation. Computer Graphics, 26(2):293-302.
    • (1992) Computer Graphics , vol.26 , Issue.2 , pp. 293-302
    • Cohen, M.F.1
  • 12
    • 84898995067 scopus 로고    scopus 로고
    • Learning from demonstration
    • Mozer, M. C. Jordan, M. and Petsche, T. editors, MIT Press, Cambridge, MA
    • Schaal, S. (1997). Learning from demonstration. In Mozer, M. C., Jordan, M., and Petsche, T., editors, Advances in Neural Information Processing Systems 9, pages 1040-1046. MIT Press, Cambridge, MA.
    • (1997) Advances in Neural Information Processing Systems , vol.9 , pp. 1040-1046
    • Schaal, S.1
  • 13
    • 0029255284 scopus 로고
    • The swing up control problem for the acrobot
    • Spong, M. W. (1995). The swing up control problem for the acrobot. IEEE Control Systems Magazine, 15(1):49-55.
    • (1995) IEEE Control Systems Magazine , vol.15 , Issue.1 , pp. 49-55
    • Spong, M.W.1
  • 14
    • 85132026293 scopus 로고
    • Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
    • Morgan Kaufmann, San Mateo, CA
    • Sutton, R. S. (1990). Integrated architectures for learning, planning, and reacting based on approximating dynamic programming. In Seventh International Machine Learning Workshop, pages 216-224. Morgan Kaufmann, San Mateo, CA. http://envy.cs.umass.edu/People/sutton/publications.html.
    • (1990) Seventh International Machine Learning Workshop , pp. 216-224
    • Sutton, R.S.1
  • 15
    • 0037631835 scopus 로고
    • Dyna, an integrated architecture for learning, planning and reacting
    • 151-155 and SIGART Bulletin
    • Sutton R. S. (1991a). Dyna, an integrated architecture for learning, planning and reacting. http://envy.cs.umass.edu/People/sutton/publications.html, Working Notes of the 1991 AAAI Spring Symposium on Integrated Intelligent Architectures pp. 151-155 and SIGART Bulletin 2, pp. 160-163.
    • (1991) Working Notes of the 1991 AAAI Spring Symposium on Integrated Intelligent Architectures , vol.2 , pp. 160-163
    • Sutton, R.S.1
  • 16
    • 85152618928 scopus 로고
    • Planning by incremental dynamic programming
    • Morgan Kaufmann, San Mateo, CA
    • Sutton, R. S. (1991b). Planning by incremental dynamic programming. In Eighth International Machine Learning Workshop, pages 353-357. Morgan Kaufmann, San Mateo, CA. http://envy.cs.umass.edu/People/sutton/publications.html.
    • (1991) Eighth International Machine Learning Workshop , pp. 353-357
    • Sutton, R.S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.