SCOPUS 정보 검색 플랫폼

NIPS 2002: Proceedings of the 15th International Conference on Neural Information Processing Systems

Volumn , Issue , 2002, Pages 1611-1618

Nonparametric Representation of Policies and Value Functions: A Trajectory-Based Approach

(2) Atkeson, Christopher G a Morimoto, Jun b

a CARNEGIE MELLON UNIVERSITY (United States)

b ADVANCED TELECOMMUNICATIONS RESEARCH INSTITUTE INTERNATIONAL (Japan)

Author keywords

[No Author keywords available]

Indexed keywords

REINFORCEMENT LEARNING;

CURSE OF DIMENSIONALITY; DISCOUNT FACTORS; FUNCTIONS APPROXIMATIONS; NONPARAMETRIC REPRESENTATION; PERIODIC TASKS; RAPID LEARNING; REINFORCEMENT LEARNINGS; TRAJECTORY FUNCTIONS; TRAJECTORY-BASED; VALUE FUNCTIONS;

TRAJECTORIES;

EID: 85156195508 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (14)

References (15)

1
- 85132026293
- Integrated architectures for learning, planning and reacting based on approximating dynamic programming
- Richard S. Sutton. Integrated architectures for learning, planning and reacting based on approximating dynamic programming. In Proceedings 7th International Conference on Machine Learning., 1990.
- (1990) Proceedings 7th International Conference on Machine Learning
- Sutton, Richard S.¹

2
- 14744292247
- C. Atkeson and J. Santamaria. A comparison of direct and model-based reinforcement learning, 1997.
- (1997) A comparison of direct and model-based reinforcement learning
- Atkeson, C.¹ Santamaria, J.²

3
- 0039816976
- Using local trajectory optimizers to speed up global optimization in dynamic programming
- Jack D. Cowan, Gerald Tesauro, and Joshua Alspector, editors, pages Morgan Kaufmann Publishers, Inc
- Christopher G. Atkeson. Using local trajectory optimizers to speed up global optimization in dynamic programming. In Jack D. Cowan, Gerald Tesauro, and Joshua Alspector, editors, Advances in Neural Information Processing Systems, volume 6, pages 663-670. Morgan Kaufmann Publishers, Inc., 1994.
- (1994) Advances in Neural Information Processing Systems , vol.6 , pp. 663-670
- Atkeson, Christopher G.¹

4
- 0004276055
- Academic Press, New York, NY
- P. Dyer and S. R. McReynolds. The Computation and Theory of Optimal Control. Academic Press, New York, NY, 1970.
- (1970) The Computation and Theory of Optimal Control
- Dyer, P.¹ McReynolds, S. R.²

5
- 0004291983
- Elsevier, New York, NY
- D. H. Jacobson and D. Q. Mayne. Differential Dynamic Programming. Elsevier, New York, NY, 1970.
- (1970) Differential Dynamic Programming
- Jacobson, D. H.¹ Mayne, D. Q.²

6
- 0002130986
- Robot learning from demonstration
- Morgan Kaufmann
- Christopher G. Atkeson and Stefan Schaal. Robot learning from demonstration. In Proc. 14th International Conference on Machine Learning, pages 12-20. Morgan Kaufmann, 1997.
- (1997) Proc. 14th International Conference on Machine Learning , pp. 12-20
- Atkeson, Christopher G.¹ Schaal, Stefan²

7
- 0031074521
- Locally weighted learning
- C. G. Atkeson, A. W. Moore, and S. Schaal. Locally weighted learning. Artificial Intelligence Review, 11:11-73, 1997.
- (1997) Artificial Intelligence Review , vol.11 , pp. 11-73
- Atkeson, C. G.¹ Moore, A. W.² Schaal, S.³

8
- 0029182990
- Control of forward velocity for a simplified planar hopping robot
- W. Schwind and D. Koditschek. Control of forward velocity for a simplified planar hopping robot. In International Conference on Robotics and Automation, volume 1, pages 691-6, 1995.
- (1995) International Conference on Robotics and Automation , vol.1 , pp. 691-696
- Schwind, W.¹ Koditschek, D.²

9
- 85156266378
- Autonomous helicopter control using reinforcement learning policy search methods
- J. Andrew Bagnell and Jeff Schneider. Autonomous helicopter control using reinforcement learning policy search methods. In International Conference on Robotics and Automation, 2001.
- (2001) International Conference on Robotics and Automation
- Andrew Bagnell, J.¹ Schneider, Jeff²

10
- 0034206993
- Efficiency, speed, and scaling of two-dimensional passive-dynamic walking
- M. Garcia, A. Chatterjee, and A. Ruina. Efficiency, speed, and scaling of two-dimensional passive-dynamic walking. Dynamics and Stability of Systems, 15(2):75-99, 2000.
- (2000) Dynamics and Stability of Systems , vol.15 , Issue.2 , pp. 75-99
- Garcia, M.¹ Chatterjee, A.² Ruina, A.³

11
- 0003585352
- PRENTICE HALL, New Jersey
- K. Zhou, J. C. Doyle, and K. Glover. Robust Optimal Control. PRENTICE HALL, New Jersey, 1996.
- (1996) Robust Optimal Control
- Zhou, K.¹ Doyle, J. C.² Glover, K.³

12
- 0346871047
- Robust Reinforcement Learning
- Todd K. Leen, Thomas G. Dietterich, and Volker Tresp, editors, pages MIT Press, Cambridge, MA
- J. Morimoto and K. Doya. Robust Reinforcement Learning. In Todd K. Leen, Thomas G. Dietterich, and Volker Tresp, editors, Advances in Neural Information Processing Systems 13, pages 1061-1067. MIT Press, Cambridge, MA, 2001.
- (2001) Advances in Neural Information Processing Systems , vol.13 , pp. 1061-1067
- Morimoto, J.¹ Doya, K.²

13
- 84899024446
- Risk Sensitive Reinforcement Learning
- M. S. Kearns, S. A. Solla, and D. A. Cohn, editors, pages MIT Press, Cambridge, MA, USA
- R. Neuneier and O. Mihatsch. Risk Sensitive Reinforcement Learning. In M. S. Kearns, S. A. Solla, and D. A. Cohn, editors, Advances in Neural Information Processing Systems 11, pages 1031-1037. MIT Press, Cambridge, MA, USA, 1998.
- (1998) Advances in Neural Information Processing Systems , vol.11 , pp. 1031-1037
- Neuneier, R.¹ Mihatsch, O.²

14
- 0033077715
- Risk-Sensitive and Minmax Control of Discrete-Time Finite-State Markov Decision Processes
- S. P. Coraluppi and S. I. Marcus. Risk-Sensitive and Minmax Control of Discrete-Time Finite-State Markov Decision Processes. Automatica, 35:301-309, 1999.
- (1999) Automatica , vol.35 , pp. 301-309
- Coraluppi, S. P.¹ Marcus, S. I.²

15
- 84887272277
- Minimax differential dynamic programming: An application to robust biped walking
- MIT Press, Cambridge, MA
- J. Morimoto and C. Atkeson. Minimax differential dynamic programming: An application to robust biped walking. In Advances in Neural Information Processing Systems 15. MIT Press, Cambridge, MA, 2002.
- (2002) Advances in Neural Information Processing Systems , vol.15
- Morimoto, J.¹ Atkeson, C.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.