SCOPUS 정보 검색 플랫폼

Volumn 2006, Issue , 2006, Pages 2997-3002

Quasi-online reinforcement learning for robots

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; COMPUTATION THEORY; FUNCTION EVALUATION; ONLINE SYSTEMS; PROBABILISTIC LOGICS;

COMPUTATION TIME; PROBABILISTIC MODEL; REINFORCEMENT LEARNING;

ROBOT LEARNING;

EID: 33845607326 PISSN: 10504729 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ROBOT.2006.1642157 Document Type: Conference Paper

Times cited : (32)

References (12)

1
- 0346149797
- A robot that reinforcement-learns to identify and memorize important previous observations
- B. Bakker, V. Zhumatiy, G. Gruener, and J. Schmidhuber. A robot that reinforcement-learns to identify and memorize important previous observations. In Proceedings of the 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems, pages 430-435, 2003.
- (2003) Proceedings of the 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems , pp. 430-435
- Bakker, B.¹ Zhumatiy, V.² Gruener, G.³ Schmidhuber, J.⁴

4
- 0027684215
- Prioritized sweeping: Reinforcement learning with less data and less time
- A. Moore and C. Atkeson. Prioritized sweeping: Reinforcement learning with less data and less time. Machine Learning, 13:103-130, 1993.
- (1993) Machine Learning , vol.13 , pp. 103-130
- Moore, A.¹ Atkeson, C.²

5
- 0141596576
- Policy invariance under reward transformations: Theory and application to reward shaping
- A. Y. Ng, D. Harada, and S. Russell. Policy invariance under reward transformations: theory and application to reward shaping. In Proc. 16th International Conf. on Machine Learning, pages 278-287, 1999.
- (1999) Proc. 16th International Conf. on Machine Learning , pp. 278-287
- Ng, A.Y.¹ Harada, D.² Russell, S.³

6
- 84898980684
- Autonomous helicopter flight via reinforcement learning
- A. Y. Ng, H. J. Kim, M. Jordan, and S. Sastry. Autonomous helicopter flight via reinforcement learning. In Advances in Neural Information Processing Systems 16, 2004.
- (2004) Advances in Neural Information Processing Systems , vol.16
- Ng, A.Y.¹ Kim, H.J.² Jordan, M.³ Sastry, S.⁴

7
- 84977063352
- Efficient learning and planning within the dyna framework
- J. Peng and R. J. Williams. Efficient learning and planning within the dyna framework. Adaptive Behavior, 1 (4):437-454, 1993.
- (1993) Adaptive Behavior , vol.1 , Issue.4 , pp. 437-454
- Peng, J.¹ Williams, R.J.²

10
- 85132026293
- Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
- R. S. Sutton. Integrated architectures for learning, planning, and reacting based on approximating dynamic programming. In Proc. 7th ICML, pages 216-224, 1990.
- (1990) Proc. 7th ICML , pp. 216-224
- Sutton, R.S.¹

11
- 0004102479
- MIT Press, Cambridge, MA
- R. S. Sutton and A. G. Barto. Reinforcement learning: An introduction. MIT Press, Cambridge, MA, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

12
- 0004049893
- PhD thesis, Cambridge University
- C. J. C. H. Watkins. Learning from delayed rewards. PhD thesis, Cambridge University, 1989.
- (1989) Learning from Delayed Rewards
- Watkins, C.J.C.H.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.