메뉴 건너뛰기




Volumn 2006, Issue , 2006, Pages 2997-3002

Quasi-online reinforcement learning for robots

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; COMPUTATION THEORY; FUNCTION EVALUATION; ONLINE SYSTEMS; PROBABILISTIC LOGICS;

EID: 33845607326     PISSN: 10504729     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ROBOT.2006.1642157     Document Type: Conference Paper
Times cited : (32)

References (12)
  • 3
    • 0000827179 scopus 로고
    • BOXES: An experiment in adaptive control
    • Dale E and Michie D., editors, Edinburgh, Oliver and Boyd
    • D. Michie and R. A. Chambers. BOXES: An experiment in adaptive control. In Dale E and Michie D., editors, Machine Intelligence 2, pages 137-152, Edinburgh, 1968. Oliver and Boyd.
    • (1968) Machine Intelligence , vol.2 , pp. 137-152
    • Michie, D.1    Chambers, R.A.2
  • 4
    • 0027684215 scopus 로고
    • Prioritized sweeping: Reinforcement learning with less data and less time
    • A. Moore and C. Atkeson. Prioritized sweeping: Reinforcement learning with less data and less time. Machine Learning, 13:103-130, 1993.
    • (1993) Machine Learning , vol.13 , pp. 103-130
    • Moore, A.1    Atkeson, C.2
  • 5
    • 0141596576 scopus 로고    scopus 로고
    • Policy invariance under reward transformations: Theory and application to reward shaping
    • A. Y. Ng, D. Harada, and S. Russell. Policy invariance under reward transformations: theory and application to reward shaping. In Proc. 16th International Conf. on Machine Learning, pages 278-287, 1999.
    • (1999) Proc. 16th International Conf. on Machine Learning , pp. 278-287
    • Ng, A.Y.1    Harada, D.2    Russell, S.3
  • 7
    • 84977063352 scopus 로고
    • Efficient learning and planning within the dyna framework
    • J. Peng and R. J. Williams. Efficient learning and planning within the dyna framework. Adaptive Behavior, 1 (4):437-454, 1993.
    • (1993) Adaptive Behavior , vol.1 , Issue.4 , pp. 437-454
    • Peng, J.1    Williams, R.J.2
  • 9
    • 0001898381 scopus 로고    scopus 로고
    • Practical reinforcement learning in continuous spaces
    • Morgan Kaufmann, San Francisco, CA
    • W. D. Smart and L. P. Kaelbling. Practical reinforcement learning in continuous spaces. In Proc. 17th International Conf. on Machine Learning, pages 903-910. Morgan Kaufmann, San Francisco, CA, 2000.
    • (2000) Proc. 17th International Conf. on Machine Learning , pp. 903-910
    • Smart, W.D.1    Kaelbling, L.P.2
  • 10
    • 85132026293 scopus 로고
    • Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
    • R. S. Sutton. Integrated architectures for learning, planning, and reacting based on approximating dynamic programming. In Proc. 7th ICML, pages 216-224, 1990.
    • (1990) Proc. 7th ICML , pp. 216-224
    • Sutton, R.S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.