메뉴 건너뛰기




Volumn 3, Issue , 2003, Pages 1108-1113

Multitask reinforcement learning on the distribution of MDPs

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; AUTOMATION; ROBOTICS;

EID: 84863336191     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/CIRA.2003.1222152     Document Type: Conference Paper
Times cited : (63)

References (11)
  • 3
    • 0027684215 scopus 로고
    • Prioritized sweeping: Reinforcement learning with less data and Less Real Time
    • A. W. Moore and C. G. Atkeson. Prioritized Sweeping: Reinforcement Learning with Less Data and Less Real Time. Machine Learning, 13:103-130, 1993.
    • (1993) Machine Learning , vol.13 , pp. 103-130
    • Moore, A.W.1    Atkeson, C.G.2
  • 4
    • 84977063352 scopus 로고
    • Efficient learning and platuiing within the dyna framework
    • J. Peng and R. J. Williams. Efficient Learning and Platuiing Within the Dyna Framework. Adaptive Behavior, l(4):437-454, 1993.
    • (1993) Adaptive Behavior , vol.1 , Issue.4 , pp. 437-454
    • Peng, J.1    Williams, R.J.2
  • 5
    • 0001027894 scopus 로고
    • Transfer of learning by composing solutions of elemental sequential Tasks
    • S. P. Singh. Transfer of Learning by Composing Solutions of Elemental Sequential Tasks. Machine Learning, 8:323-339, 1992.
    • (1992) Machine Learning , vol.8 , pp. 323-339
    • Singh, S.P.1
  • 6
    • 85132026293 scopus 로고
    • Integrated architectures for learning, planning, and reacting based on Approximating Dynamic Programming
    • R. S. Sutton. Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming. In Proceedings of the 7th International Conference on Machine Learning, pages 216-224, 1990.
    • (1990) Proceedings of the 7th International Conference on Machine Learning , pp. 216-224
    • Sutton, R.S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.