메뉴 건너뛰기




Volumn 16, Issue 7, 2003, Pages 985-994

Inter-module credit assignment in modular reinforcement learning

Author keywords

Hierarchical modular architecture; Inter module credit assignment; Modular reward; MOSAIC non linear control task; Reinforcement learning

Indexed keywords

COMPUTER SIMULATION; LEARNING SYSTEMS; LINEAR CONTROL SYSTEMS; OPTIMIZATION;

EID: 0742324926     PISSN: 08936080     EISSN: None     Source Type: Journal    
DOI: 10.1016/S0893-6080(02)00235-6     Document Type: Article
Times cited : (44)

References (12)
  • 2
    • 0033629916 scopus 로고    scopus 로고
    • Reinforcement learning in continuous time and space
    • Doya K. Reinforcement learning in continuous time and space. Neural Computation. 12:2000;219-245.
    • (2000) Neural Computation , vol.12 , pp. 219-245
    • Doya, K.1
  • 7
    • 85152618928 scopus 로고
    • Plannning by incremental dynamic programing
    • L.A. Birnbaum, R.S. Sutton, & G.C. Collins. San Mateo, CA: Morgan Kaufmann
    • Sutton R.S. Plannning by incremental dynamic programing. Birnbaum L.A., Sutton R.S., Collins G.C. Proceedings of the Eighteenth International Workshop on Machine Learning. 1991;353-357 Morgan Kaufmann, San Mateo, CA.
    • (1991) Proceedings of the Eighteenth International Workshop on Machine Learning , pp. 353-357
    • Sutton, R.S.1
  • 8
    • 0001027894 scopus 로고
    • Transfer of learning by composing solutions of elemental sequential tasks
    • Singh S. Transfer of learning by composing solutions of elemental sequential tasks. Machine Learning. 8:1992;323-339.
    • (1992) Machine Learning , vol.8 , pp. 323-339
    • Singh, S.1
  • 9
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
    • Sutton R., Precup D., Singh S. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence. 112:1999;181-211.
    • (1999) Artificial Intelligence , vol.112 , pp. 181-211
    • Sutton, R.1    Precup, D.2    Singh, S.3
  • 12
    • 0032192424 scopus 로고    scopus 로고
    • Multiple paired forward and inverse models for motor control
    • Wolpert D.M., Kawato M. Multiple paired forward and inverse models for motor control. Neural Networks. 11:1998;1317-1329.
    • (1998) Neural Networks , vol.11 , pp. 1317-1329
    • Wolpert, D.M.1    Kawato, M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.