메뉴 건너뛰기




Volumn 1864, Issue , 2000, Pages 26-44

An overview of MAXQ hierarchical reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

ABSTRACTING; DECISION MAKING; PROBLEM SOLVING; STOCHASTIC SYSTEMS;

EID: 84942867726     PISSN: 03029743     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1007/3-540-44914-0_2     Document Type: Conference Paper
Times cited : (61)

References (15)
  • 2
    • 85156187730 scopus 로고
    • Improving elevator performance using reinforcement learning
    • San Francisco, CA. Morgan Kaufmann
    • Crites, R. H., & Barto, A. G. (1995). Improving elevator performance using reinforcement learning. In Advances in Neural Information Processing Systems, Vol. 8, pp. 1017-1023 San Francisco, CA. Morgan Kaufmann.
    • (1995) Advances in Neural Information Processing Systems , vol.8 , pp. 1017-1023
    • Crites, R.H.1    Barto, A.G.2
  • 3
  • 5
    • 0002278788 scopus 로고    scopus 로고
    • Hierarchical reinforcement learning with the MAXQ value function decomposition
    • Dietterich, T. G. (2000). Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research. To appear.
    • (2000) Journal of Artificial Intelligence Research
    • Dietterich, T.G.1
  • 7
    • 0027684215 scopus 로고
    • Prioritized sweeping: Reinforcement learning with less data and less time
    • Moore, A. W., & Atkeson, C. G. (1993). Prioritized sweeping: Reinforcement learning with less data and less time. Machine Learning, 13, 103.
    • (1993) Machine Learning , vol.13 , pp. 103
    • Moore, A.W.1    Atkeson, C.G.2
  • 9
    • 84898956770 scopus 로고    scopus 로고
    • Reinforcement learning with hierarchies of machines
    • Cambridge, MA. MIT Press
    • Parr, R., & Russell, S. (1998). Reinforcement learning with hierarchies of machines. In Advances in Neural Information Processing Systems, Vol. 10, pp. 1043-1049 Cambridge, MA. MIT Press.
    • (1998) Advances in Neural Information Processing Systems , vol.10 , pp. 1043-1049
    • Parr, R.1    Russell, S.2
  • 11
    • 0001027894 scopus 로고
    • Transfer of learning by composing solutions of elemental sequential tasks
    • Singh, S. P. (1992). Transfer of learning by composing solutions of elemental sequential tasks. Machine Learning, 8, 323.
    • (1992) Machine Learning , vol.8 , pp. 323
    • Singh, S.P.1
  • 14
    • 0029276036 scopus 로고
    • Temporal difference learning and TD-Gammon
    • Tesauro, G. (1995). Temporal difference learning and TD-Gammon. Communications of the ACM, 28 (3), 58-68.
    • (1995) Communications of the ACM , vol.28 , Issue.3 , pp. 58-68
    • Tesauro, G.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.