메뉴 건너뛰기




Volumn , Issue , 2000, Pages 994-1000

State abstraction in MAXQ hierarchical reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

REINFORCEMENT LEARNING;

EID: 0003506152     PISSN: 10495258     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (50)

References (12)
  • 1
    • 84899003140 scopus 로고    scopus 로고
    • Multi-time models for temporally abstract planning
    • The MIT Press
    • D. Precup and R. S. Sutton, "Multi-time models for temporally abstract planning," in NIPS 10, The MIT Press, 1998.
    • (1998) NIPS 10
    • Precup, D.1    Sutton, R.S.2
  • 2
    • 0003899594 scopus 로고    scopus 로고
    • Between mdps and semi-mdps: Learning, planning, and representing knowledge at multiple temporal scales
    • Amherst, MA
    • R. S. Sutton, D. Precup, and S. Singh, "Between MDPs and Semi-MDPs: Learning, planning, and representing knowledge at multiple temporal scales," tech. rep., Univ. Mass., Dept. Comp. Inf. Sci., Amherst, MA, 1998.
    • (1998) Tech. Rep., Univ. Mass., Dept. Comp. Inf. Sci
    • Sutton, R.S.1    Precup, D.2    Singh, S.3
  • 3
    • 84898956770 scopus 로고    scopus 로고
    • Reinforcement learning with hierarchies of machines
    • The MIT Press
    • R. Parr and S. Russell, "Reinforcement learning with hierarchies of machines," in NIPS-10, The MIT Press, 1998.
    • (1998) NIPS-10
    • Parr, R.1    Russell, S.2
  • 4
    • 0001027894 scopus 로고
    • Transfer of learning by composing solutions of elemental sequential tasks
    • S. P. Singh, "Transfer of learning by composing solutions of elemental sequential tasks," Machine Learning, vol. 8, p. 323, 1992.
    • (1992) Machine Learning , vol.8 , pp. 323
    • Singh, S.P.1
  • 5
    • 0000908087 scopus 로고
    • Hierarchical reinforcement learning: Preliminary results
    • Morgan Kaufmann
    • L. P. Kaelbling, "Hierarchical reinforcement learning: Preliminary results," in Proceedings ICML-10, pp. 167-173, Morgan Kaufmann, 1993.
    • (1993) Proceedings ICML-10 , pp. 167-173
    • Kaelbling, L.P.1
  • 7
    • 0001234682 scopus 로고
    • Feudal reinforcement learning
    • San Francisco, CA: Morgan Kaufmann
    • P. Dayan and G. Hinton, "Feudal reinforcement learning," in NIPS-5, pp. 271-278, San Francisco, CA: Morgan Kaufmann, 1993.
    • (1993) NIPS-5 , pp. 271-278
    • Dayan, P.1    Hinton, G.2
  • 8
    • 0001806701 scopus 로고    scopus 로고
    • The MAXQ method for hierarchical reinforcement learning
    • Morgan Kaufmann
    • T. G. Dietterich, "The MAXQ method for hierarchical reinforcement learning," in ICML-15, Morgan Kaufmann, 1998.
    • (1998) ICML-15
    • Dietterich, T.G.1
  • 11
    • 0000439891 scopus 로고
    • On the convergence of stochastic iterative dynamic programming algorithms
    • T. Jaakkola, M. I. Jordan, and S. P. Singh, "On the convergence of stochastic iterative dynamic programming algorithms," Neur. Comp., vol. 6, no. 6, pp. 1185-1201, 1994.
    • (1994) Neur. Comp. , vol.6 , Issue.6 , pp. 1185-1201
    • Jaakkola, T.1    Jordan, M.I.2    Singh, S.P.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.