메뉴 건너뛰기




Volumn , Issue , 2006, Pages 332-340

A compact, hierarchically optimal Q-function decomposition

Author keywords

[No Author keywords available]

Indexed keywords

COMPLEX ENVIRONMENTS; CONCISE REPRESENTATIONS; HIERARCHICAL REINFORCEMENT LEARNING; NONLOCAL; Q-FUNCTIONS; RUNTIME ARCHITECTURE; STATE ABSTRACTION; STATE DISTRIBUTIONS; STRUCTURAL CONDITION; VALUE FUNCTIONS;

EID: 80053178447     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (6)

References (10)
  • 1
    • 0012312949 scopus 로고    scopus 로고
    • State abstraction for programmable reinforcement learning agents
    • D. Andre and S. Russell. State abstraction for programmable reinforcement learning agents. In AAAI, 2002.
    • (2002) AAAI
    • Andre, D.1    Russell, S.2
  • 3
    • 0346942368 scopus 로고    scopus 로고
    • Decision-theoretic planning: Structural assumptions and computational leverage
    • C. Boutilier, T. Dean, and S. Hanks. Decision theoretic planning: structural assumptions and computational leverage. Journal of Artificial Intelligence Research, 11:1-94, 1999. (Pubitemid 129628760)
    • (1999) Journal of Artificial Intelligence Research , vol.11 , pp. 1-94
    • Boutilier, C.1    Dean, T.2    Hanks, S.3
  • 4
    • 85168151397 scopus 로고
    • Decomposition techniques for planning in stochastic domains
    • T. Dean and S.-H. Lin. Decomposition techniques for planning in stochastic domains. In IJCAI, 1995.
    • (1995) IJCAI
    • Dean, T.1    Lin, S.-H.2
  • 5
    • 0002278788 scopus 로고    scopus 로고
    • Hierarchical reinforcement learning with the MAXQ value function decomposition
    • T. Dietterich. Hierarchical reinforcement learning with the maxq value function decomposition. JAIR, 13:227-303, 2000. (Pubitemid 33682087)
    • (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 227-303
    • Dietterich, T.G.1
  • 6
  • 7
    • 33746100681 scopus 로고    scopus 로고
    • Stochastic over-subscription planning using hierarchies of MDPs
    • N. Meuleau, R. Brafman, and E. Benaz-era. Stochastic over-subscription planning using hierarchies of MDPs. In ICAPS, 2006.
    • (2006) ICAPS
    • Meuleau, N.1    Brafman, R.2    Benazera, E.3
  • 8
    • 0001070375 scopus 로고    scopus 로고
    • Reinforcement learning with hierarchies of machines
    • R. Parr and S. Russell. Reinforcement learning with hierarchies of machines. In NIPS, 1997.
    • (1997) NIPS
    • Parr, R.1    Russell, S.2
  • 9
    • 0346738900 scopus 로고    scopus 로고
    • Flexible decomposition algorithms for weakly coupled Markov decision processes
    • R. Parr. Flexible decomposition algorithms for weakly coupled Markov decision processes. In UAI, 1998.
    • (1998) UAI
    • Parr, R.1
  • 10
    • 84899003140 scopus 로고    scopus 로고
    • Multi-time models for temporally abstract planning
    • D. Precup and R. Sutton. Multi-time models for temporally abstract planning. In NIPS, 1998.
    • (1998) NIPS
    • Precup, D.1    Sutton, R.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.