SCOPUS 정보 검색 플랫폼

Proceedings of the 22nd Conference on Uncertainty in Artificial Intelligence, UAI 2006

Volumn , Issue , 2006, Pages 332-340

A compact, hierarchically optimal Q-function decomposition

(3) Marthi, Bhaskara a Russell, Stuart a Andre, David b

a UNIVERSITY OF CALIFORNIA (United States)

b BodyMedia Inc (United States)

Author keywords

[No Author keywords available]

Indexed keywords

COMPLEX ENVIRONMENTS; CONCISE REPRESENTATIONS; HIERARCHICAL REINFORCEMENT LEARNING; NONLOCAL; Q-FUNCTIONS; RUNTIME ARCHITECTURE; STATE ABSTRACTION; STATE DISTRIBUTIONS; STRUCTURAL CONDITION; VALUE FUNCTIONS;

REINFORCEMENT LEARNING;

ARTIFICIAL INTELLIGENCE;

EID: 80053178447 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (6)

References (10)

1
- 0012312949
- State abstraction for programmable reinforcement learning agents
- D. Andre and S. Russell. State abstraction for programmable reinforcement learning agents. In AAAI, 2002.
- (2002) AAAI
- Andre, D.¹ Russell, S.²

2
- 32144459726
- PhD thesis, UC Berkeley
- D. Andre. Programmable Reinforcement Learning Agents. PhD thesis, UC Berkeley, 2003.
- (2003) Programmable Reinforcement Learning Agents
- Andre, D.¹

3
- 0346942368
- Decision-theoretic planning: Structural assumptions and computational leverage
- C. Boutilier, T. Dean, and S. Hanks. Decision theoretic planning: structural assumptions and computational leverage. Journal of Artificial Intelligence Research, 11:1-94, 1999. (Pubitemid 129628760)
- (1999) Journal of Artificial Intelligence Research , vol.11 , pp. 1-94
- Boutilier, C.¹ Dean, T.² Hanks, S.³

4
- 85168151397
- Decomposition techniques for planning in stochastic domains
- T. Dean and S.-H. Lin. Decomposition techniques for planning in stochastic domains. In IJCAI, 1995.
- (1995) IJCAI
- Dean, T.¹ Lin, S.-H.²

5
- 0002278788
- Hierarchical reinforcement learning with the MAXQ value function decomposition
- T. Dietterich. Hierarchical reinforcement learning with the maxq value function decomposition. JAIR, 13:227-303, 2000. (Pubitemid 33682087)
- (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 227-303
- Dietterich, T.G.¹

6
- 0006419533
- Hierarchical solution of Markov Decision processes using macro-actions
- M. Hauskrecht, N. Meuleau, C. Boutilier, L. Kaelbling, and T. Dean. Hierarchical solution of Markov decision processes using macro-actions. In UAI, 1998.
- (1998) UAI
- Hauskrecht, M.¹ Meuleau, N.² Boutilier, C.³ Kaelbling, L.⁴ Dean, T.⁵

7
- 33746100681
- Stochastic over-subscription planning using hierarchies of MDPs
- N. Meuleau, R. Brafman, and E. Benaz-era. Stochastic over-subscription planning using hierarchies of MDPs. In ICAPS, 2006.
- (2006) ICAPS
- Meuleau, N.¹ Brafman, R.² Benazera, E.³

8
- 0001070375
- Reinforcement learning with hierarchies of machines
- R. Parr and S. Russell. Reinforcement learning with hierarchies of machines. In NIPS, 1997.
- (1997) NIPS
- Parr, R.¹ Russell, S.²

9
- 0346738900
- Flexible decomposition algorithms for weakly coupled Markov decision processes
- R. Parr. Flexible decomposition algorithms for weakly coupled Markov decision processes. In UAI, 1998.
- (1998) UAI
- Parr, R.¹

10
- 84899003140
- Multi-time models for temporally abstract planning
- D. Precup and R. Sutton. Multi-time models for temporally abstract planning. In NIPS, 1998.
- (1998) NIPS
- Precup, D.¹ Sutton, R.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.