SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2005, Pages 69-74

Reinforcement learning acceleration through autonomous subgoal discovery

Author keywords

[No Author keywords available]

Indexed keywords

ACTION SPACES; HIERARCHICAL STATE; LEARNING TIME; REINFORCEMENT LEARNING AGENTS; STATE SPACES; SUBGOALS; TIME SPENT;

REINFORCEMENT; REINFORCEMENT LEARNING; ROBOT LEARNING;

EDUCATION;

EID: 60749083025 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (1)

References (10)

2
- 84942867726
- An overview of maxq hierarchical reinforcement learning
- T. G. Dietterich, "An overview of maxq hierarchical reinforcement learning," Lecture Notes in Computer Science, vol. 1864, 2000.
- (2000) Lecture Notes in Computer Science , vol.1864
- Dietterich, T.G.¹

3
- 0007907759
- Emergent hierarchical control structures: Learning reactive / hierarchical relationships in reinforcement environments
- B. Digney, "Emergent hierarchical control structures: Learning reactive / hierarchical relationships in reinforcement environments," in Proceedings of the Fourth Conference on the Simulation of Adaptive Behavior, 1996.
- (1996) Proceedings of the Fourth Conference on the Simulation of Adaptive Behavior
- Digney, B.¹

4
- 10044266681
- Proceedings of the 17th International FLAIRS Conference. AAAI
- M. Asadi and M. Huber, "State space reduction for hierarchical reinforcement learning," in In Proceedings of the 17th International FLAIRS Conference. AAAI, 2004, pp. 509-514.
- (2004) State space reduction for hierarchical reinforcement learning , pp. 509-514
- Asadi, M.¹ Huber, M.²

5
- 29344435556
- Proceedings of the 16th International FLAIRS Conference. AAAI
- S. Goel and M. Huber, "Subgoal discovery for hierarchical reinforcement learning using learned policies," in In Proceedings of the 16th International FLAIRS Conference. AAAI, 2003, pp. 346-350.
- (2003) Subgoal discovery for hierarchical reinforcement learning using learned policies , pp. 346-350
- Goel, S.¹ Huber, M.²

6
- 0034272032
- Bounded-parameter markov decision processes
- R. Givan, S. Leach, and T. Dean, "Bounded-parameter markov decision processes," Artificial Intelligence, vol. 122, no. 1-2, pp. 71-109, 2000.
- (2000) Artificial Intelligence , vol.122 , Issue.1-2 , pp. 71-109
- Givan, R.¹ Leach, S.² Dean, T.³

7
- 0346942368
- Decision-theoretic planning: Structural assumptions and computational leverage
- C. Boutilier, T. Dean, and S. Hanks, "Decision-theoretic planning: Structural assumptions and computational leverage," Journal of Artificial Intelligence Research, vol. 11, pp. 1-94, 1999.
- (1999) Journal of Artificial Intelligence Research , vol.11 , pp. 1-94
- Boutilier, C.¹ Dean, T.² Hanks, S.³

8
- 0033170372
- Between MDPs and Semi-MDPs: Learning, planning, and representing knowledge at multiple temporal scales
- R. Sutton, D. Precup, and S. Singh, "Between MDPs and Semi-MDPs: Learning, planning, and representing knowledge at multiple temporal scales," Artificial Intelligence, vol. 112, pp. 181-211, 1999.
- (1999) Artificial Intelligence , vol.112 , pp. 181-211
- Sutton, R.¹ Precup, D.² Singh, S.³

9
- 0038178323
- Solving factored MDPs using non-homogeneous partitions
- K. Kim and T. Dean, "Solving factored MDPs using non-homogeneous partitions," Artificial Intelligence, vol. 147, pp. 225-251, 2003.
- (2003) Artificial Intelligence , vol.147 , pp. 225-251
- Kim, K.¹ Dean, T.²

10
- 0038517214
- Equivalence notions and model minimization in markov decision processes
- T. Dean, R. Givan, and M. Greig, "Equivalence notions and model minimization in markov decision processes," in Special issue on planning with uncertainty and incomplete information, 2003, pp. 163-223.
- (2003) Special issue on planning with uncertainty and incomplete information , pp. 163-223
- Dean, T.¹ Givan, R.² Greig, M.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.