SCOPUS 정보 검색 플랫폼

Volumn 2415 LNCS, Issue , 2002, Pages 813-818

Speeding-up reinforcement learning with multi-step actions

Author keywords

[No Author keywords available]

Indexed keywords

LEARNING ALGORITHMS; MACHINE LEARNING; NEURAL NETWORKS; TIME MEASUREMENT;

BENCH-MARK PROBLEMS; DIFFERENT TIME SCALE; LEARNING SPEED; MULTI-STEP; SPECIAL STRUCTURE; SUBTASKS; TEMPORAL ABSTRACTION; TIME-SCALES; HEIDELBERG;

REINFORCEMENT LEARNING;

EID: 35248820427 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/3-540-46084-5_132 Document Type: Conference Paper

Times cited : (14)

References (6)

1
- 0002278788
- Hierarchical reinforcement learning with the MAXQ value function decomposition
- T. G. Dietterich. Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research, 13:227-303, 2000.
- (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 227-303
- Dietterich, T.G.¹

2
- 0002654557
- Roles of macro-actions in accelerating reinforcement learning
- A. McGovern, R.S. Sutton, and A.H. Fagg. Roles of macro-actions in accelerating reinforcement learning. In Grace Hopper Celebration of Women in Computing, 1997.
- (1997) Grace Hopper Celebration of Women in Computing
- McGovern, A.¹ Sutton, R.S.² Fagg, A.H.³

4
- 0003989214
- PhD thesis, University of California, Berkeley, CA
- R. E. Parr. Hierarchical Control and Learning for Markov Decision Processes. PhD thesis, University of California, Berkeley, CA, 1998.
- (1998) Hierarchical Control and Learning for Markov Decision Processes
- Parr, R.E.¹

6
- 0033170372
- Between mdps and semi-mdps: A framework for temporal abstraction in reinforcement learning
- R. S. Sutton, D. Precup, and S. Singh. Between mdps and semi-mdps: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112:181-211, 1999.
- (1999) Artificial Intelligence , vol.112 , pp. 181-211
- Sutton, R.S.¹ Precup, D.² Singh, S.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.