메뉴 건너뛰기




Volumn 2, Issue , 2010, Pages 747-754

Basis function construction for hierarchical reinforcement learning

Author keywords

Hierarchical Reinforcement Learning; Representation Discovery; Semi Markov Decision Processes

Indexed keywords

AUTONOMOUS AGENTS; FUNCTIONS; MARKOV PROCESSES; MULTI AGENT SYSTEMS; SPECTRUM ANALYSIS; TELECOMMUNICATION NETWORKS;

EID: 84899441331     PISSN: 15488403     EISSN: 15582914     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (9)

References (14)
  • 2
    • 84899421594 scopus 로고    scopus 로고
    • Number 92 in CBMS Regional Conference Series in Mathematics. American Mathematical Society
    • F. Chung. Spectral Graph Theory. Number 92 in CBMS Regional Conference Series in Mathematics. American Mathematical Society, 1997.
    • (1997) Spectral Graph Theory
    • Chung, F.1
  • 3
    • 17444366585 scopus 로고    scopus 로고
    • Laplacians and the cheeger inequality for directed graphs
    • F. Chung. Laplacians and the Cheeger inequality for directed graphs. Annals of Combinatorics, 9:1-19, 2005.
    • (2005) Annals of Combinatorics , vol.9 , pp. 1-19
    • Chung, F.1
  • 5
    • 0002278788 scopus 로고    scopus 로고
    • Hierarchical reinforcemnt learning with the MAXQ value function decomposition
    • T. Dietterich. Hierarchical reinforcemnt learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research, 13:277-303, 2000.
    • (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 277-303
    • Dietterich, T.1
  • 11
  • 13
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
    • R. Sutton, D. Precup, and S. Singh. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112:181-211, 1999.
    • (1999) Artificial Intelligence , vol.112 , pp. 181-211
    • Sutton, R.1    Precup, D.2    Singh, S.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.