메뉴 건너뛰기




Volumn 2837, Issue , 2003, Pages 96-107

Iteratively extending time horizon reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; APPROXIMATION THEORY; CLOSED LOOP CONTROL SYSTEMS; CONVERGENCE OF NUMERICAL METHODS; FUNCTIONS; ITERATIVE METHODS; OPTIMAL CONTROL SYSTEMS; PROBLEM SOLVING; RANDOM PROCESSES; STANDARDS; TIME DOMAIN ANALYSIS; VECTORS; APPROXIMATION ALGORITHMS; ARTIFICIAL INTELLIGENCE; LEARNING SYSTEMS; REINFORCEMENT LEARNING; STOCHASTIC CONTROL SYSTEMS; STOCHASTIC SYSTEMS; SUPERVISED LEARNING;

EID: 9444250519     PISSN: 03029743     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1007/978-3-540-39857-8_11     Document Type: Conference Paper
Times cited : (19)

References (11)
  • 2
    • 0030211964 scopus 로고    scopus 로고
    • Bagging predictors
    • L. Breiman. Bagging predictors. Machine Learning, 24(2): 123-140, 1996.
    • (1996) Machine Learning , vol.24 , Issue.2 , pp. 123-140
    • Breiman, L.1
  • 3
    • 0035478854 scopus 로고    scopus 로고
    • Random forests
    • L. Breiman. Random forests. Machine Learning, 45:5-32, 2001.
    • (2001) Machine Learning , vol.45 , pp. 5-32
    • Breiman, L.1
  • 7
    • 9444276220 scopus 로고    scopus 로고
    • Extremely randomized trees
    • University of Liège
    • P. Geurts. Extremely randomized trees. Technical report, University of Liège, 2003.
    • (2003) Technical Report
    • Geurts, P.1
  • 8
    • 0027684215 scopus 로고
    • Prioritized sweeping: Reinforcement learning with less data and less real time
    • A. Moore and C. Atkeson. Prioritized Sweeping: Reinforcement Learning with Less Data and Less Real Time. Machine Learning, 13:103-130, 1993.
    • (1993) Machine Learning , vol.13 , pp. 103-130
    • Moore, A.1    Atkeson, C.2
  • 9
    • 8744262572 scopus 로고    scopus 로고
    • Supervised learning combined with an actor-critic architecture
    • University of Massachusetts, Department of Computer Science
    • M. T. Rosenstein and A. G. Barto. Supervised learning combined with an actor-critic architecture. Technical report, University of Massachusetts, Department of Computer Science, 2002.
    • (2002) Technical Report
    • Rosenstein, M.T.1    Barto, A.G.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.