메뉴 건너뛰기




Volumn 148, Issue , 2006, Pages 833-840

An intrinsic reward mechanism for efficient exploration

Author keywords

[No Author keywords available]

Indexed keywords

EFFICIENT EXPLORATION; INTRINSIC REWARD MECHANISM; MARKOV DECISION PROCESS; OPTIMAL POLICY;

EID: 34250703734     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1143844.1143949     Document Type: Conference Paper
Times cited : (71)

References (19)
  • 10
    • 0027684215 scopus 로고
    • Prioritized sweeping: Reinforcement learning with less data and less real time
    • Moore, A., & Atkeson, C. G. (1993). Prioritized sweeping: Reinforcement learning with less data and less real time. Machine Learning, 13, 103-130.
    • (1993) Machine Learning , vol.13 , pp. 103-130
    • Moore, A.1    Atkeson, C.G.2
  • 11
    • 84977063352 scopus 로고
    • Efficient learning and planning within the dyna framework
    • Peng, J., & Williams, R. J. (1993). Efficient learning and planning within the dyna framework. Adaptive Behavior, 2, 437-454.
    • (1993) Adaptive Behavior , vol.2 , pp. 437-454
    • Peng, J.1    Williams, R.J.2
  • 13
    • 34250764144 scopus 로고    scopus 로고
    • Schmidhuber, J., & Storck, J. (1993). Reinforcement driven information acquisition in nondeterministic environments. Technical report, Fakultat fur Informatik, Technische Universit at Munchen.
    • Schmidhuber, J., & Storck, J. (1993). Reinforcement driven information acquisition in nondeterministic environments. Technical report, Fakultat fur Informatik, Technische Universit at Munchen.
  • 18
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and Semi-MDPs: A framework for temporal abstraction in reinforcement learning
    • Sutton, R. S., Precup, D., & Singh, S. P. (1999). Between MDPs and Semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112, 181-211.
    • (1999) Artificial Intelligence , vol.112 , pp. 181-211
    • Sutton, R.S.1    Precup, D.2    Singh, S.P.3
  • 19
    • 0003411271 scopus 로고
    • Efficient exploration in reinforcement learning
    • CMU-CS-92-102, Carnegie-Mellon University
    • Thrun, S. (1992). Efficient exploration in reinforcement learning (Technical Report CMU-CS-92-102). Carnegie-Mellon University.
    • (1992) Technical Report
    • Thrun, S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.