메뉴 건너뛰기




Volumn 5863 LNCS, Issue PART 1, 2009, Pages 530-537

A meta-learning method based on temporal difference error

Author keywords

Inverted pendulum control problem; Maze search problem; Meta learning; Meta parameter; Reinforcement learning; TD error

Indexed keywords

DISCOUNT RATES; EXTERNAL ENVIRONMENTS; INVERTED PENDULUM; LEARNING RATES; META-PARAMETER; METALEARNING; NEUROMODULATORS; SEARCH PROBLEM; TEMPORAL DIFFERENCE ERRORS; TEMPORAL DIFFERENCES;

EID: 76649092973     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-642-10677-4_60     Document Type: Conference Paper
Times cited : (12)

References (6)
  • 2
    • 0030896968 scopus 로고    scopus 로고
    • A Neural Substrate of Prediction and Reward
    • Schultz, W., Dayan, P., Montague, P.R.: A Neural Substrate of Prediction and Reward. Science 275, 1593-1599 (1997)
    • (1997) Science , vol.275 , pp. 1593-1599
    • Schultz, W.1    Dayan, P.2    Montague, P.R.3
  • 3
    • 0036592023 scopus 로고    scopus 로고
    • Metalearning and Neuromodulation
    • Doya, K.: Metalearning and Neuromodulation. Neural Networks 15, 495-506 (2002)
    • (2002) Neural Networks , vol.15 , pp. 495-506
    • Doya, K.1
  • 4
    • 0037258402 scopus 로고    scopus 로고
    • Meta-learning in Reinforcement Learning
    • Schweighofer, N., Doya, K.: Meta-learning in Reinforcement Learning. Neural Networks 16(1), 5-9 (2003)
    • (2003) Neural Networks , vol.16 , Issue.1 , pp. 5-9
    • Schweighofer, N.1    Doya, K.2
  • 6
    • 0036592028 scopus 로고    scopus 로고
    • Control of Exploitation- Exploration Meta-parameter in Reinforcement Learning
    • Ishii, S., Yoshida, W., Yoshimoto, J.: Control of Exploitation- Exploration Meta-parameter in Reinforcement Learning. Neural Networks 15(4-6), 665-687 (2002)
    • (2002) Neural Networks , vol.15 , Issue.4-6 , pp. 665-687
    • Ishii, S.1    Yoshida, W.2    Yoshimoto, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.