SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 5863 LNCS, Issue PART 1, 2009, Pages 530-537

A meta-learning method based on temporal difference error

(4) Kobayashi, Kunikazu a Mizoue, Hiroyuki a Kuremoto, Takashi a Obayashi, Masanao a

a YAMAGUCHI UNIVERSITY (Japan)

Author keywords

Inverted pendulum control problem; Maze search problem; Meta learning; Meta parameter; Reinforcement learning; TD error

Indexed keywords

DISCOUNT RATES; EXTERNAL ENVIRONMENTS; INVERTED PENDULUM; LEARNING RATES; META-PARAMETER; METALEARNING; NEUROMODULATORS; SEARCH PROBLEM; TEMPORAL DIFFERENCE ERRORS; TEMPORAL DIFFERENCES;

COMPUTER SIMULATION; DATA PROCESSING; LEARNING ALGORITHMS; REINFORCEMENT; REINFORCEMENT LEARNING;

PENDULUMS;

EID: 76649092973 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-642-10677-4_60 Document Type: Conference Paper

Times cited : (12)

References (6)

1
- 0004102479
- MIT Press, Cambridge
- Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (1998)
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

2
- 0030896968
- A Neural Substrate of Prediction and Reward
- Schultz, W., Dayan, P., Montague, P.R.: A Neural Substrate of Prediction and Reward. Science 275, 1593-1599 (1997)
- (1997) Science , vol.275 , pp. 1593-1599
- Schultz, W.¹ Dayan, P.² Montague, P.R.³

3
- 0036592023
- Metalearning and Neuromodulation
- Doya, K.: Metalearning and Neuromodulation. Neural Networks 15, 495-506 (2002)
- (2002) Neural Networks , vol.15 , pp. 495-506
- Doya, K.¹

4
- 0037258402
- Meta-learning in Reinforcement Learning
- Schweighofer, N., Doya, K.: Meta-learning in Reinforcement Learning. Neural Networks 16(1), 5-9 (2003)
- (2003) Neural Networks , vol.16 , Issue.1 , pp. 5-9
- Schweighofer, N.¹ Doya, K.²

5
- 34249833101
- Q-learning
- Watkins, C.J.C.H., Dayan, P.: Q-learning. Machine Learning 8(3-4), 279-292 (1992)
- (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

6
- 0036592028
- Control of Exploitation- Exploration Meta-parameter in Reinforcement Learning
- Ishii, S., Yoshida, W., Yoshimoto, J.: Control of Exploitation- Exploration Meta-parameter in Reinforcement Learning. Neural Networks 15(4-6), 665-687 (2002)
- (2002) Neural Networks , vol.15 , Issue.4-6 , pp. 665-687
- Ishii, S.¹ Yoshida, W.² Yoshimoto, J.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.