SCOPUS 정보 검색 플랫폼

Neural Networks

Volumn 16, Issue 1, 2003, Pages 5-9

Meta-learning in reinforcement learning

(2) Schweighofer, Nicolas a Doya, Kenji a,b

a JAPAN SCIENCE AND TECHNOLOGY AGENCY (Japan)

b ADVANCED TELECOMMUNICATIONS RESEARCH INSTITUTE INTERNATIONAL (Japan)

Author keywords

Dopamine; Dynamic environment; Meta learning; Meta parameters; Neuromodulation; Reinforcement learning; TD error

Indexed keywords

ADAPTIVE SYSTEMS; ALGORITHMS; MARKOV PROCESSES; ROBUSTNESS (CONTROL SYSTEMS); SIGNAL ENCODING;

REINFORCEMENT LEARNING;

LEARNING SYSTEMS;

DOPAMINE;

ALGORITHM; ARTICLE; DECISION MAKING; DOPAMINERGIC ACTIVITY; DOPAMINERGIC NERVE CELL; ENVIRONMENTAL FACTOR; LEARNING; MENTAL TASK; NEUROMODULATION; NONLINEAR SYSTEM; PRIORITY JOURNAL; PROBABILITY; REINFORCEMENT; SIMULATION; TASK PERFORMANCE; TIME;

ALGORITHMS; ARTIFICIAL INTELLIGENCE; DECISION THEORY; MARKOV CHAINS; REINFORCEMENT (PSYCHOLOGY); STOCHASTIC PROCESSES;

EID: 0037258402 PISSN: 08936080 EISSN: None Source Type: Journal
DOI: 10.1016/S0893-6080(02)00228-9 Document Type: Article

Times cited : (216)

References (17)

1
- 0019855733
- Activity of norepinephrine-containing locus coeruleus neurons in behaving rats anticipates fluctuations in the sleep-waking cycle
- Aston-Jones G., Bloom F.E. Activity of norepinephrine-containing locus coeruleus neurons in behaving rats anticipates fluctuations in the sleep-waking cycle. Journal of Neuroscience. 1:(8):1981;876-886.
- (1981) Journal of Neuroscience , vol.1 , Issue.8 , pp. 876-886
- Aston-Jones, G.¹ Bloom, F.E.²

2
- 0036592008
- Opponent interactions between serotonin and dopamine
- Daw N.D., Kakade S., Dayan P. Opponent interactions between serotonin and dopamine. Neural Networks. 15:2002;603-616.
- (2002) Neural Networks , vol.15 , pp. 603-616
- Daw, N.D.¹ Kakade, S.² Dayan, P.³

3
- 0027299420
- Dopaminergic regulation of cortical acetylcholine release: Effects of dopamine receptor agonists
- Day J., Fibiger H.C. Dopaminergic regulation of cortical acetylcholine release: effects of dopamine receptor agonists. Neuroscience. 54:(3):1993;643-648.
- (1993) Neuroscience , vol.54 , Issue.3 , pp. 643-648
- Day, J.¹ Fibiger, H.C.²

4
- 0033629916
- Reinforcement learning in continuous time and space
- Doya K. Reinforcement learning in continuous time and space. Neural Computations. 12:(1):2000;219-245.
- (2000) Neural Computations , vol.12 , Issue.1 , pp. 219-245
- Doya, K.¹

5
- 0036592023
- Metalearning and neuromodulation
- Doya K. Metalearning and neuromodulation. Neural Networks. 15:2002;495-506.
- (2002) Neural Networks , vol.15 , pp. 495-506
- Doya, K.¹

6
- 0025600638
- A stochastic reinforcement learning algorithm for learning real-valued functions
- Gullapalli V. A stochastic reinforcement learning algorithm for learning real-valued functions. Neural Networks. 3:1990;671-692.
- (1990) Neural Networks , vol.3 , pp. 671-692
- Gullapalli, V.¹

7
- 0034742514
- D2-like dopamine receptor activation excites rat dorsal raphe 5-HT neurons in vitro
- Haj-Dahmane S. D2-like dopamine receptor activation excites rat dorsal raphe 5-HT neurons in vitro. European Journal of Neuroscience. 14:(1):2001;125-134.
- (2001) European Journal of Neuroscience , vol.14 , Issue.1 , pp. 125-134
- Haj-Dahmane, S.¹

8
- 0036592028
- Control of exploitation-exploration meta-parameters in reinforcement learning
- Ishii S., Yoshida W., Yoshimoto J. Control of exploitation-exploration meta-parameters in reinforcement learning. Neural Networks. 15:2002;665-687.
- (2002) Neural Networks , vol.15 , pp. 665-687
- Ishii, S.¹ Yoshida, W.² Yoshimoto, J.³

9
- 0027250812
- 5-HT and motor control: A hypothesis
- Jacobs B.L., Fornal C.A. 5-HT and motor control: a hypothesis. Trends in Neuroscience. 16:(9):1993;346-352.
- (1993) Trends in Neuroscience , vol.16 , Issue.9 , pp. 346-352
- Jacobs, B.L.¹ Fornal, C.A.²

10
- 0002290970
- On the complexity of solving Markov decision problems
- Littman, M. L., Dean, T. L., et al (1995). On the complexity of solving Markov decision problems. Eleventh International Conference on Uncertainty in Artificial Intelligence.
- (1995) Eleventh International Conference on Uncertainty in Artificial Intelligence
- Littman, M.L.¹ Dean, T.L.²

11
- 0026019121
- Monoaminergic interaction in the central nervous system: A morphological analysis in the locus coeruleus of the rat
- Maeda T., Kojima Y., Arai R., Fujimiya M., Kimura H., Kitahama A., Geffard M. Monoaminergic interaction in the central nervous system: a morphological analysis in the locus coeruleus of the rat. Compartative Biochemistry and Physiology C. 98:(1):1991;193-202.
- (1991) Compartative Biochemistry and Physiology C , vol.98 , Issue.1 , pp. 193-202
- Maeda, T.¹ Kojima, Y.² Arai, R.³ Fujimiya, M.⁴ Kimura, H.⁵ Kitahama, A.⁶ Geffard, M.⁷

12
- 0031867046
- Predictive reward signal of dopamine neurons
- Schultz W. Predictive reward signal of dopamine neurons. Journal of Neurophysiology. 80:(1):1998;1-27.
- (1998) Journal of Neurophysiology , vol.80 , Issue.1 , pp. 1-27
- Schultz, W.¹

13
- 0031939094
- A model of cerebellar metaplasticity
- Schweighofer N., Arbib M.A. A model of cerebellar metaplasticity. Learning Memory. (4):1998;421-428.
- (1998) Learning Memory , Issue.4 , pp. 421-428
- Schweighofer, N.¹ Arbib, M.A.²

14
- 0016045280
- An opponent process theory of motivation. I. Temporal dynamics of affect
- Solomon R.L., Corbit J.D. An opponent process theory of motivation. I. Temporal dynamics of affect. Psychological Review. 81:1974;119-145.
- (1974) Psychological Review , vol.81 , pp. 119-145
- Solomon, R.L.¹ Corbit, J.D.²

15
- 0026971570
- Adapting bias by gradient descent: An incremental version of the delta-bar-delta
- Cambridge, MA: MIT Press
- Sutton, R (1992). Adapting bias by gradient descent: an incremental version of the delta-bar-delta. Tenth National Conference on Artificial Intelligence. Cambridge, MA: MIT Press.
- (1992) Tenth National Conference on Artificial Intelligence
- Sutton, R.¹

16
- 0012919807
- Functional MRI study of short-term and long-term prediction of reward
- Tanaka S., Doya K., Okada G., Ueda K., Okamoto Y., Yamawaki S. Functional MRI study of short-term and long-term prediction of reward. Proceedings of the Eighth International Conference on Functional Mapping of the Human Brain, Sendai, Japan. 2002;1062.
- (2002) Proceedings of the Eighth International Conference on Functional Mapping of the Human Brain, Sendai, Japan , pp. 1062
- Tanaka, S.¹ Doya, K.² Okada, G.³ Ueda, K.⁴ Okamoto, Y.⁵ Yamawaki, S.⁶

17
- 0003544743
- D.A. White, & D.A. Dofge. Florence, Kentucky: Van Nostrand
- Thrun S.B. White D.A., Dofge D.A. The role of exploration in learning control. Handbook of intelligent control: Neural, fuzzy, and adaptive approaches. 1992;Van Nostrand, Florence, Kentucky.
- (1992) The role of exploration in learning control. Handbook of intelligent control: Neural, fuzzy, and adaptive approaches
- Thrun, S.B.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.