SCOPUS 정보 검색 플랫폼

Volumn 15, Issue 4-6, 2002, Pages 665-687

Control of exploitation-exploration meta-parameter in reinforcement learning

(3) Ishii, Shin a,b Yoshida, Wako a,b Yoshimoto, Junichiro a,b

a NARA INSTITUTE OF SCIENCE AND TECHNOLOGY (Japan)

b JAPAN SCIENCE AND TECHNOLOGY AGENCY (Japan)

Author keywords

Attention; Exploitation exploration problem; Neuromodulator; Partially observable Markov decision process; Reinforcement learning

Indexed keywords

BRAIN; NEUROLOGY; SENSORY PERCEPTION;

NEURONS;

NEURAL NETWORKS;

ALGORITHM; COMPUTER SIMULATION; DECISION MAKING; EXPLORATORY BEHAVIOR; INFORMATION PROCESSING; LEARNING; LOCUS CERULEUS; MAZE TEST; NEUROTRANSMISSION; NORADRENERGIC NERVE; PERCEPTION; PRIORITY JOURNAL; REINFORCEMENT; REVIEW; SELECTIVE ATTENTION; TECHNIQUE; ANIMAL; BIOLOGICAL MODEL; BRAIN; HUMAN; PHYSIOLOGY;

ANIMALS; BRAIN; EXPLORATORY BEHAVIOR; HUMANS; LEARNING; MODELS, BIOLOGICAL; REINFORCEMENT (PSYCHOLOGY);

EID: 0036592028 PISSN: 08936080 EISSN: None Source Type: Journal
DOI: 10.1016/S0893-6080(02)00056-4 Document Type: Article

Times cited : (194)

References (65)

1
- 0025788205
- Discharge of noradrenergic locus coeruleus neurons in behaving rats and monkeys suggests a role in vigilance
- (1991) Progress in Brain Research , vol.88 , pp. 501-520
- Aston-Jones, G.¹ Chiang, C.² Alexinsky, T.³

2
- 0028292799
- Locus coeruleus neurons in monkey are selectively activated by attended cues in a vigilance task
- (1994) The Journal of Neuroscience , vol.14 , pp. 4467-4480
- Aston-Jones, G.¹ Rajkowski, J.² Kubiak, P.³ Alexinsky, T.⁴

3
- 0024325284
- Architecture and intrinsic connections of the pre-frontal cortex in the rhesus monkey
- (1989) The Journal of Comparative Neurology , vol.286 , pp. 353-375
- Barbas, H.¹ Pandya, D.N.²

4
- 0020970738
- Neuronlike elements that can solve difficult learning control problems
- (1983) IEEE Transactions on Systems, Man, and Cybernetics , vol.13 , pp. 835-846
- Barto, A.G.¹ Sutton, R.S.² Anderson, C.W.³

5
- 0033547441
- Conflict monitoring versus selection-for-action in anterior cingulate cortex
- (1999) Nature , vol.402 , pp. 179-181
- Botvinick, M.¹ Nystrom, L.E.² Fissell, K.³ Carter, C.S.⁴ Cohen, J.D.⁵

6
- 2142715567
- Brafman, R.I., & Tennenholtz, M (2001). R-max: A general polynomial time algorithm for near-optimal reinforcement learning. Proceedings of the 17th International Joint Conference on Artificial Intelligence (pp. 953-958).

7
- 0034874639
- Anterior cingulate cortex and response conflict: Effects of frequency, inhibition and errors
- (2001) Cerebral Cortex , vol.11 , pp. 825-836
- Braver, T.S.¹ Barch, D.M.² Gray, J.R.³ Molfese, D.J.⁴ Snyder, A.⁵

8
- 0037039469
- Dorsal anterior cingulate cortex: A role in reward-based decision making
- (2002) Proceedings of the National Academy of Sciences, USA , vol.99 , pp. 507-512
- Bush, G.¹ Vogt, B.A.² Holmes, J.³ Dale, A.M.⁴ Greve, D.⁵ Jenike, M.A.⁶ Rosen, B.R.⁷

9
- 0032076255
- Anterior cingulate cortex, error detection, and the online monitoring of performance
- (1998) Science , vol.280 , pp. 747-749
- Carter, C.S.¹ Braver, T.S.² Barch, D.M.³ Botvinick, M.M.⁴ Noll, D.⁵ Cohen, J.D.⁶

10
- 0024455725
- Electrical coupling synchronizes subthreshold activity in locus coeruleus neurons in vitro from neonatal rat
- (1989) The Journal of Neuroscience , vol.9 , pp. 3584-3589
- Christie, M.J.¹ Williams, J.T.² North, R.A.³

11
- 0033941060
- Anterior cingulate and prefrontal cortex: Who's in control?
- (2000) Nature Neuroscience , vol.3 , pp. 421-423
- Cohen, J.D.¹ Botvinick, M.² Carter, C.S.³

12
- 0030260201
- Exploration bonuses and dual control
- (1996) Machine Learning , vol.25 , pp. 5-22
- Dayan, P.¹ Sejnowski, T.J.²

13
- 2142828108
- Proceedings of the 15th Conference on Uncertainty in Artificial Intelligence, San Francisco, CA: Morgan Kaufman, pp. 150-159
- (1999)
- Dearden, R.¹ Friedman, N.² Andre, D.³

14
- 0033629916
- Reinforcement learning in continuous time and space
- (2000) Neural Computation , vol.12 , pp. 219-245
- Doya, K.¹

15
- 0002337786
- Metalearning, neuromodulation, and emotion
- Hatano G., Okada N., Takabe H. (Eds.), Affective minds, Amsterdam: Elsevier
- (2000) , pp. 101-104
- Doya, K.¹

16
- 0034016783
- Dissociable functions in the medial and lateral orbitofrontal cortex: Evidence from human neuroimaging studies
- (2000) Cerebral Cortex , vol.10 , pp. 308-317
- Elliott, R.¹ Dolan, R.J.² Frith, C.D.³

17
- 2142660331
- Optimal control systems, New York, NY: Academic Press
- (1965)
- Fe'ldbaum, A.A.¹

18
- 0020555603
- Nucleus locus coeruleus: New evidence of anatomical and physiological specificity
- (1983) Physiological Reviews , vol.63 , pp. 844-914
- Foote, S.L.¹ Bloom, F.E.² Aston-Jones, G.³

19
- 84965511150
- A neural system for error detection and compensation
- (1993) Psychological Science , vol.4 , pp. 385-390
- Gehring, W.J.¹ Goss, B.² Coles, M.G.H.³ Meyer, D.E.⁴ Donchin, E.⁵

20
- 0025279571
- Amygdalonigral pathway: An anterograde study in the rat with Phaseolus vulgaris leucoagglutinin (PHA-L)
- (1990) The Journal of Comparative Neurology , vol.297 , pp. 182-200
- Gonzales, C.¹ Chesselet, M.F.²

21
- 0002370418
- A tutorial on learning with Bayesian networks
- Jordan M.I. (Ed.), Learning in graphical models, Cambridge, MA: MIT Press
- (1999) , pp. 301-354
- Heckerman, D.¹

22
- 0034077644
- Neuronal activity in the primate prefrontal cortex in the process of motor selection based on two behavioral rules
- (2000) Journal of Neurophysiology , vol.83 , pp. 2355-2373
- Hoshi, E.¹ Shima, K.² Tanji, J.³

23
- 0029783330
- Synchronous activity in locus coeruleus results from dendritic interactions in pericoerulear regions
- (1996) The Journal of Neuroscience , vol.16 , pp. 5196-5204
- Ishimaru, M.¹ Williams, J.T.²

24
- 0028283937
- Motor sequence learning: A study with positron emission tomography
- (1994) The Journal of Neuroscience , vol.14 , pp. 3775-3790
- Jenkins, I.H.¹ Brooks, D.J.² Nixon, P.D.³ Frackowiak, R.S.J.⁴ Passingham, R.E.⁵

25
- 0030910639
- Anatomy of motor learning. I. Frontal cortex and attention to action
- (1997) The Journal of Neurophysiology , vol.77 , pp. 1313-1324
- Jueptner, M.¹ Stephan, K.M.² Frith, C.D.³ Brooks, D.J.⁴ Frackowiak, R.S.⁵ Passingham, R.E.⁶

26
- 2142809796
- Learning in embedded systems, Cambridge, MA: MIT Press
- (1993)
- Kaelbling, L.¹

27
- 0032073263
- Planning and acting in partially observable stochastic domains
- (1998) Artificial Intelligence , vol.101 , pp. 99-134
- Kaelbling, L.P.¹ Littman, M.L.² Cassandra, A.R.³

28
- 2142657495
- Proceedings of the 15th International Conference on Machine Learning, San Mateo, CA: Morgan Kaufmann, pp. 260-268
- (1998)
- Kearns, M.¹ Singh, S.²

29
- 0034691042
- Dissociating the role of the medial and lateral anterior prefrontal cortex in human planning
- (2000) Proceedings of the National Academy of Sciences, USA , vol.97 , pp. 7651-7656
- Koechlin, E.¹ Corrado, G.² Pietrini, P.³ Grafman, J.⁴

30
- 0033213255
- Effect of expected reward magnitude on the response of neurons in the dorsolateral prefrontal cortex of the macaque
- (1999) Neuron , vol.24 , pp. 415-425
- Leon, M.I.¹ Shadlen, M.N.²

31
- 2142756901
- Proceedings of the Fifth International Conference on Autonomous Agents, New York, NY: ACM, pp. 39-40
- (2001)
- Matsuno, Y.¹ Yamazaki, T.² Matsuda, J.³ Ishii, S.⁴

32
- 0030844802
- Effects of orbital frontal and anterior cingulate lesions on object and spatial memory in rhesus monkeys
- (1997) Neuropsychologia , vol.35 , pp. 999-1015
- Meunier, M.¹ Bachevalier, J.² Mishkin, M.³

33
- 0027684215
- Prioritized sweeping: Reinforcement learning with less data and less real time
- (1993) Machine Learning , vol.13 , pp. 103-130
- Moore, A.W.¹ Atkeson, C.G.²

34
- 0022645522
- Noradrenergic and serotonergic innervation of cortical, thalamic and tectal visual structures in old and new world monkeys
- (1986) The Journal of Comparative Neurology , vol.243 , pp. 117-128
- Morrison, J.¹ Foote, S.²

35
- 0035152958
- Abstract reward and punishment representations in the human orbitofrontal cortex
- (2001) Nature Neuroscience , vol.4 , pp. 95-102
- O'Doherty, J.¹ Kringelbach, M.L.² Rolls, E.T.³ Hornak, J.⁴ Andrews, C.⁵

36
- 0028237718
- The nucleus accumbens as a complex of functionally distinct neuronal ensembles: An integration of behavioural, electrophysiological and anatomical data
- (1994) Progress in Neurobiology , vol.42 , pp. 719-761
- Pennartz, C.M.¹ Groenewegen, H.J.² Lopez de Silva, F.H.³

37
- 0029954370
- Motor areas of the medial wall: A review of their location and functional activation
- (1996) Cerebral Cortex , vol.6 , pp. 342-353
- Picard, N.¹ Strick, P.L.²

38
- 2142822470
- Images of mind, Washington, DC: Scientific American Books, revised
- (1996)
- Posner, M.I.¹ Raichle, M.²

39
- 2142813956
- Prominent projections from the anterior cingulate cortex to the locus coeruleus in rhesus monkey
- (2000) Society of Neuroscience Abstract , vol.26 , pp. 2230
- Rajkowski, J.¹ Lu, W.² Zhu, Y.³ Cohen, J.⁴ Aston-Jones, G.⁵

40
- 0030953250
- Integration of what and where in the primate prefrontal cortex
- (1997) Science , vol.276 , pp. 821-824
- Rao, S.C.¹ Rainer, G.² Miller, E.K.³

41
- 0030605999
- The orbitofrontal cortex
- (1996) Philosophical Transactions of the Royal Society of London. Series B: Biological sciences , vol.351 , pp. 1433-1443
- Rolls, E.T.¹

42
- 0028618395
- Emotion-related learning in patients with social and emotional changes associated with frontal lobe damage
- (1994) Journal of Neurology, Neurosurgery, and Psychiatry , vol.57 , pp. 1518-1524
- Rolls, E.T.¹ Hornak, J.² Wade, D.³ McGrath, J.⁴

43
- 0000147488
- On-line model selection based on the variational Bayes
- (2001) Neural Computation , vol.13 , pp. 1649-1681
- Sato, M.¹

44
- 0032081988
- Orbitofrontal cortex and basolateral amygdala encode expected outcomes during learning
- (1998) Nature Neuroscience , vol.1 , pp. 155-159
- Schoenbaum, G.¹ Chiba, A.A.² Gallagher, M.³

45
- 0031867046
- Predictive reward signal of dopamine neurons
- (1998) The Journal of Neurophysiology , vol.80 , pp. 1-27
- Schultz, W.¹

46
- 0030896968
- A neural substrate of prediction and reward
- (1997) Science , vol.275 , pp. 1593-1599
- Schultz, W.¹ Dayan, P.² Montague, R.P.³

47
- 0025118803
- A network model if catecholamine effects: Gain, signal-to-noise ratio, and behavior
- (1990) Science , vol.249 , pp. 892-895
- Servan-Schreiber, D.¹ Printz, H.² Cohen, J.D.³

48
- 0032515019
- Role for cingulate motor area cells in voluntary movement selection based on reward
- (1998) Science , vol.282 , pp. 1335-1338
- Shima, K.¹ Tanji, J.²

49
- 2142812536
- Proceedings of the 11th International Conference on Machine Learning, San Francisco, CA: Morgan Kaufmann, pp. 284-292
- (1994)
- Singh, S.P.¹ Jaakkola, T.² Jordan, M.I.³

50
- 0008541180
- The hippocampal formation participates in novel picture encoding: Evidence from functional magnetic resonance imaging
- (1996) Proceedings of the National Academy of Sciences, USA , vol.93 , pp. 8660-8665
- Stern, C.E.¹ Corkin, S.² Gonzalez, R.G.³ Guimaraes, A.R.⁴ Baker, J.R.⁵ Jennings, P.J.⁶ Carr, C.A.⁷ Sugiura, R.M.⁸ Vedantham, V.⁹ Rosen, B.R.¹⁰

51
- 0034759167
- Anterior prefrontal cortex mediates rule learning in humans
- (2001) Cerebral Cortex , vol.11 , pp. 1040-1046
- Strange, B.A.¹ Henson, R.N.A.² Friston, K.J.³ Dolan, R.J.⁴

52
- 33847202724
- Learning to predict by the methods of temporal differences
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, R.S.¹

53
- 2142712659
- Sutton, R.S (1990). Integrated architectures for learning, planning, and reacting based on approximating dynamic programming. Machine Learning: Proceeding of the Seventh International Conference (pp. 216-224).

54
- 2142641879
- Reinforcement learning: An introduction, Cambridge, MA: MIT Press
- (1998)
- Sutton, R.S.¹ Barto, A.G.²

55
- 0035312863
- Behavioral planning in the prefrontal cortex
- (2001) Current Opinion in Neurobiology , vol.11 , pp. 164-170
- Tanji, J.¹ Hoshi, E.²

56
- 0022495758
- 6-Hydroxydopamine lesions of the nucleus accumbens, but not of the caudate nucleus, attenuate enhanced responding with reward-related stimuli produced by intra-accumbens d-amphetamine
- (1986) Psychopharmacology , vol.90 , pp. 390-397
- Taylor, J.R.¹ Robbins, T.W.²

57
- 2142696808
- Handbook of intelligent control: Neural, fuzzy and adaptive approaches, Frorence, KY: Van Nostrand Reinhold
- (1992)
- Thrun, S.B.¹

58
- 2642661876
- Novelty and familiarity activations in PET studies of memory encoding and retrieval
- (1996) Cerebral Cortex , vol.6 , pp. 71-79
- Tulving, E.¹ Markowitsch, H.J.² Craik, F.E.³ Habib, R.⁴ Houle, S.⁵

59
- 0033593588
- The role of locus coeruleus in the regulation of cognitive performance
- (1999) Science , vol.283 , pp. 549-554
- Usher, M.¹ Cohen, J.D.² Servan-Schreiber, D.³ Rajkowski, J.⁴ Aston-Jones, G.⁵

60
- 0026951344
- Functional heterogeneity in cingulate cortex: The anterior executive and posterior evaluative regions
- (1992) Cerebral Cortex , vol.2 , pp. 435-443
- Vogt, B.A.¹ Finch, D.M.² Olson, C.R.³

61
- 0035811464
- Dopamine responses comply with basic assumptions of formal learning theory
- (2001) Nature , vol.412 , pp. 43-48
- Waelti, P.¹ Dickinson, A.² Schultz, W.³

62
- 0029782802
- Reward expectancy in primate prefrontal neurons
- (1996) Nature , vol.382 , pp. 629-632
- Watanabe, M.¹

63
- 34249833101
- Technical note: Q-learning
- (1992) Machine Learning , vol.8 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

64
- 0027245697
- The effects of stimulus novelty and familiarity on neuronal activity in the amygdala of monkeys performing recognition memory tasks
- (1993) Experimental Brain Research , vol.93 , pp. 367-382
- Wilson, F.A.W.¹ Rolls, E.T.²

65
- 0024492688
- Low doses of accumbens dopamine modulate amygdala suppression of spontaneous exploratory activity in rats
- (1989) Brain Research , vol.477 , pp. 202-210
- Yim, C.Y.¹ Mogenson, G.J.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.