SCOPUS 정보 검색 플랫폼

Journal of Physiology Paris

Volumn 105, Issue 1-3, 2011, Pages 36-44

A reinforcement learning approach to instrumental contingency degradation in rats

(3) Dutech, Alain a Coutureau, Etienne b,c Marchand, Alain R b,c

a LORIA (France)

b UNIVERSITÉ DE BORDEAUX (France)

c UMR 5287 (France)

Author keywords

Contingency degradation; Instrumental; Model free learning; Prefrontal cortex; Rats; SARSA; Simulation

Indexed keywords

ALGORITHM; ANIMAL BEHAVIOR; ANIMAL EXPERIMENT; ANIMAL MODEL; ARTICLE; BRAIN DAMAGE; CONCEPTUAL FRAMEWORK; CONTINGENCY DEGRADATION; CONTROLLED STUDY; DECISION MAKING; FOOD INTAKE; INSTRUMENTAL CONDITIONING; MATHEMATICAL MODEL; MEDIAL PREFRONTAL CORTEX; MOTIVATION; NONHUMAN; PREFRONTAL CORTEX; PROBABILITY; RAT; REINFORCEMENT; TIME PERCEPTION;

ANIMALS; BEHAVIOR, ANIMAL; CONDITIONING, OPERANT; EXTINCTION, PSYCHOLOGICAL; MODELS, NEUROLOGICAL; NEURONS; RATS; REINFORCEMENT (PSYCHOLOGY); TOUCH; TOUCH PERCEPTION;

EID: 82555190923 PISSN: 09284257 EISSN: 17697115 Source Type: Journal
DOI: 10.1016/j.jphysparis.2011.07.017 Document Type: Article

Times cited : (10)

References (30)

1
- 2642576040
- Exteroceptive control of response under delayed reinforcement
- Azzi R., Fix D.S.R., Keller F.S., Rocha e Silva M.I. Exteroceptive control of response under delayed reinforcement. J. Exp. Anal. Behav. 1964, 7:159-162.
- (1964) J. Exp. Anal. Behav. , vol.7 , pp. 159-162
- Azzi, R.¹ Fix, D.S.R.² Keller, F.S.³ Rocha e Silva, M.I.⁴

2
- 0031801210
- Goal-directed instrumental action: contingency and incentive learning and their cortical substrates
- Balleine B.W., Dickinson A. Goal-directed instrumental action: contingency and incentive learning and their cortical substrates. Neuropharmacology 1998, 37:407-419.
- (1998) Neuropharmacology , vol.37 , pp. 407-419
- Balleine, B.W.¹ Dickinson, A.²

3
- 72049125602
- Human and rodent homologies in action control: corticostriatal determinants of goal-directed and habitual action
- Balleine B.W., O'Doherty J.P. Human and rodent homologies in action control: corticostriatal determinants of goal-directed and habitual action. Neuropsychopharmacology 2010, 35:48-69.
- (2010) Neuropsychopharmacology , vol.35 , pp. 48-69
- Balleine, B.W.¹ O'Doherty, J.P.²

4
- 0000409272
- Reinforcement learning methods for continuous-time Markov decision problems.
- In: Proc. Adv. Neural Inform. Process. Systems (NIPS).
- Bradtke, S.J., Duff, M.O., 1995. Reinforcement learning methods for continuous-time Markov decision problems. In: Proc. Adv. Neural Inform. Process. Systems (NIPS).
- (1995)
- Bradtke, S.J.¹ Duff, M.O.²

5
- 78049314878
- Delay discounting of qualitatively different reinforcers in rats
- Calvert A.L., Green L., Myerson J. Delay discounting of qualitatively different reinforcers in rats. J. Exp. Anal. Behav. 2010, 93:171-184.
- (2010) J. Exp. Anal. Behav. , vol.93 , pp. 171-184
- Calvert, A.L.¹ Green, L.² Myerson, J.³

6
- 0344393788
- The role of prelimbic cortex in instrumental conditioning
- Corbit L.H., Balleine B.W. The role of prelimbic cortex in instrumental conditioning. Behav. Brain Res. 2003, 146:145-157.
- (2003) Behav. Brain Res. , vol.146 , pp. 145-157
- Corbit, L.H.¹ Balleine, B.W.²

7
- 65249114697
- Goal-directed responding is sensitive to lesions to the prelimbic cortex or basolateral nucleus of the amygdala but not to their disconnection
- Coutureau E., Marchand A.R., Di Scala G. Goal-directed responding is sensitive to lesions to the prelimbic cortex or basolateral nucleus of the amygdala but not to their disconnection. Behav. Neurosci. 2009, 123:443-448.
- (2009) Behav. Neurosci. , vol.123 , pp. 443-448
- Coutureau, E.¹ Marchand, A.R.² Di Scala, G.³

8
- 28044450875
- Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
- Daw N.D., Niv Y., Dayan P. Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nat. Neurosci. 2005, 8:1704-1711.
- (2005) Nat. Neurosci. , vol.8 , pp. 1704-1711
- Daw, N.D.¹ Niv, Y.² Dayan, P.³

9
- 33745787929
- Representation and timing in theories of the dopamine system
- Daw N.D., Courville A.C., Touretzky D.S. Representation and timing in theories of the dopamine system. Neural Comput. 2006, 18:1637-1677.
- (2006) Neural Comput. , vol.18 , pp. 1637-1677
- Daw, N.D.¹ Courville, A.C.² Touretzky, D.S.³

10
- 70449715719
- Instructional control of reinforcement learning: a behavioral and neurocomputational investigation
- Doll B.B., Jacobs W.J., Sanfey A.G., Frank M.J. Instructional control of reinforcement learning: a behavioral and neurocomputational investigation. Brain Res. 2009, 1299:74-94.
- (2009) Brain Res. , vol.1299 , pp. 74-94
- Doll, B.B.¹ Jacobs, W.J.² Sanfey, A.G.³ Frank, M.J.⁴

11
- 0037459319
- Discrete coding of reward probability and uncertainty by dopamine neurons
- Fiorillo C.D., Tobler P.N., Schultz W. Discrete coding of reward probability and uncertainty by dopamine neurons. Science 2003, 299:1898-1902.
- (2003) Science , vol.299 , pp. 1898-1902
- Fiorillo, C.D.¹ Tobler, P.N.² Schultz, W.³

12
- 33744550336
- Anatomy of a decision: striato-orbitofrontal interactions in reinforcement learning, decision making, and reversal
- Frank M.J., Claus E.D. Anatomy of a decision: striato-orbitofrontal interactions in reinforcement learning, decision making, and reversal. Psychol. Rev. 2006, 113:300-326.
- (2006) Psychol. Rev. , vol.113 , pp. 300-326
- Frank, M.J.¹ Claus, E.D.²

13
- 0034973745
- The prefrontal cortex-an update: time is of the essence
- Fuster J.M. The prefrontal cortex-an update: time is of the essence. Neuron 2001, 30:319-333.
- (2001) Neuron , vol.30 , pp. 319-333
- Fuster, J.M.¹

14
- 33746001395
- The role of the rat prelimbic/infralimbic cortex in working memory: not involved in the short-term maintenance but in monitoring and processing functions
- Gisquet-Verrier P., Delatour B. The role of the rat prelimbic/infralimbic cortex in working memory: not involved in the short-term maintenance but in monitoring and processing functions. Neuroscience 2006, 141:585-596.
- (2006) Neuroscience , vol.141 , pp. 585-596
- Gisquet-Verrier, P.¹ Delatour, B.²

15
- 77953260848
- States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning
- Gläscher J., Daw N., Dayan P., O'Doherty J.P. States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning. Neuron 2010, 66:585-595.
- (2010) Neuron , vol.66 , pp. 585-595
- Gläscher, J.¹ Daw, N.² Dayan, P.³ O'Doherty, J.P.⁴

16
- 84980210366
- The effect of contingency upon the appetitive conditioning of free-operant behavior
- Hammond L.J. The effect of contingency upon the appetitive conditioning of free-operant behavior. J. Exp. Anal. Behav. 1980, 34:297-304.
- (1980) J. Exp. Anal. Behav. , vol.34 , pp. 297-304
- Hammond, L.J.¹

17
- 0036592026
- Actor-critic models of the basal ganglia: new anatomical and computational perspectives
- Joel D., Niv Y., Ruppin E. Actor-critic models of the basal ganglia: new anatomical and computational perspectives. Neural Networks 2002, 15:535-547.
- (2002) Neural Networks , vol.15 , pp. 535-547
- Joel, D.¹ Niv, Y.² Ruppin, E.³

18
- 0037382264
- Coordination of actions and habits in the medial prefrontal cortex of rats
- Killcross S., Coutureau E. Coordination of actions and habits in the medial prefrontal cortex of rats. Cereb. Cortex 2003, 13:400-408.
- (2003) Cereb. Cortex , vol.13 , pp. 400-408
- Killcross, S.¹ Coutureau, E.²

19
- 72449150120
- Role of striatum in updating values of chosen actions
- Kim H., Sul J.H., Huh N., Lee D., Jung M.W. Role of striatum in updating values of chosen actions. J. Neurosci. 2009, 29:14701-14712.
- (2009) J. Neurosci. , vol.29 , pp. 14701-14712
- Kim, H.¹ Sul, J.H.² Huh, N.³ Lee, D.⁴ Jung, M.W.⁵

20
- 50349093022
- Influence of reward delays on responses of dopamine neurons
- Kobayashi S., Schultz W. Influence of reward delays on responses of dopamine neurons. J. Neurosci. 2008, 28:7837-7846.
- (2008) J. Neurosci. , vol.28 , pp. 7837-7846
- Kobayashi, S.¹ Schultz, W.²

21
- 72449166356
- Reinforcement learning, conditioning, and the brain: successes and challenges
- Maia T.V. Reinforcement learning, conditioning, and the brain: successes and challenges. Cogn. Affect. Behav. Neurosci. 2009, 9:343-364.
- (2009) Cogn. Affect. Behav. Neurosci. , vol.9 , pp. 343-364
- Maia, T.V.¹

22
- 1842785707
- The role of the medial prefrontal cortex in achieving goals
- Matsumoto K., Tanaka K. The role of the medial prefrontal cortex in achieving goals. Curr. Opin. Neurobiol. 2004, 14:178-185.
- (2004) Curr. Opin. Neurobiol. , vol.14 , pp. 178-185
- Matsumoto, K.¹ Tanaka, K.²

23
- 66249103342
- A role for medial prefrontal dopaminergic innervation in instrumental conditioning
- Naneix F., Marchand A.R., Di Scala G., Pape J.R., Coutureau E. A role for medial prefrontal dopaminergic innervation in instrumental conditioning. J. Neurosci. 2009, 29:6599-6606.
- (2009) J. Neurosci. , vol.29 , pp. 6599-6606
- Naneix, F.¹ Marchand, A.R.² Di Scala, G.³ Pape, J.R.⁴ Coutureau, E.⁵

24
- 33747628123
- Choice values
- Niv Y., Daw N.D., Dayan P. Choice values. Nat. Neurosci. 2006, 9:987-988.
- (2006) Nat. Neurosci. , vol.9 , pp. 987-988
- Niv, Y.¹ Daw, N.D.² Dayan, P.³

25
- 33847675011
- Tonic dopamine: opportunity costs and the control of response vigor
- Niv Y., Daw N.D., Joel D., Dayan P. Tonic dopamine: opportunity costs and the control of response vigor. Psychopharmacology 2007, 191:507-520.
- (2007) Psychopharmacology , vol.191 , pp. 507-520
- Niv, Y.¹ Daw, N.D.² Joel, D.³ Dayan, P.⁴

26
- 67649342617
- Evidence of action sequence chunking in goal-directed instrumental conditioning and its dependence on the dorsomedial prefrontal cortex
- Ostlund S.B., Winterbauer N.E., Balleine B.W. Evidence of action sequence chunking in goal-directed instrumental conditioning and its dependence on the dorsomedial prefrontal cortex. J. Neurosci. 2009, 29:8280-8287.
- (2009) J. Neurosci. , vol.29 , pp. 8280-8287
- Ostlund, S.B.¹ Winterbauer, N.E.² Balleine, B.W.³

27
- 0004102479
- MIT Press, Cambridge, MA, London, UK
- Sutton R.S., Barto A.G. Reinforcement Learning: An Introduction 1998, MIT Press, Cambridge, MA, London, UK.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

28
- 48549088919
- Calculating consequences: brain systems that encode the causal effects of actions
- Tanaka S.C., Balleine B.W., O'Doherty J.P. Calculating consequences: brain systems that encode the causal effects of actions. J. Neurosci. 2008, 28:6750-6755.
- (2008) J. Neurosci. , vol.28 , pp. 6750-6755
- Tanaka, S.C.¹ Balleine, B.W.² O'Doherty, J.P.³

29
- 28944442093
- Inactivation of dorsolateral striatum enhances sensitivity to changes in the action-outcome contingency in instrumental conditioning
- Yin H.H., Knowlton B.J., Balleine B.W. Inactivation of dorsolateral striatum enhances sensitivity to changes in the action-outcome contingency in instrumental conditioning. Behav. Brain Res. 2006, 166:189-196.
- (2006) Behav. Brain Res. , vol.166 , pp. 189-196
- Yin, H.H.¹ Knowlton, B.J.² Balleine, B.W.³

30
- 41149128874
- Prefrontal cortex and hippocampus subserve different components of working memory in rats
- Yoon T., Okada J., Jung M.W., Kim J.J. Prefrontal cortex and hippocampus subserve different components of working memory in rats. Learn. Memory 2008, 15:97-105.
- (2008) Learn. Memory , vol.15 , pp. 97-105
- Yoon, T.¹ Okada, J.² Jung, M.W.³ Kim, J.J.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.