메뉴 건너뛰기




Volumn 105, Issue 1-3, 2011, Pages 36-44

A reinforcement learning approach to instrumental contingency degradation in rats

Author keywords

Contingency degradation; Instrumental; Model free learning; Prefrontal cortex; Rats; SARSA; Simulation

Indexed keywords

ALGORITHM; ANIMAL BEHAVIOR; ANIMAL EXPERIMENT; ANIMAL MODEL; ARTICLE; BRAIN DAMAGE; CONCEPTUAL FRAMEWORK; CONTINGENCY DEGRADATION; CONTROLLED STUDY; DECISION MAKING; FOOD INTAKE; INSTRUMENTAL CONDITIONING; MATHEMATICAL MODEL; MEDIAL PREFRONTAL CORTEX; MOTIVATION; NONHUMAN; PREFRONTAL CORTEX; PROBABILITY; RAT; REINFORCEMENT; TIME PERCEPTION;

EID: 82555190923     PISSN: 09284257     EISSN: 17697115     Source Type: Journal    
DOI: 10.1016/j.jphysparis.2011.07.017     Document Type: Article
Times cited : (10)

References (30)
  • 2
    • 0031801210 scopus 로고    scopus 로고
    • Goal-directed instrumental action: contingency and incentive learning and their cortical substrates
    • Balleine B.W., Dickinson A. Goal-directed instrumental action: contingency and incentive learning and their cortical substrates. Neuropharmacology 1998, 37:407-419.
    • (1998) Neuropharmacology , vol.37 , pp. 407-419
    • Balleine, B.W.1    Dickinson, A.2
  • 3
    • 72049125602 scopus 로고    scopus 로고
    • Human and rodent homologies in action control: corticostriatal determinants of goal-directed and habitual action
    • Balleine B.W., O'Doherty J.P. Human and rodent homologies in action control: corticostriatal determinants of goal-directed and habitual action. Neuropsychopharmacology 2010, 35:48-69.
    • (2010) Neuropsychopharmacology , vol.35 , pp. 48-69
    • Balleine, B.W.1    O'Doherty, J.P.2
  • 4
    • 0000409272 scopus 로고
    • Reinforcement learning methods for continuous-time Markov decision problems.
    • In: Proc. Adv. Neural Inform. Process. Systems (NIPS).
    • Bradtke, S.J., Duff, M.O., 1995. Reinforcement learning methods for continuous-time Markov decision problems. In: Proc. Adv. Neural Inform. Process. Systems (NIPS).
    • (1995)
    • Bradtke, S.J.1    Duff, M.O.2
  • 5
    • 78049314878 scopus 로고    scopus 로고
    • Delay discounting of qualitatively different reinforcers in rats
    • Calvert A.L., Green L., Myerson J. Delay discounting of qualitatively different reinforcers in rats. J. Exp. Anal. Behav. 2010, 93:171-184.
    • (2010) J. Exp. Anal. Behav. , vol.93 , pp. 171-184
    • Calvert, A.L.1    Green, L.2    Myerson, J.3
  • 6
    • 0344393788 scopus 로고    scopus 로고
    • The role of prelimbic cortex in instrumental conditioning
    • Corbit L.H., Balleine B.W. The role of prelimbic cortex in instrumental conditioning. Behav. Brain Res. 2003, 146:145-157.
    • (2003) Behav. Brain Res. , vol.146 , pp. 145-157
    • Corbit, L.H.1    Balleine, B.W.2
  • 7
    • 65249114697 scopus 로고    scopus 로고
    • Goal-directed responding is sensitive to lesions to the prelimbic cortex or basolateral nucleus of the amygdala but not to their disconnection
    • Coutureau E., Marchand A.R., Di Scala G. Goal-directed responding is sensitive to lesions to the prelimbic cortex or basolateral nucleus of the amygdala but not to their disconnection. Behav. Neurosci. 2009, 123:443-448.
    • (2009) Behav. Neurosci. , vol.123 , pp. 443-448
    • Coutureau, E.1    Marchand, A.R.2    Di Scala, G.3
  • 8
    • 28044450875 scopus 로고    scopus 로고
    • Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
    • Daw N.D., Niv Y., Dayan P. Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nat. Neurosci. 2005, 8:1704-1711.
    • (2005) Nat. Neurosci. , vol.8 , pp. 1704-1711
    • Daw, N.D.1    Niv, Y.2    Dayan, P.3
  • 9
    • 33745787929 scopus 로고    scopus 로고
    • Representation and timing in theories of the dopamine system
    • Daw N.D., Courville A.C., Touretzky D.S. Representation and timing in theories of the dopamine system. Neural Comput. 2006, 18:1637-1677.
    • (2006) Neural Comput. , vol.18 , pp. 1637-1677
    • Daw, N.D.1    Courville, A.C.2    Touretzky, D.S.3
  • 10
    • 70449715719 scopus 로고    scopus 로고
    • Instructional control of reinforcement learning: a behavioral and neurocomputational investigation
    • Doll B.B., Jacobs W.J., Sanfey A.G., Frank M.J. Instructional control of reinforcement learning: a behavioral and neurocomputational investigation. Brain Res. 2009, 1299:74-94.
    • (2009) Brain Res. , vol.1299 , pp. 74-94
    • Doll, B.B.1    Jacobs, W.J.2    Sanfey, A.G.3    Frank, M.J.4
  • 11
    • 0037459319 scopus 로고    scopus 로고
    • Discrete coding of reward probability and uncertainty by dopamine neurons
    • Fiorillo C.D., Tobler P.N., Schultz W. Discrete coding of reward probability and uncertainty by dopamine neurons. Science 2003, 299:1898-1902.
    • (2003) Science , vol.299 , pp. 1898-1902
    • Fiorillo, C.D.1    Tobler, P.N.2    Schultz, W.3
  • 12
    • 33744550336 scopus 로고    scopus 로고
    • Anatomy of a decision: striato-orbitofrontal interactions in reinforcement learning, decision making, and reversal
    • Frank M.J., Claus E.D. Anatomy of a decision: striato-orbitofrontal interactions in reinforcement learning, decision making, and reversal. Psychol. Rev. 2006, 113:300-326.
    • (2006) Psychol. Rev. , vol.113 , pp. 300-326
    • Frank, M.J.1    Claus, E.D.2
  • 13
    • 0034973745 scopus 로고    scopus 로고
    • The prefrontal cortex-an update: time is of the essence
    • Fuster J.M. The prefrontal cortex-an update: time is of the essence. Neuron 2001, 30:319-333.
    • (2001) Neuron , vol.30 , pp. 319-333
    • Fuster, J.M.1
  • 14
    • 33746001395 scopus 로고    scopus 로고
    • The role of the rat prelimbic/infralimbic cortex in working memory: not involved in the short-term maintenance but in monitoring and processing functions
    • Gisquet-Verrier P., Delatour B. The role of the rat prelimbic/infralimbic cortex in working memory: not involved in the short-term maintenance but in monitoring and processing functions. Neuroscience 2006, 141:585-596.
    • (2006) Neuroscience , vol.141 , pp. 585-596
    • Gisquet-Verrier, P.1    Delatour, B.2
  • 15
    • 77953260848 scopus 로고    scopus 로고
    • States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning
    • Gläscher J., Daw N., Dayan P., O'Doherty J.P. States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning. Neuron 2010, 66:585-595.
    • (2010) Neuron , vol.66 , pp. 585-595
    • Gläscher, J.1    Daw, N.2    Dayan, P.3    O'Doherty, J.P.4
  • 16
    • 84980210366 scopus 로고
    • The effect of contingency upon the appetitive conditioning of free-operant behavior
    • Hammond L.J. The effect of contingency upon the appetitive conditioning of free-operant behavior. J. Exp. Anal. Behav. 1980, 34:297-304.
    • (1980) J. Exp. Anal. Behav. , vol.34 , pp. 297-304
    • Hammond, L.J.1
  • 17
    • 0036592026 scopus 로고    scopus 로고
    • Actor-critic models of the basal ganglia: new anatomical and computational perspectives
    • Joel D., Niv Y., Ruppin E. Actor-critic models of the basal ganglia: new anatomical and computational perspectives. Neural Networks 2002, 15:535-547.
    • (2002) Neural Networks , vol.15 , pp. 535-547
    • Joel, D.1    Niv, Y.2    Ruppin, E.3
  • 18
    • 0037382264 scopus 로고    scopus 로고
    • Coordination of actions and habits in the medial prefrontal cortex of rats
    • Killcross S., Coutureau E. Coordination of actions and habits in the medial prefrontal cortex of rats. Cereb. Cortex 2003, 13:400-408.
    • (2003) Cereb. Cortex , vol.13 , pp. 400-408
    • Killcross, S.1    Coutureau, E.2
  • 19
    • 72449150120 scopus 로고    scopus 로고
    • Role of striatum in updating values of chosen actions
    • Kim H., Sul J.H., Huh N., Lee D., Jung M.W. Role of striatum in updating values of chosen actions. J. Neurosci. 2009, 29:14701-14712.
    • (2009) J. Neurosci. , vol.29 , pp. 14701-14712
    • Kim, H.1    Sul, J.H.2    Huh, N.3    Lee, D.4    Jung, M.W.5
  • 20
    • 50349093022 scopus 로고    scopus 로고
    • Influence of reward delays on responses of dopamine neurons
    • Kobayashi S., Schultz W. Influence of reward delays on responses of dopamine neurons. J. Neurosci. 2008, 28:7837-7846.
    • (2008) J. Neurosci. , vol.28 , pp. 7837-7846
    • Kobayashi, S.1    Schultz, W.2
  • 21
    • 72449166356 scopus 로고    scopus 로고
    • Reinforcement learning, conditioning, and the brain: successes and challenges
    • Maia T.V. Reinforcement learning, conditioning, and the brain: successes and challenges. Cogn. Affect. Behav. Neurosci. 2009, 9:343-364.
    • (2009) Cogn. Affect. Behav. Neurosci. , vol.9 , pp. 343-364
    • Maia, T.V.1
  • 22
    • 1842785707 scopus 로고    scopus 로고
    • The role of the medial prefrontal cortex in achieving goals
    • Matsumoto K., Tanaka K. The role of the medial prefrontal cortex in achieving goals. Curr. Opin. Neurobiol. 2004, 14:178-185.
    • (2004) Curr. Opin. Neurobiol. , vol.14 , pp. 178-185
    • Matsumoto, K.1    Tanaka, K.2
  • 23
    • 66249103342 scopus 로고    scopus 로고
    • A role for medial prefrontal dopaminergic innervation in instrumental conditioning
    • Naneix F., Marchand A.R., Di Scala G., Pape J.R., Coutureau E. A role for medial prefrontal dopaminergic innervation in instrumental conditioning. J. Neurosci. 2009, 29:6599-6606.
    • (2009) J. Neurosci. , vol.29 , pp. 6599-6606
    • Naneix, F.1    Marchand, A.R.2    Di Scala, G.3    Pape, J.R.4    Coutureau, E.5
  • 25
    • 33847675011 scopus 로고    scopus 로고
    • Tonic dopamine: opportunity costs and the control of response vigor
    • Niv Y., Daw N.D., Joel D., Dayan P. Tonic dopamine: opportunity costs and the control of response vigor. Psychopharmacology 2007, 191:507-520.
    • (2007) Psychopharmacology , vol.191 , pp. 507-520
    • Niv, Y.1    Daw, N.D.2    Joel, D.3    Dayan, P.4
  • 26
    • 67649342617 scopus 로고    scopus 로고
    • Evidence of action sequence chunking in goal-directed instrumental conditioning and its dependence on the dorsomedial prefrontal cortex
    • Ostlund S.B., Winterbauer N.E., Balleine B.W. Evidence of action sequence chunking in goal-directed instrumental conditioning and its dependence on the dorsomedial prefrontal cortex. J. Neurosci. 2009, 29:8280-8287.
    • (2009) J. Neurosci. , vol.29 , pp. 8280-8287
    • Ostlund, S.B.1    Winterbauer, N.E.2    Balleine, B.W.3
  • 28
    • 48549088919 scopus 로고    scopus 로고
    • Calculating consequences: brain systems that encode the causal effects of actions
    • Tanaka S.C., Balleine B.W., O'Doherty J.P. Calculating consequences: brain systems that encode the causal effects of actions. J. Neurosci. 2008, 28:6750-6755.
    • (2008) J. Neurosci. , vol.28 , pp. 6750-6755
    • Tanaka, S.C.1    Balleine, B.W.2    O'Doherty, J.P.3
  • 29
    • 28944442093 scopus 로고    scopus 로고
    • Inactivation of dorsolateral striatum enhances sensitivity to changes in the action-outcome contingency in instrumental conditioning
    • Yin H.H., Knowlton B.J., Balleine B.W. Inactivation of dorsolateral striatum enhances sensitivity to changes in the action-outcome contingency in instrumental conditioning. Behav. Brain Res. 2006, 166:189-196.
    • (2006) Behav. Brain Res. , vol.166 , pp. 189-196
    • Yin, H.H.1    Knowlton, B.J.2    Balleine, B.W.3
  • 30
    • 41149128874 scopus 로고    scopus 로고
    • Prefrontal cortex and hippocampus subserve different components of working memory in rats
    • Yoon T., Okada J., Jung M.W., Kim J.J. Prefrontal cortex and hippocampus subserve different components of working memory in rats. Learn. Memory 2008, 15:97-105.
    • (2008) Learn. Memory , vol.15 , pp. 97-105
    • Yoon, T.1    Okada, J.2    Jung, M.W.3    Kim, J.J.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.