메뉴 건너뛰기




Volumn 35, Issue 7, 2012, Pages 1036-1051

Habits, action sequences and reinforcement learning

Author keywords

Action sequence; Goal directed action; Habitual action; Reinforcement learning

Indexed keywords

ADAPTIVE BEHAVIOR; ARTICLE; BEHAVIOR; CORPUS STRIATUM; DECISION MAKING; DORSOLATERAL STRIATUM; GOAL DIRECTED ACTION; HABIT; HUMAN; INSTRUMENTAL CONDITIONING; LEARNING; NONHUMAN; PRIORITY JOURNAL; REACTION TIME; REINFORCEMENT; REINFORCEMENT LEARNING; REWARD; SENSORIMOTOR CORTEX; SENSORIMOTOR STRIATUM; STIMULUS RESPONSE; TASK PERFORMANCE;

EID: 84859341150     PISSN: 0953816X     EISSN: 14609568     Source Type: Journal    
DOI: 10.1111/j.1460-9568.2012.08050.x     Document Type: Article
Times cited : (231)

References (116)
  • 1
    • 84946268134 scopus 로고
    • Variations in the sensitivity of instrumental responding to reinforcer devaluation
    • Adams, C.D. (1982) Variations in the sensitivity of instrumental responding to reinforcer devaluation. Q. J. Exp. Psychol. B, 34B, 77-98.
    • (1982) Q. J. Exp. Psychol. B , vol.34 B , pp. 77-98
    • Adams, C.D.1
  • 2
    • 0025321039 scopus 로고
    • Functional architecture of basal ganglia circuits: neural substrates of parallel processing
    • Alexander, G.E. & Crutcher, M.D. (1990) Functional architecture of basal ganglia circuits: neural substrates of parallel processing. Trends Neurosci., 13, 266-271.
    • (1990) Trends Neurosci. , vol.13 , pp. 266-271
    • Alexander, G.E.1    Crutcher, M.D.2
  • 4
    • 84857207526 scopus 로고    scopus 로고
    • Mechanisms of hierarchical reinforcement learning in cortico-striatal circuits 2: evidence from fMRI
    • Badre, D. & Frank, M.J. (2012) Mechanisms of hierarchical reinforcement learning in cortico-striatal circuits 2: evidence from fMRI. Cereb. Cortex, 22, 527-536.
    • (2012) Cereb. Cortex , vol.22 , pp. 527-536
    • Badre, D.1    Frank, M.J.2
  • 5
    • 32544433751 scopus 로고    scopus 로고
    • The role of striatum in initiation and execution of learned action sequences in rats
    • Bailey, K.R. & Mair, R.G. (2006) The role of striatum in initiation and execution of learned action sequences in rats. J. Neurosci., 26, 1016-1025.
    • (2006) J. Neurosci. , vol.26 , pp. 1016-1025
    • Bailey, K.R.1    Mair, R.G.2
  • 6
    • 34249994602 scopus 로고    scopus 로고
    • Effects of frontal cortex lesions on action sequence learning in the rat
    • Bailey, K.R. & Mair, R.G. (2007) Effects of frontal cortex lesions on action sequence learning in the rat. Eur. J. Neurosci., 25, 2905-2915.
    • (2007) Eur. J. Neurosci. , vol.25 , pp. 2905-2915
    • Bailey, K.R.1    Mair, R.G.2
  • 8
    • 72049125602 scopus 로고    scopus 로고
    • Human and rodent homologies in action control: corticostriatal determinants of goal-directed and habitual action
    • Balleine, B.W. & O'Doherty, J.P. (2010) Human and rodent homologies in action control: corticostriatal determinants of goal-directed and habitual action. Neuropsychopharmacology, 35, 48-69.
    • (2010) Neuropsychopharmacology , vol.35 , pp. 48-69
    • Balleine, B.W.1    O'Doherty, J.P.2
  • 9
    • 34547670815 scopus 로고    scopus 로고
    • The role of the dorsal striatum in reward and decision-making
    • Balleine, B.W., Delgado, M.R. & Hikosaka, O. (2007) The role of the dorsal striatum in reward and decision-making. J. Neurosci., 27, 8161-8165.
    • (2007) J. Neurosci. , vol.27 , pp. 8161-8165
    • Balleine, B.W.1    Delgado, M.R.2    Hikosaka, O.3
  • 10
    • 61349114222 scopus 로고    scopus 로고
    • The integrative function of the basal ganglia in instrumental conditioning
    • Balleine, B.W., Liljeholm, M. & Ostlund, S.B. (2009) The integrative function of the basal ganglia in instrumental conditioning. Behav. Brain Res., 199, 43-52.
    • (2009) Behav. Brain Res. , vol.199 , pp. 43-52
    • Balleine, B.W.1    Liljeholm, M.2    Ostlund, S.B.3
  • 11
    • 31644449663 scopus 로고    scopus 로고
    • Multiple forward model architecture for sequence processing
    • Sun, R. & Giles, C.L. (Eds), Springer Verlag, NY -
    • Bapi, R.S. & Doya, K. (2001) Multiple forward model architecture for sequence processing. In Sun, R. & Giles, C.L. (Eds), Sequence learning: paradigms, algorithms, and applications. Springer Verlag, NY, pp. 309-320.
    • (2001) Sequence learning: paradigms, algorithms, and applications , pp. 309-320
    • Bapi, R.S.1    Doya, K.2
  • 12
    • 27144542972 scopus 로고    scopus 로고
    • Activity of striatal neurons reflects dynamic encoding and recoding of procedural memories
    • Barnes, T.D., Kubota, Y., Hu, D., Jin, D.Z. & Graybiel, A.M. (2005) Activity of striatal neurons reflects dynamic encoding and recoding of procedural memories. Nature, 437, 1158-1161.
    • (2005) Nature , vol.437 , pp. 1158-1161
    • Barnes, T.D.1    Kubota, Y.2    Hu, D.3    Jin, D.Z.4    Graybiel, A.M.5
  • 13
    • 0037288370 scopus 로고    scopus 로고
    • Recent advances in hierarchical reinforcement learning
    • Barto, A.G. & Mahadevan, S. (2003) Recent advances in hierarchical reinforcement learning. Discrete Event Dyn. S., 13, 41-77.
    • (2003) Discrete Event Dyn. S. , vol.13 , pp. 41-77
    • Barto, A.G.1    Mahadevan, S.2
  • 14
    • 0031931935 scopus 로고    scopus 로고
    • A computational model of how the basal ganglia produce sequences
    • Berns, G.S. & Sejnowski, T.J. (1998) A computational model of how the basal ganglia produce sequences. J. Cogn. Neurosci., 10, 108-121.
    • (1998) J. Cogn. Neurosci. , vol.10 , pp. 108-121
    • Berns, G.S.1    Sejnowski, T.J.2
  • 15
    • 55949101421 scopus 로고    scopus 로고
    • Combining modalities with different latencies for optimal motor control
    • Bissmarck, F., Nakahara, H., Doya, K. & Hikosaka, O. (2008) Combining modalities with different latencies for optimal motor control. J. Cogn. Neurosci., 20, 1966-1979.
    • (2008) J. Cogn. Neurosci. , vol.20 , pp. 1966-1979
    • Bissmarck, F.1    Nakahara, H.2    Doya, K.3    Hikosaka, O.4
  • 16
    • 43049099970 scopus 로고    scopus 로고
    • Hierarchical models of behavior and prefrontal function
    • Botvinick, M.M. (2008) Hierarchical models of behavior and prefrontal function. Trends Cogn. Sci., 12, 201-208.
    • (2008) Trends Cogn. Sci. , vol.12 , pp. 201-208
    • Botvinick, M.M.1
  • 17
    • 70350566799 scopus 로고    scopus 로고
    • Hierarchically organized behavior and its neural foundations: a reinforcement learning perspective
    • Botvinick, M.M., Niv, Y. & Barto, A.G. (2009) Hierarchically organized behavior and its neural foundations: a reinforcement learning perspective. Cognition, 113, 262-280.
    • (2009) Cognition , vol.113 , pp. 262-280
    • Botvinick, M.M.1    Niv, Y.2    Barto, A.G.3
  • 18
    • 4444276205 scopus 로고    scopus 로고
    • Characterization of motor skill and instrumental learning time scales in a skilled reaching task in rat
    • Buitrago, M.M., Ringer, T., Schulz, J.B., Dichgans, J. & Luft, A.R. (2004a) Characterization of motor skill and instrumental learning time scales in a skilled reaching task in rat. Behav. Brain Res., 155, 249-256.
    • (2004) Behav. Brain Res. , vol.155 , pp. 249-256
    • Buitrago, M.M.1    Ringer, T.2    Schulz, J.B.3    Dichgans, J.4    Luft, A.R.5
  • 19
    • 1842686143 scopus 로고    scopus 로고
    • Short and long-term motor skill learning in an accelerated rotarod training paradigm
    • Buitrago, M.M., Schulz, J.B., Dichgans, J. & Luft, A.R. (2004b) Short and long-term motor skill learning in an accelerated rotarod training paradigm. Neurobiol. Learn. Mem., 81, 211-216.
    • (2004) Neurobiol. Learn. Mem. , vol.81 , pp. 211-216
    • Buitrago, M.M.1    Schulz, J.B.2    Dichgans, J.3    Luft, A.R.4
  • 20
    • 1842583998 scopus 로고    scopus 로고
    • Inactivation of dorsolateral striatum impairs acquisition of response learning in cue-deficient, but not cue-available, conditions
    • Chang, Q. & Gold, P.E. (2004) Inactivation of dorsolateral striatum impairs acquisition of response learning in cue-deficient, but not cue-available, conditions. Behav. Neurosci., 118, 383-388.
    • (2004) Behav. Neurosci. , vol.118 , pp. 383-388
    • Chang, Q.1    Gold, P.E.2
  • 21
    • 3142570811 scopus 로고    scopus 로고
    • Differential corticostriatal plasticity during fast and slow motor skill learning in mice
    • Costa, R.M., Cohen, D. & Nicolelis, M.A.L. (2004) Differential corticostriatal plasticity during fast and slow motor skill learning in mice. Curr. Biol., 14, 1124-1134.
    • (2004) Curr. Biol. , vol.14 , pp. 1124-1134
    • Costa, R.M.1    Cohen, D.2    Nicolelis, M.A.L.3
  • 22
    • 33749591279 scopus 로고    scopus 로고
    • Rapid alterations in corticostriatal ensemble coordination during acute dopamine-dependent motor dysfunction
    • Costa, R.M., Lin, S.-C., Sotnikova, T.D., Cyr, M., Gainetdinov, R.R., Caron, M.G. & Nicolelis, M.A.L. (2006) Rapid alterations in corticostriatal ensemble coordination during acute dopamine-dependent motor dysfunction. Neuron, 52, 359-369.
    • (2006) Neuron , vol.52 , pp. 359-369
    • Costa, R.M.1    Lin, S.-C.2    Sotnikova, T.D.3    Cyr, M.4    Gainetdinov, R.R.5    Caron, M.G.6    Nicolelis, M.A.L.7
  • 23
    • 0033722074 scopus 로고    scopus 로고
    • Behavioral considerations suggest an average reward TD model of the dopamine system
    • Daw, N.D. & Touretzky, D.S. (2000) Behavioral considerations suggest an average reward TD model of the dopamine system. Neurocomputing, 32-33, 679-684.
    • (2000) Neurocomputing , vol.32-33 , pp. 679-684
    • Daw, N.D.1    Touretzky, D.S.2
  • 24
    • 0036835734 scopus 로고    scopus 로고
    • Long-term reward prediction in TD models of the dopamine system
    • Daw, N.D. & Touretzky, D.S. (2002) Long-term reward prediction in TD models of the dopamine system. Neural Comput., 14, 2567-2583.
    • (2002) Neural Comput. , vol.14 , pp. 2567-2583
    • Daw, N.D.1    Touretzky, D.S.2
  • 25
    • 28044450875 scopus 로고    scopus 로고
    • Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
    • Daw, N.D., Niv, Y. & Dayan, P. (2005) Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nat. Neurosci., 8, 1704-1711.
    • (2005) Nat. Neurosci. , vol.8 , pp. 1704-1711
    • Daw, N.D.1    Niv, Y.2    Dayan, P.3
  • 26
    • 0037057808 scopus 로고    scopus 로고
    • Reward, motivation, and reinforcement learning
    • Dayan, P. & Balleine, B.W. (2002) Reward, motivation, and reinforcement learning. Neuron, 36, 285-298.
    • (2002) Neuron , vol.36 , pp. 285-298
    • Dayan, P.1    Balleine, B.W.2
  • 27
    • 84898964259 scopus 로고    scopus 로고
    • Explaining away in weight space
    • Leen, T.K., Dietterich, T.G. & Tresp, V. (Eds), MIT Press, Denver, CO, USA -
    • Dayan, P. & Kakade, S. (2001) Explaining away in weight space. In Leen, T.K., Dietterich, T.G. & Tresp, V. (Eds), Advances in Neural Information Processing Systems 13. MIT Press, Denver, CO, USA, pp. 451-457.
    • (2001) Advances in Neural Information Processing Systems 13 , pp. 451-457
    • Dayan, P.1    Kakade, S.2
  • 29
    • 58149439676 scopus 로고
    • Resistance to extinction as a function of the discrimination habit established during fixed-ratio reinforcement
    • Denny, M.R., Wells, R.H. & Maatsch, J.L. (1957) Resistance to extinction as a function of the discrimination habit established during fixed-ratio reinforcement. J. Exp. Psychol., 54, 451-456.
    • (1957) J. Exp. Psychol. , vol.54 , pp. 451-456
    • Denny, M.R.1    Wells, R.H.2    Maatsch, J.L.3
  • 30
    • 77953197543 scopus 로고    scopus 로고
    • Motor sequences and the basal ganglia: kinematics, not habits
    • Desmurget, M. & Turner, R.S. (2010) Motor sequences and the basal ganglia: kinematics, not habits. J. Neurosci., 30, 7685-7690.
    • (2010) J. Neurosci. , vol.30 , pp. 7685-7690
    • Desmurget, M.1    Turner, R.S.2
  • 31
    • 0033119117 scopus 로고    scopus 로고
    • Parallel information processing in the dorsal striatum: relation to hippocampal function
    • Devan, B.D. & White, N.M. (1999) Parallel information processing in the dorsal striatum: relation to hippocampal function. J. Neurosci., 19, 2789-2798.
    • (1999) J. Neurosci. , vol.19 , pp. 2789-2798
    • Devan, B.D.1    White, N.M.2
  • 32
    • 0033061121 scopus 로고    scopus 로고
    • Effects of medial and lateral caudate-putamen lesions on place- and cue-guided behaviors in the water maze: relation to thigmotaxis
    • Devan, B.D., McDonald, R.J. & White, N.M. (1999) Effects of medial and lateral caudate-putamen lesions on place- and cue-guided behaviors in the water maze: relation to thigmotaxis. Behav. Brain Res., 100, 5-14.
    • (1999) Behav. Brain Res. , vol.100 , pp. 5-14
    • Devan, B.D.1    McDonald, R.J.2    White, N.M.3
  • 33
    • 84943255384 scopus 로고
    • Instrumental conditioning
    • Mackintosh, N.J. (Ed.) Academic Press, London -
    • Dickinson, A. (1994) Instrumental conditioning. In Mackintosh, N.J. (Ed.) Animal Cognition and Learning. Academic Press, London, pp. 4-79.
    • (1994) Animal Cognition and Learning , pp. 4-79
    • Dickinson, A.1
  • 35
    • 80052628191 scopus 로고    scopus 로고
    • From movements to actions: two mechanisms for learning action sequences
    • Endress, A.D. & Wood, J. (2011) From movements to actions: two mechanisms for learning action sequences. Cogn. Psychol., 63, 141-171.
    • (2011) Cogn. Psychol. , vol.63 , pp. 141-171
    • Endress, A.D.1    Wood, J.2
  • 36
    • 15244346900 scopus 로고    scopus 로고
    • Lesion to the nigrostriatal dopamine system disrupts stimulus-response habit formation
    • Faure, A., Haberland, U., Condé, F. & El Massioui, N. (2005) Lesion to the nigrostriatal dopamine system disrupts stimulus-response habit formation. J. Neurosci., 25, 2771-2780.
    • (2005) J. Neurosci. , vol.25 , pp. 2771-2780
    • Faure, A.1    Haberland, U.2    Condé, F.3    El Massioui, N.4
  • 37
    • 0442323622 scopus 로고    scopus 로고
    • Dorsal striatum and stimulus-response learning: lesions of the dorsolateral, but not dorsomedial, striatum impair acquisition of a stimulus-response-based instrumental discrimination task, while sparing conditioned place preference learning
    • Featherstone, R.E. & McDonald, R.J. (2004) Dorsal striatum and stimulus-response learning: lesions of the dorsolateral, but not dorsomedial, striatum impair acquisition of a stimulus-response-based instrumental discrimination task, while sparing conditioned place preference learning. Neuroscience, 124, 23-31.
    • (2004) Neuroscience , vol.124 , pp. 23-31
    • Featherstone, R.E.1    McDonald, R.J.2
  • 38
    • 27744588775 scopus 로고    scopus 로고
    • Lesions of the dorsolateral striatum impair the acquisition of a simplified stimulus-response dependent conditional discrimination task
    • Featherstone, R.E. & McDonald, R.J. (2005) Lesions of the dorsolateral striatum impair the acquisition of a simplified stimulus-response dependent conditional discrimination task. Neuroscience, 136, 387-395.
    • (2005) Neuroscience , vol.136 , pp. 387-395
    • Featherstone, R.E.1    McDonald, R.J.2
  • 39
    • 33745927445 scopus 로고    scopus 로고
    • A mechanistic account of striatal dopamine function in human cognition: psychopharmacological studies with cabergoline and haloperidol
    • Frank, M.J. & O'Reilly, R.C. (2006) A mechanistic account of striatal dopamine function in human cognition: psychopharmacological studies with cabergoline and haloperidol. Behav. Neurosci., 120, 497-517.
    • (2006) Behav. Neurosci. , vol.120 , pp. 497-517
    • Frank, M.J.1    O'Reilly, R.C.2
  • 40
    • 0032123567 scopus 로고    scopus 로고
    • The basal ganglia and chunking of action repertoires
    • Graybiel, A.M. (1998) The basal ganglia and chunking of action repertoires. Neurobiol. Learn. Mem., 70, 119-136.
    • (1998) Neurobiol. Learn. Mem. , vol.70 , pp. 119-136
    • Graybiel, A.M.1
  • 41
    • 36348943183 scopus 로고    scopus 로고
    • Reinforcement learning for mixed open-loop and closed loop control
    • Mozer, M., Jordan, M.I. & Petsche, T. (Eds)., The MIT Press, Denver, CO, USA -
    • Hansen, E.A., Barto, A.G. & Zilberstein, S. (1996) Reinforcement learning for mixed open-loop and closed loop control. In Mozer, M., Jordan, M.I. & Petsche, T. (Eds). Advances in Neural Information Processing Systems NIPS, Vol. 9. The MIT Press, Denver, CO, USA, pp. 1026-1032.
    • (1996) Advances in Neural Information Processing Systems NIPS , vol.9 , pp. 1026-1032
    • Hansen, E.A.1    Barto, A.G.2    Zilberstein, S.3
  • 42
    • 33749080272 scopus 로고    scopus 로고
    • Heterarchical reinforcement-learning model for integration of multiple cortico-striatal loops: fMRI examination in stimulus-action-reward association learning
    • Haruno, M. & Kawato, M. (2006) Heterarchical reinforcement-learning model for integration of multiple cortico-striatal loops: fMRI examination in stimulus-action-reward association learning. Neural. Netw., 19, 1242-1254.
    • (2006) Neural. Netw. , vol.19 , pp. 1242-1254
    • Haruno, M.1    Kawato, M.2
  • 43
    • 33749368172 scopus 로고    scopus 로고
    • Dynamic shifts in corticostriatal expression patterns of the immediate early genes Homer 1a and Zif268 during early and late phases of instrumental training
    • Hernandez, P.J., Schiltz, C.A. & Kelley, A.E. (2006) Dynamic shifts in corticostriatal expression patterns of the immediate early genes Homer 1a and Zif268 during early and late phases of instrumental training. Learn. Mem., 13, 599-608.
    • (2006) Learn. Mem. , vol.13 , pp. 599-608
    • Hernandez, P.J.1    Schiltz, C.A.2    Kelley, A.E.3
  • 44
    • 0028972201 scopus 로고
    • Learning of sequential movements in the monkey: process of learning and retention of memory
    • Hikosaka, O., Rand, M.K., Miyachi, S. & Miyashita, K. (1995) Learning of sequential movements in the monkey: process of learning and retention of memory. J. Neurophysiol., 74, 1652-1661.
    • (1995) J. Neurophysiol. , vol.74 , pp. 1652-1661
    • Hikosaka, O.1    Rand, M.K.2    Miyachi, S.3    Miyashita, K.4
  • 46
    • 0000148778 scopus 로고
    • A heuristic approach to the discovery of macro-operators
    • Iba, G.A. (1989) A heuristic approach to the discovery of macro-operators. Mach. Learn., 3, 285-317.
    • (1989) Mach. Learn. , vol.3 , pp. 285-317
    • Iba, G.A.1
  • 47
    • 77954925944 scopus 로고    scopus 로고
    • Start/stop signals emerge in nigrostriatal circuits during sequence learning
    • Jin, X. & Costa, R.M. (2010) Start/stop signals emerge in nigrostriatal circuits during sequence learning. Nature, 466, 457-462.
    • (2010) Nature , vol.466 , pp. 457-462
    • Jin, X.1    Costa, R.M.2
  • 50
    • 0038618950 scopus 로고    scopus 로고
    • The cognitive and neural architecture of sequence representation
    • Keele, S.W., Ivry, R., Mayr, U., Hazeltine, E. & Heuer, H. (2003) The cognitive and neural architecture of sequence representation. Psychol. Rev., 110, 316-339.
    • (2003) Psychol. Rev. , vol.110 , pp. 316-339
    • Keele, S.W.1    Ivry, R.2    Mayr, U.3    Hazeltine, E.4    Heuer, H.5
  • 51
    • 0344440897 scopus 로고    scopus 로고
    • Macro-architecture of basal ganglia loops with the cerebral cortex: use of rabies virus to reveal multisynaptic circuits
    • Kelly, R.M. & Strick, P.L. (2004) Macro-architecture of basal ganglia loops with the cerebral cortex: use of rabies virus to reveal multisynaptic circuits. Prog. Brain Res., 143, 449-459.
    • (2004) Prog. Brain Res. , vol.143 , pp. 449-459
    • Kelly, R.M.1    Strick, P.L.2
  • 52
    • 79958143780 scopus 로고    scopus 로고
    • Speed/accuracy trade-off between the habitual and the goal-directed processes
    • Keramati, M., Dezfouli, A. & Piray, P. (2011) Speed/accuracy trade-off between the habitual and the goal-directed processes. PLoS Comput. Biol., 7, e1002055.
    • (2011) PLoS Comput. Biol. , vol.7
    • Keramati, M.1    Dezfouli, A.2    Piray, P.3
  • 53
    • 77955819329 scopus 로고    scopus 로고
    • A probabilistic approach to mixed open-loop and closed-loop control, with application to extreme autonomous driving. Proc. of the IEEE Int.Conf.on Robotics & Automation (ICRA ). Anchorage, Alaska, USA
    • Kolter, Z., Plagemann, C., Jackson, D.T., Ng, A. & Thrun, S. (2010) A probabilistic approach to mixed open-loop and closed-loop control, with application to extreme autonomous driving. Proc. of the IEEE Int.Conf.on Robotics & Automation (ICRA ). Anchorage, Alaska, USA.
    • (2010)
    • Kolter, Z.1    Plagemann, C.2    Jackson, D.T.3    Ng, A.4    Thrun, S.5
  • 54
    • 0022045044 scopus 로고
    • Macro-operators: a weak method for learning
    • Korf, R.E. (1985) Macro-operators: a weak method for learning. Artif. Intell., 26, 35-77.
    • (1985) Artif. Intell. , vol.26 , pp. 35-77
    • Korf, R.E.1
  • 55
    • 70350279035 scopus 로고    scopus 로고
    • Stable encoding of task structure coexists with flexible coding of task events in sensorimotor striatum
    • Kubota, Y., Liu, J., Hu, D., DeCoteau, W.E., Eden, U.T., Smith, A.C. & Graybiel, A.M. (2009) Stable encoding of task structure coexists with flexible coding of task events in sensorimotor striatum. J. Neurophysiol., 102, 2142-2160.
    • (2009) J. Neurophysiol. , vol.102 , pp. 2142-2160
    • Kubota, Y.1    Liu, J.2    Hu, D.3    DeCoteau, W.E.4    Eden, U.T.5    Smith, A.C.6    Graybiel, A.M.7
  • 56
    • 0001990073 scopus 로고
    • The problem of serial order in behavior
    • Jeffress, L.A. (Ed.) Wiley, NY -
    • Lashley, K.S. (1951) The problem of serial order in behavior. In Jeffress, L.A. (Ed.) Cerebral Mechanisms in Behavior. Wiley, NY, pp. 112-136.
    • (1951) Cerebral Mechanisms in Behavior , pp. 112-136
    • Lashley, K.S.1
  • 58
    • 0033174825 scopus 로고    scopus 로고
    • Nigrostriatal dopamine system in learning to perform sequential motor tasks in a predictive manner
    • Matsumoto, N., Hanakawa, T., Maki, S., Graybiel, A.M. & Kimura, M. (1999) Nigrostriatal dopamine system in learning to perform sequential motor tasks in a predictive manner. J. Neurophysiol., 82, 978-998.
    • (1999) J. Neurophysiol. , vol.82 , pp. 978-998
    • Matsumoto, N.1    Hanakawa, T.2    Maki, S.3    Graybiel, A.M.4    Kimura, M.5
  • 59
    • 0029752592 scopus 로고    scopus 로고
    • Average reward reinforcement learning: foundations, algorithms, and empirical results
    • Mahadevan, S. (1996) Average reward reinforcement learning: foundations, algorithms, and empirical results. Mach. Learn., 22, 159-195.
    • (1996) Mach. Learn. , vol.22 , pp. 159-195
    • Mahadevan, S.1
  • 60
    • 33846916162 scopus 로고    scopus 로고
    • Skill representation in the primary motor cortex after long-term practice
    • Matsuzaka, Y., Picard, N. & Strick, P.L. (2007) Skill representation in the primary motor cortex after long-term practice. J. Neurophysiol., 97, 1819-1832.
    • (2007) J. Neurophysiol. , vol.97 , pp. 1819-1832
    • Matsuzaka, Y.1    Picard, N.2    Strick, P.L.3
  • 62
    • 0030910984 scopus 로고    scopus 로고
    • Differential roles of monkey striatum in learning of sequential hand movement
    • Miyachi, S., Hikosaka, O., Miyashita, K., Kárádi, Z. & Rand, M.K. (1997) Differential roles of monkey striatum in learning of sequential hand movement. Exp. Brain Res., 115, 1-5.
    • (1997) Exp. Brain Res. , vol.115 , pp. 1-5
    • Miyachi, S.1    Hikosaka, O.2    Miyashita, K.3    Kárádi, Z.4    Rand, M.K.5
  • 63
    • 0036361088 scopus 로고    scopus 로고
    • Differential activation of monkey striatal neurons in the early and late stages of procedural learning
    • Miyachi, S., Hikosaka, O. & Lu, X. (2002) Differential activation of monkey striatal neurons in the early and late stages of procedural learning. Exp. Brain Res., 146, 122-126.
    • (2002) Exp. Brain Res. , vol.146 , pp. 122-126
    • Miyachi, S.1    Hikosaka, O.2    Lu, X.3
  • 64
    • 9444282076 scopus 로고    scopus 로고
    • Anticipatory saccades in sequential procedural learning in monkeys
    • Miyashita, K., Rand, M.K., Miyachi, S. & Hikosaka, O. (1996) Anticipatory saccades in sequential procedural learning in monkeys. J. Neurophysiol., 76, 1361-1366.
    • (1996) J. Neurophysiol. , vol.76 , pp. 1361-1366
    • Miyashita, K.1    Rand, M.K.2    Miyachi, S.3    Hikosaka, O.4
  • 65
    • 0029981543 scopus 로고    scopus 로고
    • A framework for mesencephalic dopamine systems based on predictive Hebbian learning
    • Montague, P.R., Dayan, P. & Sejnowski, T.J. (1996) A framework for mesencephalic dopamine systems based on predictive Hebbian learning. J. Neurosci., 16, 1936-1947.
    • (1996) J. Neurosci. , vol.16 , pp. 1936-1947
    • Montague, P.R.1    Dayan, P.2    Sejnowski, T.J.3
  • 66
    • 79959466871 scopus 로고    scopus 로고
    • Behavior contributions of dorsal striatal subregions to spatial alternation behavior
    • Moussa, R., Poucet, B., Amalric, M. & Sargolini, F. (2011) Behavior contributions of dorsal striatal subregions to spatial alternation behavior. Learn. Mem., 18, 444-451.
    • (2011) Learn. Mem. , vol.18 , pp. 444-451
    • Moussa, R.1    Poucet, B.2    Amalric, M.3    Sargolini, F.4
  • 67
    • 0000983889 scopus 로고
    • Habit strength as a function of the pattern of reinforcement
    • Mowrer, O.H. & Jones, H. (1945) Habit strength as a function of the pattern of reinforcement. J. Exp. Psychol., 35, 293-311.
    • (1945) J. Exp. Psychol. , vol.35 , pp. 293-311
    • Mowrer, O.H.1    Jones, H.2
  • 68
    • 0035399093 scopus 로고    scopus 로고
    • Parallel cortico-basal ganglia mechanisms for acquisition and execution of visuomotor sequences - a computational approach
    • Nakahara, H., Doya, K. & Hikosaka, O. (2001) Parallel cortico-basal ganglia mechanisms for acquisition and execution of visuomotor sequences - a computational approach. J. Cogn. Neurosci., 13, 626-647.
    • (2001) J. Cogn. Neurosci. , vol.13 , pp. 626-647
    • Nakahara, H.1    Doya, K.2    Hikosaka, O.3
  • 69
    • 10844248448 scopus 로고    scopus 로고
    • Reinforced variability in animals and people: implications for adaptive action
    • Neuringer, A. (2004) Reinforced variability in animals and people: implications for adaptive action. Am. Psychol., 59, 891-906.
    • (2004) Am. Psychol. , vol.59 , pp. 891-906
    • Neuringer, A.1
  • 70
    • 77955326088 scopus 로고    scopus 로고
    • Operant variability and voluntary action
    • Neuringer, A. & Jensen, G. (2010) Operant variability and voluntary action. Psychol. Rev., 117, 972-993.
    • (2010) Psychol. Rev. , vol.117 , pp. 972-993
    • Neuringer, A.1    Jensen, G.2
  • 71
    • 0035229321 scopus 로고    scopus 로고
    • Time scales in motor learning and development
    • Newell, K.M., Liu, Y.T. & Mayer-Kress, G. (2001) Time scales in motor learning and development. Psychol. Rev., 108, 57-82.
    • (2001) Psychol. Rev. , vol.108 , pp. 57-82
    • Newell, K.M.1    Liu, Y.T.2    Mayer-Kress, G.3
  • 72
    • 0003109533 scopus 로고
    • Attentional requirements of learning: performance measures evidence from
    • Nissen, M.J. & Bullemer, P. (1987) Attentional requirements of learning: performance measures evidence from. Cogn. Psychol., 19, 1-32.
    • (1987) Cogn. Psychol. , vol.19 , pp. 1-32
    • Nissen, M.J.1    Bullemer, P.2
  • 73
    • 33847675011 scopus 로고    scopus 로고
    • Tonic dopamine: opportunity costs and the control of response vigor
    • Niv, Y., Daw, N.D., Joel, D. & Dayan, P. (2007) Tonic dopamine: opportunity costs and the control of response vigor. Psychopharmacology, 191, 507-520.
    • (2007) Psychopharmacology , vol.191 , pp. 507-520
    • Niv, Y.1    Daw, N.D.2    Joel, D.3    Dayan, P.4
  • 74
    • 1942520195 scopus 로고    scopus 로고
    • Dissociable roles of ventral and dorsal striatum in instrumental conditioning
    • O'Doherty, J.P., Dayan, P., Schultz, J., Deichmann, R., Friston, K. & Dolan, R.J. (2004) Dissociable roles of ventral and dorsal striatum in instrumental conditioning. Science, 304, 452-454.
    • (2004) Science , vol.304 , pp. 452-454
    • O'Doherty, J.P.1    Dayan, P.2    Schultz, J.3    Deichmann, R.4    Friston, K.5    Dolan, R.J.6
  • 76
    • 67649342617 scopus 로고    scopus 로고
    • Evidence of action sequence chunking in goal-directed instrumental conditioning and its dependence on the dorsomedial prefrontal cortex
    • Ostlund, S.B., Winterbauer, N.E. & Balleine, B.W. (2009) Evidence of action sequence chunking in goal-directed instrumental conditioning and its dependence on the dorsomedial prefrontal cortex. J. Neurosci., 29, 8280-8287.
    • (2009) J. Neurosci. , vol.29 , pp. 8280-8287
    • Ostlund, S.B.1    Winterbauer, N.E.2    Balleine, B.W.3
  • 77
    • 49549148711 scopus 로고
    • The free-operant partial reinforcement effect: a discrimination analysis
    • Overmann, S.R. & Denny, M.R. (1974) The free-operant partial reinforcement effect: a discrimination analysis. Learn. Motiv., 5, 248-257.
    • (1974) Learn. Motiv. , vol.5 , pp. 248-257
    • Overmann, S.R.1    Denny, M.R.2
  • 78
    • 0029972847 scopus 로고    scopus 로고
    • Inactivation of hippocampus or caudate nucleus with lidocaine differentially affects expression of place and response learning
    • Packard, M.G. & McGaugh, J.L. (1996) Inactivation of hippocampus or caudate nucleus with lidocaine differentially affects expression of place and response learning. Neurobiol. Learn. Mem., 65, 65-72.
    • (1996) Neurobiol. Learn. Mem. , vol.65 , pp. 65-72
    • Packard, M.G.1    McGaugh, J.L.2
  • 79
    • 58149406298 scopus 로고
    • A hierarchical response-unit analysis of resistance to extinction following fixed-number and fixed-consecutive-number reinforcement
    • Platt, J.R. & Day, R.B. (1979) A hierarchical response-unit analysis of resistance to extinction following fixed-number and fixed-consecutive-number reinforcement. J. Exp. Psychol. Anim. Behav. Process., 5, 307-320.
    • (1979) J. Exp. Psychol. Anim. Behav. Process. , vol.5 , pp. 307-320
    • Platt, J.R.1    Day, R.B.2
  • 82
    • 0007972375 scopus 로고    scopus 로고
    • Learning macro-actions in reinforcement learning
    • Kearns, M.J., Solla, S.A. & Cohn, D.A. (Eds). MIT Press, Denver, CO, USA -
    • Randløv, J. (1998) Learning macro-actions in reinforcement learning. In Kearns, M.J., Solla, S.A. & Cohn, D.A. (Eds). Advances in Neural Information Processing Systems 11. MIT Press, Denver, CO, USA, pp. 1045-1051.
    • (1998) Advances in Neural Information Processing Systems 11 , pp. 1045-1051
    • Randløv, J.1
  • 83
    • 45749098894 scopus 로고    scopus 로고
    • A framework for studying the neurobiology of value-based decision making
    • Rangel, A., Camerer, C. & Montague, P.R. (2008) A framework for studying the neurobiology of value-based decision making. Nat. Rev. Neurosci., 9, 545-556.
    • (2008) Nat. Rev. Neurosci. , vol.9 , pp. 545-556
    • Rangel, A.1    Camerer, C.2    Montague, P.R.3
  • 84
    • 48349092693 scopus 로고    scopus 로고
    • A unified framework for addiction: vulnerabilities in the decision process
    • discussion 437-487
    • Redish, A.D., Jensen, S. & Johnson, A. (2008) A unified framework for addiction: vulnerabilities in the decision process. Behav. Brain Sci., 31, 415-437; discussion 437-487.
    • (2008) Behav. Brain Sci. , vol.31 , pp. 415-437
    • Redish, A.D.1    Jensen, S.2    Johnson, A.3
  • 85
    • 0000564572 scopus 로고
    • Discrimination of cues in mazes: a resolution of the place-vs.-response question
    • Restle, F. (1957) Discrimination of cues in mazes: a resolution of the place-vs.-response question. Psychol. Rev., 64, 217-228.
    • (1957) Psychol. Rev. , vol.64 , pp. 217-228
    • Restle, F.1
  • 86
    • 0035817882 scopus 로고    scopus 로고
    • A cellular mechanism of reward-related learning
    • Reynolds, J.N., Hyland, B.I. & Wickens, J.R. (2001) A cellular mechanism of reward-related learning. Nature, 413, 67-70.
    • (2001) Nature , vol.413 , pp. 67-70
    • Reynolds, J.N.1    Hyland, B.I.2    Wickens, J.R.3
  • 88
    • 0002196210 scopus 로고
    • Studies in spatial learning: VIII. Place performance and the acquisition of place dispositions
    • Ritchie, B.F., Aeschliman, B. & Pierce, P. (1950) Studies in spatial learning: VIII. Place performance and the acquisition of place dispositions. J. Comp. Physiol. Psychol., 43, 73-85.
    • (1950) J. Comp. Physiol. Psychol. , vol.43 , pp. 73-85
    • Ritchie, B.F.1    Aeschliman, B.2    Pierce, P.3
  • 89
    • 33646082742 scopus 로고    scopus 로고
    • Impaired sequential egocentric and allocentric memories in forebrain-specific-NMDA receptor knock-out mice during a new task dissociating strategies of navigation
    • Rondi-Reig, L., Petit, G.H., Tobin, C., Tonegawa, S., Mariani, J. & Berthoz, A. (2006) Impaired sequential egocentric and allocentric memories in forebrain-specific-NMDA receptor knock-out mice during a new task dissociating strategies of navigation. J. Neurosci., 26, 4071-4081.
    • (2006) J. Neurosci. , vol.26 , pp. 4071-4081
    • Rondi-Reig, L.1    Petit, G.H.2    Tobin, C.3    Tonegawa, S.4    Mariani, J.5    Berthoz, A.6
  • 90
    • 0141564821 scopus 로고    scopus 로고
    • Chunking during human visuomotor sequence learning
    • Sakai, K., Kitaguchi, K. & Hikosaka, O. (2003) Chunking during human visuomotor sequence learning. Exp. Brain Res., 152, 229-242.
    • (2003) Exp. Brain Res. , vol.152 , pp. 229-242
    • Sakai, K.1    Kitaguchi, K.2    Hikosaka, O.3
  • 91
    • 8544268868 scopus 로고    scopus 로고
    • Neuronal activity in the rodent dorsal striatum in sequential navigation: separation of spatial and reward responses on the multiple T task
    • Schmitzer-Torbert, N. & Redish, A.D. (2004) Neuronal activity in the rodent dorsal striatum in sequential navigation: separation of spatial and reward responses on the multiple T task. J. Neurophysiol., 91, 2259-2272.
    • (2004) J. Neurophysiol. , vol.91 , pp. 2259-2272
    • Schmitzer-Torbert, N.1    Redish, A.D.2
  • 92
    • 0034078011 scopus 로고    scopus 로고
    • Neuronal coding of prediction errors
    • Schultz, W. & Dickinson, A. (2000) Neuronal coding of prediction errors. Annu. Rev. Neurosci., 23, 473-500.
    • (2000) Annu. Rev. Neurosci. , vol.23 , pp. 473-500
    • Schultz, W.1    Dickinson, A.2
  • 93
    • 0030896968 scopus 로고    scopus 로고
    • A neural substrate of prediction and reward
    • Schultz, W., Dayan, P. & Montague, P.R. (1997) A neural substrate of prediction and reward. Science, 275, 1593-1599.
    • (1997) Science , vol.275 , pp. 1593-1599
    • Schultz, W.1    Dayan, P.2    Montague, P.R.3
  • 94
    • 61349098831 scopus 로고    scopus 로고
    • Rodent models of serial reaction time tasks and their implementation in neurobiological research
    • Schwarting, R.K.W. (2009) Rodent models of serial reaction time tasks and their implementation in neurobiological research. Behav. Brain Res., 199, 76-88.
    • (2009) Behav. Brain Res. , vol.199 , pp. 76-88
    • Schwarting, R.K.W.1
  • 96
    • 70449698526 scopus 로고    scopus 로고
    • Effect on movement selection of an evolving sensory representation: a multiple controller model of skill acquisition
    • Elsevier B.V
    • Shah, A. & Barto, A.G. (2009). Effect on movement selection of an evolving sensory representation: a multiple controller model of skill acquisition. Brain Res., 1299, 55-73. Elsevier B.V.
    • (2009) Brain Res. , vol.1299 , pp. 55-73
    • Shah, A.1    Barto, A.G.2
  • 97
    • 2142646065 scopus 로고    scopus 로고
    • Bouts of responding from variable-interval reinforcement of lever pressing by rats
    • Shull, R.L. & Grimes, J.A. (2003) Bouts of responding from variable-interval reinforcement of lever pressing by rats. J. Exp. Anal. Behav., 80, 159-171.
    • (2003) J. Exp. Anal. Behav. , vol.80 , pp. 159-171
    • Shull, R.L.1    Grimes, J.A.2
  • 98
    • 0036562853 scopus 로고    scopus 로고
    • Response rate viewed as engagement bouts: resistance to extinction
    • Shull, R.L., Gaynor, S.T. & Grimes, J.A. (2002) Response rate viewed as engagement bouts: resistance to extinction. J. Exp. Anal. Behav., 77, 211-231.
    • (2002) J. Exp. Anal. Behav. , vol.77 , pp. 211-231
    • Shull, R.L.1    Gaynor, S.T.2    Grimes, J.A.3
  • 102
    • 33847182102 scopus 로고    scopus 로고
    • Changes in activity of the striatum during formation of a motor habit
    • Tang, C., Pawlak, A.P., Prokopenko, V. & West, M.O. (2007) Changes in activity of the striatum during formation of a motor habit. Eur. J. Neurosci., 25, 1212-1227.
    • (2007) Eur. J. Neurosci. , vol.25 , pp. 1212-1227
    • Tang, C.1    Pawlak, A.P.2    Prokopenko, V.3    West, M.O.4
  • 103
    • 0025982683 scopus 로고
    • Chunking during serial learning by a pigeon: I. Basic evidence
    • Terrace, H.S. (1991) Chunking during serial learning by a pigeon: I. Basic evidence. J. Exp. Psychol. Anim. Behav. Process., 17, 81-93.
    • (1991) J. Exp. Psychol. Anim. Behav. Process. , vol.17 , pp. 81-93
    • Terrace, H.S.1
  • 104
    • 0002096715 scopus 로고
    • Studies in spatial learning: II. Place learning versus response learning
    • Tolman, E.C., Ritchie, B.F. & Kalish, D. (1946) Studies in spatial learning: II. Place learning versus response learning. J. Exp. Psychol., 36, 221-229.
    • (1946) J. Exp. Psychol. , vol.36 , pp. 221-229
    • Tolman, E.C.1    Ritchie, B.F.2    Kalish, D.3
  • 105
    • 0033221519 scopus 로고    scopus 로고
    • Average cost temporal-difference learning
    • Tsitsiklis, J.N. & Roy, B.V. (1999) Average cost temporal-difference learning. Automatica, 35, 1799-1808.
    • (1999) Automatica , vol.35 , pp. 1799-1808
    • Tsitsiklis, J.N.1    Roy, B.V.2
  • 106
    • 78651481149 scopus 로고    scopus 로고
    • Basal ganglia contributions to motor control: a vigorous tutor
    • Turner, R.S. & Desmurget, M. (2010) Basal ganglia contributions to motor control: a vigorous tutor. Curr. Opin. Neurobiol., 20, 704-716.
    • (2010) Curr. Opin. Neurobiol. , vol.20 , pp. 704-716
    • Turner, R.S.1    Desmurget, M.2
  • 108
    • 0032116811 scopus 로고    scopus 로고
    • A neuropsychological theory of motor skill learning
    • Willingham, D.B. (1998) A neuropsychological theory of motor skill learning. Psychol. Rev., 105, 558-584.
    • (1998) Psychol. Rev. , vol.105 , pp. 558-584
    • Willingham, D.B.1
  • 112
    • 4043144491 scopus 로고    scopus 로고
    • Contributions of striatal subregions to place and response learning
    • Yin, H.H. & Knowlton, B.J. (2004) Contributions of striatal subregions to place and response learning. Learn. Mem., 11, 459-463.
    • (2004) Learn. Mem. , vol.11 , pp. 459-463
    • Yin, H.H.1    Knowlton, B.J.2
  • 113
    • 1642580578 scopus 로고    scopus 로고
    • Lesions of dorsolateral striatum preserve outcome expectancy but disrupt habit formation in instrumental learning
    • Yin, H.H., Knowlton, B.J. & Balleine, B.W. (2004) Lesions of dorsolateral striatum preserve outcome expectancy but disrupt habit formation in instrumental learning. Eur. J. Neurosci., 19, 181-189.
    • (2004) Eur. J. Neurosci. , vol.19 , pp. 181-189
    • Yin, H.H.1    Knowlton, B.J.2    Balleine, B.W.3
  • 114
    • 23244461369 scopus 로고    scopus 로고
    • Blockade of NMDA receptors in the dorsomedial striatum prevents action-outcome learning in instrumental conditioning
    • Yin, H.H., Knowlton, B.J. & Balleine, B.W. (2005) Blockade of NMDA receptors in the dorsomedial striatum prevents action-outcome learning in instrumental conditioning. Eur. J. Neurosci., 22, 505-512.
    • (2005) Eur. J. Neurosci. , vol.22 , pp. 505-512
    • Yin, H.H.1    Knowlton, B.J.2    Balleine, B.W.3
  • 115
    • 28944442093 scopus 로고    scopus 로고
    • Inactivation of dorsolateral striatum enhances sensitivity to changes in the action-outcome contingency in instrumental conditioning
    • Yin, H.H., Knowlton, B.J. & Balleine, B.W. (2006) Inactivation of dorsolateral striatum enhances sensitivity to changes in the action-outcome contingency in instrumental conditioning. Behav. Brain Res., 166, 189-196.
    • (2006) Behav. Brain Res. , vol.166 , pp. 189-196
    • Yin, H.H.1    Knowlton, B.J.2    Balleine, B.W.3
  • 116
    • 53949118376 scopus 로고    scopus 로고
    • Reward-guided learning beyond dopamine in the nucleus accumbens: the integrative functions of cortico-basal ganglia networks
    • Yin, H.H., Ostlund, S.B. & Balleine, B.W. (2008) Reward-guided learning beyond dopamine in the nucleus accumbens: the integrative functions of cortico-basal ganglia networks. Eur. J. Neurosci., 28, 1437-1448.
    • (2008) Eur. J. Neurosci. , vol.28 , pp. 1437-1448
    • Yin, H.H.1    Ostlund, S.B.2    Balleine, B.W.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.