메뉴 건너뛰기




Volumn 4, Issue MAR, 2013, Pages

The mixed instrumental controller: Using value of information to combine habitual choice and mental simulation

Author keywords

Exploration exploitation; Forward sweeps; Goal directed decision making; Hippocampus; Model based reinforcement learning; Value of information; Ventral striatum

Indexed keywords


EID: 84878783112     PISSN: None     EISSN: 16641078     Source Type: Journal    
DOI: 10.3389/fpsyg.2013.00092     Document Type: Article
Times cited : (116)

References (109)
  • 1
    • 80053265513 scopus 로고    scopus 로고
    • Medial prefrontal cortex as an action-outcome predictor
    • Alexander, W. H., and Brown, J. W. (2011). Medial prefrontal cortex as an action-outcome predictor. Nat. Neurosci. 14, 1338-1344.
    • (2011) Nat. Neurosci. , vol.14 , pp. 1338-1344
    • Alexander, W.H.1    Brown, J.W.2
  • 2
    • 23244432007 scopus 로고    scopus 로고
    • An integrative theory of locus coeruleus-norepinephrine function: adaptive gain and optimal performance
    • Aston-Jones, G., and Cohen, J. D. (2005). An integrative theory of locus coeruleus-norepinephrine function: adaptive gain and optimal performance. Annu. Rev. Neurosci. 28, 403-450.
    • (2005) Annu. Rev. Neurosci. , vol.28 , pp. 403-450
    • Aston-Jones, G.1    Cohen, J.D.2
  • 3
    • 72049125602 scopus 로고    scopus 로고
    • Human and rodent homologies in action control: corticostriatal determinants of goal-directed and habitual action
    • Balleine, B., and O'Doherty, J. (2009). Human and rodent homologies in action control: corticostriatal determinants of goal-directed and habitual action. Neuropsychopharmacology 35, 48-69.
    • (2009) Neuropsychopharmacology , vol.35 , pp. 48-69
    • Balleine, B.1    O'Doherty, J.2
  • 4
    • 0031801210 scopus 로고    scopus 로고
    • Goal-directed instrumental action: contingency and incentive learning and their cortical substrates
    • Balleine, B. W., and Dickinson, A. (1998). Goal-directed instrumental action: contingency and incentive learning and their cortical substrates. Neuropharmacology 37, 407-419.
    • (1998) Neuropharmacology , vol.37 , pp. 407-419
    • Balleine, B.W.1    Dickinson, A.2
  • 6
    • 0031351658 scopus 로고    scopus 로고
    • A Bayesian approach to relevance in game playing
    • Baum, E. B., and Smith, W. D. (1997). A Bayesian approach to relevance in game playing. Artif. Intell. 97, 195-242.
    • (1997) Artif. Intell. , vol.97 , pp. 195-242
    • Baum, E.B.1    Smith, W.D.2
  • 8
    • 78650972934 scopus 로고    scopus 로고
    • Spontaneous cortical activity reveals hallmarks of an optimal internal model of the environment
    • Berkes, P., Orbn, G., Lengyel, M., and Fiser, J. (2011). Spontaneous cortical activity reveals hallmarks of an optimal internal model of the environment. Science 331, 83-87.
    • (2011) Science , vol.331 , pp. 83-87
    • Berkes, P.1    Orbn, G.2    Lengyel, M.3    Fiser, J.4
  • 10
    • 79960203266 scopus 로고    scopus 로고
    • Multiplicity of control in the basal ganglia: computational roles of striatal subregions
    • Bornstein, A. M., and Daw, N. D. (2011). Multiplicity of control in the basal ganglia: computational roles of striatal subregions. Curr. Opin. Neurobiol. 21, 374-380.
    • (2011) Curr. Opin. Neurobiol. , vol.21 , pp. 374-380
    • Bornstein, A.M.1    Daw, N.D.2
  • 11
    • 70350566799 scopus 로고    scopus 로고
    • Hierarchically organized behavior and its neural foundations: a reinforcement learning perspective
    • Botvinick, M., Niv, Y., and Barto, A. (2009). Hierarchically organized behavior and its neural foundations: a reinforcement learning perspective. Cognition 113, 262-280.
    • (2009) Cognition , vol.113 , pp. 262-280
    • Botvinick, M.1    Niv, Y.2    Barto, A.3
  • 12
    • 43049099970 scopus 로고    scopus 로고
    • Hierarchical models of behavior and prefrontal function
    • Botvinick, M. M. (2008). Hierarchical models of behavior and prefrontal function. Trends Cogn. Sci. (Regul. Ed.) 12, 201-208.
    • (2008) Trends Cogn. Sci. (Regul. Ed.) , vol.12 , pp. 201-208
    • Botvinick, M.M.1
  • 15
    • 80052211211 scopus 로고    scopus 로고
    • Lateral habenula neurons signal errors in the prediction of reward information
    • Bromberg-Martin, E. S., and Hikosaka, O. (2011). Lateral habenula neurons signal errors in the prediction of reward information. Nat. Neurosci. 14, 1209-1216.
    • (2011) Nat. Neurosci. , vol.14 , pp. 1209-1216
    • Bromberg-Martin, E.S.1    Hikosaka, O.2
  • 16
    • 77449143455 scopus 로고    scopus 로고
    • The role of the hippocampus in prediction and imagination
    • C1-C8
    • Buckner, R. L. (2010). The role of the hippocampus in prediction and imagination. Annu. Rev. Psychol. 61, 27-48, C1-C8.
    • (2010) Annu. Rev. Psychol. , vol.61 , pp. 27-48
    • Buckner, R.L.1
  • 17
    • 34250757050 scopus 로고    scopus 로고
    • The gateway hypothesis of rostral prefrontal cortex (area 10) function
    • Burgess, P. W., Dumontheil, I., and Gilbert, S. J. (2007). The gateway hypothesis of rostral prefrontal cortex (area 10) function. Trends Cogn. Sci. (Regul. Ed.) 11, 290-298.
    • (2007) Trends Cogn. Sci. (Regul. Ed.) , vol.11 , pp. 290-298
    • Burgess, P.W.1    Dumontheil, I.2    Gilbert, S.J.3
  • 18
    • 79251550466 scopus 로고    scopus 로고
    • Hippocampal replay in the awake state: a potential substrate for memory consolidation and retrieval
    • Carr, M., Jadhav, S., and Frank, L. (2011). Hippocampal replay in the awake state: a potential substrate for memory consolidation and retrieval. Nat. Neurosci. 14, 147-153.
    • (2011) Nat. Neurosci. , vol.14 , pp. 147-153
    • Carr, M.1    Jadhav, S.2    Frank, L.3
  • 19
    • 84875878352 scopus 로고    scopus 로고
    • A spiking neuron model of the cortico-basal ganglia circuits for goal-directed and habitual action learning
    • doi:10.1016/j.neunet.2012.11.009. [Epub ahead of print]
    • Chersi, F., Mirolli, M., Pezzulo, G., and Baldassarre, G. (2012). A spiking neuron model of the cortico-basal ganglia circuits for goal-directed and habitual action learning. Neural Netw. doi:10.1016/j.neunet.2012.11.009. [Epub ahead of print].
    • (2012) Neural Netw.
    • Chersi, F.1    Mirolli, M.2    Pezzulo, G.3    Baldassarre, G.4
  • 20
    • 84872823292 scopus 로고    scopus 로고
    • Using hippocampal-striatal loops for spatial navigation and goal-directed decision-making
    • Chersi, F., and Pezzulo, G. (2012). Using hippocampal-striatal loops for spatial navigation and goal-directed decision-making. Cogn. Process. 13, 125-129.
    • (2012) Cogn. Process. , vol.13 , pp. 125-129
    • Chersi, F.1    Pezzulo, G.2
  • 21
    • 33748883565 scopus 로고    scopus 로고
    • Integrated neural processes for defining potential actions and deciding between them: a computational model
    • Cisek, P. (2006). Integrated neural processes for defining potential actions and deciding between them: a computational model. J. Neurosci. 26, 9761-9770.
    • (2006) J. Neurosci. , vol.26 , pp. 9761-9770
    • Cisek, P.1
  • 22
    • 84878210201 scopus 로고    scopus 로고
    • Making decisions through a distributed consensus
    • Cisek, P. (2012). Making decisions through a distributed consensus. Curr. Opin. Neurobiol. 22, 927-936.
    • (2012) Curr. Opin. Neurobiol. , vol.22 , pp. 927-936
    • Cisek, P.1
  • 23
    • 14644435687 scopus 로고    scopus 로고
    • Neural correlates of reaching decisions in dorsal premotor cortex: specification of multiple direction choices and final selection of action
    • Cisek, P., and Kalaska, J. F. (2005). Neural correlates of reaching decisions in dorsal premotor cortex: specification of multiple direction choices and final selection of action. Neuron 45, 801-814.
    • (2005) Neuron , vol.45 , pp. 801-814
    • Cisek, P.1    Kalaska, J.F.2
  • 24
    • 77956987126 scopus 로고    scopus 로고
    • Neural mechanisms for interacting with a world full of action choices
    • Cisek, P., and Kalaska, J. F. (2010). Neural mechanisms for interacting with a world full of action choices. Annu. Rev. Neurosci. 33, 269-298.
    • (2010) Annu. Rev. Neurosci. , vol.33 , pp. 269-298
    • Cisek, P.1    Kalaska, J.F.2
  • 25
    • 34250348767 scopus 로고    scopus 로고
    • Should i stay or should i go? how the human brain manages the trade-off between exploitation and exploration
    • Cohen, J. D., McClure, S. M., and Yu, A. J. (2007). Should i stay or should i go? how the human brain manages the trade-off between exploitation and exploration. Philos. Trans. R. Soc. Lond. B Biol. Sci. 362, 933-942.
    • (2007) Philos. Trans. R. Soc. Lond. B Biol. Sci. , vol.362 , pp. 933-942
    • Cohen, J.D.1    McClure, S.M.2    Yu, A.J.3
  • 26
    • 84877264809 scopus 로고    scopus 로고
    • Model-based reinforcement learning as cognitive search: neurocomputational theories
    • eds P. Todd and T. Robbins (MIT Press)
    • Daw, N. (2012). "Model-based reinforcement learning as cognitive search: neurocomputational theories," in Cognitive Search: Evolution, Algorithms and the Brain, eds P. Todd and T. Robbins (MIT Press).
    • (2012) Cognitive Search: Evolution, Algorithms and the Brain
    • Daw, N.1
  • 27
    • 28044450875 scopus 로고    scopus 로고
    • Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
    • Daw, N. D., Niv, Y., and Dayan, P. (2005). Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nat. Neurosci. 8, 1704-1711.
    • (2005) Nat. Neurosci. , vol.8 , pp. 1704-1711
    • Daw, N.D.1    Niv, Y.2    Dayan, P.3
  • 28
    • 33745223257 scopus 로고    scopus 로고
    • Cortical substrates for exploratory decisions in humans
    • Daw, N. D., O'Doherty, J. P., Dayan, P., Seymour, B., and Dolan, R. J. (2006). Cortical substrates for exploratory decisions in humans. Nature 441, 876-879.
    • (2006) Nature , vol.441 , pp. 876-879
    • Daw, N.D.1    O'Doherty, J.P.2    Dayan, P.3    Seymour, B.4    Dolan, R.J.5
  • 29
    • 67349170462 scopus 로고    scopus 로고
    • Goal-directed control and its antipodes
    • Dayan, P. (2009). Goal-directed control and its antipodes. Neural Netw. 22, 213-219.
    • (2009) Neural Netw. , vol.22 , pp. 213-219
    • Dayan, P.1
  • 30
    • 84859341150 scopus 로고    scopus 로고
    • Habits, action sequences and reinforcement learning
    • Dezfouli, A., and Balleine, B. W. (2012). Habits, action sequences and reinforcement learning. Eur. J. Neurosci. 35, 1036-1051.
    • (2012) Eur. J. Neurosci. , vol.35 , pp. 1036-1051
    • Dezfouli, A.1    Balleine, B.W.2
  • 31
    • 34748889593 scopus 로고    scopus 로고
    • Forward and reverse hippocampal place-cell sequences during ripples
    • Diba, K., and Buzski, G. (2007). Forward and reverse hippocampal place-cell sequences during ripples. Nat. Neurosci. 10, 1241-1242.
    • (2007) Nat. Neurosci. , vol.10 , pp. 1241-1242
    • Diba, K.1    Buzski, G.2
  • 32
    • 0002692217 scopus 로고
    • Actions and habits: the development of behavioural autonomy
    • Dickinson, A. (1985). Actions and habits: the development of behavioural autonomy. Philos. Trans. R. Soc. Lond. B Biol. Sci. 308, 67-78.
    • (1985) Philos. Trans. R. Soc. Lond. B Biol. Sci. , vol.308 , pp. 67-78
    • Dickinson, A.1
  • 33
    • 84881058774 scopus 로고    scopus 로고
    • Motor simulation via coupled internal models using sequential Monte Carlo
    • Dindo, H., Zambuto, D., and Pezzulo, G. (2011). "Motor simulation via coupled internal models using sequential Monte Carlo," in Proceedings of IJCAI 2011, Barcelona, 2113-2119.
    • (2011) Proceedings of IJCAI 2011, Barcelona , pp. 2113-2119
    • Dindo, H.1    Zambuto, D.2    Pezzulo, G.3
  • 34
    • 78649417324 scopus 로고    scopus 로고
    • Caudate encodes multiple computations for perceptual decisions
    • Ding, L., and Gold, J. I. (2010). Caudate encodes multiple computations for perceptual decisions. J. Neurosci. 30, 15747-15759.
    • (2010) J. Neurosci. , vol.30 , pp. 15747-15759
    • Ding, L.1    Gold, J.I.2
  • 35
    • 0001460136 scopus 로고    scopus 로고
    • On sequential Monte Carlo sampling methods for Bayesian filtering
    • Doucet, A., Godsill, S., and Andrieu, C. (2000). On sequential Monte Carlo sampling methods for Bayesian filtering. Stat. Comput. 10, 197-208.
    • (2000) Stat. Comput. , vol.10 , pp. 197-208
    • Doucet, A.1    Godsill, S.2    Andrieu, C.3
  • 37
    • 84858863489 scopus 로고    scopus 로고
    • A goal-directed spatial navigation model using forward trajectory planning based on grid cells
    • Erdem, U. M., and Hasselmo, M. (2012). A goal-directed spatial navigation model using forward trajectory planning based on grid cells. Eur. J. Neurosci. 35, 916-931.
    • (2012) Eur. J. Neurosci. , vol.35 , pp. 916-931
    • Erdem, U.M.1    Hasselmo, M.2
  • 38
    • 76749113376 scopus 로고    scopus 로고
    • Statistically optimal perception and learning: from behavior to neural representations
    • Fiser, J., Berkes, P., Orbn, G., and Lengyel, M. (2010). Statistically optimal perception and learning: from behavior to neural representations. Trends Cogn. Sci. (Regul. Ed.) 14, 119-130.
    • (2010) Trends Cogn. Sci. (Regul. Ed.) , vol.14 , pp. 119-130
    • Fiser, J.1    Berkes, P.2    Orbn, G.3    Lengyel, M.4
  • 39
    • 33645458694 scopus 로고    scopus 로고
    • Reverse replay of behavioural sequences in hippocampal place cells during the awake state
    • Foster, D., and Wilson, M. (2006). Reverse replay of behavioural sequences in hippocampal place cells during the awake state. Nature 440, 680-683.
    • (2006) Nature , vol.440 , pp. 680-683
    • Foster, D.1    Wilson, M.2
  • 40
    • 84857211334 scopus 로고    scopus 로고
    • Mechanisms of hierarchical reinforcement learning in corticostriatal circuits 1: computational analysis
    • Frank, M. J., and Badre, D. (2012). Mechanisms of hierarchical reinforcement learning in corticostriatal circuits 1: computational analysis. Cereb. Cortex 22, 509-526.
    • (2012) Cereb. Cortex , vol.22 , pp. 509-526
    • Frank, M.J.1    Badre, D.2
  • 42
    • 84877994945 scopus 로고    scopus 로고
    • Perception action and utility: the tangled skein
    • (MIT Press), eds M. Rabinovich, M. K. Friston, and P. Varona
    • Gershman, S., and Daw, N. (2011). "Perception, action and utility: the tangled skein," in Principles of Brain Dynamics: Global State Interactions, eds M. Rabinovich, M. K. Friston, and P. Varona (MIT Press).
    • (2011) Principles of Brain Dynamics: Global State Interactions
    • Gershman, S.1    Daw, N.2
  • 43
    • 84878779561 scopus 로고    scopus 로고
    • Retrospective revaluation in sequential decision making: a tale of two systems
    • PMID:23230992. [Epub ahead of print]
    • Gershman, S., Markman, A., and Otto, A. (2012). Retrospective revaluation in sequential decision making: a tale of two systems. J. Exp. Psychol. Gen. PMID:23230992. [Epub ahead of print].
    • (2012) J. Exp. Psychol. Gen.
    • Gershman, S.1    Markman, A.2    Otto, A.3
  • 44
    • 77953260848 scopus 로고    scopus 로고
    • States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning
    • Glascher, J., Daw, N., Dayan, P., and O'Doherty, J. P. (2010). States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning. Neuron 66, 585-595.
    • (2010) Neuron , vol.66 , pp. 585-595
    • Glascher, J.1    Daw, N.2    Dayan, P.3    O'Doherty, J.P.4
  • 46
    • 0035155538 scopus 로고    scopus 로고
    • Neural computations that underlie decisions about sensory stimuli
    • Gold, J., and Shadlen, M. (2001). Neural computations that underlie decisions about sensory stimuli. Trends Cogn. Sci. (Regul. Ed.) 5, 10-16.
    • (2001) Trends Cogn. Sci. (Regul. Ed.) , vol.5 , pp. 10-16
    • Gold, J.1    Shadlen, M.2
  • 47
    • 34347361793 scopus 로고    scopus 로고
    • The neural basis of decision making
    • Gold, J. I., and Shadlen, M. N. (2007). The neural basis of decision making. Annu. Rev. Neurosci. 30, 535-574.
    • (2007) Annu. Rev. Neurosci. , vol.30 , pp. 535-574
    • Gold, J.I.1    Shadlen, M.N.2
  • 49
    • 77649151242 scopus 로고    scopus 로고
    • Hippocampal replay is not a simple function of experience
    • Gupta, A. S., van der Meer, M. A. A., Touretzky, D. S., and Redish, A. D. (2010). Hippocampal replay is not a simple function of experience. Neuron 65, 695-705.
    • (2010) Neuron , vol.65 , pp. 695-705
    • Gupta, A.S.1    van der Meer, M.A.A.2    Touretzky, D.S.3    Redish, A.D.4
  • 50
    • 0002861883 scopus 로고
    • A model of how the basal ganglia generates and uses neural signals that predict reinforcement
    • (Cambridge: MIT Press), eds J. C. Houk, J. Davis, and D. Beiser
    • Houk, J. C., Adams, J. L., and Barto, A. G. (1995). "A model of how the basal ganglia generates and uses neural signals that predict reinforcement," in Models of Information Processing in the Basal Ganglia, eds J. C. Houk, J. Davis, and D. Beiser (Cambridge: MIT Press), 249-270.
    • (1995) Models of Information Processing in the Basal Ganglia , pp. 249-270
    • Houk, J.C.1    Adams, J.L.2    Barto, A.G.3
  • 52
    • 79960212793 scopus 로고    scopus 로고
    • Multiple representations and algorithms for reinforcement learning in the cortico-basal ganglia circuit
    • Ito, M., and Doya, K. (2011). Multiple representations and algorithms for reinforcement learning in the cortico-basal ganglia circuit. Curr. Opin. Neurobiol. 21, 368-373.
    • (2011) Curr. Opin. Neurobiol. , vol.21 , pp. 368-373
    • Ito, M.1    Doya, K.2
  • 53
    • 79951514570 scopus 로고    scopus 로고
    • A neuromorphic model of spatial lookahead planning
    • Ivey, R., Bullock, D., and Grossberg, S. (2011). A neuromorphic model of spatial lookahead planning. Neural Netw. 24, 257-266.
    • (2011) Neural Netw. , vol.24 , pp. 257-266
    • Ivey, R.1    Bullock, D.2    Grossberg, S.3
  • 55
    • 36048937548 scopus 로고    scopus 로고
    • Neural ensembles in ca3 transiently encode paths forward of the animal at a decision point
    • Johnson, A., and Redish, A. D. (2007). Neural ensembles in ca3 transiently encode paths forward of the animal at a decision point. J. Neurosci. 27, 12176-12189.
    • (2007) J. Neurosci. , vol.27 , pp. 12176-12189
    • Johnson, A.1    Redish, A.D.2
  • 56
    • 51649116802 scopus 로고    scopus 로고
    • Neural correlates, computation and behavioural impact of decision confidence
    • Kepecs, A., Uchida, N., Zariwala, H. A., and Mainen, Z. F. (2008). Neural correlates, computation and behavioural impact of decision confidence. Nature 455, 227-231.
    • (2008) Nature , vol.455 , pp. 227-231
    • Kepecs, A.1    Uchida, N.2    Zariwala, H.A.3    Mainen, Z.F.4
  • 57
    • 79958143780 scopus 로고    scopus 로고
    • Speed/accuracy trade-off between the habitual and the goal-directed processes
    • doi:10.1371/journal.pcbi.1002055
    • Keramati, M., Dezfouli, A., and Piray, P. (2011). Speed/accuracy trade-off between the habitual and the goal-directed processes. PLoS Comput. Biol. 7:e1002055. doi:10.1371/journal.pcbi.1002055
    • (2011) PLoS Comput. Biol. , vol.7
    • Keramati, M.1    Dezfouli, A.2    Piray, P.3
  • 58
    • 34249278114 scopus 로고    scopus 로고
    • An information theoretical approach to prefrontal executive function
    • Koechlin, E., and Summerfield, C. (2007). An information theoretical approach to prefrontal executive function. Trends Cogn. Sci. (Regul. Ed.) 11, 229-235.
    • (2007) Trends Cogn. Sci. (Regul. Ed.) , vol.11 , pp. 229-235
    • Koechlin, E.1    Summerfield, C.2
  • 59
    • 40649116624 scopus 로고    scopus 로고
    • Reversed and forward buffering of behavioral spike sequences enables retrospective and prospective retrieval in hippocampal regions ca3 and ca1
    • Koene, R. A., and Hasselmo, M. E. (2008). Reversed and forward buffering of behavioral spike sequences enables retrospective and prospective retrieval in hippocampal regions ca3 and ca1. Neural Netw. 21, 276-288.
    • (2008) Neural Netw. , vol.21 , pp. 276-288
    • Koene, R.A.1    Hasselmo, M.E.2
  • 61
    • 69349092175 scopus 로고    scopus 로고
    • Hippocampus leads ventral striatum in replay of place-reward information
    • doi:10.1371/journal.pbio.1000173
    • Lansink, C. S., Goltstein, P. M., Lankelma, J. V., McNaughton, B. L., and Pennartz, C. M. A. (2009). Hippocampus leads ventral striatum in replay of place-reward information. PLoS Biol. 7:e1000173. doi:10.1371/journal.pbio.1000173
    • (2009) PLoS Biol. , vol.7
    • Lansink, C.S.1    Goltstein, P.M.2    Lankelma, J.V.3    McNaughton, B.L.4    Pennartz, C.M.A.5
  • 62
    • 85162020429 scopus 로고    scopus 로고
    • Hippocampal contributions to control: the third way
    • (Cambridge, MA: MIT Press), Vol. 20, eds J. Platt, D. Koller, Y. Singer, and S. Roweis
    • Lengyel, M., and Dayan, P. (2008). "Hippocampal contributions to control: the third way," in Advances in Neural Information Processing Systems, Vol. 20, eds J. Platt, D. Koller, Y. Singer, and S. Roweis (Cambridge, MA: MIT Press), 889-896.
    • (2008) in Advances in Neural Information Processing Systems , pp. 889-896
    • Lengyel, M.1    Dayan, P.2
  • 63
    • 84874204648 scopus 로고    scopus 로고
    • The basal ganglia optimize decision making over general perceptual hypotheses
    • Lepora, N. F., and Gurney, K. N. (2012). The basal ganglia optimize decision making over general perceptual hypotheses. Neural Comput. 24, 2924-2945.
    • (2012) Neural Comput. , vol.24 , pp. 2924-2945
    • Lepora, N.F.1    Gurney, K.N.2
  • 64
    • 33750437292 scopus 로고    scopus 로고
    • Bayesian inference with probabilistic population codes
    • Ma, W. J., Beck, J. M., Latham, P. E., and Pouget, A. (2006). Bayesian inference with probabilistic population codes. Nat. Neurosci. 9, 1432-1438.
    • (2006) Nat. Neurosci. , vol.9 , pp. 1432-1438
    • Ma, W.J.1    Beck, J.M.2    Latham, P.E.3    Pouget, A.4
  • 66
    • 0027684215 scopus 로고
    • Prioritized sweeping: reinforcement learning with less data and less real time
    • Moore, A. W., and Atkeson, C. (1993). Prioritized sweeping: reinforcement learning with less data and less real time. Mach. Learn. 13, 103-130.
    • (1993) Mach. Learn. , vol.13 , pp. 103-130
    • Moore, A.W.1    Atkeson, C.2
  • 67
    • 0019957779 scopus 로고
    • Place navigation impaired in rats with hippocampal lesions
    • Morris, R. G., Garrud, P., Rawlins, J. N., and O'Keefe, J. (1982). Place navigation impaired in rats with hippocampal lesions. Nature 297, 681-683.
    • (1982) Nature , vol.297 , pp. 681-683
    • Morris, R.G.1    Garrud, P.2    Rawlins, J.N.3    O'Keefe, J.4
  • 69
    • 80052232280 scopus 로고    scopus 로고
    • On the value of information and other rewards
    • Niv, Y., and Chan, S. (2011). On the value of information and other rewards. Nat. Neurosci. 14, 1095D-1097D.
    • (2011) Nat. Neurosci. , vol.14
    • Niv, Y.1    Chan, S.2
  • 71
    • 84859311497 scopus 로고    scopus 로고
    • Beyond simple reinforcement learning: the computational neurobiology of reward-learning and valuation
    • O'Doherty, J. P. (2012). Beyond simple reinforcement learning: the computational neurobiology of reward-learning and valuation. Eur. J. Neurosci. 35, 987-990.
    • (2012) Eur. J. Neurosci. , vol.35 , pp. 987-990
    • O'Doherty, J.P.1
  • 72
    • 0015145985 scopus 로고
    • The hippocampus as a spatial map, preliminary evidence from unit activity in the freely-moving rat
    • O'Keefe, J., and Dostrovsky, J. (1971). The hippocampus as a spatial map. preliminary evidence from unit activity in the freely-moving rat. Brain Res. 34, 171-175.
    • (1971) Brain Res. , vol.34 , pp. 171-175
    • O'Keefe, J.1    Dostrovsky, J.2
  • 73
    • 33646566317 scopus 로고    scopus 로고
    • Neurons in the orbitofrontal cortex encode economic value
    • Padoa-Schioppa, C., and Assad, J. A. (2006). Neurons in the orbitofrontal cortex encode economic value. Nature 441, 223-226.
    • (2006) Nature , vol.441 , pp. 223-226
    • Padoa-Schioppa, C.1    Assad, J.A.2
  • 75
    • 83155191145 scopus 로고    scopus 로고
    • Neural systems analysis of decision making during goal-directed navigation
    • Penner, M. R., and Mizumori, S. J. Y. (2012). Neural systems analysis of decision making during goal-directed navigation. Prog. Neurobiol. 96, 96-135.
    • (2012) Prog. Neurobiol. , vol.96 , pp. 96-135
    • Penner, M.R.1    Mizumori, S.J.Y.2
  • 76
    • 67649818089 scopus 로고    scopus 로고
    • Replay of rule-learning related neural patterns in the prefrontal cortex during sleep
    • Peyrache, A., Khamassi, M., Benchenane, K., Wiener, S., and Battaglia, F. (2009). Replay of rule-learning related neural patterns in the prefrontal cortex during sleep. Nat. Neurosci. 12, 12, 919-926.
    • (2009) Nat. Neurosci. , vol.12 , Issue.12 , pp. 919-926
    • Peyrache, A.1    Khamassi, M.2    Benchenane, K.3    Wiener, S.4    Battaglia, F.5
  • 77
    • 44349151557 scopus 로고    scopus 로고
    • Coordinating with the future: the anticipatory nature of representation
    • Pezzulo, G. (2008). Coordinating with the future: the anticipatory nature of representation. Minds Mach. 18, 179-225.
    • (2008) Minds Mach. , vol.18 , pp. 179-225
    • Pezzulo, G.1
  • 78
    • 78651458058 scopus 로고    scopus 로고
    • Grounding procedural and declarative knowledge in sensorimotor anticipation
    • Pezzulo, G. (2011). Grounding procedural and declarative knowledge in sensorimotor anticipation. Mind Lang. 26, 78-114.
    • (2011) Mind Lang. , vol.26 , pp. 78-114
    • Pezzulo, G.1
  • 79
    • 67449113542 scopus 로고    scopus 로고
    • Thinking as the control of imagination: a conceptual framework for goal-directed systems
    • Pezzulo, G., and Castelfranchi, C. (2009). Thinking as the control of imagination: a conceptual framework for goal-directed systems. Psychol. Res. 73, 559-577.
    • (2009) Psychol. Res. , vol.73 , pp. 559-577
    • Pezzulo, G.1    Castelfranchi, C.2
  • 81
    • 34249728989 scopus 로고    scopus 로고
    • Anticipation and anticipatory behavior
    • Pezzulo, G., Hoffmann, J., and Falcone, R. (2007). Anticipation and anticipatory behavior. Cogn. Process. 8, 67-70.
    • (2007) Cogn. Process. , vol.8 , pp. 67-70
    • Pezzulo, G.1    Hoffmann, J.2    Falcone, R.3
  • 82
    • 84857290855 scopus 로고    scopus 로고
    • The value of foresight: how prospection affects decision-making
    • doi:10.3389/fnins.2011.00079
    • Pezzulo, G., and Rigoli, F. (2011). The value of foresight: how prospection affects decision-making. Front. Neurosci. 5:79. doi:10.3389/fnins.2011.00079
    • (2011) Front. Neurosci. , vol.5 , pp. 79
    • Pezzulo, G.1    Rigoli, F.2
  • 83
    • 79960241771 scopus 로고    scopus 로고
    • Decision making under uncertainty: a neural model based on partially observable Markov decision processes
    • doi:10.3389/fncom.2010.00146
    • Rao, R. P. N. (2010). Decision making under uncertainty: a neural model based on partially observable Markov decision processes. Front. Comput. Neurosci. 4:146. doi:10.3389/fncom.2010.00146
    • (2010) Front. Comput. Neurosci. , vol.4 , pp. 146
    • Rao, R.P.N.1
  • 84
    • 58149404021 scopus 로고
    • A theory of memory retrieval
    • Ratcliff, R. (1978). A theory of memory retrieval. Psychol. Rev. 85, 59-108.
    • (1978) Psychol. Rev. , vol.85 , pp. 59-108
    • Ratcliff, R.1
  • 85
    • 0032973437 scopus 로고    scopus 로고
    • The basal ganglia: a vertebrate solution to the selection problem?
    • Redgrave, P., Prescott, T. J., and Gurney, K. (1999). The basal ganglia: a vertebrate solution to the selection problem? Neuroscience 89, 1009-1023.
    • (1999) Neuroscience , vol.89 , pp. 1009-1023
    • Redgrave, P.1    Prescott, T.J.2    Gurney, K.3
  • 86
    • 84870953994 scopus 로고    scopus 로고
    • Aversive pavlovian responses affect human instrumental motor performance
    • doi:10.3389/fnins.2012.00134
    • Rigoli, F., Pavone, E. F., and Pezzulo, G. (2012). Aversive pavlovian responses affect human instrumental motor performance. Front. Neurosci. 6:134. doi:10.3389/fnins.2012.00134
    • (2012) Front. Neurosci. , vol.6 , pp. 134
    • Rigoli, F.1    Pavone, E.F.2    Pezzulo, G.3
  • 87
    • 34548013298 scopus 로고    scopus 로고
    • Remembering the past to imagine the future: the prospective brain
    • Schacter, D. L., Addis, D. R., and Buckner, R. L. (2007). Remembering the past to imagine the future: the prospective brain. Nat. Rev. Neurosci. 8, 657-661.
    • (2007) Nat. Rev. Neurosci. , vol.8 , pp. 657-661
    • Schacter, D.L.1    Addis, D.R.2    Buckner, R.L.3
  • 89
    • 0030896968 scopus 로고    scopus 로고
    • A neural substrate of prediction and reward
    • Schultz, W., Dayan, P., and Montague, P. (1997). A neural substrate of prediction and reward. Science 275, 1593-1599.
    • (1997) Science , vol.275 , pp. 1593-1599
    • Schultz, W.1    Dayan, P.2    Montague, P.3
  • 90
    • 0034796381 scopus 로고    scopus 로고
    • Neural basis of a perceptual decision in the parietal cortex (area lip) of the rhesus monkey
    • Shadlen, M. N., and Newsome, W. T. (2001). Neural basis of a perceptual decision in the parietal cortex (area lip) of the rhesus monkey. J. Neurophysiol. 86, 1916-1936.
    • (2001) J. Neurophysiol. , vol.86 , pp. 1916-1936
    • Shadlen, M.N.1    Newsome, W.T.2
  • 91
    • 78651437982 scopus 로고    scopus 로고
    • Control of movements and temporal discounting of reward
    • Shadmehr, R. (2010). Control of movements and temporal discounting of reward. Curr. Opin. Neurobiol. 20, 726-730.
    • (2010) Curr. Opin. Neurobiol. , vol.20 , pp. 726-730
    • Shadmehr, R.1
  • 92
    • 84899438881 scopus 로고    scopus 로고
    • Monte-Carlo planning in large POMDPs
    • NIPS, eds J. D. Lafferty, C. K. I. Williams, J. Shawe-Taylor, R. S. Zemel, and A. Culotta (Curran Associates, Inc)
    • Silver, D., and Veness, J. (2010). "Monte-Carlo planning in large POMDPs," in NIPS, eds J. D. Lafferty, C. K. I. Williams, J. Shawe-Taylor, R. S. Zemel, and A. Culotta (Curran Associates, Inc), 2164-2172.
    • (2010) , pp. 2164-2172
    • Silver, D.1    Veness, J.2
  • 93
    • 85162526341 scopus 로고    scopus 로고
    • Environmental statistics and the trade-off between model-based and td learning in humans
    • NIPS, eds J. Shawe-Taylor, R. S. Zemel, P. L. Bartlett, F. C. N. Pereira, and K. Q. Weinberger (Granada)
    • Simon, D. A., and Daw, N. D. (2011a). "Environmental statistics and the trade-off between model-based and td learning in humans," in NIPS, eds J. Shawe-Taylor, R. S. Zemel, P. L. Bartlett, F. C. N. Pereira, and K. Q. Weinberger (Granada), 127-135.
    • (2011) , pp. 127-135
    • Simon, D.A.1    Daw, N.D.2
  • 94
    • 79955709936 scopus 로고    scopus 로고
    • Neural correlates of forward planning in a spatial decision task in humans
    • Simon, D. A., and Daw, N. D. (2011b). Neural correlates of forward planning in a spatial decision task in humans. J. Neurosci. 31, 5526-5539.
    • (2011) J. Neurosci. , vol.31 , pp. 5526-5539
    • Simon, D.A.1    Daw, N.D.2
  • 95
    • 84859737036 scopus 로고    scopus 로고
    • Goal-directed decision making as probabilistic inference: a computational framework and potential neural correlates
    • Solway, A., and Botvinick, M. M. (2012). Goal-directed decision making as probabilistic inference: a computational framework and potential neural correlates. Psychol. Rev. 119, 120-154.
    • (2012) Psychol. Rev. , vol.119 , pp. 120-154
    • Solway, A.1    Botvinick, M.M.2
  • 98
    • 0001394603 scopus 로고
    • An adaptive network that constructs and uses an internal model of its environment
    • Sutton, R. S., and Barto, A. G. (1981). An adaptive network that constructs and uses an internal model of its environment. Cogn. Brain Theory 4, 217-246.
    • (1981) Cogn. Brain Theory , vol.4 , pp. 217-246
    • Sutton, R.S.1    Barto, A.G.2
  • 100
    • 58149442669 scopus 로고
    • Cognitive maps in rats and men
    • Tolman, E. C. (1948). Cognitive maps in rats and men. Psychol. Rev. 55, 189-208.
    • (1948) Psychol. Rev. , vol.55 , pp. 189-208
    • Tolman, E.C.1
  • 101
    • 84859344211 scopus 로고    scopus 로고
    • Information processing in decision-making systems
    • van der Meer, M., Kurth-Nelson, Z., and Redish, A. D. (2012). Information processing in decision-making systems. Neuroscientist 18, 342-359.
    • (2012) Neuroscientist , vol.18 , pp. 342-359
    • van der Meer, M.1    Kurth-Nelson, Z.2    Redish, A.D.3
  • 102
    • 79960218850 scopus 로고    scopus 로고
    • Expectancies in decision making, reinforcement learning, and ventral striatum
    • doi:10.3389/neuro.01.006.2010
    • van der Meer, M. A. A., and Redish, A. (2010). Expectancies in decision making, reinforcement learning, and ventral striatum. Front. Neurosci. 4:6. doi:10.3389/neuro.01.006.2010
    • (2010) Front. Neurosci. , vol.4 , pp. 6
    • van der Meer, M.A.A.1    Redish, A.2
  • 103
    • 79960270857 scopus 로고    scopus 로고
    • Ventral striatum: a critical look at models of learning and evaluation
    • van der Meer, M. A. A., and Redish, A. (2011). Ventral striatum: a critical look at models of learning and evaluation. Curr. Opin. Neurobiol. 21, 387-392.
    • (2011) Curr. Opin. Neurobiol. , vol.21 , pp. 387-392
    • van der Meer, M.A.A.1    Redish, A.2
  • 104
    • 84890869062 scopus 로고    scopus 로고
    • Covert expectation-of-reward in rat ventral striatum at decision points
    • doi:10.3389/neuro.07.001.2009
    • van der Meer, M. A. A., and Redish, A. D. (2009). Covert expectation-of-reward in rat ventral striatum at decision points. Front. Integr. Neurosci. 3:1. doi:10.3389/neuro.07.001.2009
    • (2009) Front. Integr. Neurosci. , vol.3 , pp. 1
    • van der Meer, M.A.A.1    Redish, A.D.2
  • 105
    • 0142136737 scopus 로고    scopus 로고
    • Environmentally mediated synergy between perception and behaviour in mobile robots
    • Verschure, P. F. M. J., Voegtlin, T., and Douglas, R. J. (2003). Environmentally mediated synergy between perception and behaviour in mobile robots. Nature 425, 620-624.
    • (2003) Nature , vol.425 , pp. 620-624
    • Verschure, P.F.M.J.1    Voegtlin, T.2    Douglas, R.J.3
  • 107
    • 56349093255 scopus 로고    scopus 로고
    • Forward frontal fields: phylogeny and fundamental function
    • Wise, S. P. (2008). Forward frontal fields: phylogeny and fundamental function. Trends Neurosci. 31, 599-608.
    • (2008) Trends Neurosci. , vol.31 , pp. 599-608
    • Wise, S.P.1
  • 108
    • 1642580578 scopus 로고    scopus 로고
    • Lesions of dorsolateral striatum preserve outcome expectancy but disrupt habit formation in instrumental learning
    • Yin, H. H., Knowlton, B. J., and Balleine, B. W. (2004). Lesions of dorsolateral striatum preserve outcome expectancy but disrupt habit formation in instrumental learning. Eur. J. Neurosci. 19, 181-189.
    • (2004) Eur. J. Neurosci. , vol.19 , pp. 181-189
    • Yin, H.H.1    Knowlton, B.J.2    Balleine, B.W.3
  • 109
    • 53949118376 scopus 로고    scopus 로고
    • Reward-guided learning beyond dopamine in the nucleus accumbens: the integrative functions of cortico-basal ganglia networks
    • Yin, H. H., Ostlund, S. B., and Balleine, B. W. (2008). Reward-guided learning beyond dopamine in the nucleus accumbens: the integrative functions of cortico-basal ganglia networks. Eur. J. Neurosci. 28, 1437-1448.
    • (2008) Eur. J. Neurosci. , vol.28 , pp. 1437-1448
    • Yin, H.H.1    Ostlund, S.B.2    Balleine, B.W.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.