메뉴 건너뛰기




Volumn 7, Issue 3, 2015, Pages 391-415

Novelty and Inductive Generalization in Human Reinforcement Learning

Author keywords

Bayesian inference; Exploration exploitation dilemma; Neophilia; Neophobia; Reinforcement learning

Indexed keywords

DOPAMINE;

EID: 84939246893     PISSN: 17568757     EISSN: 17568765     Source Type: Journal    
DOI: 10.1111/tops.12138     Document Type: Article
Times cited : (70)

References (89)
  • 1
    • 78651226963 scopus 로고    scopus 로고
    • Structure learning in human sequential decision-making
    • Acuña, D., & Schrater, P. (2010). Structure learning in human sequential decision-making. PLoS Computational Biology, 6(12), 221-229.
    • (2010) PLoS Computational Biology , vol.6 , Issue.12 , pp. 221-229
    • Acuña, D.1    Schrater, P.2
  • 2
    • 0000705894 scopus 로고
    • The adaptive nature of human categorization
    • Anderson, J. (1991). The adaptive nature of human categorization. Psychological Review, 98(3), 409-429.
    • (1991) Psychological Review , vol.98 , Issue.3 , pp. 409-429
    • Anderson, J.1
  • 3
    • 0034493070 scopus 로고    scopus 로고
    • Conditioned place preference: What does it add to our preclinical understanding of drug reward
    • Bardo, M., & Bevins, R. (2000). Conditioned place preference: What does it add to our preclinical understanding of drug reward? Psychopharmacology, 153(1), 31-43.
    • (2000) Psychopharmacology , vol.153 , Issue.1 , pp. 31-43
    • Bardo, M.1    Bevins, R.2
  • 4
    • 0001296523 scopus 로고
    • Experiments on "neophobia' in wild and laboratory rats
    • Barnett, S. (1958). Experiments on "neophobia' in wild and laboratory rats. British Journal of Psychology, 49(3), 195-201.
    • (1958) British Journal of Psychology , vol.49 , Issue.3 , pp. 195-201
    • Barnett, S.1
  • 5
    • 0000541213 scopus 로고
    • Adaptive critics and the basal ganglia
    • In J. Houk, J. Davis, & D. Beiser, Eds.), Cambridge, MA: MIT Press.
    • Barto, A. (1995). Adaptive critics and the basal ganglia. In J. Houk, J. Davis, & D. Beiser, (Eds.), Models of information processing in the basal ganglia (pp. 215-232). Cambridge, MA: MIT Press.
    • (1995) Models of information processing in the basal ganglia , pp. 215-232
    • Barto, A.1
  • 6
    • 21544435722 scopus 로고    scopus 로고
    • Midbrain dopamine neurons encode a quantitative reward prediction error signal
    • Bayer, H., & Glimcher, P. (2005). Midbrain dopamine neurons encode a quantitative reward prediction error signal. Neuron, 47(1), 129-141.
    • (2005) Neuron , vol.47 , Issue.1 , pp. 129-141
    • Bayer, H.1    Glimcher, P.2
  • 7
    • 34548295327 scopus 로고    scopus 로고
    • Learning the value of information in an uncertain world
    • Behrens, T., Woolrich, M., Walton, M., & Rushworth, M. (2007). Learning the value of information in an uncertain world. Nature Neuroscience, 10(9), 1214-1221.
    • (2007) Nature Neuroscience , vol.10 , Issue.9 , pp. 1214-1221
    • Behrens, T.1    Woolrich, M.2    Walton, M.3    Rushworth, M.4
  • 9
    • 0013931617 scopus 로고
    • Curiosity and exploration
    • Berlyne, D. (1966). Curiosity and exploration. Science, 153(3731), 25-33.
    • (1966) Science , vol.153 , Issue.3731 , pp. 25-33
    • Berlyne, D.1
  • 11
    • 0035540669 scopus 로고    scopus 로고
    • Novelty seeking and reward: Implications for the study of high-risk behaviors
    • Bevins, R. (2001). Novelty seeking and reward: Implications for the study of high-risk behaviors. Current Directions in Psychological Science, 10(6), 189-193.
    • (2001) Current Directions in Psychological Science , vol.10 , Issue.6 , pp. 189-193
    • Bevins, R.1
  • 13
    • 70350566799 scopus 로고    scopus 로고
    • Hierarchically organized behavior and its neural foundations: A reinforcement learning perspective
    • Botvinick, M., Niv, Y., & Barto, A. (2009). Hierarchically organized behavior and its neural foundations: A reinforcement learning perspective. Cognition, 113(3), 262-280.
    • (2009) Cognition , vol.113 , Issue.3 , pp. 262-280
    • Botvinick, M.1    Niv, Y.2    Barto, A.3
  • 14
    • 0020965801 scopus 로고
    • Contextual control of the extinction of conditioned fear: Tests for the associative value of the context
    • Bouton, M., & King, D. (1983). Contextual control of the extinction of conditioned fear: Tests for the associative value of the context. Journal of Experimental Psychology: Animal Behavior Processes, 9(3), 248-265.
    • (1983) Journal of Experimental Psychology: Animal Behavior Processes , vol.9 , Issue.3 , pp. 248-265
    • Bouton, M.1    King, D.2
  • 15
    • 0041965975 scopus 로고    scopus 로고
    • R-max-a general polynomial time algorithm for nearoptimal reinforcement learning
    • Brafman, R., & Tennenholtz, M. (2003). R-max-a general polynomial time algorithm for nearoptimal reinforcement learning. The Journal of Machine Learning Research, 3(3), 213-231.
    • (2003) The Journal of Machine Learning Research , vol.3 , Issue.3 , pp. 213-231
    • Brafman, R.1    Tennenholtz, M.2
  • 16
    • 0030612822 scopus 로고    scopus 로고
    • The psychophysics toolbox
    • Brainard, D. (1997). The psychophysics toolbox. Spatial Vision, 10(4), 433-436.
    • (1997) Spatial Vision , vol.10 , Issue.4 , pp. 433-436
    • Brainard, D.1
  • 17
    • 34250348767 scopus 로고    scopus 로고
    • Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration
    • Cohen, J., McClure, S., & Yu, A. (2007). Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration. Philosophical Transactions of the Royal Society B: Biological Sciences, 362(1481), 933-942.
    • (2007) Philosophical Transactions of the Royal Society B: Biological Sciences , vol.362 , Issue.1481 , pp. 933-942
    • Cohen, J.1    McClure, S.2    Yu, A.3
  • 18
    • 0018233873 scopus 로고
    • The determinants of exploration and neophobia
    • Corey, D. (1978). The determinants of exploration and neophobia. Neuroscience & Biobehavioral Reviews, 2(4), 235-253.
    • (1978) Neuroscience & Biobehavioral Reviews , vol.2 , Issue.4 , pp. 235-253
    • Corey, D.1
  • 19
    • 33746365099 scopus 로고    scopus 로고
    • Bayesian theories of conditioning in a changing world
    • Courville, A., Daw, N., & Touretzky, D. (2006). Bayesian theories of conditioning in a changing world. Trends in Cognitive Sciences, 10(7), 294-300.
    • (2006) Trends in Cognitive Sciences , vol.10 , Issue.7 , pp. 294-300
    • Courville, A.1    Daw, N.2    Touretzky, D.3
  • 20
    • 0017288709 scopus 로고
    • The new object reaction of Rattus rattus L.: The relative importance of various cues
    • Cowan, P. (1976). The new object reaction of Rattus rattus L.: The relative importance of various cues. Behavioral Biology, 16(1), 31-44.
    • (1976) Behavioral Biology , vol.16 , Issue.1 , pp. 31-44
    • Cowan, P.1
  • 21
    • 33745787929 scopus 로고    scopus 로고
    • Representation and timing in theories of the dopamine system
    • Daw, N., Courville, A., & Touretzky, D. (2006a). Representation and timing in theories of the dopamine system. Neural Computation, 18(7), 1637-1677.
    • (2006) Neural Computation , vol.18 , Issue.7 , pp. 1637-1677
    • Daw, N.1    Courville, A.2    Touretzky, D.3
  • 22
    • 33745223257 scopus 로고    scopus 로고
    • Cortical substrates for exploratory decisions in humans
    • Daw, N., O'Doherty, J., Dayan, P., Seymour, B., & Dolan, R. (2006b). Cortical substrates for exploratory decisions in humans. Nature, 441(7095), 876-879.
    • (2006) Nature , vol.441 , Issue.7095 , pp. 876-879
    • Daw, N.1    O'Doherty, J.2    Dayan, P.3    Seymour, B.4    Dolan, R.5
  • 23
    • 34547536392 scopus 로고    scopus 로고
    • Associative learning mediates dynamic shifts in dopamine signaling in the nucleus accumbens
    • Day, J., Roitman, M., Wightman, R., & Carelli, R. (2007). Associative learning mediates dynamic shifts in dopamine signaling in the nucleus accumbens. Nature Neuroscience, 10(8), 1020-1028.
    • (2007) Nature Neuroscience , vol.10 , Issue.8 , pp. 1020-1028
    • Day, J.1    Roitman, M.2    Wightman, R.3    Carelli, R.4
  • 24
    • 33749055062 scopus 로고    scopus 로고
    • The misbehavior of value and the discipline of the will
    • Dayan, P., Niv, Y., Seymour, B., & Daw, N. D. (2006). The misbehavior of value and the discipline of the will. Neural Networks, 19(8), 1153-1160.
    • (2006) Neural Networks , vol.19 , Issue.8 , pp. 1153-1160
    • Dayan, P.1    Niv, Y.2    Seymour, B.3    Daw, N.D.4
  • 27
    • 1942421151 scopus 로고    scopus 로고
    • Bayes meets Bellman: The Gaussian process approach to temporal difference learning
    • In T. Fawcett & N. Mishra (Eds.), Washington, DC.
    • Engel, Y., Mannor, S., & Meir, R. (2003). Bayes meets Bellman: The Gaussian process approach to temporal difference learning. In T. Fawcett & N. Mishra (Eds.), International Conference on Machine Learning (Vol. 20, pp. 154-162). Washington, DC.
    • (2003) International Conference on Machine Learning , vol.20 , pp. 154-162
    • Engel, Y.1    Mannor, S.2    Meir, R.3
  • 28
    • 0023691166 scopus 로고
    • A new one-trial test for neurobiological studies of memory in rats. 1: Behavioral data
    • Ennaceur, A., & Delacour, J. (1988). A new one-trial test for neurobiological studies of memory in rats. 1: Behavioral data. Behavioural Brain Research, 31(1), 47-59.
    • (1988) Behavioural Brain Research , vol.31 , Issue.1 , pp. 47-59
    • Ennaceur, A.1    Delacour, J.2
  • 29
    • 0000996526 scopus 로고
    • The effects of hunger and familiarity of locale on exploration
    • Fehrer, E. (1956). The effects of hunger and familiarity of locale on exploration. Journal of Comparative and Physiological Psychology, 49(6), 549-552.
    • (1956) Journal of Comparative and Physiological Psychology , vol.49 , Issue.6 , pp. 549-552
    • Fehrer, E.1
  • 30
    • 0015427497 scopus 로고
    • Effects of time of day and food deprivation on exploratory activity in the rat
    • File, S., & Day, S. (1972). Effects of time of day and food deprivation on exploratory activity in the rat. Animal Behaviour, 20(4), 758-762.
    • (1972) Animal Behaviour , vol.20 , Issue.4 , pp. 758-762
    • File, S.1    Day, S.2
  • 31
    • 57149113922 scopus 로고    scopus 로고
    • Hierarchical models in the brain
    • Friston, K. (2008). Hierarchical models in the brain. PLoS Computational Biology, 4(11), e1000211.
    • (2008) PLoS Computational Biology , vol.4 , Issue.11 , pp. e1000211
    • Friston, K.1
  • 32
    • 74049117596 scopus 로고    scopus 로고
    • Context, learning, and extinction
    • Gershman, S., Blei, D., & Niv, Y. (2010). Context, learning, and extinction. Psychological Review, 117(1), 197-209.
    • (2010) Psychological Review , vol.117 , Issue.1 , pp. 197-209
    • Gershman, S.1    Blei, D.2    Niv, Y.3
  • 33
    • 77952541839 scopus 로고    scopus 로고
    • Learning latent structure: Carving nature at its joints
    • Gershman, S. & Niv, Y. (2010). Learning latent structure: Carving nature at its joints. Current Opinion in Neurobiology, 20(2), 251-256.
    • (2010) Current Opinion in Neurobiology , vol.20 , Issue.2 , pp. 251-256
    • Gershman, S.1    Niv, Y.2
  • 34
    • 70350521769 scopus 로고    scopus 로고
    • Human reinforcement learning subdivides structured action spaces by learning effector-specific values
    • Gershman, S., Pesaran, B., & Daw, N. (2009). Human reinforcement learning subdivides structured action spaces by learning effector-specific values. Journal of Neuroscience, 29(43), 13524-13531.
    • (2009) Journal of Neuroscience , vol.29 , Issue.43 , pp. 13524-13531
    • Gershman, S.1    Pesaran, B.2    Daw, N.3
  • 36
    • 84995019952 scopus 로고
    • Stimulus generalization and representation in adaptive network models of category learning
    • Gluck, M., (1991). Stimulus generalization and representation in adaptive network models of category learning. Psychological Science, 2(1), 50-55.
    • (1991) Psychological Science , vol.2 , Issue.1 , pp. 50-55
    • Gluck, M.1
  • 37
    • 0001011344 scopus 로고
    • Evaluating an adaptive network model of human learning
    • Gluck, M., & Bower, G. (1988a). Evaluating an adaptive network model of human learning. Journal of Memory and Language, 27(2), 166-195.
    • (1988) Journal of Memory and Language , vol.27 , Issue.2 , pp. 166-195
    • Gluck, M.1    Bower, G.2
  • 38
    • 0024077113 scopus 로고
    • From conditioning to category learning: An adaptive network model
    • Gluck, M., & Bower, G. (1988b). From conditioning to category learning: An adaptive network model. Journal of Experimental Psychology: General, 117(3), 227-247.
    • (1988) Journal of Experimental Psychology: General , vol.117 , Issue.3 , pp. 227-247
    • Gluck, M.1    Bower, G.2
  • 39
    • 0037219403 scopus 로고    scopus 로고
    • Learning, prediction and causal bayes nets
    • Glymour, C. (2003). Learning, prediction and causal bayes nets. Trends in Cognitive Sciences, 7(1), 43-48.
    • (2003) Trends in Cognitive Sciences , vol.7 , Issue.1 , pp. 43-48
    • Glymour, C.1
  • 40
    • 0034280528 scopus 로고    scopus 로고
    • Detecting blickets: How young children use information about novel causal powers in categorization and induction
    • Gopnik, A., & Sobel, D. (2000). Detecting blickets: How young children use information about novel causal powers in categorization and induction. Child Development, 7(5), 1205-1222.
    • (2000) Child Development , vol.7 , Issue.5 , pp. 1205-1222
    • Gopnik, A.1    Sobel, D.2
  • 41
    • 77955281093 scopus 로고    scopus 로고
    • Probabilistic models of cognition: Exploring representations and inductive biases
    • Griffiths, T., Chater, N., Kemp, C., Perfors, A., & Tenenbaum, J. (2010). Probabilistic models of cognition: Exploring representations and inductive biases. Trends in Cognitive Sciences, 14(8), 357-364.
    • (2010) Trends in Cognitive Sciences , vol.14 , Issue.8 , pp. 357-364
    • Griffiths, T.1    Chater, N.2    Kemp, C.3    Perfors, A.4    Tenenbaum, J.5
  • 42
    • 0017621814 scopus 로고
    • Influence of experiential factors and gonadal hormones on pituitary-adrenal response of the mouse to novelty and electric shock
    • Hennessy, J., Levin, R., & Levine, S. (1977). Influence of experiential factors and gonadal hormones on pituitary-adrenal response of the mouse to novelty and electric shock. Journal of Comparative and Physiological Psychology, 91(4), 770-777.
    • (1977) Journal of Comparative and Physiological Psychology , vol.91 , Issue.4 , pp. 770-777
    • Hennessy, J.1    Levin, R.2    Levine, S.3
  • 43
    • 33644688754 scopus 로고    scopus 로고
    • Dopamine neurons report an error in the temporal prediction of reward during learning
    • Hollerman, J., & Schultz, W. (1998). Dopamine neurons report an error in the temporal prediction of reward during learning. Nature Neuroscience, 1(4), 304-309.
    • (1998) Nature Neuroscience , vol.1 , Issue.4 , pp. 304-309
    • Hollerman, J.1    Schultz, W.2
  • 44
    • 0030757872 scopus 로고    scopus 로고
    • Burst activity of ventral tegmental dopamine neurons is elicited by sensory stimuli in the awake cat
    • Horvitz, J., Stewart, T., & Jacobs, B. (1997). Burst activity of ventral tegmental dopamine neurons is elicited by sensory stimuli in the awake cat. Brain Research, 759(2), 251-258.
    • (1997) Brain Research , vol.759 , Issue.2 , pp. 251-258
    • Horvitz, J.1    Stewart, T.2    Jacobs, B.3
  • 45
    • 0002861883 scopus 로고
    • A model of how the basal ganglia generate and use neural signals that predict reinforcement
    • In J. Houk, J. Davis, & D. Beiser (Eds.), Cambridge, MA: MIT Press.
    • Houk, J., Adams, J., & Barto, A. (1995). A model of how the basal ganglia generate and use neural signals that predict reinforcement. In J. Houk, J. Davis, & D. Beiser (Eds.), Models of information processing in the basal ganglia (pp. 249-270). Cambridge, MA: MIT Press.
    • (1995) Models of information processing in the basal ganglia , pp. 249-270
    • Houk, J.1    Adams, J.2    Barto, A.3
  • 47
    • 33846987175 scopus 로고    scopus 로고
    • Neotic preferences in laboratory rodents: Issues, assessment and substrates
    • Hughes, R. (2007). Neotic preferences in laboratory rodents: Issues, assessment and substrates. Neuroscience & Biobehavioral Reviews, 31(3), 441-464.
    • (2007) Neuroscience & Biobehavioral Reviews , vol.31 , Issue.3 , pp. 441-464
    • Hughes, R.1
  • 48
    • 85047672086 scopus 로고    scopus 로고
    • Acquisition and extinction in autoshaping
    • Kakade, S., & Dayan, P. (2002a). Acquisition and extinction in autoshaping. Psychological Review, 109(3), 533-544.
    • (2002) Psychological Review , vol.109 , Issue.3 , pp. 533-544
    • Kakade, S.1    Dayan, P.2
  • 49
    • 0036592029 scopus 로고    scopus 로고
    • Dopamine: Generalization and bonuses
    • Kakade, S., & Dayan, P. (2002b). Dopamine: Generalization and bonuses. Neural Networks, 15(4-6), 549-559.
    • (2002) Neural Networks , vol.15 , Issue.4-6 , pp. 549-559
    • Kakade, S.1    Dayan, P.2
  • 50
    • 78650147550 scopus 로고    scopus 로고
    • Learning to learn causal models
    • Kemp, C., Goodman, N., & Tenenbaum, J. (2010). Learning to learn causal models. Cognitive Science, 34(7), 1185-1243.
    • (2010) Cognitive Science , vol.34 , Issue.7 , pp. 1185-1243
    • Kemp, C.1    Goodman, N.2    Tenenbaum, J.3
  • 51
    • 34247368422 scopus 로고    scopus 로고
    • Learning overhypotheses with hierarchical Bayesian models
    • Kemp, C., Perfors, A., & Tenenbaum, J. (2007). Learning overhypotheses with hierarchical Bayesian models. Developmental Science, 10(3), 307-321.
    • (2007) Developmental Science , vol.10 , Issue.3 , pp. 307-321
    • Kemp, C.1    Perfors, A.2    Tenenbaum, J.3
  • 53
    • 0034146693 scopus 로고    scopus 로고
    • Distinguishing genuine from spurious causes: A coherence hypothesis
    • Lien, Y., & Cheng, P. (2000). Distinguishing genuine from spurious causes: A coherence hypothesis. Cognitive Psychology, 40(2), 87-137.
    • (2000) Cognitive Psychology , vol.40 , Issue.2 , pp. 87-137
    • Lien, Y.1    Cheng, P.2
  • 54
    • 1942539715 scopus 로고    scopus 로고
    • Sustain: A network model of category learning
    • Love, B., Medin, D., & Gureckis, T. (2004). Sustain: A network model of category learning. Psychological Review, 111(2), 309.
    • (2004) Psychological Review , vol.111 , Issue.2 , pp. 309
    • Love, B.1    Medin, D.2    Gureckis, T.3
  • 55
    • 77953204481 scopus 로고    scopus 로고
    • Learning the form of causal relationships using hierarchical Bayesian models
    • Lucas, C., & Griffiths, T. (2010). Learning the form of causal relationships using hierarchical Bayesian models. Cognitive Science, 34(1), 113-147.
    • (2010) Cognitive Science , vol.34 , Issue.1 , pp. 113-147
    • Lucas, C.1    Griffiths, T.2
  • 56
    • 0003834557 scopus 로고
    • Cambridge, MA: Freeman.
    • Marr, D. (1982). Vision. Cambridge, MA: Freeman.
    • (1982) Vision
    • Marr, D.1
  • 58
    • 0029981543 scopus 로고    scopus 로고
    • A framework for mesencephalic dopamine systems based on predictive hebbian learning
    • Montague, P., Dayan, P., & Sejnowski, T. (1996). A framework for mesencephalic dopamine systems based on predictive hebbian learning. The Journal of Neuroscience, 16(5), 1936-1947.
    • (1996) The Journal of Neuroscience , vol.16 , Issue.5 , pp. 1936-1947
    • Montague, P.1    Dayan, P.2    Sejnowski, T.3
  • 59
    • 0009804852 scopus 로고
    • Failure to find a learned drive based on hunger; evidence for learning motivated by exploration
    • Myers, A., & Miller, N. (1954). Failure to find a learned drive based on hunger; evidence for learning motivated by exploration. Journal of Comparative and Physiological Psychology, 47(6), 428-436.
    • (1954) Journal of Comparative and Physiological Psychology , vol.47 , Issue.6 , pp. 428-436
    • Myers, A.1    Miller, N.2
  • 60
    • 0141596576 scopus 로고    scopus 로고
    • Policy invariance under reward transformations: Theory and application to reward shaping
    • In I. Bratko, & S. Dzeroski (Eds.), Bled, Slovenia.
    • Ng, A., Harada, D., & Russell, S. (1999). Policy invariance under reward transformations: Theory and application to reward shaping. In I. Bratko, & S. Dzeroski (Eds.), Proceedings of the Sixteenth International Conference on Machine Learning. Bled, Slovenia.
    • (1999) Proceedings of the Sixteenth International Conference on Machine Learning
    • Ng, A.1    Harada, D.2    Russell, S.3
  • 61
    • 0009692872 scopus 로고
    • A study of exploratory behavior in the white rat by means of the obstruction method
    • Nissen, H. (1930). A study of exploratory behavior in the white rat by means of the obstruction method. Journal of Genetic Psychology, 37(3), 361-376.
    • (1930) Journal of Genetic Psychology , vol.37 , Issue.3 , pp. 361-376
    • Nissen, H.1
  • 62
    • 67349283062 scopus 로고    scopus 로고
    • Reinforcement learning in the brain
    • Niv, Y. (2009). Reinforcement learning in the brain. Journal of Mathematical Psychology, 53(3), 139-154.
    • (2009) Journal of Mathematical Psychology , vol.53 , Issue.3 , pp. 139-154
    • Niv, Y.1
  • 63
    • 0022686961 scopus 로고
    • Attention, similarity, and the identification-categorization relationship
    • Nosofsky, R. (1986). Attention, similarity, and the identification-categorization relationship. Journal of Experimental Psychology: General, 115(1), 39-57.
    • (1986) Journal of Experimental Psychology: General , vol.115 , Issue.1 , pp. 39-57
    • Nosofsky, R.1
  • 64
    • 79551573880 scopus 로고    scopus 로고
    • Risk, unexpected uncertainty, and estimation uncertainty: Bayesian learning in unstable settings
    • Payzan-LeNestour, E., & Bossaerts, P. (2011). Risk, unexpected uncertainty, and estimation uncertainty: Bayesian learning in unstable settings. PLoS Computational Biology, 7(1), e1001048.
    • (2011) PLoS Computational Biology , vol.7 , Issue.1
    • Payzan-LeNestour, E.1    Bossaerts, P.2
  • 66
    • 0016273271 scopus 로고
    • Effects of prenatal exposure to auditory or visual stimulation on postnatal distress vocalizations in chicks
    • Rajecki, D. (1974). Effects of prenatal exposure to auditory or visual stimulation on postnatal distress vocalizations in chicks. Behavioral Biology, 11(4), 525-536.
    • (1974) Behavioral Biology , vol.11 , Issue.4 , pp. 525-536
    • Rajecki, D.1
  • 67
    • 79960241771 scopus 로고    scopus 로고
    • Decision making under uncertainty: A neural model based on partially observable markov decision processes
    • Rao, R. (2010). Decision making under uncertainty: A neural model based on partially observable markov decision processes. Frontiers in Computational Neuroscience, 4(4), 146.
    • (2010) Frontiers in Computational Neuroscience , vol.4 , Issue.4 , pp. 146
    • Rao, R.1
  • 68
    • 34548837994 scopus 로고    scopus 로고
    • Reconciling reinforcement learning models with behavioral extinction and renewal: implications for addiction, relapse, and problem gambling
    • Redish, A., Jensen, S., Johnson, A., & Kurth-Nelson, Z. (2007). Reconciling reinforcement learning models with behavioral extinction and renewal: implications for addiction, relapse, and problem gambling. Psychological Review, 114(3), 784-805.
    • (2007) Psychological Review , vol.114 , Issue.3 , pp. 784-805
    • Redish, A.1    Jensen, S.2    Johnson, A.3    Kurth-Nelson, Z.4
  • 69
    • 0030023561 scopus 로고    scopus 로고
    • Intrinsic reinforcing properties of putatively neutral stimuli in an instrumental two-lever discrimination task
    • Reed, P., Mitchell, C., & Nokes, T. (1996). Intrinsic reinforcing properties of putatively neutral stimuli in an instrumental two-lever discrimination task. Animal Learning and Behavior, 24(1), 38-45.
    • (1996) Animal Learning and Behavior , vol.24 , Issue.1 , pp. 38-45
    • Reed, P.1    Mitchell, C.2    Nokes, T.3
  • 70
    • 40749085540 scopus 로고    scopus 로고
    • Competition between the conditioned rewarding effects of cocaine and novelty
    • Reichel, C., & Bevins, R. (2008). Competition between the conditioned rewarding effects of cocaine and novelty. Behavioral Neuroscience, 122(1), 140-150.
    • (2008) Behavioral Neuroscience , vol.122 , Issue.1 , pp. 140-150
    • Reichel, C.1    Bevins, R.2
  • 71
    • 0002109138 scopus 로고
    • Variations in the effectiveness of reinforcement and nonreinforcement
    • In A. H. Black, & W. F. Prokasy (Eds.), New York: Appleton-Century-Crofts.
    • Rescorla, R. & Wagner, A. (1972). Variations in the effectiveness of reinforcement and nonreinforcement. In A. H. Black, & W. F. Prokasy (Eds.), Classical conditioning. II: Current research and theory (pp. 64-99). New York: Appleton-Century-Crofts.
    • (1972) Classical conditioning. II: Current research and theory , pp. 64-99
    • Rescorla, R.1    Wagner, A.2
  • 74
    • 36348966690 scopus 로고    scopus 로고
    • Reinforcement learning signals in the human striatum distinguish learners from nonlearners during reward-based decision making
    • Schönberg, T., Daw, N., Joel, D., & O'Doherty, J. (2007). Reinforcement learning signals in the human striatum distinguish learners from nonlearners during reward-based decision making. The Journal of Neuroscience, 27(47), 12860-12867.
    • (2007) The Journal of Neuroscience , vol.27 , Issue.47 , pp. 12860-12867
    • Schönberg, T.1    Daw, N.2    Joel, D.3    O'Doherty, J.4
  • 75
    • 0031867046 scopus 로고    scopus 로고
    • Predictive reward signal of dopamine neurons
    • Schultz, W. (1998). Predictive reward signal of dopamine neurons. Journal of Neurophysiology, 80(1), 1-27.
    • (1998) Journal of Neurophysiology , vol.80 , Issue.1 , pp. 1-27
    • Schultz, W.1
  • 76
    • 0030896968 scopus 로고    scopus 로고
    • A neural substrate of prediction and reward
    • Schultz, W., Dayan, P., & Montague, P. (1997). A neural substrate of prediction and reward. Science, 275(5306), 1593-1599.
    • (1997) Science , vol.275 , Issue.5306 , pp. 1593-1599
    • Schultz, W.1    Dayan, P.2    Montague, P.3
  • 77
    • 0000337027 scopus 로고
    • Preference for familiar versus novel stimuli as a function of the familiarity of the environment
    • Sheldon, A. (1969). Preference for familiar versus novel stimuli as a function of the familiarity of the environment. Journal of Comparative and Physiological Psychology, 67(4), 516-521.
    • (1969) Journal of Comparative and Physiological Psychology , vol.67 , Issue.4 , pp. 516-521
    • Sheldon, A.1
  • 78
    • 0023223978 scopus 로고
    • Toward a universal law of generalization for psychological science
    • Shepard, R. (1987). Toward a universal law of generalization for psychological science. Science, 237(4820), 1317-1323.
    • (1987) Science , vol.237 , Issue.4820 , pp. 1317-1323
    • Shepard, R.1
  • 79
    • 77957277972 scopus 로고    scopus 로고
    • Exemplar models as a mechanism for performing bayesian inference
    • Shi, L., Griffiths, T., Feldman, N., & Sanborn, A. (2010). Exemplar models as a mechanism for performing bayesian inference. Psychonomic Bulletin & Review, 17(4), 443-464.
    • (2010) Psychonomic Bulletin & Review , vol.17 , Issue.4 , pp. 443-464
    • Shi, L.1    Griffiths, T.2    Feldman, N.3    Sanborn, A.4
  • 80
    • 67349268975 scopus 로고    scopus 로고
    • A Bayesian analysis of human decision-making on bandit problems
    • Steyvers, M., Lee, M., & Wagenmakers, E. (2009). A Bayesian analysis of human decision-making on bandit problems. Journal of Mathematical Psychology, 53(3), 168-179.
    • (2009) Journal of Mathematical Psychology , vol.53 , Issue.3 , pp. 168-179
    • Steyvers, M.1    Lee, M.2    Wagenmakers, E.3
  • 81
    • 0032930935 scopus 로고    scopus 로고
    • A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task
    • Suri, R., Schultz, W., et al. (1999). A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task. Neuroscience, 91(3), 871-890.
    • (1999) Neuroscience , vol.91 , Issue.3 , pp. 871-890
    • Suri, R.1    Schultz, W.2
  • 83
    • 72449172543 scopus 로고    scopus 로고
    • On the generality and limits of abstraction in rats and humans
    • Urcelay, G., & Miller, R. (2010). On the generality and limits of abstraction in rats and humans. Animal Cognition, 13(1), 21-32.
    • (2010) Animal Cognition , vol.13 , Issue.1 , pp. 21-32
    • Urcelay, G.1    Miller, R.2
  • 84
    • 33646550876 scopus 로고    scopus 로고
    • Categories and causality: The neglected direction
    • Waldmann, M., & Hagmayer, Y. (2006). Categories and causality: The neglected direction. Cognitive Psychology, 53(1), 27-58.
    • (2006) Cognitive Psychology , vol.53 , Issue.1 , pp. 27-58
    • Waldmann, M.1    Hagmayer, Y.2
  • 85
    • 50549177661 scopus 로고
    • The aetiology of food reward in monkeys
    • Weiskrantz, L., & Cowey, A. (1963). The aetiology of food reward in monkeys. Animal Behaviour, 11(2-3), 225-234.
    • (1963) Animal Behaviour , vol.11 , Issue.2-3 , pp. 225-234
    • Weiskrantz, L.1    Cowey, A.2
  • 86
    • 0000251860 scopus 로고
    • Novelty, familiarity, and the development of infant attention
    • Weizmann, F., Cohen, L., & Pratt, R. (1971). Novelty, familiarity, and the development of infant attention. Developmental Psychology, 4(2), 149-154.
    • (1971) Developmental Psychology , vol.4 , Issue.2 , pp. 149-154
    • Weizmann, F.1    Cohen, L.2    Pratt, R.3
  • 87
    • 0002278965 scopus 로고
    • Adaptive switching circuits
    • In New York: IRE-New York.
    • Widrow, B., & Hoff, M. (1960). Adaptive switching circuits. In IRE WES CON convention record (Vol. 4, pp. 96-104). New York: IRE-New York.
    • (1960) IRE WES CON convention record , vol.4 , pp. 96-104
    • Widrow, B.1    Hoff, M.2
  • 88
    • 45249097567 scopus 로고    scopus 로고
    • Striatal activity underlies novelty based choice in humans
    • Wittmann, B., Daw, N., Seymour, B., & Dolan, R. (2008). Striatal activity underlies novelty based choice in humans. Neuron, 58(6), 967-973.
    • (2008) Neuron , vol.58 , Issue.6 , pp. 967-973
    • Wittmann, B.1    Daw, N.2    Seymour, B.3    Dolan, R.4
  • 89
    • 0035540661 scopus 로고    scopus 로고
    • Mere exposure: A gateway to the subliminal
    • Zajonc, R. (2001). Mere exposure: A gateway to the subliminal. Current Directions in Psychological Science, 10(6), 224-228.
    • (2001) Current Directions in Psychological Science , vol.10 , Issue.6 , pp. 224-228
    • Zajonc, R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.