SCOPUS 정보 검색 플랫폼

Topics in Cognitive Science

Volumn 7, Issue 3, 2015, Pages 391-415

Novelty and Inductive Generalization in Human Reinforcement Learning

(2) Gershman, Samuel J a Niv, Yael b

a MASSACHUSETTS INSTITUTE OF TECHNOLOGY (United States)

b PRINCETON UNIVERSITY (United States)

Author keywords

Bayesian inference; Exploration exploitation dilemma; Neophilia; Neophobia; Reinforcement learning

Indexed keywords

DOPAMINE;

ALGORITHM; ANIMAL; COGNITION; DECISION MAKING; ENVIRONMENT; EXPLORATORY BEHAVIOR; HUMAN; LEARNING; METABOLISM; PHOBIA; PHYSIOLOGY; PRISONER DILEMMA; PSYCHOLOGICAL MODEL; PSYCHOLOGY; REINFORCEMENT; REWARD;

ALGORITHMS; ANIMALS; COGNITION; DECISION MAKING; DOPAMINE; ENVIRONMENT; EXPLORATORY BEHAVIOR; HUMANS; LEARNING; MODELS, PSYCHOLOGICAL; PHOBIC DISORDERS; PRISONER DILEMMA; REINFORCEMENT (PSYCHOLOGY); REWARD;

EID: 84939246893 PISSN: 17568757 EISSN: 17568765 Source Type: Journal
DOI: 10.1111/tops.12138 Document Type: Article

Times cited : (70)

References (89)

1
- 78651226963
- Structure learning in human sequential decision-making
- Acuña, D., & Schrater, P. (2010). Structure learning in human sequential decision-making. PLoS Computational Biology, 6(12), 221-229.
- (2010) PLoS Computational Biology , vol.6 , Issue.12 , pp. 221-229
- Acuña, D.¹ Schrater, P.²

2
- 0000705894
- The adaptive nature of human categorization
- Anderson, J. (1991). The adaptive nature of human categorization. Psychological Review, 98(3), 409-429.
- (1991) Psychological Review , vol.98 , Issue.3 , pp. 409-429
- Anderson, J.¹

3
- 0034493070
- Conditioned place preference: What does it add to our preclinical understanding of drug reward
- Bardo, M., & Bevins, R. (2000). Conditioned place preference: What does it add to our preclinical understanding of drug reward? Psychopharmacology, 153(1), 31-43.
- (2000) Psychopharmacology , vol.153 , Issue.1 , pp. 31-43
- Bardo, M.¹ Bevins, R.²

4
- 0001296523
- Experiments on "neophobia' in wild and laboratory rats
- Barnett, S. (1958). Experiments on "neophobia' in wild and laboratory rats. British Journal of Psychology, 49(3), 195-201.
- (1958) British Journal of Psychology , vol.49 , Issue.3 , pp. 195-201
- Barnett, S.¹

5
- 0000541213
- Adaptive critics and the basal ganglia
- In J. Houk, J. Davis, & D. Beiser, Eds.), Cambridge, MA: MIT Press.
- Barto, A. (1995). Adaptive critics and the basal ganglia. In J. Houk, J. Davis, & D. Beiser, (Eds.), Models of information processing in the basal ganglia (pp. 215-232). Cambridge, MA: MIT Press.
- (1995) Models of information processing in the basal ganglia , pp. 215-232
- Barto, A.¹

6
- 21544435722
- Midbrain dopamine neurons encode a quantitative reward prediction error signal
- Bayer, H., & Glimcher, P. (2005). Midbrain dopamine neurons encode a quantitative reward prediction error signal. Neuron, 47(1), 129-141.
- (2005) Neuron , vol.47 , Issue.1 , pp. 129-141
- Bayer, H.¹ Glimcher, P.²

7
- 34548295327
- Learning the value of information in an uncertain world
- Behrens, T., Woolrich, M., Walton, M., & Rushworth, M. (2007). Learning the value of information in an uncertain world. Nature Neuroscience, 10(9), 1214-1221.
- (2007) Nature Neuroscience , vol.10 , Issue.9 , pp. 1214-1221
- Behrens, T.¹ Woolrich, M.² Walton, M.³ Rushworth, M.⁴

8
- 0003722409
- New York: McGraw-Hill.
- Berlyne, D. (1960). Conflict, arousal, and curiosity. New York: McGraw-Hill.
- (1960) Conflict, arousal, and curiosity
- Berlyne, D.¹

9
- 0013931617
- Curiosity and exploration
- Berlyne, D. (1966). Curiosity and exploration. Science, 153(3731), 25-33.
- (1966) Science , vol.153 , Issue.3731 , pp. 25-33
- Berlyne, D.¹

10
- 0013955837
- Novelty, arousal, and the reinforcement of diversive exploration in the rat
- Berlyne, D., Koenig, I., & Hirota, T. (1966). Novelty, arousal, and the reinforcement of diversive exploration in the rat. Journal of Comparative and Physiological Psychology, 62(2), 222-226.
- (1966) Journal of Comparative and Physiological Psychology , vol.62 , Issue.2 , pp. 222-226
- Berlyne, D.¹ Koenig, I.² Hirota, T.³

11
- 0035540669
- Novelty seeking and reward: Implications for the study of high-risk behaviors
- Bevins, R. (2001). Novelty seeking and reward: Implications for the study of high-risk behaviors. Current Directions in Psychological Science, 10(6), 189-193.
- (2001) Current Directions in Psychological Science , vol.10 , Issue.6 , pp. 189-193
- Bevins, R.¹

12
- 0016324843
- Defensive reactions and exploratory behavior in rats
- Blanchard, R., Kelley, M., & Blanchard, D. (1974). Defensive reactions and exploratory behavior in rats. Journal of Comparative and Physiological Psychology, 87(6), 1129-1133.
- (1974) Journal of Comparative and Physiological Psychology , vol.87 , Issue.6 , pp. 1129-1133
- Blanchard, R.¹ Kelley, M.² Blanchard, D.³

13
- 70350566799
- Hierarchically organized behavior and its neural foundations: A reinforcement learning perspective
- Botvinick, M., Niv, Y., & Barto, A. (2009). Hierarchically organized behavior and its neural foundations: A reinforcement learning perspective. Cognition, 113(3), 262-280.
- (2009) Cognition , vol.113 , Issue.3 , pp. 262-280
- Botvinick, M.¹ Niv, Y.² Barto, A.³

14
- 0020965801
- Contextual control of the extinction of conditioned fear: Tests for the associative value of the context
- Bouton, M., & King, D. (1983). Contextual control of the extinction of conditioned fear: Tests for the associative value of the context. Journal of Experimental Psychology: Animal Behavior Processes, 9(3), 248-265.
- (1983) Journal of Experimental Psychology: Animal Behavior Processes , vol.9 , Issue.3 , pp. 248-265
- Bouton, M.¹ King, D.²

15
- 0041965975
- R-max-a general polynomial time algorithm for nearoptimal reinforcement learning
- Brafman, R., & Tennenholtz, M. (2003). R-max-a general polynomial time algorithm for nearoptimal reinforcement learning. The Journal of Machine Learning Research, 3(3), 213-231.
- (2003) The Journal of Machine Learning Research , vol.3 , Issue.3 , pp. 213-231
- Brafman, R.¹ Tennenholtz, M.²

16
- 0030612822
- The psychophysics toolbox
- Brainard, D. (1997). The psychophysics toolbox. Spatial Vision, 10(4), 433-436.
- (1997) Spatial Vision , vol.10 , Issue.4 , pp. 433-436
- Brainard, D.¹

17
- 34250348767
- Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration
- Cohen, J., McClure, S., & Yu, A. (2007). Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration. Philosophical Transactions of the Royal Society B: Biological Sciences, 362(1481), 933-942.
- (2007) Philosophical Transactions of the Royal Society B: Biological Sciences , vol.362 , Issue.1481 , pp. 933-942
- Cohen, J.¹ McClure, S.² Yu, A.³

18
- 0018233873
- The determinants of exploration and neophobia
- Corey, D. (1978). The determinants of exploration and neophobia. Neuroscience & Biobehavioral Reviews, 2(4), 235-253.
- (1978) Neuroscience & Biobehavioral Reviews , vol.2 , Issue.4 , pp. 235-253
- Corey, D.¹

19
- 33746365099
- Bayesian theories of conditioning in a changing world
- Courville, A., Daw, N., & Touretzky, D. (2006). Bayesian theories of conditioning in a changing world. Trends in Cognitive Sciences, 10(7), 294-300.
- (2006) Trends in Cognitive Sciences , vol.10 , Issue.7 , pp. 294-300
- Courville, A.¹ Daw, N.² Touretzky, D.³

20
- 0017288709
- The new object reaction of Rattus rattus L.: The relative importance of various cues
- Cowan, P. (1976). The new object reaction of Rattus rattus L.: The relative importance of various cues. Behavioral Biology, 16(1), 31-44.
- (1976) Behavioral Biology , vol.16 , Issue.1 , pp. 31-44
- Cowan, P.¹

21
- 33745787929
- Representation and timing in theories of the dopamine system
- Daw, N., Courville, A., & Touretzky, D. (2006a). Representation and timing in theories of the dopamine system. Neural Computation, 18(7), 1637-1677.
- (2006) Neural Computation , vol.18 , Issue.7 , pp. 1637-1677
- Daw, N.¹ Courville, A.² Touretzky, D.³

22
- 33745223257
- Cortical substrates for exploratory decisions in humans
- Daw, N., O'Doherty, J., Dayan, P., Seymour, B., & Dolan, R. (2006b). Cortical substrates for exploratory decisions in humans. Nature, 441(7095), 876-879.
- (2006) Nature , vol.441 , Issue.7095 , pp. 876-879
- Daw, N.¹ O'Doherty, J.² Dayan, P.³ Seymour, B.⁴ Dolan, R.⁵

23
- 34547536392
- Associative learning mediates dynamic shifts in dopamine signaling in the nucleus accumbens
- Day, J., Roitman, M., Wightman, R., & Carelli, R. (2007). Associative learning mediates dynamic shifts in dopamine signaling in the nucleus accumbens. Nature Neuroscience, 10(8), 1020-1028.
- (2007) Nature Neuroscience , vol.10 , Issue.8 , pp. 1020-1028
- Day, J.¹ Roitman, M.² Wightman, R.³ Carelli, R.⁴

24
- 33749055062
- The misbehavior of value and the discipline of the will
- Dayan, P., Niv, Y., Seymour, B., & Daw, N. D. (2006). The misbehavior of value and the discipline of the will. Neural Networks, 19(8), 1153-1160.
- (2006) Neural Networks , vol.19 , Issue.8 , pp. 1153-1160
- Dayan, P.¹ Niv, Y.² Seymour, B.³ Daw, N.D.⁴

25
- 0031619316
- Bayesian q-learning
- In J. Mostow & C. Rich (Eds.), Madison, WI.
- Dearden, R., Friedman, N., & Russell, S. (1998). Bayesian q-learning. In J. Mostow & C. Rich (Eds.), Proceedings of the National Conference on Artificial Intelligence (pp. 761-768). Madison, WI.
- (1998) Proceedings of the National Conference on Artificial Intelligence , pp. 761-768
- Dearden, R.¹ Friedman, N.² Russell, S.³

26
- 0004025040
- Stamford, CT: Thomson/Wadsworth.
- Domjan, M. (2003). The principles of learning and behavior. Stamford, CT: Thomson/Wadsworth.
- (2003) The principles of learning and behavior
- Domjan, M.¹

27
- 1942421151
- Bayes meets Bellman: The Gaussian process approach to temporal difference learning
- In T. Fawcett & N. Mishra (Eds.), Washington, DC.
- Engel, Y., Mannor, S., & Meir, R. (2003). Bayes meets Bellman: The Gaussian process approach to temporal difference learning. In T. Fawcett & N. Mishra (Eds.), International Conference on Machine Learning (Vol. 20, pp. 154-162). Washington, DC.
- (2003) International Conference on Machine Learning , vol.20 , pp. 154-162
- Engel, Y.¹ Mannor, S.² Meir, R.³

28
- 0023691166
- A new one-trial test for neurobiological studies of memory in rats. 1: Behavioral data
- Ennaceur, A., & Delacour, J. (1988). A new one-trial test for neurobiological studies of memory in rats. 1: Behavioral data. Behavioural Brain Research, 31(1), 47-59.
- (1988) Behavioural Brain Research , vol.31 , Issue.1 , pp. 47-59
- Ennaceur, A.¹ Delacour, J.²

29
- 0000996526
- The effects of hunger and familiarity of locale on exploration
- Fehrer, E. (1956). The effects of hunger and familiarity of locale on exploration. Journal of Comparative and Physiological Psychology, 49(6), 549-552.
- (1956) Journal of Comparative and Physiological Psychology , vol.49 , Issue.6 , pp. 549-552
- Fehrer, E.¹

30
- 0015427497
- Effects of time of day and food deprivation on exploratory activity in the rat
- File, S., & Day, S. (1972). Effects of time of day and food deprivation on exploratory activity in the rat. Animal Behaviour, 20(4), 758-762.
- (1972) Animal Behaviour , vol.20 , Issue.4 , pp. 758-762
- File, S.¹ Day, S.²

31
- 57149113922
- Hierarchical models in the brain
- Friston, K. (2008). Hierarchical models in the brain. PLoS Computational Biology, 4(11), e1000211.
- (2008) PLoS Computational Biology , vol.4 , Issue.11 , pp. e1000211
- Friston, K.¹

32
- 74049117596
- Context, learning, and extinction
- Gershman, S., Blei, D., & Niv, Y. (2010). Context, learning, and extinction. Psychological Review, 117(1), 197-209.
- (2010) Psychological Review , vol.117 , Issue.1 , pp. 197-209
- Gershman, S.¹ Blei, D.² Niv, Y.³

33
- 77952541839
- Learning latent structure: Carving nature at its joints
- Gershman, S. & Niv, Y. (2010). Learning latent structure: Carving nature at its joints. Current Opinion in Neurobiology, 20(2), 251-256.
- (2010) Current Opinion in Neurobiology , vol.20 , Issue.2 , pp. 251-256
- Gershman, S.¹ Niv, Y.²

34
- 70350521769
- Human reinforcement learning subdivides structured action spaces by learning effector-specific values
- Gershman, S., Pesaran, B., & Daw, N. (2009). Human reinforcement learning subdivides structured action spaces by learning effector-specific values. Journal of Neuroscience, 29(43), 13524-13531.
- (2009) Journal of Neuroscience , vol.29 , Issue.43 , pp. 13524-13531
- Gershman, S.¹ Pesaran, B.² Daw, N.³

35
- 84891584370
- Chichester, England: John Wiley & Sons Inc.
- Gittins, J. (1989). Multi-armed bandit allocation indices. Chichester, England: John Wiley & Sons Inc.
- (1989) Multi-armed bandit allocation indices
- Gittins, J.¹

36
- 84995019952
- Stimulus generalization and representation in adaptive network models of category learning
- Gluck, M., (1991). Stimulus generalization and representation in adaptive network models of category learning. Psychological Science, 2(1), 50-55.
- (1991) Psychological Science , vol.2 , Issue.1 , pp. 50-55
- Gluck, M.¹

37
- 0001011344
- Evaluating an adaptive network model of human learning
- Gluck, M., & Bower, G. (1988a). Evaluating an adaptive network model of human learning. Journal of Memory and Language, 27(2), 166-195.
- (1988) Journal of Memory and Language , vol.27 , Issue.2 , pp. 166-195
- Gluck, M.¹ Bower, G.²

38
- 0024077113
- From conditioning to category learning: An adaptive network model
- Gluck, M., & Bower, G. (1988b). From conditioning to category learning: An adaptive network model. Journal of Experimental Psychology: General, 117(3), 227-247.
- (1988) Journal of Experimental Psychology: General , vol.117 , Issue.3 , pp. 227-247
- Gluck, M.¹ Bower, G.²

39
- 0037219403
- Learning, prediction and causal bayes nets
- Glymour, C. (2003). Learning, prediction and causal bayes nets. Trends in Cognitive Sciences, 7(1), 43-48.
- (2003) Trends in Cognitive Sciences , vol.7 , Issue.1 , pp. 43-48
- Glymour, C.¹

40
- 0034280528
- Detecting blickets: How young children use information about novel causal powers in categorization and induction
- Gopnik, A., & Sobel, D. (2000). Detecting blickets: How young children use information about novel causal powers in categorization and induction. Child Development, 7(5), 1205-1222.
- (2000) Child Development , vol.7 , Issue.5 , pp. 1205-1222
- Gopnik, A.¹ Sobel, D.²

41
- 77955281093
- Probabilistic models of cognition: Exploring representations and inductive biases
- Griffiths, T., Chater, N., Kemp, C., Perfors, A., & Tenenbaum, J. (2010). Probabilistic models of cognition: Exploring representations and inductive biases. Trends in Cognitive Sciences, 14(8), 357-364.
- (2010) Trends in Cognitive Sciences , vol.14 , Issue.8 , pp. 357-364
- Griffiths, T.¹ Chater, N.² Kemp, C.³ Perfors, A.⁴ Tenenbaum, J.⁵

42
- 0017621814
- Influence of experiential factors and gonadal hormones on pituitary-adrenal response of the mouse to novelty and electric shock
- Hennessy, J., Levin, R., & Levine, S. (1977). Influence of experiential factors and gonadal hormones on pituitary-adrenal response of the mouse to novelty and electric shock. Journal of Comparative and Physiological Psychology, 91(4), 770-777.
- (1977) Journal of Comparative and Physiological Psychology , vol.91 , Issue.4 , pp. 770-777
- Hennessy, J.¹ Levin, R.² Levine, S.³

43
- 33644688754
- Dopamine neurons report an error in the temporal prediction of reward during learning
- Hollerman, J., & Schultz, W. (1998). Dopamine neurons report an error in the temporal prediction of reward during learning. Nature Neuroscience, 1(4), 304-309.
- (1998) Nature Neuroscience , vol.1 , Issue.4 , pp. 304-309
- Hollerman, J.¹ Schultz, W.²

44
- 0030757872
- Burst activity of ventral tegmental dopamine neurons is elicited by sensory stimuli in the awake cat
- Horvitz, J., Stewart, T., & Jacobs, B. (1997). Burst activity of ventral tegmental dopamine neurons is elicited by sensory stimuli in the awake cat. Brain Research, 759(2), 251-258.
- (1997) Brain Research , vol.759 , Issue.2 , pp. 251-258
- Horvitz, J.¹ Stewart, T.² Jacobs, B.³

45
- 0002861883
- A model of how the basal ganglia generate and use neural signals that predict reinforcement
- In J. Houk, J. Davis, & D. Beiser (Eds.), Cambridge, MA: MIT Press.
- Houk, J., Adams, J., & Barto, A. (1995). A model of how the basal ganglia generate and use neural signals that predict reinforcement. In J. Houk, J. Davis, & D. Beiser (Eds.), Models of information processing in the basal ganglia (pp. 249-270). Cambridge, MA: MIT Press.
- (1995) Models of information processing in the basal ganglia , pp. 249-270
- Houk, J.¹ Adams, J.² Barto, A.³

46
- 84939003870
- Information value theory
- Howard, R. (1966). Information value theory. IEEE Transactions on Systems Science and Cybernetics, 2(1), 22-26.
- (1966) IEEE Transactions on Systems Science and Cybernetics , vol.2 , Issue.1 , pp. 22-26
- Howard, R.¹

47
- 33846987175
- Neotic preferences in laboratory rodents: Issues, assessment and substrates
- Hughes, R. (2007). Neotic preferences in laboratory rodents: Issues, assessment and substrates. Neuroscience & Biobehavioral Reviews, 31(3), 441-464.
- (2007) Neuroscience & Biobehavioral Reviews , vol.31 , Issue.3 , pp. 441-464
- Hughes, R.¹

48
- 85047672086
- Acquisition and extinction in autoshaping
- Kakade, S., & Dayan, P. (2002a). Acquisition and extinction in autoshaping. Psychological Review, 109(3), 533-544.
- (2002) Psychological Review , vol.109 , Issue.3 , pp. 533-544
- Kakade, S.¹ Dayan, P.²

49
- 0036592029
- Dopamine: Generalization and bonuses
- Kakade, S., & Dayan, P. (2002b). Dopamine: Generalization and bonuses. Neural Networks, 15(4-6), 549-559.
- (2002) Neural Networks , vol.15 , Issue.4-6 , pp. 549-559
- Kakade, S.¹ Dayan, P.²

50
- 78650147550
- Learning to learn causal models
- Kemp, C., Goodman, N., & Tenenbaum, J. (2010). Learning to learn causal models. Cognitive Science, 34(7), 1185-1243.
- (2010) Cognitive Science , vol.34 , Issue.7 , pp. 1185-1243
- Kemp, C.¹ Goodman, N.² Tenenbaum, J.³

51
- 34247368422
- Learning overhypotheses with hierarchical Bayesian models
- Kemp, C., Perfors, A., & Tenenbaum, J. (2007). Learning overhypotheses with hierarchical Bayesian models. Developmental Science, 10(3), 307-321.
- (2007) Developmental Science , vol.10 , Issue.3 , pp. 307-321
- Kemp, C.¹ Perfors, A.² Tenenbaum, J.³

52
- 0015729185
- Effect of trials on "emotionality' behavior of the rat and mouse
- King, D., & Appelbaum, J. (1973). Effect of trials on "emotionality' behavior of the rat and mouse. Journal of Comparative and Physiological Psychology, 85(1), 186-194.
- (1973) Journal of Comparative and Physiological Psychology , vol.85 , Issue.1 , pp. 186-194
- King, D.¹ Appelbaum, J.²

53
- 0034146693
- Distinguishing genuine from spurious causes: A coherence hypothesis
- Lien, Y., & Cheng, P. (2000). Distinguishing genuine from spurious causes: A coherence hypothesis. Cognitive Psychology, 40(2), 87-137.
- (2000) Cognitive Psychology , vol.40 , Issue.2 , pp. 87-137
- Lien, Y.¹ Cheng, P.²

54
- 1942539715
- Sustain: A network model of category learning
- Love, B., Medin, D., & Gureckis, T. (2004). Sustain: A network model of category learning. Psychological Review, 111(2), 309.
- (2004) Psychological Review , vol.111 , Issue.2 , pp. 309
- Love, B.¹ Medin, D.² Gureckis, T.³

55
- 77953204481
- Learning the form of causal relationships using hierarchical Bayesian models
- Lucas, C., & Griffiths, T. (2010). Learning the form of causal relationships using hierarchical Bayesian models. Cognitive Science, 34(1), 113-147.
- (2010) Cognitive Science , vol.34 , Issue.1 , pp. 113-147
- Lucas, C.¹ Griffiths, T.²

56
- 0003834557
- Cambridge, MA: Freeman.
- Marr, D. (1982). Vision. Cambridge, MA: Freeman.
- (1982) Vision
- Marr, D.¹

57
- 0004255908
- Boston: McGraw-Hill.
- Mitchell, T. (1997). Machine learning. Boston: McGraw-Hill.
- (1997) Machine learning
- Mitchell, T.¹

58
- 0029981543
- A framework for mesencephalic dopamine systems based on predictive hebbian learning
- Montague, P., Dayan, P., & Sejnowski, T. (1996). A framework for mesencephalic dopamine systems based on predictive hebbian learning. The Journal of Neuroscience, 16(5), 1936-1947.
- (1996) The Journal of Neuroscience , vol.16 , Issue.5 , pp. 1936-1947
- Montague, P.¹ Dayan, P.² Sejnowski, T.³

59
- 0009804852
- Failure to find a learned drive based on hunger; evidence for learning motivated by exploration
- Myers, A., & Miller, N. (1954). Failure to find a learned drive based on hunger; evidence for learning motivated by exploration. Journal of Comparative and Physiological Psychology, 47(6), 428-436.
- (1954) Journal of Comparative and Physiological Psychology , vol.47 , Issue.6 , pp. 428-436
- Myers, A.¹ Miller, N.²

60
- 0141596576
- Policy invariance under reward transformations: Theory and application to reward shaping
- In I. Bratko, & S. Dzeroski (Eds.), Bled, Slovenia.
- Ng, A., Harada, D., & Russell, S. (1999). Policy invariance under reward transformations: Theory and application to reward shaping. In I. Bratko, & S. Dzeroski (Eds.), Proceedings of the Sixteenth International Conference on Machine Learning. Bled, Slovenia.
- (1999) Proceedings of the Sixteenth International Conference on Machine Learning
- Ng, A.¹ Harada, D.² Russell, S.³

61
- 0009692872
- A study of exploratory behavior in the white rat by means of the obstruction method
- Nissen, H. (1930). A study of exploratory behavior in the white rat by means of the obstruction method. Journal of Genetic Psychology, 37(3), 361-376.
- (1930) Journal of Genetic Psychology , vol.37 , Issue.3 , pp. 361-376
- Nissen, H.¹

62
- 67349283062
- Reinforcement learning in the brain
- Niv, Y. (2009). Reinforcement learning in the brain. Journal of Mathematical Psychology, 53(3), 139-154.
- (2009) Journal of Mathematical Psychology , vol.53 , Issue.3 , pp. 139-154
- Niv, Y.¹

63
- 0022686961
- Attention, similarity, and the identification-categorization relationship
- Nosofsky, R. (1986). Attention, similarity, and the identification-categorization relationship. Journal of Experimental Psychology: General, 115(1), 39-57.
- (1986) Journal of Experimental Psychology: General , vol.115 , Issue.1 , pp. 39-57
- Nosofsky, R.¹

64
- 79551573880
- Risk, unexpected uncertainty, and estimation uncertainty: Bayesian learning in unstable settings
- Payzan-LeNestour, E., & Bossaerts, P. (2011). Risk, unexpected uncertainty, and estimation uncertainty: Bayesian learning in unstable settings. PLoS Computational Biology, 7(1), e1001048.
- (2011) PLoS Computational Biology , vol.7 , Issue.1
- Payzan-LeNestour, E.¹ Bossaerts, P.²

65
- 0003391330
- San Francisco, CA: Morgan Kaufmann.
- Pearl, J. (1988). Probabilistic reasoning in intelligent systems: Networks of plausible inference. San Francisco, CA: Morgan Kaufmann.
- (1988) Probabilistic reasoning in intelligent systems: Networks of plausible inference
- Pearl, J.¹

66
- 0016273271
- Effects of prenatal exposure to auditory or visual stimulation on postnatal distress vocalizations in chicks
- Rajecki, D. (1974). Effects of prenatal exposure to auditory or visual stimulation on postnatal distress vocalizations in chicks. Behavioral Biology, 11(4), 525-536.
- (1974) Behavioral Biology , vol.11 , Issue.4 , pp. 525-536
- Rajecki, D.¹

67
- 79960241771
- Decision making under uncertainty: A neural model based on partially observable markov decision processes
- Rao, R. (2010). Decision making under uncertainty: A neural model based on partially observable markov decision processes. Frontiers in Computational Neuroscience, 4(4), 146.
- (2010) Frontiers in Computational Neuroscience , vol.4 , Issue.4 , pp. 146
- Rao, R.¹

68
- 34548837994
- Reconciling reinforcement learning models with behavioral extinction and renewal: implications for addiction, relapse, and problem gambling
- Redish, A., Jensen, S., Johnson, A., & Kurth-Nelson, Z. (2007). Reconciling reinforcement learning models with behavioral extinction and renewal: implications for addiction, relapse, and problem gambling. Psychological Review, 114(3), 784-805.
- (2007) Psychological Review , vol.114 , Issue.3 , pp. 784-805
- Redish, A.¹ Jensen, S.² Johnson, A.³ Kurth-Nelson, Z.⁴

69
- 0030023561
- Intrinsic reinforcing properties of putatively neutral stimuli in an instrumental two-lever discrimination task
- Reed, P., Mitchell, C., & Nokes, T. (1996). Intrinsic reinforcing properties of putatively neutral stimuli in an instrumental two-lever discrimination task. Animal Learning and Behavior, 24(1), 38-45.
- (1996) Animal Learning and Behavior , vol.24 , Issue.1 , pp. 38-45
- Reed, P.¹ Mitchell, C.² Nokes, T.³

70
- 40749085540
- Competition between the conditioned rewarding effects of cocaine and novelty
- Reichel, C., & Bevins, R. (2008). Competition between the conditioned rewarding effects of cocaine and novelty. Behavioral Neuroscience, 122(1), 140-150.
- (2008) Behavioral Neuroscience , vol.122 , Issue.1 , pp. 140-150
- Reichel, C.¹ Bevins, R.²

71
- 0002109138
- Variations in the effectiveness of reinforcement and nonreinforcement
- In A. H. Black, & W. F. Prokasy (Eds.), New York: Appleton-Century-Crofts.
- Rescorla, R. & Wagner, A. (1972). Variations in the effectiveness of reinforcement and nonreinforcement. In A. H. Black, & W. F. Prokasy (Eds.), Classical conditioning. II: Current research and theory (pp. 64-99). New York: Appleton-Century-Crofts.
- (1972) Classical conditioning. II: Current research and theory , pp. 64-99
- Rescorla, R.¹ Wagner, A.²

72
- 79960637995
- A neural signature of hierarchical reinforcement learning
- Ribas-Fernandes, J., Solway, A., Diuk, C., McGuire, J., Barto, A., Niv, Y., & Botvinick, M. (2011). A neural signature of hierarchical reinforcement learning. Neuron, 71(2), 370-379.
- (2011) Neuron , vol.71 , Issue.2 , pp. 370-379
- Ribas-Fernandes, J.¹ Solway, A.² Diuk, C.³ McGuire, J.⁴ Barto, A.⁵ Niv, Y.⁶ Botvinick, M.⁷

73
- 0003919677
- New York: Springer Verlag.
- Robert, C., & Casella, G. (2004). Monte Carlo statistical methods. New York: Springer Verlag.
- (2004) Monte Carlo statistical methods
- Robert, C.¹ Casella, G.²

74
- 36348966690
- Reinforcement learning signals in the human striatum distinguish learners from nonlearners during reward-based decision making
- Schönberg, T., Daw, N., Joel, D., & O'Doherty, J. (2007). Reinforcement learning signals in the human striatum distinguish learners from nonlearners during reward-based decision making. The Journal of Neuroscience, 27(47), 12860-12867.
- (2007) The Journal of Neuroscience , vol.27 , Issue.47 , pp. 12860-12867
- Schönberg, T.¹ Daw, N.² Joel, D.³ O'Doherty, J.⁴

75
- 0031867046
- Predictive reward signal of dopamine neurons
- Schultz, W. (1998). Predictive reward signal of dopamine neurons. Journal of Neurophysiology, 80(1), 1-27.
- (1998) Journal of Neurophysiology , vol.80 , Issue.1 , pp. 1-27
- Schultz, W.¹

76
- 0030896968
- A neural substrate of prediction and reward
- Schultz, W., Dayan, P., & Montague, P. (1997). A neural substrate of prediction and reward. Science, 275(5306), 1593-1599.
- (1997) Science , vol.275 , Issue.5306 , pp. 1593-1599
- Schultz, W.¹ Dayan, P.² Montague, P.³

77
- 0000337027
- Preference for familiar versus novel stimuli as a function of the familiarity of the environment
- Sheldon, A. (1969). Preference for familiar versus novel stimuli as a function of the familiarity of the environment. Journal of Comparative and Physiological Psychology, 67(4), 516-521.
- (1969) Journal of Comparative and Physiological Psychology , vol.67 , Issue.4 , pp. 516-521
- Sheldon, A.¹

78
- 0023223978
- Toward a universal law of generalization for psychological science
- Shepard, R. (1987). Toward a universal law of generalization for psychological science. Science, 237(4820), 1317-1323.
- (1987) Science , vol.237 , Issue.4820 , pp. 1317-1323
- Shepard, R.¹

79
- 77957277972
- Exemplar models as a mechanism for performing bayesian inference
- Shi, L., Griffiths, T., Feldman, N., & Sanborn, A. (2010). Exemplar models as a mechanism for performing bayesian inference. Psychonomic Bulletin & Review, 17(4), 443-464.
- (2010) Psychonomic Bulletin & Review , vol.17 , Issue.4 , pp. 443-464
- Shi, L.¹ Griffiths, T.² Feldman, N.³ Sanborn, A.⁴

80
- 67349268975
- A Bayesian analysis of human decision-making on bandit problems
- Steyvers, M., Lee, M., & Wagenmakers, E. (2009). A Bayesian analysis of human decision-making on bandit problems. Journal of Mathematical Psychology, 53(3), 168-179.
- (2009) Journal of Mathematical Psychology , vol.53 , Issue.3 , pp. 168-179
- Steyvers, M.¹ Lee, M.² Wagenmakers, E.³

81
- 0032930935
- A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task
- Suri, R., Schultz, W., et al. (1999). A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task. Neuroscience, 91(3), 871-890.
- (1999) Neuroscience , vol.91 , Issue.3 , pp. 871-890
- Suri, R.¹ Schultz, W.²

82
- 0004102479
- Cambridge, MA: MIT Press.
- Sutton, R., & Barto, A. (1998). Reinforcement learning: An introduction. Cambridge, MA: MIT Press.
- (1998) Reinforcement learning: An introduction
- Sutton, R.¹ Barto, A.²

83
- 72449172543
- On the generality and limits of abstraction in rats and humans
- Urcelay, G., & Miller, R. (2010). On the generality and limits of abstraction in rats and humans. Animal Cognition, 13(1), 21-32.
- (2010) Animal Cognition , vol.13 , Issue.1 , pp. 21-32
- Urcelay, G.¹ Miller, R.²

84
- 33646550876
- Categories and causality: The neglected direction
- Waldmann, M., & Hagmayer, Y. (2006). Categories and causality: The neglected direction. Cognitive Psychology, 53(1), 27-58.
- (2006) Cognitive Psychology , vol.53 , Issue.1 , pp. 27-58
- Waldmann, M.¹ Hagmayer, Y.²

85
- 50549177661
- The aetiology of food reward in monkeys
- Weiskrantz, L., & Cowey, A. (1963). The aetiology of food reward in monkeys. Animal Behaviour, 11(2-3), 225-234.
- (1963) Animal Behaviour , vol.11 , Issue.2-3 , pp. 225-234
- Weiskrantz, L.¹ Cowey, A.²

86
- 0000251860
- Novelty, familiarity, and the development of infant attention
- Weizmann, F., Cohen, L., & Pratt, R. (1971). Novelty, familiarity, and the development of infant attention. Developmental Psychology, 4(2), 149-154.
- (1971) Developmental Psychology , vol.4 , Issue.2 , pp. 149-154
- Weizmann, F.¹ Cohen, L.² Pratt, R.³

87
- 0002278965
- Adaptive switching circuits
- In New York: IRE-New York.
- Widrow, B., & Hoff, M. (1960). Adaptive switching circuits. In IRE WES CON convention record (Vol. 4, pp. 96-104). New York: IRE-New York.
- (1960) IRE WES CON convention record , vol.4 , pp. 96-104
- Widrow, B.¹ Hoff, M.²

88
- 45249097567
- Striatal activity underlies novelty based choice in humans
- Wittmann, B., Daw, N., Seymour, B., & Dolan, R. (2008). Striatal activity underlies novelty based choice in humans. Neuron, 58(6), 967-973.
- (2008) Neuron , vol.58 , Issue.6 , pp. 967-973
- Wittmann, B.¹ Daw, N.² Seymour, B.³ Dolan, R.⁴

89
- 0035540661
- Mere exposure: A gateway to the subliminal
- Zajonc, R. (2001). Mere exposure: A gateway to the subliminal. Current Directions in Psychological Science, 10(6), 224-228.
- (2001) Current Directions in Psychological Science , vol.10 , Issue.6 , pp. 224-228
- Zajonc, R.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.