메뉴 건너뛰기




Volumn 40, Issue 3, 2012, Pages 305-319

Evaluating the TD model of classical conditioning

Author keywords

Associative learning; Classical conditioning; Reinforcement learning; Timing

Indexed keywords

ALGORITHM; ANIMAL; ARTICLE; COMPUTER SIMULATION; CONDITIONED REFLEX; LEARNING; PSYCHOLOGICAL MODEL; REINFORCEMENT; STATISTICS; TIME;

EID: 84870910490     PISSN: 15434494     EISSN: 15434508     Source Type: Journal    
DOI: 10.3758/s13420-012-0082-6     Document Type: Article
Times cited : (78)

References (54)
  • 1
    • 46649106964 scopus 로고    scopus 로고
    • Cs-us temporal relations in blocking
    • Amundson, J. C., & Miller, R. R. (2008). CS-US temporal relations in blocking. Learning & Behavior, 36, 92-103.
    • (2008) Learning & Behavior , vol.36 , pp. 92-103
    • Amundson, J.C.1    Miller, R.R.2
  • 2
    • 0033508899 scopus 로고    scopus 로고
    • How the basal ganglia use parallel excitatory and inhibitory learning pathways to selectively respond to unexpected rewarding cues
    • Brown, J., Bullock, D., & Grossberg, S. (1999). How the basal ganglia use parallel excitatory and inhibitory learning pathways to selectively respond to unexpected rewarding cues. Journal of Neuroscience, 19, 10502-10511. (Pubitemid 30228056)
    • (1999) Journal of Neuroscience , vol.19 , Issue.23 , pp. 10502-10511
    • Brown, J.1    Bullock, D.2    Grossberg, S.3
  • 3
    • 0032974481 scopus 로고    scopus 로고
    • Timing in simple conditioning and occasion setting: A neural network approach
    • DOI 10.1016/S0376-6357(99)00008-X, PII S037663579900008X
    • Buhusi, C. V., & Schmajuk, N. A. (1999). Timing in simple conditioning and occasion setting: A neural network approach. Behavioural Processes, 45, 33-57. (Pubitemid 29134913)
    • (1999) Behavioural Processes , vol.45 , Issue.1-3 , pp. 33-57
    • Buhusi, C.V.1    Schmajuk, N.A.2
  • 4
    • 0000333399 scopus 로고    scopus 로고
    • Theories of conditioning and timing
    • In R. R. Mowrer & S. B. Klein (Eds.). Hillsdale, NJ: Erlbaum
    • Church, R. M., & Kirkpatrick, K. (2001). Theories of conditioning and timing. In R. R. Mowrer & S. B. Klein (Eds.), Contemporary learning: Theory and applications (pp. 211-253). Hillsdale, NJ: Erlbaum.
    • (2001) Contemporary Learning: Theory and Applications , pp. 211-253
    • Church, R.M.1    Kirkpatrick, K.2
  • 5
    • 35748963871 scopus 로고    scopus 로고
    • Temporal-Difference Prediction Errors and Pavlovian Fear Conditioning: Role of NMDA and Opioid Receptors
    • DOI 10.1037/0735-7044.121.5.1043, PII S0735704407605424
    • Cole, S., & McNally, G. P. (2007). Temporal-difference prediction errors and Pavlovian fear conditioning: Role of NMDA and opioid receptors. Behavioral Neuroscience, 121, 1043-1052. (Pubitemid 350051306)
    • (2007) Behavioral Neuroscience , vol.121 , Issue.5 , pp. 1043-1052
    • Cole, S.1    McNally, G.P.2
  • 6
    • 33745787929 scopus 로고    scopus 로고
    • Representation and timing in theories of the dopamine system
    • DOI 10.1162/neco.2006.18.7.1637
    • Daw, N. D., Courville, A. C., & Touretzky, D. S. (2006). Representation and timing in theories of the dopamine system. Neural Computation, 18, 1637-1677. (Pubitemid 44024733)
    • (2006) Neural Computation , vol.18 , Issue.7 , pp. 1637-1677
    • Daw, N.D.1    Courville, A.C.2    Touretzky, D.S.3
  • 7
    • 28044450875 scopus 로고    scopus 로고
    • Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
    • DOI 10.1038/nn1560, PII N1560
    • Daw, N. D., Niv, Y., & Dayan, P. (2005). Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nature Neuroscience, 8, 1704-1711. (Pubitemid 41683198)
    • (2005) Nature Neuroscience , vol.8 , Issue.12 , pp. 1704-1711
    • Daw, N.D.1    Niv, Y.2    Dayan, P.3
  • 8
    • 0023947554 scopus 로고
    • Adaptive timing in neural networks: The conditioned response
    • Desmond, J. E., & Moore, J. W. (1988). Adaptive timing in neural networks: The conditioned response. Biological Cybernetics, 58, 405-415.
    • (1988) Biological Cybernetics , vol.58 , pp. 405-415
    • Desmond, J.E.1    Moore, J.W.2
  • 10
    • 0011255655 scopus 로고
    • Secondary reinforcement in rats as a function of information value and reliability of the stimulus
    • Egger, M. D., & Miller, N. E. (1962). Secondary reinforcement in rats as a function of information value and reliability of the stimulus. Journal of Experimental Psychology, 64, 97-104.
    • (1962) Journal of Experimental Psychology , vol.64 , pp. 97-104
    • Egger, M.D.1    Miller, N.E.2
  • 11
    • 0001152235 scopus 로고
    • Model of conditioning incorporating the rescorla-wagner associative axiom, a dynamic attention process, and a catastrophe rule
    • Frey, P. W., & Sears, R. J. (1978). Model of conditioning incorporating the Rescorla-Wagner associative axiom, a dynamic attention process, and a catastrophe rule. Psychological Review, 85, 321-348.
    • (1978) Psychological Review , vol.85 , pp. 321-348
    • Frey, P.W.1    Sears, R.J.2
  • 13
    • 0024775767 scopus 로고
    • Neural dynamics of adaptive timing and temporal discrimination during associative learning
    • Grossberg, S., & Schmajuk, N. A. (1989). Neural dynamics of adaptive timing and temporal discrimination during associative learning. Neural Networks, 2, 79-102.
    • (1989) Neural Networks , vol.2 , pp. 79-102
    • Grossberg, S.1    Schmajuk, N.A.2
  • 16
    • 31544463783 scopus 로고    scopus 로고
    • Interval duration effects on blocking in appetitive conditioning
    • DOI 10.1016/j.beproc.2005.11.007, PII S0376635705002299, Interval Timing: The Current Status
    • Jennings, D. J., & Kirkpatrick, K. (2006). Interval duration effects on blocking in appetitive conditioning. Behavioural Processes, 71, 318-329. (Pubitemid 43163028)
    • (2006) Behavioural Processes , vol.71 , Issue.2-3 , pp. 318-329
    • Jennings, D.1    Kirkpatrick, K.2
  • 18
    • 43449123616 scopus 로고    scopus 로고
    • Magnitude and Timing of Nictitating Membrane Movements During Classical Conditioning of the Rabbit (Oryctolagus cuniculus)
    • DOI 10.1037/0735-7044.122.2.471, PII S0735704408600786
    • Kehoe, E. J., Ludvig, E. A., Dudeney, J. E., Neufeld, J., & Sutton, R. S. (2008). Magnitude and timing of nictitating membrane movements during classical conditioning of the rabbit (Oryctolagus cuniculus). Behavioral Neuroscience, 122, 471-476. (Pubitemid 351672338)
    • (2008) Behavioral Neuroscience , vol.122 , Issue.2 , pp. 471-476
    • Kehoe, E.J.1    Ludvig, E.A.2    Dudeney, J.E.3    Neufeld, J.4    Sutton, R.S.5
  • 19
    • 70350090173 scopus 로고    scopus 로고
    • Magnitude and timing of crs in delay and trace classical conditioning of the nictitating membrane response of the rabbit (oryctolagus cuniculus
    • Kehoe, E. J., Ludvig, E. A., & Sutton, R. S. (2009a). Magnitude and timing of CRs in delay and trace classical conditioning of the nictitating membrane response of the rabbit (Oryctolagus cuniculus). Behavioral Neuroscience, 123, 1095-1101.
    • (2009) Behavioral Neuroscience , vol.123 , pp. 1095-1101
    • Kehoe, E.J.1    Ludvig, E.A.2    Sutton, R.S.3
  • 20
    • 60549096084 scopus 로고    scopus 로고
    • Scalar timing varies with response magnitude in classical conditioning of the nictitating membrane response of the rabbit (oryctolagus cuniculus
    • Kehoe, E. J., Olsen, K. N., Ludvig, E. A., & Sutton, R. S. (2009b). Scalar timing varies with response magnitude in classical conditioning of the nictitating membrane response of the rabbit (Oryctolagus cuniculus). Behavioral Neuroscience, 123, 212-217.
    • (2009) Behavioral Neuroscience , vol.123 , pp. 212-217
    • Kehoe, E.J.1    Olsen, K.N.2    Ludvig, E.A.3    Sutton, R.S.4
  • 21
    • 0013619751 scopus 로고
    • Blocking acquisition of the rabbit's nictitating membrane response to serial conditioned stimuli
    • Kehoe, E. J., Schreurs, B. G., & Amodei, N. (1981). Blocking acquisition of the rabbit's nictitating membrane response to serial conditioned stimuli. Learning and Motivation, 12, 92-108.
    • (1981) Learning and Motivation , vol.12 , pp. 92-108
    • Kehoe, E.J.1    Schreurs, B.G.2    Amodei, N.3
  • 22
    • 0023501932 scopus 로고
    • Temporal primacy overrides prior training in serial compound conditioning of the rabbit's nictitating membrane response
    • Kehoe, E. J., Schreurs, B. G., & Graham, P. (1987). Temporal primacy overrides prior training in serial compound conditioning of the rabbit's nictitating membrane response. Animal Learning & Behavior, 15, 455-464. (Pubitemid 18000790)
    • (1987) Animal Learning and Behavior , vol.15 , Issue.4 , pp. 455-464
    • Kehoe, E.J.1    Schreurs, B.G.2    Graham, P.3
  • 23
    • 0036075297 scopus 로고    scopus 로고
    • Extinction revisited: Similarities between extinction and reductions in US intensity in classical conditioning of the rabbit's nictitating membrane response
    • Kehoe, E. J., & White, N. E. (2002). Extinction revisited: Similarities between extinction and reductions in US intensity in classical conditioning of the rabbit's nictitating membrane response. Animal Learning & Behavior, 30, 96-111. (Pubitemid 34651254)
    • (2002) Animal Learning and Behavior , vol.30 , Issue.2 , pp. 96-111
    • Kehoe, E.J.1    White, N.E.2
  • 27
    • 57349130536 scopus 로고    scopus 로고
    • Stimulus representation and the timing of reward-prediction errors in models of the dopamine system
    • Ludvig, E. A., Sutton, R. S., & Kehoe, E. J. (2008). Stimulus representation and the timing of reward-prediction errors in models of the dopamine system. Neural Computation, 20, 3034-3054.
    • (2008) Neural Computation , vol.20 , pp. 3034-3054
    • Ludvig, E.A.1    Sutton, R.S.2    Kehoe, E.J.3
  • 29
    • 80051879076 scopus 로고    scopus 로고
    • Hippocampal "time cells" bridge the gap in memory for discontiguous events
    • MacDonald, C. J., Lepage, K. Q., Eden, U. T., & Eichenbaum, H. (2011). Hippocampal "time cells" bridge the gap in memory for discontiguous events. Neuron, 71, 737-749.
    • (2011) Neuron , vol.71 , pp. 737-749
    • MacDonald, C.J.1    Lepage, K.Q.2    Eden, U.T.3    Eichenbaum, H.4
  • 30
    • 0031110595 scopus 로고    scopus 로고
    • Learning the Temporal Dynamics of Behavior
    • Machado, A. (1997). Learning the temporal dynamics of behavior. Psychological Review, 104, 241-265. (Pubitemid 127455260)
    • (1997) Psychological Review , vol.104 , Issue.2 , pp. 241-265
    • Machado, A.1
  • 31
    • 72449166356 scopus 로고    scopus 로고
    • Reinforcement learning, conditioning, and the brain: Successes and challenges
    • Maia, T. V. (2009). Reinforcement learning, conditioning, and the brain: Successes and challenges. Cognitive, Affective, & Behavioral Neuroscience, 9, 343-364.
    • (2009) Cognitive, Affective, & Behavioral Neuroscience , vol.9 , pp. 343-364
    • Maia, T.V.1
  • 32
    • 0029981543 scopus 로고    scopus 로고
    • A framework for mesencephalic dopamine systems based on predictive Hebbian learning
    • Montague, P. R., Dayan, P., & Sejnowski, T. J. (1996). A framework for mesencephalic dopamine systems based on predictive Hebbian learning. Journal of Neuroscience, 16, 1936-1947. (Pubitemid 26145969)
    • (1996) Journal of Neuroscience , vol.16 , Issue.5 , pp. 1936-1947
    • Montague, P.R.1    Dayan, P.2    Sejnowski, T.J.3
  • 33
    • 77956766137 scopus 로고    scopus 로고
    • The td model of classical conditioning: Response topography and brain implementation
    • In J. W. Donahoe & V. P. Dorsel (Eds.). Amsterdam: North-Holland/Elsevier
    • Moore, J. W., & Choi, J. S. (1997). The TD model of classical conditioning: Response topography and brain implementation. In J. W. Donahoe & V. P. Dorsel (Eds.), Neural-network models of cognition, biobehavioral foundations (Advances in Psychology (pp, Vol. 121, pp. 387-405). Amsterdam: North-Holland/Elsevier.
    • (1997) Neural-Network Models of Cognition, Biobehavioral Foundations Advances in Psychology , vol.121 , pp. 387-405
    • Moore, J.W.1    Choi, J.S.2
  • 34
    • 0022486566 scopus 로고
    • Simulation of the classically conditioned nictitating membrane response by a neuron-like adaptive element: Response topography, neuronal firing, and interstimulus intervals
    • DOI 10.1016/0166-4328(86)90092-6
    • Moore, J. W., Desmond, J. E., Berthier, N. E., Blazis, D. E. J., Sutton, R. S., & Barto, A. G. (1986). Simulation of the classically conditioned nictitating membrane response by a neuron-like adaptive element: Response topography, neuronal firing and inter-stimulus intervals. Behavioral Brain Research, 21, 143-154. (Pubitemid 16042818)
    • (1986) Behavioural Brain Research , vol.21 , Issue.2 , pp. 143-154
    • Moore, J.W.1    Desmond, J.E.2    Berthier, N.E.3
  • 35
    • 67349283062 scopus 로고    scopus 로고
    • Reinforcement learning in the brain
    • Niv, Y. (2009). Reinforcement learning in the brain. Journal of Mathematical Psychology, 53, 139-154.
    • (2009) Journal of Mathematical Psychology , vol.53 , pp. 139-154
    • Niv, Y.1
  • 36
    • 55749102442 scopus 로고    scopus 로고
    • Tripartite mechanism of extinction suggested by dopamine neuron activity and temporal difference model
    • Pan, W. X., Schmidt, R., Wickens, J. R., & Hyland, B. I. (2008). Tripartite mechanism of extinction suggested by dopamine neuron activity and temporal difference model. Journal of Neuroscience, 28, 9619-9631.
    • (2008) Journal of Neuroscience , vol.28 , pp. 9619-9631
    • Pan, W.X.1    Schmidt, R.2    Wickens, J.R.3    Hyland, B.I.4
  • 37
    • 0023233591 scopus 로고
    • A model of stimulus generalization for pavlovian conditioning
    • Pearce, J. M. (1987). A model of stimulus generalization for Pavlovian conditioning. Psychological Review, 94, 61-73.
    • (1987) Psychological Review , vol.94 , pp. 61-73
    • Pearce, J.M.1
  • 38
    • 0028526748 scopus 로고
    • Similarity and discrimination: A selective review and a connectionist model
    • Pearce, J. M. (1994). Similarity and discrimination: A selective review and a connectionist model. Psychological Review, 101, 587-607. (Pubitemid 24976818)
    • (1994) Psychological Review , vol.101 , Issue.4 , pp. 587-607
    • Pearce, J.M.1
  • 39
    • 0019089514 scopus 로고
    • A model for pavlovian learning: Variations in the effectiveness of conditioned but not of unconditioned stimuli
    • Pearce, J. M., & Hall, G. (1980). A model for Pavlovian learning: Variations in the effectiveness of conditioned but not of unconditioned stimuli. Psychological Review, 87, 532-552.
    • (1980) Psychological Review , vol.87 , pp. 532-552
    • Pearce, J.M.1    Hall, G.2
  • 40
    • 0002109138 scopus 로고
    • A theory of pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement
    • In A. H. Black & W. F. Prokasy (Eds.). New York: Appleton-Century- Crofts
    • Rescorla, R. A., & Wagner, A. R. (1972). A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement. In A. H. Black & W. F. Prokasy (Eds.), Classical conditioning II (pp. 64-99). New York: Appleton-Century-Crofts.
    • (1972) Classical Conditioning II , pp. 64-99
    • Rescorla, R.A.1    Wagner, A.R.2
  • 42
    • 4243979578 scopus 로고
    • The effects of changes in the cs-us interval during compound conditioning upon an otherwise blocked element
    • Schreurs, B. G., & Westbrook, R. F. (1982). The effects of changes in the CS-US interval during compound conditioning upon an otherwise blocked element. Quarterly Journal of Experimental Psychology, 34B, 19-30.
    • (1982) Quarterly Journal of Experimental Psychology , vol.34 B , pp. 19-30
    • Schreurs, B.G.1    Westbrook, R.F.2
  • 43
    • 32444439058 scopus 로고    scopus 로고
    • Behavioral theories and the neurophysiology of reward
    • DOI 10.1146/annurev.psych.56.091103.070229
    • Schultz, W. (2006). Behavioral theories and the neurophysiology of reward. Annual Review of Psychology, 57, 87-115. (Pubitemid 43237228)
    • (2006) Annual Review of Psychology , vol.57 , pp. 87-115
    • Schultz, W.1
  • 44
    • 0030896968 scopus 로고    scopus 로고
    • A neural substrate of prediction and reward
    • DOI 10.1126/science.275.5306.1593
    • Schultz,W., Dayan, P., & Montague, P. R. (1997). A neural substrate of prediction and reward. Science, 275, 1593-1599. (Pubitemid 27120526)
    • (1997) Science , vol.275 , Issue.5306 , pp. 1593-1599
    • Schultz, W.1    Dayan, P.2    Montague, P.R.3
  • 45
    • 0014397999 scopus 로고
    • Cs-us interval and us intensity in classical conditioning of the rabbit's nictitating membrane response
    • Smith, M. C. (1968). CS-US interval and US intensity in classical conditioning of the rabbit's nictitating membrane response. Journal of Comparative and Physiological Psychology, 66, 679-687.
    • (1968) Journal of Comparative and Physiological Psychology , vol.66 , pp. 679-687
    • Smith, M.C.1
  • 46
    • 0014587943 scopus 로고
    • Classical conditioning of the rabbit's nictitating membrane response at backward, simultaneous, and forward cs-us intervals
    • Smith, M. C., Coleman, S. R., & Gormezano, I. (1969). Classical conditioning of the rabbit's nictitating membrane response at backward, simultaneous, and forward CS-US intervals. Journal of Comparative and Physiological Psychology, 69, 226-231.
    • (1969) Journal of Comparative and Physiological Psychology , vol.69 , pp. 226-231
    • Smith, M.C.1    Coleman, S.R.2    Gormezano, I.3
  • 47
    • 0032930935 scopus 로고    scopus 로고
    • A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task
    • DOI 10.1016/S0306-4522(98)00697-6, PII S0306452298006976
    • Suri, R. E., & Schultz, W. (1999). A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task. Neuroscience, 91, 871-890. (Pubitemid 29225621)
    • (1999) Neuroscience , vol.91 , Issue.3 , pp. 871-890
    • Suri, R.E.1    Schultz, W.2
  • 48
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal differences
    • Sutton, R. S. (1988). Learning to predict by the methods of temporal differences. Machine Learning, 3, 9-44.
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, R.S.1
  • 49
    • 85132026293 scopus 로고
    • Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
    • Sutton, R. S. (1990). Integrated architectures for learning, planning, and reacting based on approximating dynamic programming. International Conference on Machine Learning (ICML), 7, 216-224.
    • (1990) International Conference on Machine Learning (ICML , vol.7 , pp. 216-224
    • Sutton, R.S.1
  • 50
    • 0026971570 scopus 로고
    • Adapting bias by gradient descent: An incremental version of delta-bar-delta
    • Sutton, R. S. (1992). Adapting bias by gradient descent: An incremental version of delta-bar-delta. National Conference on Artificial Intelligence, 10, 171-176.
    • (1992) National Conference on Artificial Intelligence , vol.10 , pp. 171-176
    • Sutton, R.S.1
  • 51
    • 0019537951 scopus 로고
    • Toward a modern theory of adaptive networks: Expectation and prediction
    • Sutton, R. S., & Barto, A. G. (1981). Toward a modern theory of adaptive networks: Expectation and prediction. Psychological Review, 88, 135-171.
    • (1981) Psychological Review , vol.88 , pp. 135-171
    • Sutton, R.S.1    Barto, A.G.2
  • 52
    • 0003066891 scopus 로고
    • Time-derivative models of pavlovian reinforcement
    • In M. Gabriel & J. W. Moore (Eds.). Cambridge, MA: MIT Press
    • Sutton, R. S., & Barto, A. G. (1990). Time-derivative models of Pavlovian reinforcement. In M. Gabriel & J. W. Moore (Eds.), Learning and computational neuroscience (pp. 497-537). Cambridge, MA: MIT Press.
    • (1990) Learning and Computational Neuroscience , pp. 497-537
    • Sutton, R.S.1    Barto, A.G.2
  • 54
    • 0037471531 scopus 로고    scopus 로고
    • Stimulus representation in SOP: II. An application to inhibition of delay
    • DOI 10.1016/S0376-6357(03)00050-0
    • Vogel, E. H., Brandon, S. E., & Wagner, A. R. (2003). Stimulus representation in SOP: II. An application to inhibition of delay. Behavioural Processes, 62, 27-48. (Pubitemid 36511421)
    • (2003) Behavioural Processes , vol.62 , Issue.1-3 , pp. 27-48
    • Vogel, E.H.1    Brandon, S.E.2    Wagner, A.R.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.