메뉴 건너뛰기




Volumn 10, Issue 2, 1997, Pages 201-229

A dynamic theory of acquisition and extinction in operant learning

Author keywords

assignment of credit; contingency; expectancy; long term memory; operant conditioning; recurrent choice; reinforcement learning; short term memory

Indexed keywords

ASSOCIATIVE STORAGE; LEARNING SYSTEMS; MATHEMATICAL MODELS; PROBABILITY;

EID: 0031104847     PISSN: 08936080     EISSN: None     Source Type: Journal    
DOI: 10.1016/S0893-6080(96)00067-6     Document Type: Article
Times cited : (14)

References (78)
  • 1
    • 84946268134 scopus 로고
    • Variations in the sensitivity of instrumental responding to reinforcer devaluation
    • Adams, C. D. (1982). Variations in the sensitivity of instrumental responding to reinforcer devaluation. Quarterly Journal of Experimental Psychology, 34B, 77-98.
    • (1982) Quarterly Journal of Experimental Psychology , vol.34 B , pp. 77-98
    • Adams, C.D.1
  • 2
    • 0001316548 scopus 로고
    • Frustrative nonreward in partial reinforcement and discrimination learning
    • Amsel, A. (1962). Frustrative nonreward in partial reinforcement and discrimination learning. Psychological Review, 69, 306-328.
    • (1962) Psychological Review , vol.69 , pp. 306-328
    • Amsel, A.1
  • 3
    • 0041197227 scopus 로고
    • Partial-reinforcement extinction effect following different amounts of training
    • Bacon, W. E. (1962). Partial-reinforcement extinction effect following different amounts of training. Journal of Comparative and Physiological Psychology, 55, 998-1003.
    • (1962) Journal of Comparative and Physiological Psychology , vol.55 , pp. 998-1003
    • Bacon, W.E.1
  • 4
    • 0025423150 scopus 로고
    • Choice behavior in transition: Development of preference for the higher probability of reinforcement
    • Bailey, J. T., & Mazur, J. E. (1990). Choice behavior in transition: development of preference for the higher probability of reinforcement. Journal of the Experimental Analysis of Behavior, 53, 409-422.
    • (1990) Journal of the Experimental Analysis of Behavior , vol.53 , pp. 409-422
    • Bailey, J.T.1    Mazur, J.E.2
  • 7
    • 0014262701 scopus 로고
    • Shifts in magnitude of reward and contrast effects in instrumental conditioning
    • Black, R. W. (1968). Shifts in magnitude of reward and contrast effects in instrumental conditioning. Psychological Review, 75, 114-126.
    • (1968) Psychological Review , vol.75 , pp. 114-126
    • Black, R.W.1
  • 8
    • 0001147768 scopus 로고
    • A contrast effect in differential conditioning
    • Bower, G. H. (1961). A contrast effect in differential conditioning. Journal of Experimental Psychology, 62, 196-199.
    • (1961) Journal of Experimental Psychology , vol.62 , pp. 196-199
    • Bower, G.H.1
  • 9
    • 0342427333 scopus 로고
    • Spontaneous recovery: A (Hullian) noninhibition interpretation
    • Burstein, K. R. (1967). Spontaneous recovery: A (Hullian) noninhibition interpretation. Psychonomic Science, 7, 389-390.
    • (1967) Psychonomic Science , vol.7 , pp. 389-390
    • Burstein, K.R.1
  • 10
    • 0014094874 scopus 로고
    • Sequential versus nonsequential variables in partial delay of reward
    • Capaldi, E. J. (1967). Sequential versus nonsequential variables in partial delay of reward. Journal of Experimental Psychology, 74, 161-166.
    • (1967) Journal of Experimental Psychology , vol.74 , pp. 161-166
    • Capaldi, E.J.1
  • 11
    • 0002511024 scopus 로고
    • Memory and learning: A sequential viewpoint
    • W. K. Honig & P.H.R. James (Eds.). New York: Academic Press
    • Capaldi, E. J. (1971). Memory and learning: a sequential viewpoint. In W. K. Honig & P.H.R. James (Eds.). Animal memory (pp. 111-154). New York: Academic Press.
    • (1971) Animal Memory , pp. 111-154
    • Capaldi, E.J.1
  • 12
    • 0343296868 scopus 로고
    • Reward schedule effects at a relatively long intertrial interval
    • Capaldi, E. J., & Minkoff, R. (1967). Reward schedule effects at a relatively long intertrial interval. Psychonomic Science, 9, 169-170.
    • (1967) Psychonomic Science , vol.9 , pp. 169-170
    • Capaldi, E.J.1    Minkoff, R.2
  • 16
    • 0000145916 scopus 로고
    • A review of recent incentive contrast studies involving discrete trial procedures
    • Cox, W. M. (1975). A review of recent incentive contrast studies involving discrete trial procedures. The Psychological Record, 25, 373-393.
    • (1975) The Psychological Record , vol.25 , pp. 373-393
    • Cox, W.M.1
  • 17
    • 0000928662 scopus 로고
    • Quantitative variation of incentive and performance in the white rat
    • Crespi, L. P. (1952). Quantitative variation of incentive and performance in the white rat. American Journal of Psychology, 55, 467-517.
    • (1952) American Journal of Psychology , vol.55 , pp. 467-517
    • Crespi, L.P.1
  • 18
    • 0001655080 scopus 로고
    • A mathematical model of reward and aversive nonreward: Its application in over 30 appetitive learning situations
    • Daly, H. B., & Daly, J. T. (1982). A mathematical model of reward and aversive nonreward: its application in over 30 appetitive learning situations. Journal of Experimental Psychology: General, 111, 441-480.
    • (1982) Journal of Experimental Psychology: General , vol.111 , pp. 441-480
    • Daly, H.B.1    Daly, J.T.2
  • 20
    • 0002550972 scopus 로고
    • Negative contrast effect as a function of magnitude of reward decrement
    • Di Lollo, V., & Beez, V. (1966). Negative contrast effect as a function of magnitude of reward decrement. Psychonomic Science, 5, 99-100.
    • (1966) Psychonomic Science , vol.5 , pp. 99-100
    • Di Lollo, V.1    Beez, V.2
  • 21
    • 0040009877 scopus 로고
    • A competitive neural network model for the process of recurrent choice
    • M. C. Mozer, P. Smolensky, D. S. Touretzky, J. L. Elman, & A. S. Weigend (Eds.), Hillsdale, N.J.: Lawrence Erlbaum Associates
    • Dragoi, V., & Staddon, J. E. R. (1993). A competitive neural network model for the process of recurrent choice. In M. C. Mozer, P. Smolensky, D. S. Touretzky, J. L. Elman, & A. S. Weigend (Eds.), Proceedings of the 1993 Connectionist Models Summer School (pp. 65-73). Hillsdale, N.J.: Lawrence Erlbaum Associates.
    • (1993) Proceedings of the 1993 Connectionist Models Summer School , pp. 65-73
    • Dragoi, V.1    Staddon, J.E.R.2
  • 22
    • 0000581049 scopus 로고
    • Statistical theory of spontaneous recovery and regression
    • Estes, W. K. (1955). Statistical theory of spontaneous recovery and regression. Psychological Review, 62, 145-154.
    • (1955) Psychological Review , vol.62 , pp. 145-154
    • Estes, W.K.1
  • 26
    • 0015486047 scopus 로고
    • A neural theory of punishment and avoidance, II: Quantitative theory
    • Grossberg, S. (1972). A neural theory of punishment and avoidance, II: Quantitative theory. Mathematical Biosciences, 15, 39-67.
    • (1972) Mathematical Biosciences , vol.15 , pp. 39-67
    • Grossberg, S.1
  • 27
    • 0016589560 scopus 로고
    • A neural model of attention, reinforcement, and discrimination learning
    • Grossberg, S. (1975). A neural model of attention, reinforcement, and discrimination learning. International Review of Neurobiology, 18, 263-325.
    • (1975) International Review of Neurobiology , vol.18 , pp. 263-325
    • Grossberg, S.1
  • 28
    • 84893616503 scopus 로고
    • Psychophysiological substrates of schedule interactions and behavioral contrast
    • Grossberg, S. (1981). Psychophysiological substrates of schedule interactions and behavioral contrast. SIAM-AMS Proceedings, 13, 157-186.
    • (1981) SIAM-AMS Proceedings , vol.13 , pp. 157-186
    • Grossberg, S.1
  • 29
    • 0020187167 scopus 로고
    • Processing of expected and unexpected events during conditioning and attention: A psychophysiological theory
    • Grossberg, S. (1982). Processing of expected and unexpected events during conditioning and attention: a psychophysiological theory. Psychological Review, 89, 529-572.
    • (1982) Psychological Review , vol.89 , pp. 529-572
    • Grossberg, S.1
  • 30
    • 0017352369 scopus 로고
    • Positive contrast, negative induction, and inhibitory stimulus control in rat
    • Gutman, A. (1977). Positive contrast, negative induction, and inhibitory stimulus control in rat. Journal of the Experimental Analysis of Behavior, 27, 219-233.
    • (1977) Journal of the Experimental Analysis of Behavior , vol.27 , pp. 219-233
    • Gutman, A.1
  • 31
    • 0002833950 scopus 로고
    • The formation of learning sets
    • Harlow, H. F. (1949). The formation of learning sets. Psychological Review, 56, 51-65.
    • (1949) Psychological Review , vol.56 , pp. 51-65
    • Harlow, H.F.1
  • 32
    • 27844539379 scopus 로고
    • Relative and absolute strength of response as a function of frequency of reinforcement
    • Herrnstein, R. J. (1961). Relative and absolute strength of response as a function of frequency of reinforcement. Journal of Experimental Analysis and Behavior, 4, 267-272.
    • (1961) Journal of Experimental Analysis and Behavior , vol.4 , pp. 267-272
    • Herrnstein, R.J.1
  • 34
  • 35
    • 0003093362 scopus 로고
    • Foraging in a changing environment: An experiment with starlings (sturnus vulgaris)
    • M. L. Commons, A. Kacelnik & S. J. Shettleworth (Eds.), Hillsdale, NJ: Laurence Erlbaum
    • Kacelnik, A., Krebs, J. R. & Ens, B. (1987). Foraging in a changing environment: an experiment with starlings (sturnus vulgaris). In M. L. Commons, A. Kacelnik & S. J. Shettleworth (Eds.), Quantitative analyses of behavior VI: foraging (pp. 63-87). Hillsdale, NJ: Laurence Erlbaum.
    • (1987) Quantitative Analyses of Behavior VI: Foraging , pp. 63-87
    • Kacelnik, A.1    Krebs, J.R.2    Ens, B.3
  • 36
    • 0014281345 scopus 로고
    • On the measurement of reinforcement frequency in the study of preference
    • Killeen, P. (1968). on the measurement of reinforcement frequency in the study of preference. Journal of the Experimental Analysis of Behavior, 11, 263-269.
    • (1968) Journal of the Experimental Analysis of Behavior , vol.11 , pp. 263-269
    • Killeen, P.1
  • 37
  • 38
    • 0023878618 scopus 로고
    • A neuronal model of classical conditioning
    • Klopf, A. H. (1988). A neuronal model of classical conditioning. Psychobiology, 16, 85-125.
    • (1988) Psychobiology , vol.16 , pp. 85-125
    • Klopf, A.H.1
  • 41
    • 0017048367 scopus 로고
    • Positive and negative successive contrast effects following multiple shifts in reward magnitude under high drive and immediate reinforcement
    • Maxwell, F. R., Calef, R. S., Murray, D. W., Shephard, D. C. & Norville, R. A. (1976). Positive and negative successive contrast effects following multiple shifts in reward magnitude under high drive and immediate reinforcement. Animal Learning and Behavior, 4, 480-484.
    • (1976) Animal Learning and Behavior , vol.4 , pp. 480-484
    • Maxwell, F.R.1    Calef, R.S.2    Murray, D.W.3    Shephard, D.C.4    Norville, R.A.5
  • 42
    • 0026932735 scopus 로고
    • Choice behavior in transition: Development of preference with ratio and interval schedules
    • Mazur, J. E. (1992). Choice behavior in transition: development of preference with ratio and interval schedules. Journal of Experimental Psychology: Animal Behavior Processes, 18, 364-378.
    • (1992) Journal of Experimental Psychology: Animal Behavior Processes , vol.18 , pp. 364-378
    • Mazur, J.E.1
  • 43
    • 0028899605 scopus 로고
    • Development of preference and spontaneous recovery in choice behavior with concurrent variable-interval schedules
    • Mazur, J. E. (1995). Development of preference and spontaneous recovery in choice behavior with concurrent variable-interval schedules. Animal Learning and Behavior, 23, 93-103.
    • (1995) Animal Learning and Behavior , vol.23 , pp. 93-103
    • Mazur, J.E.1
  • 44
    • 84986534149 scopus 로고
    • The effects of terminal-link fixed-interval and variable-interval schedules on responding under concurrent chained schedules
    • McEwen, D. (1972). The effects of terminal-link fixed-interval and variable-interval schedules on responding under concurrent chained schedules. Journal of the Experimental Analysis of Behavior, 18, 253-261.
    • (1972) Journal of the Experimental Analysis of Behavior , vol.18 , pp. 253-261
    • McEwen, D.1
  • 46
    • 0040009870 scopus 로고
    • The effects of differential rewards on discrimination reversal learning by monkeys
    • Meyer, D. R. (1951). The effects of differential rewards on discrimination reversal learning by monkeys. Journal of Experimental Psychology, 41, 268-274.
    • (1951) Journal of Experimental Psychology , vol.41 , pp. 268-274
    • Meyer, D.R.1
  • 48
    • 84937350040 scopus 로고
    • Steps toward artificial intelligence
    • Minsky, M. L. (1963). Steps toward artificial intelligence. Proceedings of the Institute of Radio Engineers, 49, 8-30, 1961. Reprinted in E. A. Feigenbaum & J. Feldman (Eds.), Computers and thought (pp. 406-450). New York: MacGraw- Hill.
    • (1963) Proceedings of the Institute of Radio Engineers , vol.49 , pp. 8-30
    • Minsky, M.L.1
  • 49
    • 0004242550 scopus 로고
    • Reprinted New York: MacGraw-Hill
    • Minsky, M. L. (1963). Steps toward artificial intelligence. Proceedings of the Institute of Radio Engineers, 49, 8-30, 1961. Reprinted in E. A. Feigenbaum & J. Feldman (Eds.), Computers and thought (pp. 406-450). New York: MacGraw-Hill.
    • (1961) Computers and Thought , pp. 406-450
    • Feigenbaum, E.A.1    Feldman, J.2
  • 51
    • 0002158173 scopus 로고
    • Behavioral momentum and the partial reinforcement effect
    • Nevin, J. A. (1988). Behavioral momentum and the partial reinforcement effect. Psychological Bulletin, 103, 44-56.
    • (1988) Psychological Bulletin , vol.103 , pp. 44-56
    • Nevin, J.A.1
  • 54
    • 0019089514 scopus 로고
    • A model for Pavlovian learning: Variations in the effectiveness of conditioned but not of unconditioned stimuli
    • Pearce, J. M., & Hall, G. (1980). A model for Pavlovian learning: Variations in the effectiveness of conditioned but not of unconditioned stimuli. Psychological Review, 87, 532-552.
    • (1980) Psychological Review , vol.87 , pp. 532-552
    • Pearce, J.M.1    Hall, G.2
  • 55
    • 0002109138 scopus 로고
    • A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement
    • A. H. Black & W. F. Prokasy (Eds.), New York: Appleton-Century-Crofts
    • Rescorla, R. A., & Wanger, A. R. (1972). A theory of Pavlovian conditioning: variations in the effectiveness of reinforcement and nonreinforcement. In A. H. Black & W. F. Prokasy (Eds.), Classical conditioning II: current research and theory. New York: Appleton-Century-Crofts.
    • (1972) Classical Conditioning II: Current Research and Theory
    • Rescorla, R.A.1    Wanger, A.R.2
  • 58
    • 0342861925 scopus 로고
    • Sequential variables as determiners of the rat's discrimination of reinforcement events: Effects of extinction performance
    • Rudy, J. W. (1971). Sequential variables as determiners of the rat's discrimination of reinforcement events: effects of extinction performance. Journal of Comparative Physiological Psychology, 77, 476-481.
    • (1971) Journal of Comparative Physiological Psychology , vol.77 , pp. 476-481
    • Rudy, J.W.1
  • 59
    • 0024392989 scopus 로고
    • The hippocampal formation is necessary for rats to learn and remember configural discriminations
    • Rudy, J. W., & Sutherland, R. J. (1989). The hippocampal formation is necessary for rats to learn and remember configural discriminations. Behavioral Brain Research, 34, 97-109.
    • (1989) Behavioral Brain Research , vol.34 , pp. 97-109
    • Rudy, J.W.1    Sutherland, R.J.2
  • 60
    • 0003297918 scopus 로고
    • Some studies in machine learning using the game of checkers
    • E. A. Feigenbaum & J. Feldman (Eds.), New York: McGraw-Hill
    • Samuel, A. L. (1963). Some studies in machine learning using the game of checkers. In E. A. Feigenbaum & J. Feldman (Eds.), Computers and thought. New York: McGraw-Hill. (Reprinted from IBM Journal on Research & Development, 1959, 3, 210- 229.)
    • (1963) Computers and Thought
    • Samuel, A.L.1
  • 61
    • 0001201756 scopus 로고
    • Reprinted
    • Samuel, A. L. (1963). Some studies in machine learning using the game of checkers. In E. A. Feigenbaum & J. Feldman (Eds.), Computers and thought. New York: McGraw-Hill. (Reprinted from IBM Journal on Research & Development, 1959, 3, 210-229.)
    • (1959) IBM Journal on Research & Development , vol.3 , pp. 210-229
  • 63
    • 0026844787 scopus 로고
    • Stimulus configuration, classical conditioning, and hippocampal function
    • Schmajuk, N. A., & DiCarlo, J. J. (1992). Stimulus configuration, classical conditioning, and hippocampal function. Psychological Review, 99, 268-305.
    • (1992) Psychological Review , vol.99 , pp. 268-305
    • Schmajuk, N.A.1    DiCarlo, J.J.2
  • 64
    • 0001938320 scopus 로고
    • Pavlovian control of operant behavior: An analysis of autoshaping and its implication for operant conditioning
    • W. K. Honig & J. E. R. Staddon (Eds.), Englewood Cliffs, NJ: Prentice Hall
    • Schwartz, B., & Gamzu, E. (1977). Pavlovian control of operant behavior: an analysis of autoshaping and its implication for operant conditioning. In W. K. Honig & J. E. R. Staddon (Eds.), Handbook of operant behavior. Englewood Cliffs, NJ: Prentice Hall.
    • (1977) Handbook of Operant Behavior
    • Schwartz, B.1    Gamzu, E.2
  • 66
    • 0000644781 scopus 로고
    • Are theories of learning necessary?
    • Skinner, B. F. (1950). Are theories of learning necessary? Psychological Review, 57, 193-216.
    • (1950) Psychological Review , vol.57 , pp. 193-216
    • Skinner, B.F.1
  • 67
    • 0001949203 scopus 로고
    • Neuronal models and the orienting reflex
    • M. A. B. Brazier (Ed.), New York: Josiah Macy Jr Foundation
    • Sokolov, E. N. (1960). Neuronal models and the orienting reflex. In M. A. B. Brazier (Ed.), The central nervous system and behavior, 3rd Conference (pp. 187-276). New York: Josiah Macy Jr Foundation.
    • (1960) The Central Nervous System and Behavior, 3rd Conference , pp. 187-276
    • Sokolov, E.N.1
  • 70
    • 0018120309 scopus 로고
    • Behavioral competition: A mechanism for schedule interactions
    • Staddon, J. E. R., & Hinson, J. M. (1978). Behavioral competition: a mechanism for schedule interactions. Science, 202, 432-434.
    • (1978) Science , vol.202 , pp. 432-434
    • Staddon, J.E.R.1    Hinson, J.M.2
  • 71
    • 0003227456 scopus 로고
    • On the assignment-of-credit problem in operant learning
    • M. L. Commons, S. Grossberg, & J. E. R. Staddon (Eds.), Hillsdale, N.J.: Lawrence Erlbaum
    • Staddon, J. E. R., & Zhang, Y. (1991). On the assignment-of-credit problem in operant learning. In M. L. Commons, S. Grossberg, & J. E. R. Staddon (Eds.), Neural network models of conditioning and action (pp. 279-393). Hillsdale, N.J.: Lawrence Erlbaum.
    • (1991) Neural Network Models of Conditioning and Action , pp. 279-393
    • Staddon, J.E.R.1    Zhang, Y.2
  • 72
    • 0019537951 scopus 로고
    • Toward a modern theory of adaptive networks: Expectation and prediction
    • Sutton, R. S., & Barto, A. G. (1981). Toward a modern theory of adaptive networks: expectation and prediction. Psychological Review, 88, 135-170.
    • (1981) Psychological Review , vol.88 , pp. 135-170
    • Sutton, R.S.1    Barto, A.G.2
  • 74
    • 0011488865 scopus 로고
    • The anatomical organization of septohippocampal projections
    • K. Elliot & J. Whelan (Eds.), Ciba Foundation Symposium 58 (New Series)
    • Swanson, L. W. (1978). The anatomical organization of septohippocampal projections. In K. Elliot & J. Whelan (Eds.), Functions of the septo-hippocampal system (pp. 25-43). Ciba Foundation Symposium 58 (New Series).
    • (1978) Functions of the Septo-hippocampal System , pp. 25-43
    • Swanson, L.W.1
  • 77
    • 0002534681 scopus 로고
    • Functional organization of the limbic system in the process of registration of information: Facts and hypotheses
    • R. L. Isaacson & K. H. Pribram (Eds.), New York: Plenum Press
    • Vinogradova, O. S. (1975). Functional organization of the limbic system in the process of registration of information: facts and hypotheses. In R. L. Isaacson & K. H. Pribram (Eds.), The hippocampus, v. 2, Neurophysiology and behavior (pp. 1-70). New York: Plenum Press.
    • (1975) The Hippocampus, V. 2, Neurophysiology and Behavior , vol.2 , pp. 1-70
    • Vinogradova, O.S.1
  • 78
    • 0009831594 scopus 로고
    • Acquisition and extinction of a partially reinforced running response at a 24-hour intertrial interval
    • Weinstock, S. (1958). Acquisition and extinction of a partially reinforced running response at a 24-hour intertrial interval. Journal of Experimental Psychology, 47, 151-158.
    • (1958) Journal of Experimental Psychology , vol.47 , pp. 151-158
    • Weinstock, S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.