메뉴 건너뛰기




Volumn 4, Issue DEC, 2010, Pages

A reinforcement learning model of precommitment in decision making

Author keywords

Addiction; Decision making; Delay discounting; Hyperbolic discounting; Impulsivity; Precommitment; Reinforcement learning

Indexed keywords


EID: 84856116217     PISSN: 16625153     EISSN: None     Source Type: Journal    
DOI: 10.3389/fnbeh.2010.00184     Document Type: Article
Times cited : (24)

References (58)
  • 1
    • 84987341352 scopus 로고
    • Impulse control in pigeons
    • Ainslie, G. (1974). Impulse control in pigeons. J. Exp. Anal. Behav. 21, 485-489.
    • (1974) J. Exp. Anal. Behav. , vol.21 , pp. 485-489
    • Ainslie, G.1
  • 2
    • 0016525445 scopus 로고
    • Specious reward: A behavioral theory of impulsiveness and impulse control
    • Ainslie, G. (1975). Specious reward: a behavioral theory of impulsiveness and impulse control. Psychol. Bull. 82, 463-496.
    • (1975) Psychol. Bull. , vol.82 , pp. 463-496
    • Ainslie, G.1
  • 3
    • 0003934109 scopus 로고
    • Cambridge: Cambridge University Press
    • Ainslie, G. (1992). Picoeconomics. Cambridge: Cambridge University Press.
    • (1992) Picoeconomics
    • Ainslie, G.1
  • 4
    • 0004245883 scopus 로고    scopus 로고
    • Cambridge: Cambridge University Press
    • Ainslie, G. (2001). Breakdown of Will. Cambridge: Cambridge University Press.
    • (2001) Breakdown of Will
    • Ainslie, G.1
  • 5
    • 77953497419 scopus 로고    scopus 로고
    • Hyperbolically discounted temporal difference learning
    • Alexander, W. H., and Brown, J. W. (2010). Hyperbolically discounted temporal difference learning. Neural. Comput. 22, 1511-1527.
    • (2010) Neural. Comput. , vol.22 , pp. 1511-1527
    • Alexander, W.H.1    Brown, J.W.2
  • 6
    • 0002610737 scopus 로고
    • On a routing problem
    • Bellman, R. (1958). On a routing problem. Q. J. Appl. Math. 16, 87-90.
    • (1958) Q. J. Appl. Math. , vol.16 , pp. 87-90
    • Bellman, R.1
  • 7
    • 16244384595 scopus 로고    scopus 로고
    • Addiction and cue-triggered decision processes
    • Bernheim, B. D., and Rangel, A. (2004). Addiction and cue-triggered decision processes. Am. Econ. Rev. 94, 1558-1590.
    • (2004) Am. Econ. Rev. , vol.94 , pp. 1558-1590
    • Bernheim, B.D.1    Rangel, A.2
  • 8
    • 34249890522 scopus 로고    scopus 로고
    • Behavioral and neuroeconomics of drug addiction: Competing neural systems and temporal discounting processes
    • Bickel, W. K., Miller, M. L., Yi, R., Kowal, B. P., Lindquist, D. M., and Pitcock, J. A. (2007). Behavioral and neuroeconomics of drug addiction: competing neural systems and temporal discounting processes. Drug Alcohol Depend. 90, S85-S91.
    • (2007) Drug Alcohol Depend. , vol.90
    • Bickel, W.K.1    Miller, M.L.2    Yi, R.3    Kowal, B.P.4    Lindquist, D.M.5    Pitcock, J.A.6
  • 9
    • 0032743148 scopus 로고    scopus 로고
    • Impulsivity and cigarette smoking: Delay discounting in current, never, and ex-smokers
    • Bickel, W. K., Odum, A. L., and Madden, G. J. (1999). Impulsivity and cigarette smoking: delay discounting in current, never, and ex-smokers. Psychopharmacology (Berl.) 146, 447-454.
    • (1999) Psychopharmacology (Berl.) , vol.146 , pp. 447-454
    • Bickel, W.K.1    Odum, A.L.2    Madden, G.J.3
  • 10
    • 0037328955 scopus 로고    scopus 로고
    • Impulsivity and rapid discounting of delayed hypothetical rewards in cocainedependent individuals
    • Coffey, S. F., Gudleski, G. D., Saladin, M. E., and Brady, K. T. (2003). Impulsivity and rapid discounting of delayed hypothetical rewards in cocainedependent individuals. Exp. Clin. Psychopharmacol. 11, 18-25.
    • (2003) Exp. Clin. Psychopharmacol. , vol.11 , pp. 18-25
    • Coffey, S.F.1    Gudleski, G.D.2    Saladin, M.E.3    Brady, K.T.4
  • 12
    • 0033722074 scopus 로고    scopus 로고
    • Behavioral considerations suggest an average reward TD model of the dopamine system
    • Daw, N. D., and Touretzky, D. S. (2000). Behavioral considerations suggest an average reward TD model of the dopamine system. Neurocomputing 32-33, 679-684.
    • (2000) Neurocomputing , vol.32-33 , pp. 679-684
    • Daw, N.D.1    Touretzky, D.S.2
  • 13
    • 0036849790 scopus 로고    scopus 로고
    • Acute administration of d-amphetamine decreases impulsivity in healthy volunteers
    • de Wit, H., Enggasser, J. L., and Richards, J. B. (2002). Acute administration of d-amphetamine decreases impulsivity in healthy volunteers. Neuropsychopharmacology 27, 813-825.
    • (2002) Neuropsychopharmacology , vol.27 , pp. 813-825
    • de Wit, H.1    Enggasser, J.L.2    Richards, J.B.3
  • 15
    • 33947219085 scopus 로고    scopus 로고
    • Contextual control of delay discounting by pathological gamblers
    • Dixon, M. R., Jacobs, E. A., and Sanders, S. (2006). Contextual control of delay discounting by pathological gamblers. J. Appl. Behav. Anal. 39, 413-422.
    • (2006) J. Appl. Behav. Anal. , vol.39 , pp. 413-422
    • Dixon, M.R.1    Jacobs, E.A.2    Sanders, S.3
  • 16
    • 33644787305 scopus 로고    scopus 로고
    • Impulsivity in abstinent early-and late-onset alcoholics: Differences in self-report measures and a discounting task
    • Dom, G., D'haene, P., Hulstijn, W., and Sabbe, B. (2006). Impulsivity in abstinent early-and late-onset alcoholics: differences in self-report measures and a discounting task. Addiction 101, 50-59.
    • (2006) Addiction , vol.101 , pp. 50-59
    • Dom, G.1    D'haene, P.2    Hulstijn, W.3    Sabbe, B.4
  • 17
    • 0033213819 scopus 로고    scopus 로고
    • What are the computations of the cerebellum, the basal ganglia, and the cerebral cortex?
    • Doya, K. (1999). What are the computations of the cerebellum, the basal ganglia, and the cerebral cortex? Neural Netw. 12, 961-974.
    • (1999) Neural Netw. , vol.12 , pp. 961-974
    • Doya, K.1
  • 18
    • 84892580762 scopus 로고
    • Precommitment, prohibition, and the problem of dissent
    • Dripps, D. A. (1993). Precommitment, prohibition, and the problem of dissent. J. Legal Stud. 22, 255-263.
    • (1993) J. Legal Stud. , vol.22 , pp. 255-263
    • Dripps, D.A.1
  • 19
    • 0032694028 scopus 로고    scopus 로고
    • Varieties of impulsivity
    • Evenden, J. L. (1999). Varieties of impulsivity. Psychopharmacology 146, 348-361.
    • (1999) Psychopharmacology , vol.146 , pp. 348-361
    • Evenden, J.L.1
  • 21
    • 0041109884 scopus 로고    scopus 로고
    • Time discounting and time preference: A critical review
    • Frederick, S., Loewenstein, G., and O'Donoghue, T. (2002). Time discounting and time preference: a critical review. J. Econ. Lit. 40, 351-401.
    • (2002) J. Econ. Lit. , vol.40 , pp. 351-401
    • Frederick, S.1    Loewenstein, G.2    O'Donoghue, T.3
  • 22
    • 77953260848 scopus 로고    scopus 로고
    • States versus rewards: Dissociable neural prediction error signals underlying model-based and model-free reinforcement learning
    • Gläscher, J., Daw, N., Dayan, P., and O'Doherty, J. P. (2010). States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning. Neuron 66, 585-595.
    • (2010) Neuron , vol.66 , pp. 585-595
    • Gläscher, J.1    Daw, N.2    Dayan, P.3    O'Doherty, J.P.4
  • 24
    • 33644688754 scopus 로고    scopus 로고
    • Dopamine neurons report an error in the temporal prediction of reward during learning
    • Hollerman, J. R., and Schultz, W. (1998). Dopamine neurons report an error in the temporal prediction of reward during learning. Nat. Neurosci. 1, 304-309.
    • (1998) Nat. Neurosci. , vol.1 , pp. 304-309
    • Hollerman, J.R.1    Schultz, W.2
  • 25
    • 33846626965 scopus 로고    scopus 로고
    • Switching from automatic to controlled action by monkey medial frontal cortex
    • Isoda, M., and Hikosaka, O. (2007). Switching from automatic to controlled action by monkey medial frontal cortex. Nat. Neurosci. 10, 240-248.
    • (2007) Nat. Neurosci. , vol.10 , pp. 240-248
    • Isoda, M.1    Hikosaka, O.2
  • 26
    • 36048937548 scopus 로고    scopus 로고
    • Neural ensembles in CA3 transiently encode paths forward of the animal at a decision point
    • Johnson, A., and Redish, A. D. (2007). Neural ensembles in CA3 transiently encode paths forward of the animal at a decision point. J. Neurosci. 27, 12176-12189.
    • (2007) J. Neurosci. , vol.27 , pp. 12176-12189
    • Johnson, A.1    Redish, A.D.2
  • 28
    • 36448986837 scopus 로고    scopus 로고
    • The neural correlates of subjective value during intertemporal choice
    • Kable, J. W., and Glimcher, P. W. (2007). The neural correlates of subjective value during intertemporal choice. Nat. Neurosci. 10, 1625-1633.
    • (2007) Nat. Neurosci , vol.10 , pp. 1625-1633
    • Kable, J.W.1    Glimcher, P.W.2
  • 29
    • 70349138795 scopus 로고    scopus 로고
    • The neurobiology of decision: Consensus and controversy
    • Kable, J. W., and Glimcher, P. W. (2009). The neurobiology of decision: consensus and controversy. Neuron 63, 733-745.
    • (2009) Neuron , vol.63 , pp. 733-745
    • Kable, J.W.1    Glimcher, P.W.2
  • 30
    • 0018896989 scopus 로고
    • Cognitive-behavioral treatment for impulsivity: Concrete versus conceptual training in non-self-controlled problem children
    • Kendall, P. C., and Wilcox, L. E. (1980). Cognitive-behavioral treatment for impulsivity: concrete versus conceptual training in non-self-controlled problem children. J. Consult. Clin. Psychol. 48, 80-91.
    • (1980) J. Consult. Clin. Psychol. , vol.48 , pp. 80-91
    • Kendall, P.C.1    Wilcox, L.E.2
  • 31
    • 0000737101 scopus 로고
    • Stationary ordinal utility and impatience
    • Koopmans, T. C. (1960). Stationary ordinal utility and impatience. Econometrica 28, 287-309.
    • (1960) Econometrica , vol.28 , pp. 287-309
    • Koopmans, T.C.1
  • 32
    • 70449382577 scopus 로고    scopus 로고
    • Temporal-difference reinforcement learning with distributed representations
    • doi:10.1371/ journal.pone.0007362
    • Kurth-Nelson, Z., and Redish, A. D. (2009). Temporal-difference reinforcement learning with distributed representations. PLoS One 4:e7362. doi:10.1371/ journal.pone.0007362.
    • (2009) PLoS One , vol.4
    • Kurth-Nelson, Z.1    Redish, A.D.2
  • 33
    • 0026505520 scopus 로고
    • Responses of monkey dopamine neurons during learning of behavioral reactions
    • Ljungberg, T., Apicella, P., and Schultz, W. (1992). Responses of monkey dopamine neurons during learning of behavioral reactions. J. Neurophysiol. 67, 145-163.
    • (1992) J. Neurophysiol. , vol.67 , pp. 145-163
    • Ljungberg, T.1    Apicella, P.2    Schultz, W.3
  • 35
    • 0030789031 scopus 로고    scopus 로고
    • Impulsive and self-control choices in opioid-dependent patients and non-drug-using control patients: Drug and monetary rewards
    • Madden, G. J., Petry, N. M., Badger, G. J., and Bickford, W. K. (1997). Impulsive and self-control choices in opioid-dependent patients and non-drug-using control patients: drug and monetary rewards. Exp. Clin. Psychopharmacol. 5, 256-262.
    • (1997) Exp. Clin. Psychopharmacol. , vol.5 , pp. 256-262
    • Madden, G.J.1    Petry, N.M.2    Badger, G.J.3    Bickford, W.K.4
  • 37
    • 0034059348 scopus 로고    scopus 로고
    • Effect of central 5-hydroxytryptamine depletion on inter-temporal choice: A quantitative analysis
    • Mobini, S., Chiang, T. J., Al-Ruwaitea, A. S., Ho, M. Y., Bradshaw, C. M., and Szabadi, E. (2000). Effect of central 5-hydroxytryptamine depletion on inter-temporal choice: a quantitative analysis. Psychopharmacology 149, 313-318.
    • (2000) Psychopharmacology , vol.149 , pp. 313-318
    • Mobini, S.1    Chiang, T.J.2    Al-Ruwaitea, A.S.3    Ho, M.Y.4    Bradshaw, C.M.5    Szabadi, E.6
  • 38
    • 0029981543 scopus 로고    scopus 로고
    • A framework for mesencephalic dopamine systems based on predictive Hebbian learning
    • Montague, P. R., Dayan, P., and Sejnowski, T. J. (1996). A framework for mesencephalic dopamine systems based on predictive Hebbian learning. J. Neurosci. 16, 1936-1947.
    • (1996) J. Neurosci. , vol.16 , pp. 1936-1947
    • Montague, P.R.1    Dayan, P.2    Sejnowski, T.J.3
  • 39
    • 0024947710 scopus 로고
    • Cognitive impulsivity training: The effects of peer teaching
    • Nelson, W. M., and Behler, J. J. (1989). Cognitive impulsivity training: the effects of peer teaching. J. Behav. Ther. Exp. Psychiatry 20, 303-309.
    • (1989) J. Behav. Ther. Exp. Psychiatry , vol.20 , pp. 303-309
    • Nelson, W.M.1    Behler, J.J.2
  • 40
    • 0038957677 scopus 로고    scopus 로고
    • Doing it now or later
    • O'Donoghue, T., and Rabin, M. (1999). Doing it now or later. Am. Econ. Rev. 89, 103-124.
    • (1999) Am. Econ. Rev. , vol.89 , pp. 103-124
    • O'Donoghue, T.1    Rabin, M.2
  • 41
    • 77951929653 scopus 로고    scopus 로고
    • Episodic future thinking reduces reward delay discounting through an enhancement of prefrontal-mediotemporal interactions
    • Peters, J., and Büchel, C. (2010). Episodic future thinking reduces reward delay discounting through an enhancement of prefrontal-mediotemporal interactions. Neuron 66, 138-148.
    • (2010) Neuron , vol.66 , pp. 138-148
    • Peters, J.1    Büchel, C.2
  • 42
    • 84986470330 scopus 로고
    • Commitment, choice, and self-control
    • Rachlin, H., and Green, L. (1972). Commitment, choice, and self-control. J. Exp. Anal. Behav. 17, 15-22.
    • (1972) J. Exp. Anal. Behav. , vol.17 , pp. 15-22
    • Rachlin, H.1    Green, L.2
  • 43
    • 48349092693 scopus 로고    scopus 로고
    • A unified framework for addiction: Vulnerabilities in the decision process
    • Redish, A. D., Jensen, S., and Johnson, A. (2008). A unified framework for addiction: vulnerabilities in the decision process. Behav. Brain Sci. 31, 415-437.
    • (2008) Behav. Brain Sci. , vol.31 , pp. 415-437
    • Redish, A.D.1    Jensen, S.2    Johnson, A.3
  • 44
    • 30344431832 scopus 로고    scopus 로고
    • Dimensions of impulsive behavior: Personality and behavioral measures
    • Reynolds, B., Ortengren, A., Richards, J. B., and Wit, H. D. (2006). Dimensions of impulsive behavior: Personality and behavioral measures. Pers. Individ. Dif. 40, 305-315.
    • (2006) Pers. Individ. Dif. , vol.40 , pp. 305-315
    • Reynolds, B.1    Ortengren, A.2    Richards, J.B.3    Wit, H.D.4
  • 46
    • 0346706306 scopus 로고    scopus 로고
    • One hundred years of forgetting: A quantitative description of retention
    • Rubin, D. C., and Wenzel, A. E. (1996). One hundred years of forgetting: a quantitative description of retention. Psychol. Rev. 103, 734-760.
    • (1996) Psychol. Rev. , vol.103 , pp. 734-760
    • Rubin, D.C.1    Wenzel, A.E.2
  • 47
    • 28144449057 scopus 로고    scopus 로고
    • Representation of action-specific reward values in the striatum
    • Samejima, K., Ueda, Y., Doya, K., and Kimura, M. (2005). Representation of action-specific reward values in the striatum. Science 310, 1337-1340.
    • (2005) Science , vol.310 , pp. 1337-1340
    • Samejima, K.1    Ueda, Y.2    Doya, K.3    Kimura, M.4
  • 48
    • 84963107372 scopus 로고
    • A note on measurement of utility
    • Samuelson, P. A. (1937). A note on measurement of utility. Rev. Econ. Stud. 4, 155-161.
    • (1937) Rev. Econ. Stud. , vol.4 , pp. 155-161
    • Samuelson, P.A.1
  • 50
    • 0032558817 scopus 로고    scopus 로고
    • On hyperbolic discounting and uncertain hazard rates
    • Sozou, P. D. (1998). On hyperbolic discounting and uncertain hazard rates. Proc. Biol. Sci. 265, 2015-2020.
    • (1998) Proc. Biol. Sci. , vol.265 , pp. 2015-2020
    • Sozou, P.D.1
  • 51
    • 84963071606 scopus 로고
    • Myopia and inconsistency in dynamic utility maximization
    • Strotz, R. H. (1955). Myopia and inconsistency in dynamic utility maximization. Rev. Econ. Stud. 23, 165-180.
    • (1955) Rev. Econ. Stud. , vol.23 , pp. 165-180
    • Strotz, R.H.1
  • 53
    • 3343026029 scopus 로고    scopus 로고
    • Prediction of immediate and future rewards differentially recruits cortico-basal ganglia loops
    • Tanaka, S. C., Doya, K., Okada, G., Ueda, K., Okamoto, Y., and Yamawaki, S. (2004). Prediction of immediate and future rewards differentially recruits cortico-basal ganglia loops. Nat. Neurosci. 7, 887-893.
    • (2004) Nat. Neurosci. , vol.7 , pp. 887-893
    • Tanaka, S.C.1    Doya, K.2    Okada, G.3    Ueda, K.4    Okamoto, Y.5    Yamawaki, S.6
  • 54
    • 41149171114 scopus 로고    scopus 로고
    • Serotonin differentially regulates short-and long-term prediction of rewards in the ventral and dorsal striatum
    • doi: 10.1371/journal. pone.0001333
    • Tanaka, S. C., Schweighofer, N., Asahi, S., Shishida, K., Okamoto, Y., Yamawaki, S., and Doya, K. (2007). Serotonin differentially regulates short-and long-term prediction of rewards in the ventral and dorsal striatum. PLoS One 2:e1333. doi: 10.1371/journal. pone.0001333.
    • (2007) PLoS One , vol.2
    • Tanaka, S.C.1    Schweighofer, N.2    Asahi, S.3    Shishida, K.4    Okamoto, Y.5    Yamawaki, S.6    Doya, K.7
  • 55
    • 0025412207 scopus 로고
    • A cognitive model of drug urges and drug-use behavior: Role of automatic and nonautomatic processes
    • Tiffany, S. T. (1990). A cognitive model of drug urges and drug-use behavior: role of automatic and nonautomatic processes. Psychol. Rev. 97, 147-168.
    • (1990) Psychol. Rev. , vol.97 , pp. 147-168
    • Tiffany, S.T.1
  • 56
    • 33846933461 scopus 로고    scopus 로고
    • Reward value coding distinct from risk attitude-related uncertainty coding in human reward systems
    • Tobler, P. N., O'Doherty, J. P., Dolan, R. J., and Schultz, W. (2007). Reward value coding distinct from risk attitude-related uncertainty coding in human reward systems. J. Neurophysiol. 97, 1621-1632.
    • (2007) J. Neurophysiol , vol.97 , pp. 1621-1632
    • Tobler, P.N.1    O'Doherty, J.P.2    Dolan, R.J.3    Schultz, W.4
  • 57
    • 0033221519 scopus 로고    scopus 로고
    • Average cost temporal-difference learning
    • Tsitsiklis, J. N., and Van Roy, B. (1999). Average cost temporal-difference learning. Automatica 35, 1799-1808.
    • (1999) Automatica , vol.35 , pp. 1799-1808
    • Tsitsiklis, J.N.1    Van Roy, B.2
  • 58
    • 43249121628 scopus 로고    scopus 로고
    • Making choices impairs subsequent self-control: A limited-resource account of decision making, self-regulation, and active initiative
    • Vohs, K. D., Baumeister, R. F., Schmeichel, B. J., Twenge, J. M., Nelson, N. M., and Tice, D. M. (2008). Making choices impairs subsequent self-control: a limited-resource account of decision making, self-regulation, and active initiative. J. Pers. Soc. Psychol. 94, 883-898.
    • (2008) J. Pers. Soc. Psychol. , vol.94 , pp. 883-898
    • Vohs, K.D.1    Baumeister, R.F.2    Schmeichel, B.J.3    Twenge, J.M.4    Nelson, N.M.5    Tice, D.M.6


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.