메뉴 건너뛰기




Volumn 5, Issue , 2015, Pages 43-50

Discovering latent causes in reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

ASSOCIATION; COGNITION; CONDITIONING; LATENT PERIOD; LEARNING; REINFORCEMENT; REVIEW; STIMULUS RESPONSE; THEORETICAL STUDY;

EID: 84938863250     PISSN: None     EISSN: 23521546     Source Type: Journal    
DOI: 10.1016/j.cobeha.2015.07.007     Document Type: Review
Times cited : (108)

References (62)
  • 3
    • 77952541839 scopus 로고    scopus 로고
    • Learning latent structure: carving nature at its joints
    • Gershman S.J., Niv Y. Learning latent structure: carving nature at its joints. Curr Opin Neurobiol 2010, 20:251-256.
    • (2010) Curr Opin Neurobiol , vol.20 , pp. 251-256
    • Gershman, S.J.1    Niv, Y.2
  • 4
    • 0030168518 scopus 로고    scopus 로고
    • Hidden state and reinforcement learning with instance-based state identification
    • McCallum R.A. Hidden state and reinforcement learning with instance-based state identification. IEEE Trans Syst Man Cybernet, Part B 1996, 26:464-473.
    • (1996) IEEE Trans Syst Man Cybernet, Part B , vol.26 , pp. 464-473
    • McCallum, R.A.1
  • 5
    • 0038517214 scopus 로고    scopus 로고
    • Equivalence notions and model minimization in Markov decision processes
    • Givan R., Dean T., Greig M. Equivalence notions and model minimization in Markov decision processes. Artif Intell 2003, 147:163-223.
    • (2003) Artif Intell , vol.147 , pp. 163-223
    • Givan, R.1    Dean, T.2    Greig, M.3
  • 6
    • 27344443125 scopus 로고    scopus 로고
    • Finding approximate POMDP solutions through belief compression
    • Roy N., Gordon G.J., Thrun S. Finding approximate POMDP solutions through belief compression. J Artif Intell Res 2005, 23:1-40.
    • (2005) J Artif Intell Res , vol.23 , pp. 1-40
    • Roy, N.1    Gordon, G.J.2    Thrun, S.3
  • 7
    • 33746365099 scopus 로고    scopus 로고
    • Bayesian theories of conditioning in a changing world
    • Courville A.C., Daw N.D., Touretzky D.S. Bayesian theories of conditioning in a changing world. Trends Cogn Sci 2006, 10:294-300.
    • (2006) Trends Cogn Sci , vol.10 , pp. 294-300
    • Courville, A.C.1    Daw, N.D.2    Touretzky, D.S.3
  • 8
    • 74049117596 scopus 로고    scopus 로고
    • Context, learning, and extinction
    • Gershman D.J., Blei D.M., Niv Y. Context, learning, and extinction. Psychol Rev 2010, 117:197-209.
    • (2010) Psychol Rev , vol.117 , pp. 197-209
    • Gershman, D.J.1    Blei, D.M.2    Niv, Y.3
  • 9
    • 84870955069 scopus 로고    scopus 로고
    • Exploring a latent cause theory of classical conditioning
    • Gershman S.J., Niv Y. Exploring a latent cause theory of classical conditioning. Learn Behav 2012, 40:255-268.
    • (2012) Learn Behav , vol.40 , pp. 255-268
    • Gershman, S.J.1    Niv, Y.2
  • 10
    • 84912099627 scopus 로고    scopus 로고
    • Statistical computations underlying the dynamics of memory updating
    • Gershman D.J., Radulescu A., Norman K.A., Niv Y. Statistical computations underlying the dynamics of memory updating. PLOS Comput Biol 2014, 10:e1003939.
    • (2014) PLOS Comput Biol , vol.10 , pp. e1003939
    • Gershman, D.J.1    Radulescu, A.2    Norman, K.A.3    Niv, Y.4
  • 11
    • 84938207414 scopus 로고    scopus 로고
    • Individual differences in learning predict the return of fear
    • Gershman S.J., Hartley C. Individual differences in learning predict the return of fear. Learn Behav 2015, 43:243-250.
    • (2015) Learn Behav , vol.43 , pp. 243-250
    • Gershman, S.J.1    Hartley, C.2
  • 12
    • 84881274323 scopus 로고    scopus 로고
    • Context-dependent decision-making: a simple Bayesian model
    • Lloyd K., Leslie D.S. Context-dependent decision-making: a simple Bayesian model. J R Soc Interface 2013, 10:20130069.
    • (2013) J R Soc Interface , vol.10 , pp. 20130069
    • Lloyd, K.1    Leslie, D.S.2
  • 13
    • 84905482314 scopus 로고    scopus 로고
    • Explaining compound generalization in associative and causal learning through rational principles of dimensional generalization
    • Soto F.A., Gershman S.J., Niv Y. Explaining compound generalization in associative and causal learning through rational principles of dimensional generalization. Psychol Rev 2014, 121:526-558.
    • (2014) Psychol Rev , vol.121 , pp. 526-558
    • Soto, F.A.1    Gershman, S.J.2    Niv, Y.3
  • 15
    • 0002109138 scopus 로고
    • A theory of Pavlovian conditioning: variations in the effectiveness of reinforcement and nonreinforcement
    • Appleton-Century-Crofts, New York, NY, A.H. Black, W.F. Prokasy (Eds.)
    • Rescorla R.A., Wagner A.R. A theory of Pavlovian conditioning: variations in the effectiveness of reinforcement and nonreinforcement. Classical Conditioning II: Current Research and Theory 1972, 64-99. Appleton-Century-Crofts, New York, NY. A.H. Black, W.F. Prokasy (Eds.).
    • (1972) Classical Conditioning II: Current Research and Theory , pp. 64-99
    • Rescorla, R.A.1    Wagner, A.R.2
  • 16
    • 4644334535 scopus 로고    scopus 로고
    • Context and behavioral processes in extinction
    • Bouton M.E. Context and behavioral processes in extinction. Learn Memory 2004, 11:485-494.
    • (2004) Learn Memory , vol.11 , pp. 485-494
    • Bouton, M.E.1
  • 17
    • 0037219380 scopus 로고    scopus 로고
    • Simplicity: a unifying principle in cognitive science?
    • Chater N., Vitányi P. Simplicity: a unifying principle in cognitive science?. Trends Cogn Sci 2003, 7:19-22.
    • (2003) Trends Cogn Sci , vol.7 , pp. 19-22
    • Chater, N.1    Vitányi, P.2
  • 18
    • 84885362378 scopus 로고    scopus 로고
    • Perceptual estimation obeys Occam's razor
    • Gershman S.J., Niv Y. Perceptual estimation obeys Occam's razor. Front Psychol 2013, 4.
    • (2013) Front Psychol , vol.4
    • Gershman, S.J.1    Niv, Y.2
  • 19
    • 84857235576 scopus 로고    scopus 로고
    • A tutorial on Bayesian nonparametric models
    • Gershman S.J., Blei D.M. A tutorial on Bayesian nonparametric models. J Math Psychol 2012, 56:1-12.
    • (2012) J Math Psychol , vol.56 , pp. 1-12
    • Gershman, S.J.1    Blei, D.M.2
  • 21
    • 4644282212 scopus 로고    scopus 로고
    • Spontaneous recovery
    • Rescorla R.A. Spontaneous recovery. Learn Memory 2004, 11:501-509.
    • (2004) Learn Memory , vol.11 , pp. 501-509
    • Rescorla, R.A.1
  • 22
    • 0016417721 scopus 로고
    • Reinstatement of fear to an extinguished conditioned stimulus
    • Rescorla R.A., Heth C.D. Reinstatement of fear to an extinguished conditioned stimulus. J Exp Psychol: Anim Behav Process 1975, 1:88-96.
    • (1975) J Exp Psychol: Anim Behav Process , vol.1 , pp. 88-96
    • Rescorla, R.A.1    Heth, C.D.2
  • 23
    • 0018526860 scopus 로고
    • Role of conditioned contextual stimuli in reinstatement of extinguished fear
    • Bouton M.E., Bolles R.C. Role of conditioned contextual stimuli in reinstatement of extinguished fear. J Exp Psychol: Anim Behav Process 1979, 5:368-378.
    • (1979) J Exp Psychol: Anim Behav Process , vol.5 , pp. 368-378
    • Bouton, M.E.1    Bolles, R.C.2
  • 24
    • 33749343760 scopus 로고    scopus 로고
    • Retrieval failure versus memory loss in experimental amnesia: definitions and processes
    • Miller R.R., Matzel L.D. Retrieval failure versus memory loss in experimental amnesia: definitions and processes. Learn Memory 2006, 13:491-497.
    • (2006) Learn Memory , vol.13 , pp. 491-497
    • Miller, R.R.1    Matzel, L.D.2
  • 25
    • 0030798078 scopus 로고    scopus 로고
    • Adaptation to gradual as compared with sudden visuo-motor distortions
    • Kagerer F.A., Contreras-Vidal J.L., Stelmach G.E. Adaptation to gradual as compared with sudden visuo-motor distortions. Exp Brain Res 1997, 115:557-561.
    • (1997) Exp Brain Res , vol.115 , pp. 557-561
    • Kagerer, F.A.1    Contreras-Vidal, J.L.2    Stelmach, G.E.3
  • 27
    • 83055179235 scopus 로고    scopus 로고
    • Trial-by-trial analysis of intermanual transfer during visuomotor adaptation
    • Taylor J.A., Wojaczynski G.J., Ivry R.B. Trial-by-trial analysis of intermanual transfer during visuomotor adaptation. J Neurophysiol 2011, 106:3157-3172.
    • (2011) J Neurophysiol , vol.106 , pp. 3157-3172
    • Taylor, J.A.1    Wojaczynski, G.J.2    Ivry, R.B.3
  • 28
    • 0035836774 scopus 로고    scopus 로고
    • Effects of temporal association on recognition memory
    • Wallis G., Bülthoff H.H. Effects of temporal association on recognition memory. Proc Natl Acad Sci U S A 2001, 98:4800-4804.
    • (2001) Proc Natl Acad Sci U S A , vol.98 , pp. 4800-4804
    • Wallis, G.1    Bülthoff, H.H.2
  • 29
    • 33847607971 scopus 로고    scopus 로고
    • The effects of perceptual history on memory of visual objects
    • Preminger S., Sagi D., Tsodyks M. The effects of perceptual history on memory of visual objects. Vis Res 2007, 47:965-973.
    • (2007) Vis Res , vol.47 , pp. 965-973
    • Preminger, S.1    Sagi, D.2    Tsodyks, M.3
  • 31
    • 0023233591 scopus 로고
    • A model for stimulus generalization in Pavlovian conditioning
    • Pearce J.M. A model for stimulus generalization in Pavlovian conditioning. Psychol Rev 1987, 94:61-73.
    • (1987) Psychol Rev , vol.94 , pp. 61-73
    • Pearce, J.M.1
  • 32
    • 0037316490 scopus 로고    scopus 로고
    • Context-sensitive elemental theory
    • Wagner A.R. Context-sensitive elemental theory. Q J Exp Psychol: Sect B 2003, 56:7-29.
    • (2003) Q J Exp Psychol: Sect B , vol.56 , pp. 7-29
    • Wagner, A.R.1
  • 33
    • 33746335663 scopus 로고    scopus 로고
    • Elemental representations of stimuli in associative learning
    • Harris J.A. Elemental representations of stimuli in associative learning. Psychol. Rev. 2006, 113:584-605.
    • (2006) Psychol. Rev. , vol.113 , pp. 584-605
    • Harris, J.A.1
  • 34
    • 39449115677 scopus 로고    scopus 로고
    • Stimulus coding in human associative learning: flexible representations of parts and wholes
    • Melchers K.G., Shanks D.R., Lachnit H. Stimulus coding in human associative learning: flexible representations of parts and wholes. Behav Process 2008, 77:413-427.
    • (2008) Behav Process , vol.77 , pp. 413-427
    • Melchers, K.G.1    Shanks, D.R.2    Lachnit, H.3
  • 35
    • 0028526748 scopus 로고
    • Similarity and discrimination: a selective review and a connectionist model
    • Pearce J.M. Similarity and discrimination: a selective review and a connectionist model. Psychol Rev 1994, 101:587-607.
    • (1994) Psychol Rev , vol.101 , pp. 587-607
    • Pearce, J.M.1
  • 36
    • 0028009937 scopus 로고
    • Summation and configuration between and within sensory modalities in classical conditioning of the rabbit
    • Kehoe E.J., Horne A.J., Horne P.S., Macrae M. Summation and configuration between and within sensory modalities in classical conditioning of the rabbit. Anim Learn Behav 1994, 22:19-26.
    • (1994) Anim Learn Behav , vol.22 , pp. 19-26
    • Kehoe, E.J.1    Horne, A.J.2    Horne, P.S.3    Macrae, M.4
  • 38
    • 0036527899 scopus 로고    scopus 로고
    • Spatial separation of target and competitor cues enhances blocking of human causality judgements
    • Glautier S. Spatial separation of target and competitor cues enhances blocking of human causality judgements. Q J Exp Psychol: Sect B 2002, 55:121-135.
    • (2002) Q J Exp Psychol: Sect B , vol.55 , pp. 121-135
    • Glautier, S.1
  • 39
    • 7544244209 scopus 로고    scopus 로고
    • Outcome additivity, elemental processing and blocking in human causality judgements
    • Livesey E.J., Boakes R.A. Outcome additivity, elemental processing and blocking in human causality judgements. Q J Exp Psychol: Sect B 2004, 57:361-379.
    • (2004) Q J Exp Psychol: Sect B , vol.57 , pp. 361-379
    • Livesey, E.J.1    Boakes, R.A.2
  • 40
    • 0023223978 scopus 로고
    • Toward a universal law of generalization for psychological science
    • Shepard R.N. Toward a universal law of generalization for psychological science. Science 1987, 237:1317-1323.
    • (1987) Science , vol.237 , pp. 1317-1323
    • Shepard, R.N.1
  • 41
    • 0035735751 scopus 로고    scopus 로고
    • Generalization, similarity, and Bayesian inference
    • Tenenbaum J.B., Griffiths T.L. Generalization, similarity, and Bayesian inference. Behav Brain Sci 2001, 24:629-640.
    • (2001) Behav Brain Sci , vol.24 , pp. 629-640
    • Tenenbaum, J.B.1    Griffiths, T.L.2
  • 42
    • 0000705894 scopus 로고
    • The adaptive nature of human categorization
    • Anderson J.R. The adaptive nature of human categorization. Psychol Rev 1991, 98:409-429.
    • (1991) Psychol Rev , vol.98 , pp. 409-429
    • Anderson, J.R.1
  • 43
    • 67349278780 scopus 로고    scopus 로고
    • A Bayesian framework for word segmentation: exploring the effects of context
    • Goldwater S., Griffiths T.L., Johnson M. A Bayesian framework for word segmentation: exploring the effects of context. Cognition 2009, 112:21-54.
    • (2009) Cognition , vol.112 , pp. 21-54
    • Goldwater, S.1    Griffiths, T.L.2    Johnson, M.3
  • 44
    • 78249247078 scopus 로고    scopus 로고
    • Rational approximations to rational models: alternative algorithms for category learning
    • Sanborn A.N., Griffiths T.L., Navarro D.J. Rational approximations to rational models: alternative algorithms for category learning. Psychol Rev 2010, 117:1144-1167.
    • (2010) Psychol Rev , vol.117 , pp. 1144-1167
    • Sanborn, A.N.1    Griffiths, T.L.2    Navarro, D.J.3
  • 46
    • 84881089217 scopus 로고    scopus 로고
    • Cognitive control over learning: creating, clustering, and generalizing task-set structure
    • Collins A.G.E., Frank M.J. Cognitive control over learning: creating, clustering, and generalizing task-set structure. Psychol Rev 2013, 120:190-229.
    • (2013) Psychol Rev , vol.120 , pp. 190-229
    • Collins, A.G.E.1    Frank, M.J.2
  • 47
    • 84897881519 scopus 로고    scopus 로고
    • Human EEG uncovers latent generalizable rule structure during learning
    • Collins A.G.E., Cavanagh J.F., Frank M.J. Human EEG uncovers latent generalizable rule structure during learning. J Neurosci 2014, 34:4677-4685.
    • (2014) J Neurosci , vol.34 , pp. 4677-4685
    • Collins, A.G.E.1    Cavanagh, J.F.2    Frank, M.J.3
  • 48
    • 36849079932 scopus 로고    scopus 로고
    • Context learning in the rodent hippocampus
    • Fuhs M.C., Touretzky D.S. Context learning in the rodent hippocampus. Neural Comput 2007, 19:3173-3215.
    • (2007) Neural Comput , vol.19 , pp. 3173-3215
    • Fuhs, M.C.1    Touretzky, D.S.2
  • 50
    • 84892714870 scopus 로고    scopus 로고
    • Orbitofrontal cortex as a cognitive map of task space
    • Wilson R.C., Takahashi Y.K., Schoenbaum G., Niv Y. Orbitofrontal cortex as a cognitive map of task space. Neuron 2014, 81:267-279.
    • (2014) Neuron , vol.81 , pp. 267-279
    • Wilson, R.C.1    Takahashi, Y.K.2    Schoenbaum, G.3    Niv, Y.4
  • 52
    • 34848885477 scopus 로고    scopus 로고
    • Hippocampal involvement in contextual modulation of fear extinction
    • Ji J., Maren S. Hippocampal involvement in contextual modulation of fear extinction. Hippocampus 2007, 17:749-758.
    • (2007) Hippocampus , vol.17 , pp. 749-758
    • Ji, J.1    Maren, S.2
  • 53
    • 84888031491 scopus 로고    scopus 로고
    • Erasing the engram: the unlearning of procedural skills
    • Crossley M.J., Ashby F.G., Maddox W.T. Erasing the engram: the unlearning of procedural skills. J Exp Psychol: Gen 2013, 142:710-741.
    • (2013) J Exp Psychol: Gen , vol.142 , pp. 710-741
    • Crossley, M.J.1    Ashby, F.G.2    Maddox, W.T.3
  • 54
    • 84880031738 scopus 로고    scopus 로고
    • The thalamostriatal pathway and cholinergic control of goal-directed action: interlacing new with existing learning in the striatum
    • Bradfield L.A., Bertran-Gonzalez J., Chieng B., Balleine B.W. The thalamostriatal pathway and cholinergic control of goal-directed action: interlacing new with existing learning in the striatum. Neuron 2013, 79:153-166.
    • (2013) Neuron , vol.79 , pp. 153-166
    • Bradfield, L.A.1    Bertran-Gonzalez, J.2    Chieng, B.3    Balleine, B.W.4
  • 55
    • 84880008689 scopus 로고    scopus 로고
    • How did the chicken cross the road? With her striatal cholinergic interneurons, of course
    • Schoenbaum G., Stalnaker T.A., Niv Y. How did the chicken cross the road? With her striatal cholinergic interneurons, of course. Neuron 2013, 79:3-6.
    • (2013) Neuron , vol.79 , pp. 3-6
    • Schoenbaum, G.1    Stalnaker, T.A.2    Niv, Y.3
  • 57
    • 79955846155 scopus 로고    scopus 로고
    • The Indian buffet process: an introduction and review
    • Griffiths T.L., Ghahramani Z. The Indian buffet process: an introduction and review. J Mach Learn Res 2011, 12:1185-1224.
    • (2011) J Mach Learn Res , vol.12 , pp. 1185-1224
    • Griffiths, T.L.1    Ghahramani, Z.2
  • 58
    • 80052852204 scopus 로고    scopus 로고
    • A rational model of the effects of distributional information on feature learning
    • Austerweil J.L., Griffiths T.L. A rational model of the effects of distributional information on feature learning. Cogn Psychol 2011, 63:173-209.
    • (2011) Cogn Psychol , vol.63 , pp. 173-209
    • Austerweil, J.L.1    Griffiths, T.L.2
  • 59
    • 84887557472 scopus 로고    scopus 로고
    • A nonparametric Bayesian framework for constructing flexible feature representations
    • Austerweil J.L., Griffiths T.L. A nonparametric Bayesian framework for constructing flexible feature representations. Psychol Rev 2013, 120:817-851.
    • (2013) Psychol Rev , vol.120 , pp. 817-851
    • Austerweil, J.L.1    Griffiths, T.L.2
  • 60
    • 84857965362 scopus 로고    scopus 로고
    • A rational analysis of the acquisition of multisensory representations
    • Yildirim I., Jacobs R.A. A rational analysis of the acquisition of multisensory representations. Cogn Sci 2012, 36:305-332.
    • (2012) Cogn Sci , vol.36 , pp. 305-332
    • Yildirim, I.1    Jacobs, R.A.2
  • 62
    • 70449640182 scopus 로고    scopus 로고
    • Uncovering mental representations with Markov chain Monte Carlo
    • Sanborn A.N., Griffiths T.L., Shiffrin R.M. Uncovering mental representations with Markov chain Monte Carlo. Cogn Psychol 2010, 60:63-106.
    • (2010) Cogn Psychol , vol.60 , pp. 63-106
    • Sanborn, A.N.1    Griffiths, T.L.2    Shiffrin, R.M.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.