메뉴 건너뛰기




Volumn , Issue , 2016, Pages

Prioritized experience replay

Author keywords

[No Author keywords available]

Indexed keywords

INTELLIGENT AGENTS; LEARNING ALGORITHMS; MACHINE LEARNING;

EID: 85083953310     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (1478)

References (37)
  • 2
    • 84940795121 scopus 로고    scopus 로고
    • Memory trace replay: The shaping of memory consolidation by neuromodulation
    • Atherton, Laura A, Dupret, David, and Mellor, Jack R. Memory trace replay: the shaping of memory consolidation by neuromodulation. Trends in neurosciences, 38(9):560–570, 2015.
    • (2015) Trends in Neurosciences , vol.38 , Issue.9 , pp. 560-570
    • Atherton, L.A.1    Dupret, D.2    Mellor, J.R.3
  • 5
    • 84888340666 scopus 로고    scopus 로고
    • Torch7: A matlab-like environment for machine learning
    • number EPFL-CONF-192376
    • Collobert, Ronan, Kavukcuoglu, Koray, and Farabet, Clément. Torch7: A matlab-like environment for machine learning. In BigLearn, NIPS Workshop, number EPFL-CONF-192376, 2011.
    • (2011) BigLearn, NIPS Workshop
    • Collobert, R.1    Kavukcuoglu, K.2    Farabet, C.3
  • 6
    • 21844491206 scopus 로고
    • Zebras and the Anna Karenina principle
    • Diamond, Jared. Zebras and the Anna Karenina principle. Natural History, 103:4–4, 1994.
    • (1994) Natural History , vol.103 , pp. 4
    • Diamond, J.1
  • 8
    • 33645458694 scopus 로고    scopus 로고
    • Reverse replay of behavioural sequences in hippocampal place cells during the awake state
    • Foster, David J and Wilson, Matthew A. Reverse replay of behavioural sequences in hippocampal place cells during the awake state. Nature, 440(7084):680–683, 2006.
    • (2006) Nature , vol.440 , Issue.7084 , pp. 680-683
    • Foster, D.J.1    Wilson, M.A.2
  • 11
    • 84937779024 scopus 로고    scopus 로고
    • Deep learning for real-time atari game play using offline Monte-carlo tree search planning
    • Ghahra-mani, Z., Welling, M., Cortes, C., Lawrence, and Weinberger, K.Q. (eds), Curran Associates, Inc
    • Guo, Xiaoxiao, Singh, Satinder, Lee, Honglak, Lewis, Richard L, and Wang, Xiaoshi. Deep Learning for Real-Time Atari Game Play Using Offline Monte-Carlo Tree Search Planning. In Ghahra-mani, Z., Welling, M., Cortes, C., Lawrence, N.D., and Weinberger, K.Q. (eds.), Advances in Neural Information Processing Systems 27, pp. 3338–3346. Curran Associates, Inc., 2014.
    • (2014) Advances in Neural Information Processing Systems , vol.27 , pp. 3338-3346
    • Guo, X.1    Singh, S.2    Lee, H.3    Lewis, R.L.4    Wang, X.5
  • 12
    • 34848816179 scopus 로고    scopus 로고
    • To recognize shapes, first learn to generate images
    • Hinton, Geoffrey E. To recognize shapes, first learn to generate images. Progress in brain research, 165:535–547, 2007.
    • (2007) Progress in Brain Research , vol.165 , pp. 535-547
    • Hinton, G.E.1
  • 13
    • 85083951076 scopus 로고    scopus 로고
    • ADaM: A method for stochastic optimization
    • Kingma, Diederik P. and Ba, Jimmy. Adam: A method for stochastic optimization. CoRR, abs/1412.6980, 2014.
    • (2014) CoRR
    • Kingma, D.P.1    Ba, J.2
  • 14
    • 0032203257 scopus 로고    scopus 로고
    • Gradient-based learning applied to document recognition
    • Nov
    • Lecun, Y., Bottou, L., Bengio, Y., and Haffner, P. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11):2278–2324, Nov 1998. ISSN 0018-9219. doi: 10.1109/5.726791.
    • (1998) Proceedings of the IEEE , vol.86 , Issue.11 , pp. 2278-2324
    • Lecun, Y.1    Bottou, L.2    Bengio, Y.3    Haffner, P.4
  • 16
    • 0000123778 scopus 로고
    • Self-improving reactive agents based on reinforcement learning, planning and teaching
    • Lin, Long-Ji. Self-improving reactive agents based on reinforcement learning, planning and teaching. Machine learning, 8(3-4):293–321, 1992.
    • (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 293-321
    • Lin, L.-J.1
  • 21
    • 0027684215 scopus 로고
    • Prioritized sweeping: Reinforcement learning with less data and less time
    • Moore, Andrew W and Atkeson, Christopher G. Prioritized sweeping: Reinforcement learning with less data and less time. Machine Learning, 13(1):103–130, 1993.
    • (1993) Machine Learning , vol.13 , Issue.1 , pp. 103-130
    • Moore, A.W.1    Atkeson, C.G.2
  • 24
    • 84937060789 scopus 로고    scopus 로고
    • Hippocampal place cells construct reward related sequences through unexplored space
    • Ólafsdóttir, H Freyja, Barry, Caswell, Saleem, Aman B, Hassabis, Demis, and Spiers, Hugo J. Hippocampal place cells construct reward related sequences through unexplored space. Elife, 4: e06063, 2015.
    • (2015) Elife , vol.4
    • Ólafsdóttir, H.F.1    Barry, C.2    Saleem, A.B.3    Hassabis, D.4    Spiers, H.J.5
  • 26
    • 0031082536 scopus 로고    scopus 로고
    • New methods for competitive coevolution
    • Rosin, Christopher D and Belew, Richard K. New methods for competitive coevolution. Evolutionary Computation, 5(1):1–29, 1997.
    • (1997) Evolutionary Computation , vol.5 , Issue.1 , pp. 1-29
    • Rosin, C.D.1    Belew, R.K.2
  • 29
    • 72149101860 scopus 로고    scopus 로고
    • Rewarded outcomes enhance reactivation of experience in the hippocampus
    • Singer, Annabelle C and Frank, Loren M. Rewarded outcomes enhance reactivation of experience in the hippocampus. Neuron, 64(6):910–921, 2009.
    • (2009) Neuron , vol.64 , Issue.6 , pp. 910-921
    • Singer, A.C.1    Frank, L.M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.