메뉴 건너뛰기




Volumn 2015-January, Issue , 2015, Pages 3528-3536

Gradient estimation using stochastic computation graphs

Author keywords

[No Author keywords available]

Indexed keywords

BACKPROPAGATION ALGORITHMS; DIRECTED GRAPHS; ESTIMATION; INFORMATION SCIENCE; LEARNING ALGORITHMS; PROBABILITY DISTRIBUTIONS; REINFORCEMENT LEARNING; STOCHASTIC MODELS;

EID: 84965157716     PISSN: 10495258     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (421)

References (29)
  • 5
    • 84976859194 scopus 로고
    • Likelihood ratio gradient estimation for stochastic systems
    • P. W. Glynn. Likelihood ratio gradient estimation for stochastic systems. Communications of the ACM, 33(10):75-84, 1990.
    • (1990) Communications of the ACM , vol.33 , Issue.10 , pp. 75-84
    • Glynn, P.W.1
  • 12
    • 0032203257 scopus 로고    scopus 로고
    • Gradient-based learning applied to document recognition
    • Y. Le Cun, L. Bottou, Y. Bengio, and P. Haffner. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11):2278-2324, 1998.
    • (1998) Proceedings of the IEEE , vol.86 , Issue.11 , pp. 2278-2324
    • Le Cun, Y.1    Bottou, L.2    Bengio, Y.3    Haffner, P.4
  • 18
    • 0002788893 scopus 로고    scopus 로고
    • A view of the em algorithm that justifies incremental, sparse, and other variants
    • Springer
    • R. M. Neal and G. E. Hinton. A view of the em algorithm that justifies incremental, sparse, and other variants. In Learning in graphical models, pages 355-368. Springer, 1998.
    • (1998) Learning in Graphical Models , pp. 355-368
    • Neal, R.M.1    Hinton, G.E.2
  • 23
    • 84898939480 scopus 로고    scopus 로고
    • Policy gradient methods for reinforcement learning with function approximation
    • Citeseer
    • R. S. Sutton, D. A. McAllester, S. P. Singh, Y. Mansour, et al. Policy gradient methods for reinforcement learning with function approximation. In NIPS, volume 99, pages 1057-1063. Citeseer, 1999.
    • (1999) NIPS , vol.99 , pp. 1057-1063
    • Sutton, R.S.1    McAllester, D.A.2    Singh, S.P.3    Mansour, Y.4
  • 24
    • 70349327392 scopus 로고    scopus 로고
    • Learning model-free robot control by a Monte Carlo EM algorithm
    • N. Vlassis, M. Toussaint, G. Kontes, and S. Piperidis. Learning model-free robot control by a Monte Carlo EM algorithm. Autonomous Robots, 27(2):123-130, 2009.
    • (2009) Autonomous Robots , vol.27 , Issue.2 , pp. 123-130
    • Vlassis, N.1    Toussaint, M.2    Kontes, G.3    Piperidis, S.4
  • 26
    • 0000337576 scopus 로고
    • Simple statistical gradient-following algorithms for connectionist reinforcement learning
    • R. J. Williams. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine learning, 8(3-4):229-256, 1992.
    • (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 229-256
    • Williams, R.J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.