메뉴 건너뛰기




Volumn , Issue , 2007, Pages 457-464

Bayesian policy gradient algorithms

Author keywords

[No Author keywords available]

Indexed keywords

BAYESIAN FRAMEWORKS; GAUSSIAN PROCESSES; GRADIENT ESTIMATES; MONTE CARLO; NATURAL GRADIENT; NUMBER OF SAMPLES; PARAMETERIZED; POLICY GRADIENT; POLICY GRADIENT METHODS;

EID: 84864065133     PISSN: 10495258     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (57)

References (16)
  • 1
    • 0000337576 scopus 로고
    • Simple statistical gradient-following algorithms for connectionist reinforcement learning
    • R. Williams. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning, 8:229-256, 1992.
    • (1992) Machine Learning , vol.8 , pp. 229-256
    • Williams, R.1
  • 3
    • 0013535965 scopus 로고    scopus 로고
    • Infinite-horizon policy-gradient estimation
    • J. Baxter and P. Bartlett. Infinite-horizon policy-gradient estimation. JAIR, 15:319-350, 2001.
    • (2001) JAIR , vol.15 , pp. 319-350
    • Baxter, J.1    Bartlett, P.2
  • 4
    • 84898939480 scopus 로고    scopus 로고
    • Policy gradient methods for reinforcement learning with function approximation
    • R. Sutton, D. McAllester, S. Singh, and Y. Mansour. Policy gradient methods for reinforcement learning with function approximation. In Proceedings of NIPS 12, pages 1057-1063, 2000.
    • (2000) Proceedings of NIPS , vol.12 , pp. 1057-1063
    • Sutton, R.1    McAllester, D.2    Singh, S.3    Mansour, Y.4
  • 5
    • 84898930479 scopus 로고    scopus 로고
    • A natural policy gradient
    • S. Kakade. A natural policy gradient. In Proceedings of NIPS 14, 2002.
    • (2002) Proceedings of NIPS , vol.14
    • Kakade, S.1
  • 9
    • 0013521749 scopus 로고
    • Monte Carlo is fundamentally unsound
    • A. O'Hagan. Monte Carlo is fundamentally unsound. The Statistician, 36:247-249, 1987.
    • (1987) The Statistician , vol.36 , pp. 247-249
    • O'Hagan, A.1
  • 13
    • 0002853450 scopus 로고    scopus 로고
    • Exploiting generative models in discriminative classifiers
    • MIT Press
    • T. Jaakkola and D. Haussler. Exploiting generative models in discriminative classifiers. In Proceedings of NIPS 11. MIT Press, 1998.
    • (1998) Proceedings of NIPS , vol.11
    • Jaakkola, T.1    Haussler, D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.