SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 6792 LNCS, Issue PART 2, 2011, Pages 221-228

Improving Gaussian process value function approximation in policy gradient algorithms

(2) Jakab, Hunor a,b Csató, Lehel a,b

a BABE BOLYAI UNIVERSITY (Romania)

b EÖTVÖS LORÁND UNIVERSITY (Hungary)

Author keywords

control problems; Gaussian processes; policy gradient methods; Reinforcement learning; value function estimation

Indexed keywords

BASIS VECTOR; CONTINUOUS DOMAIN; CONTINUOUS STATE-ACTION SPACES; CONTROL PROBLEMS; DISTANCE-BASED; DYNAMIC SYSTEM CONTROL; GAUSSIAN PROCESS REGRESSION; GAUSSIAN PROCESSES; GRADIENT BASED; GRADIENT ESTIMATES; GRADIENT VARIANCE; KULLBACK-LEIBLER DISTANCE; POLICY GRADIENT; POLICY GRADIENT METHODS; POLICY SEARCH; TIME-DEPENDENT FACTORS; VALUE FUNCTION APPROXIMATION; VALUE FUNCTIONS; VALUE-BASED; WILLIAMS;

APPROXIMATION THEORY; CONVERGENCE OF NUMERICAL METHODS; ESTIMATION; GAUSSIAN DISTRIBUTION; GAUSSIAN NOISE (ELECTRONIC); GRADIENT METHODS; LEARNING ALGORITHMS; NEURAL NETWORKS; PROCESS CONTROL; REINFORCEMENT LEARNING;

APPROXIMATION ALGORITHMS;

EID: 79959344344 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-642-21738-8_29 Document Type: Conference Paper

Times cited : (3)

References (15)

1
- 84898958374
- Gradient descent for general reinforcement learning
- Kearns, M.S., Solla, S.A., Cohn, D.A. (eds.) NIPS 1998. MIT Press, Cambridge
- Baird, L., Moore, A.: Gradient descent for general reinforcement learning. In: Kearns, M.S., Solla, S.A., Cohn, D.A. (eds.) NIPS 1998. Advances in Neural Information Processing Systems, vol. 11, pp. 968-974. MIT Press, Cambridge (1998)
- (1998) Advances in Neural Information Processing Systems , vol.11 , pp. 968-974
- Baird, L.¹ Moore, A.²

2
- 14344257510
- PhD thesis, Neural Computing Research Group
- Csató, L.: Gaussian Processes - Iterative Sparse Approximation. PhD thesis, Neural Computing Research Group (2002)
- (2002) Gaussian Processes - Iterative Sparse Approximation
- Csató, L.¹

3
- 84898947911
- Sparse representation for Gaussian process models
- Leen, T.K., Dietterich, T.G., Tresp, V. (eds.) MIT Press, Cambridge
- Csató, L., Opper, M.: Sparse representation for Gaussian process models. In: Leen, T.K., Dietterich, T.G., Tresp, V. (eds.) NIPS, vol. 13, pp. 444-450. MIT Press, Cambridge (2001)
- (2001) NIPS , vol.13 , pp. 444-450
- Csató, L.¹ Opper, M.²

4
- 61849173491
- Gaussian process dynamic programming
- Deisenroth, M.P., Rasmussen, C.E., Peters, J.: Gaussian process dynamic programming. Neurocomputing 72(7-9), 1508-1524 (2009)
- (2009) Neurocomputing , vol.72 , Issue.7-9 , pp. 1508-1524
- Deisenroth, M.P.¹ Rasmussen, C.E.² Peters, J.³

5
- 31844451013
- Reinforcement learning with Gaussian processes
- New York
- Engel, Y., Mannor, S., Meir, R.: Reinforcement learning with Gaussian processes. In: Proceedings of the 22nd International Conference on Machine learning, pp. 201-208, New York (2005)
- (2005) Proceedings of the 22nd International Conference on Machine Learning , pp. 201-208
- Engel, Y.¹ Mannor, S.² Meir, R.³

6
- 77956930777
- Importance sampling for continuous time Bayesian networks
- Fan, Y., Xu, J., Shelton, C.R.: Importance sampling for continuous time Bayesian networks. Journal of Machine Learning Research 11, 2115-2140 (2010)
- (2010) Journal of Machine Learning Research , vol.11 , pp. 2115-2140
- Fan, Y.¹ Xu, J.² Shelton, C.R.³

7
- 84864065133
- Bayesian policy gradient algorithms
- Schölkopf, B., Platt, J., Hoffman, T. (eds.) NIPS 2007, MIT Press, Cambridge
- Ghavamzadeh, M., Engel, Y.: Bayesian policy gradient algorithms. In: Schölkopf, B., Platt, J., Hoffman, T. (eds.) NIPS 2007, Advances in Neural Information Processing Systems, vol. 19, pp. 457-464. MIT Press, Cambridge (2007)
- (2007) Advances in Neural Information Processing Systems , vol.19 , pp. 457-464
- Ghavamzadeh, M.¹ Engel, Y.²

8
- 79959332051
- Using Gaussian processes for variance reduction in policy gradient algorithms
- Jakab, H.S., Csató, L.: Using Gaussian processes for variance reduction in policy gradient algorithms. In: 8th International Conference on Applied Informatics, Eger, pp. 55-63 (2010)
- (2010) 8th International Conference on Applied Informatics, Eger , pp. 55-63
- Jakab, H.S.¹ Csató, L.²

9
- 44949241322
- Reinforcement learning of motor skills with policy gradients
- Peters, J., Schaal, S.: Reinforcement learning of motor skills with policy gradients. Neural Networks 21(4), 682-697 (2008)
- (2008) Neural Networks , vol.21 , Issue.4 , pp. 682-697
- Peters, J.¹ Schaal, S.²

10
- 85102627959
- John Wiley & Sons, New York
- Puterman, M.L.: Markov Decision Processes: Discrete Stochastic Dynamic Programming. John Wiley & Sons, New York (1994)
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

11
- 84899026055
- Gaussian processes in reinforcement learning
- Saul, L.K., Thrun, S., Schlkopf, B. (eds.) NIPS 2003, MIT Press, Cambridge
- Rasmussen, C.E., Kuss, M.: Gaussian processes in reinforcement learning. In: Saul, L.K., Thrun, S., Schlkopf, B. (eds.) NIPS 2003, Advances in Neural Information Processing Systems, pp. 751-759. MIT Press, Cambridge (2004)
- (2004) Advances in Neural Information Processing Systems , pp. 751-759
- Rasmussen, C.E.¹ Kuss, M.²

12
- 25444448065
- MIT Press, Cambridge
- Rasmussen, C.E., Williams, C.: Gaussian Processes for Machine Learning. MIT Press, Cambridge (2006)
- (2006) Gaussian Processes for Machine Learning.
- Rasmussen, C.E.¹ Williams, C.²

13
- 52949143093
- Geodesic gaussian kernels for value function approximation
- Sugiyama, M., Hachiya, H., Towell, C., Vijayakumar, S.: Geodesic gaussian kernels for value function approximation. Auton. Robots 25, 287-304 (2008)
- (2008) Auton. Robots , vol.25 , pp. 287-304
- Sugiyama, M.¹ Hachiya, H.² Towell, C.³ Vijayakumar, S.⁴

14
- 84898939480
- Policy gradient methods for reinforcement learning with function approximation
- Solla, S.A., Leen, T.K., Müller, K.R. (eds.) NIPS 1999, MIT Press, Cambridge
- Sutton, R.S., McAllester, D.A., Singh, S.P., Mansour, Y.: Policy gradient methods for reinforcement learning with function approximation. In: Solla, S.A., Leen, T.K., Müller, K.R. (eds.) NIPS 1999, Advances in Neural Information Processing Systems, pp. 1057-1063. MIT Press, Cambridge (1999)
- (1999) Advances in Neural Information Processing Systems , pp. 1057-1063
- Sutton, R.S.¹ McAllester, D.A.² Singh, S.P.³ Mansour, Y.⁴

15
- 0000337576
- Simple statistical gradient-following algorithms for connectionist reinforcement learning
- Williams, R.J.: Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning 8, 229-256 (1992)
- (1992) Machine Learning , vol.8 , pp. 229-256
- Williams, R.J.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.