SCOPUS 정보 검색 플랫폼

Proceedings of the International Joint Conference on Neural Networks

Volumn , Issue , 2010, Pages

Reinforcement learning with a Gaussian mixture model

(2) Agostini, Alejandro a Celaya, Enric a

a INSTITUT DE ROBÒTICA I INFORMÀTICA INDUSTRIAL CSIC UPC (Spain)

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATIONAL EFFICIENCY; GAUSSIAN DISTRIBUTION; GAUSSIAN NOISE (ELECTRONIC); ITERATIVE METHODS; PROBABILITY DENSITY FUNCTION;

FITTED VALUE ITERATION; FUNCTIONS APPROXIMATIONS; GAUSSIAN MIXTURE MODEL; GAUSSIAN PROCESSES; ITERATIVE PROCESS; NUMBER OF SAMPLES; REINFORCEMENT LEARNING WITH FUNCTION APPROXIMATIONS; REINFORCEMENT LEARNINGS; VALUE FUNCTIONS; VALUE ITERATION ALGORITHM;

REINFORCEMENT LEARNING;

EID: 79959391832 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/IJCNN.2010.5596306 Document Type: Conference Paper

Times cited : (31)

References (25)

1
- 0000439527
- Optimal global rates of convergence for nonparametric regression
- C. Stone, "Optimal global rates of convergence for nonparametric regression," The Annals of Statistics, vol. 10, no. 4, pp. 1040-1053, 1982.
- (1982) The Annals of Statistics , vol.10 , Issue.4 , pp. 1040-1053
- Stone, C.¹

2
- 84899026055
- Gaussian processes in reinforcement learning
- C. Rasmussen and M. Kuss, "Gaussian processes in reinforcement learning," Advances in Neural Information Processing Systems, vol. 16, pp. 751-759, 2004.
- (2004) Advances in Neural Information Processing Systems , vol.16 , pp. 751-759
- Rasmussen, C.¹ Kuss, M.²

3
- 61849173491
- Gaussian process dynamic programming
- M. Diesenroth, C. Rasmussen, and J. Peters, "Gaussian process dynamic programming," Neurocomputing, vol. 72, no. 7-9, pp. 1508-1524, 2009.
- (2009) Neurocomputing , vol.72 , Issue.7-9 , pp. 1508-1524
- Diesenroth, M.¹ Rasmussen, C.² Peters, J.³

4
- 31844451013
- Reinforcement learning with Gaussian processes
- New York, NY, USA: ACM
- Y. Engel, S. Mannor, and R. Meir, "Reinforcement learning with Gaussian processes," in ICML '05: Proceedings of the 22nd international conference on Machine learning. New York, NY, USA: ACM, 2005, pp. 201-208.
- (2005) ICML '05: Proceedings of the 22nd International Conference on Machine Learning , pp. 201-208
- Engel, Y.¹ Mannor, S.² Meir, R.³

5
- 1942421151
- Bayes meets Bellman: The Gaussian process approach to temporal difference learning
- -, "Bayes meets Bellman: The Gaussian process approach to temporal difference learning," in Proc. of the 20th International Conference on Machine Learning, 2003, pp. 154-161.
- Proc. of the 20th International Conference on Machine Learning, 2003 , pp. 154-161
- Engel, Y.¹ Mannor, S.² Meir, R.³

6
- 84880694195
- Stable function approximation in dynamic programming
- G. J. Gordon, "Stable function approximation in dynamic programming," in ICML, 1995, pp. 261-268.
- (1995) ICML , pp. 261-268
- Gordon, G.J.¹

7
- 21844465127
- Tree-based batch mode reinforcement learning
- D. Ernst, P. Geurts, and L. Wehenkel, "Tree-based batch mode reinforcement learning," J. Mach. Learn. Res., vol. 6, pp. 503-556, 2005.
- (2005) J. Mach. Learn. Res. , vol.6 , pp. 503-556
- Ernst, D.¹ Geurts, P.² Wehenkel, L.³

8
- 0036832956
- Kernel-based reinforcement learning
- D. Ormoneit and S. Sen, "Kernel-based reinforcement learning," Machine Learning, vol. 49, no. 2-3, pp. 161-178, 2002.
- (2002) Machine Learning , vol.49 , Issue.2-3 , pp. 161-178
- Ormoneit, D.¹ Sen, S.²

9
- 27944453854
- Neural Reinforcement Learning to Swing-up and Balance a Real Pole
- M. Riedmiller, "Neural Reinforcement Learning to Swing-up and Balance a Real Pole," in Proceedings of the 2005 IEEE International Conference on Systems, Man and Cybernetics, vol. 4, 2005, pp. 3191-3196.
- (2005) Proceedings of the 2005 IEEE International Conference on Systems, Man and Cybernetics , vol.4 , pp. 3191-3196
- Riedmiller, M.¹

10
- 33646398129
- Neural fitted Q iteration-first experiences with a data efficient neural reinforcement learning method
- -, "Neural fitted Q iteration-first experiences with a data efficient neural reinforcement learning method," Lecture notes in computer science, vol. 3720, pp. 317-328, 2005.
- (2005) Lecture Notes in Computer Science , vol.3720 , pp. 317-328
- Riedmiller, M.¹

11
- 0004102479
- B. Book, Ed. Cambridge, MA: MIT Press
- R. Sutton and A. Barto, Reinforcement Learning: An Introduction, B. Book, Ed. Cambridge, MA: MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Barto, A.²

12
- 34249833101
- Q-learning
- [Online]. Available
- C. Watkins and P. Dayan, "Q-learning," Machine Learning, vol. 8, no. 3-4, pp. 279-292, 1992. [Online]. Available: http://jmvidal.cse.sc.edu/ library/watkins92a.pdf
- (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 279-292
- Watkins, C.¹ Dayan, P.²

13
- 84954213764
- Princeton, New Jersy: Princeton University Press
- R. Bellman and S. Dreyfus, Applied Dynamic Programming. Princeton, New Jersy: Princeton University Press, 1962.
- (1962) Applied Dynamic Programming
- Bellman, R.¹ Dreyfus, S.²

14
- 33846516584
- Secaucus, NJ, USA: Springer-Verlag New York, Inc.
- C. M. Bishop, Pattern Recognition and Machine Learning (Information Science and Statistics). Secaucus, NJ, USA: Springer-Verlag New York, Inc., 2006.
- (2006) Pattern Recognition and Machine Learning (Information Science and Statistics)
- Bishop, C.M.¹

15
- 0141571972
- On Gaussian radial basis function approximations: Interpretation, extensions, and learning strategies
- M. Figueiredo, "On Gaussian radial basis function approximations: Interpretation, extensions, and learning strategies," Pattern Recognition, International Conference on, vol. 2, pp. 618-621, 2000.
- (2000) Pattern Recognition, International Conference on , vol.2 , pp. 618-621
- Figueiredo, M.¹

16
- 0002629270
- Maximum likelihood from incomplete data via the EM algorithm
- A. Dempster, N. Laird, D. Rubin, et al., "Maximum likelihood from incomplete data via the EM algorithm," Journal of the Royal Statistical Society. Series B (Methodological), vol. 39, no. 1, pp. 1-38, 1977.
- (1977) Journal of the Royal Statistical Society. Series B (Methodological) , vol.39 , Issue.1 , pp. 1-38
- Dempster, A.¹ Laird, N.² Rubin, D.³

17
- 0003922190
- New-York, USA: John Wiley and Sons, Inc
- R. O. Duda, P. E. Hart, and D. G. Stork, Pattern classification. New-York, USA: John Wiley and Sons, Inc, 2001.
- (2001) Pattern Classification
- Duda, R.O.¹ Hart, P.E.² Stork, D.G.³

18
- 27544498086
- Highly efficient incremental estimation of Gaussian mixture models for online data stream clustering
- M. Song and H. Wang, "Highly efficient incremental estimation of Gaussian mixture models for online data stream clustering," in Proceedings of SPIE: Intelligent Computing: Theory and Applications III, Orlando, FL, USA, 2005, pp. 174-183.
- Proceedings of SPIE: Intelligent Computing: Theory and Applications III, Orlando, FL, USA, 2005 , pp. 174-183
- Song, M.¹ Wang, H.²

19
- 78649651429
- Incremental learning of temporally-coherent Gaussian mixture models
- O. Arandjelovic and R. Cipolla, "Incremental learning of temporally-coherent Gaussian mixture models," in Technical Papers - Society of Manufacturing Engineers (SME), 2005.
- (2005) Technical Papers - Society of Manufacturing Engineers (SME)
- Arandjelovic, O.¹ Cipolla, R.²

20
- 0034131785
- On-line em algorithm for the normalized Gaussian network
- M.-A. Sato and S. Ishii, "On-line em algorithm for the normalized Gaussian network," Neural Comput., vol. 12, no. 2, pp. 407-432, 2000.
- (2000) Neural Comput. , vol.12 , Issue.2 , pp. 407-432
- Sato, M.-A.¹ Ishii, S.²

21
- 0003541323
- Ph.D. dissertation, Pittsburgh, PA, USA
- S. J. Nowlan, "Soft competitive adaptation: neural network learning algorithms based on fitting statistical mixtures," Ph.D. dissertation, Pittsburgh, PA, USA, 1991.
- (1991) Soft Competitive Adaptation: Neural Network Learning Algorithms Based on Fitting Statistical Mixtures
- Nowlan, S.J.¹

22
- 0002788893
- A view of the em algorithm that justifies incremental, sparse, and other variants
- Norwell, MA, USA: Kluwer Academic Publishers
- R. Neal and G. Hinton, "A view of the em algorithm that justifies incremental, sparse, and other variants," in Proceedings of the NATO Advanced Study Institute on Learning in graphical models. Norwell, MA, USA: Kluwer Academic Publishers, 1998, pp. 355-368.
- (1998) Proceedings of the NATO Advanced Study Institute on Learning in Graphical Models , pp. 355-368
- Neal, R.¹ Hinton, G.²

23
- 0002210775
- The role of exploration in learning control
- D.White and D. Sofge, Eds. Florence, Kentucky 41022: Van Nostrand Reinhold
- S. Thrun, "The role of exploration in learning control," in Handbook for Intelligent Control: Neural, Fuzzy and Adaptive Approaches, D.White and D. Sofge, Eds. Florence, Kentucky 41022: Van Nostrand Reinhold, 1992.
- (1992) Handbook for Intelligent Control: Neural, Fuzzy and Adaptive Approaches
- Thrun, S.¹

24
- 0033629916
- Reinforcement learning in continuous time and space
- K. Doya, "Reinforcement learning in continuous time and space," Neural Comput., vol. 12, no. 1, pp. 219-245, 2000.
- (2000) Neural Comput. , vol.12 , Issue.1 , pp. 219-245
- Doya, K.¹

25
- 0002997066
- Reinforcement learning based on on-line em algorithm
- Cambridge, MA, USA: MIT Press
- M.-a. Sato and S. Ishii, "Reinforcement learning based on on-line em algorithm," in Proceedings of the 1998 conference on Advances in neural information processing systems (NIPS'99). Cambridge, MA, USA: MIT Press, 1999, pp. 1052-1058.
- (1999) Proceedings of the 1998 Conference on Advances in Neural Information Processing Systems (NIPS'99) , pp. 1052-1058
- Sato, M.-A.¹ Ishii, S.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.