-
1
-
-
0000439527
-
Optimal global rates of convergence for nonparametric regression
-
C. Stone, "Optimal global rates of convergence for nonparametric regression," The Annals of Statistics, vol. 10, no. 4, pp. 1040-1053, 1982.
-
(1982)
The Annals of Statistics
, vol.10
, Issue.4
, pp. 1040-1053
-
-
Stone, C.1
-
3
-
-
61849173491
-
Gaussian process dynamic programming
-
M. Diesenroth, C. Rasmussen, and J. Peters, "Gaussian process dynamic programming," Neurocomputing, vol. 72, no. 7-9, pp. 1508-1524, 2009.
-
(2009)
Neurocomputing
, vol.72
, Issue.7-9
, pp. 1508-1524
-
-
Diesenroth, M.1
Rasmussen, C.2
Peters, J.3
-
4
-
-
31844451013
-
Reinforcement learning with Gaussian processes
-
New York, NY, USA: ACM
-
Y. Engel, S. Mannor, and R. Meir, "Reinforcement learning with Gaussian processes," in ICML '05: Proceedings of the 22nd international conference on Machine learning. New York, NY, USA: ACM, 2005, pp. 201-208.
-
(2005)
ICML '05: Proceedings of the 22nd International Conference on Machine Learning
, pp. 201-208
-
-
Engel, Y.1
Mannor, S.2
Meir, R.3
-
6
-
-
84880694195
-
Stable function approximation in dynamic programming
-
G. J. Gordon, "Stable function approximation in dynamic programming," in ICML, 1995, pp. 261-268.
-
(1995)
ICML
, pp. 261-268
-
-
Gordon, G.J.1
-
7
-
-
21844465127
-
Tree-based batch mode reinforcement learning
-
D. Ernst, P. Geurts, and L. Wehenkel, "Tree-based batch mode reinforcement learning," J. Mach. Learn. Res., vol. 6, pp. 503-556, 2005.
-
(2005)
J. Mach. Learn. Res.
, vol.6
, pp. 503-556
-
-
Ernst, D.1
Geurts, P.2
Wehenkel, L.3
-
8
-
-
0036832956
-
Kernel-based reinforcement learning
-
D. Ormoneit and S. Sen, "Kernel-based reinforcement learning," Machine Learning, vol. 49, no. 2-3, pp. 161-178, 2002.
-
(2002)
Machine Learning
, vol.49
, Issue.2-3
, pp. 161-178
-
-
Ormoneit, D.1
Sen, S.2
-
9
-
-
27944453854
-
Neural Reinforcement Learning to Swing-up and Balance a Real Pole
-
M. Riedmiller, "Neural Reinforcement Learning to Swing-up and Balance a Real Pole," in Proceedings of the 2005 IEEE International Conference on Systems, Man and Cybernetics, vol. 4, 2005, pp. 3191-3196.
-
(2005)
Proceedings of the 2005 IEEE International Conference on Systems, Man and Cybernetics
, vol.4
, pp. 3191-3196
-
-
Riedmiller, M.1
-
10
-
-
33646398129
-
Neural fitted Q iteration-first experiences with a data efficient neural reinforcement learning method
-
-, "Neural fitted Q iteration-first experiences with a data efficient neural reinforcement learning method," Lecture notes in computer science, vol. 3720, pp. 317-328, 2005.
-
(2005)
Lecture Notes in Computer Science
, vol.3720
, pp. 317-328
-
-
Riedmiller, M.1
-
12
-
-
34249833101
-
Q-learning
-
[Online]. Available
-
C. Watkins and P. Dayan, "Q-learning," Machine Learning, vol. 8, no. 3-4, pp. 279-292, 1992. [Online]. Available: http://jmvidal.cse.sc.edu/ library/watkins92a.pdf
-
(1992)
Machine Learning
, vol.8
, Issue.3-4
, pp. 279-292
-
-
Watkins, C.1
Dayan, P.2
-
15
-
-
0141571972
-
On Gaussian radial basis function approximations: Interpretation, extensions, and learning strategies
-
M. Figueiredo, "On Gaussian radial basis function approximations: Interpretation, extensions, and learning strategies," Pattern Recognition, International Conference on, vol. 2, pp. 618-621, 2000.
-
(2000)
Pattern Recognition, International Conference on
, vol.2
, pp. 618-621
-
-
Figueiredo, M.1
-
16
-
-
0002629270
-
Maximum likelihood from incomplete data via the EM algorithm
-
A. Dempster, N. Laird, D. Rubin, et al., "Maximum likelihood from incomplete data via the EM algorithm," Journal of the Royal Statistical Society. Series B (Methodological), vol. 39, no. 1, pp. 1-38, 1977.
-
(1977)
Journal of the Royal Statistical Society. Series B (Methodological)
, vol.39
, Issue.1
, pp. 1-38
-
-
Dempster, A.1
Laird, N.2
Rubin, D.3
-
17
-
-
0003922190
-
-
New-York, USA: John Wiley and Sons, Inc
-
R. O. Duda, P. E. Hart, and D. G. Stork, Pattern classification. New-York, USA: John Wiley and Sons, Inc, 2001.
-
(2001)
Pattern Classification
-
-
Duda, R.O.1
Hart, P.E.2
Stork, D.G.3
-
18
-
-
27544498086
-
Highly efficient incremental estimation of Gaussian mixture models for online data stream clustering
-
M. Song and H. Wang, "Highly efficient incremental estimation of Gaussian mixture models for online data stream clustering," in Proceedings of SPIE: Intelligent Computing: Theory and Applications III, Orlando, FL, USA, 2005, pp. 174-183.
-
Proceedings of SPIE: Intelligent Computing: Theory and Applications III, Orlando, FL, USA, 2005
, pp. 174-183
-
-
Song, M.1
Wang, H.2
-
20
-
-
0034131785
-
On-line em algorithm for the normalized Gaussian network
-
M.-A. Sato and S. Ishii, "On-line em algorithm for the normalized Gaussian network," Neural Comput., vol. 12, no. 2, pp. 407-432, 2000.
-
(2000)
Neural Comput.
, vol.12
, Issue.2
, pp. 407-432
-
-
Sato, M.-A.1
Ishii, S.2
-
21
-
-
0003541323
-
-
Ph.D. dissertation, Pittsburgh, PA, USA
-
S. J. Nowlan, "Soft competitive adaptation: neural network learning algorithms based on fitting statistical mixtures," Ph.D. dissertation, Pittsburgh, PA, USA, 1991.
-
(1991)
Soft Competitive Adaptation: Neural Network Learning Algorithms Based on Fitting Statistical Mixtures
-
-
Nowlan, S.J.1
-
22
-
-
0002788893
-
A view of the em algorithm that justifies incremental, sparse, and other variants
-
Norwell, MA, USA: Kluwer Academic Publishers
-
R. Neal and G. Hinton, "A view of the em algorithm that justifies incremental, sparse, and other variants," in Proceedings of the NATO Advanced Study Institute on Learning in graphical models. Norwell, MA, USA: Kluwer Academic Publishers, 1998, pp. 355-368.
-
(1998)
Proceedings of the NATO Advanced Study Institute on Learning in Graphical Models
, pp. 355-368
-
-
Neal, R.1
Hinton, G.2
-
23
-
-
0002210775
-
The role of exploration in learning control
-
D.White and D. Sofge, Eds. Florence, Kentucky 41022: Van Nostrand Reinhold
-
S. Thrun, "The role of exploration in learning control," in Handbook for Intelligent Control: Neural, Fuzzy and Adaptive Approaches, D.White and D. Sofge, Eds. Florence, Kentucky 41022: Van Nostrand Reinhold, 1992.
-
(1992)
Handbook for Intelligent Control: Neural, Fuzzy and Adaptive Approaches
-
-
Thrun, S.1
-
24
-
-
0033629916
-
Reinforcement learning in continuous time and space
-
K. Doya, "Reinforcement learning in continuous time and space," Neural Comput., vol. 12, no. 1, pp. 219-245, 2000.
-
(2000)
Neural Comput.
, vol.12
, Issue.1
, pp. 219-245
-
-
Doya, K.1
-
25
-
-
0002997066
-
Reinforcement learning based on on-line em algorithm
-
Cambridge, MA, USA: MIT Press
-
M.-a. Sato and S. Ishii, "Reinforcement learning based on on-line em algorithm," in Proceedings of the 1998 conference on Advances in neural information processing systems (NIPS'99). Cambridge, MA, USA: MIT Press, 1999, pp. 1052-1058.
-
(1999)
Proceedings of the 1998 Conference on Advances in Neural Information Processing Systems (NIPS'99)
, pp. 1052-1058
-
-
Sato, M.-A.1
Ishii, S.2
|