-
7
-
-
84923295342
-
-
Sebastian Bubeck, December
-
Sebastian Bubeck. The complexities of optimization, December 2013. URL https: //blogs.princeton.edu/imabandit/2013/04/ 25/orf523-noisy-oracles/.
-
(2013)
The Complexities of Optimization
-
-
-
8
-
-
79955702502
-
LIBSVM: A library for support vector machines
-
Software available at
-
Chih-Chung Chang and Chih-Jen Lin. LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 2: 27:1-27:27, 2011. Software available at http://www. csie.ntu.edu.tw/~cjlin/libsvm.
-
(2011)
ACM Transactions on Intelligent Systems and Technology
, vol.2
, pp. 271-2727
-
-
Chang, C.1
Lin, C.2
-
10
-
-
0020203191
-
Optimal control and nonlinear filtering for nondegenerate diffusion processes
-
W. Fleming and S. Mitter. Optimal control and nonlinear filtering for nondegenerate diffusion processes. Stochastics, 8:226-261, 1982.
-
(1982)
Stochastics
, vol.8
, pp. 226-261
-
-
Fleming, W.1
Mitter, S.2
-
11
-
-
28844435646
-
Linear theory for control of nonlinear stochastic systems
-
H.J. Kappen. Linear theory for control of nonlinear stochastic systems. Physical Review Letters, 95(20): 200201, 2005.
-
(2005)
Physical Review Letters
, vol.95
, Issue.20
, pp. 200201
-
-
Kappen, H.J.1
-
12
-
-
0032203257
-
Gradient-based learning applied to document recognition
-
Yann LeCun, Léon Bottou, Yoshua Bengio, and Patrick Haffner. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86 (11):2278-2324, 1998.
-
(1998)
Proceedings of the IEEE
, vol.86
, Issue.11
, pp. 2278-2324
-
-
Lecun, Y.1
Bottou, L.2
Bengio, Y.3
Haffner, P.4
-
13
-
-
0001916840
-
Risk sensitive Markov decision processes
-
S.I. Marcus, E. Fernández-Gaucherand, D. Hernández-Hernandez, S. Coraluppi, and P. Fard. Risk sensitive Markov decision processes. Systems and Control in the Twenty-First Century, 29, 1997.
-
(1997)
Systems and Control in the Twenty-First Century
, vol.29
-
-
Marcus, S.I.1
Fernández-Gaucherand, E.2
Hernández-Hernandez, D.3
Coraluppi, S.4
Fard, P.5
-
15
-
-
0025536653
-
Generalised graduated non-convexity algorithm for maximum a posterjori image estimation
-
A. Rangarajan. Generalised graduated non-convexity algorithm for maximum a posterjori image estimation. In Proc. ICPR, pages 127-133, 1990.
-
(1990)
Proc. ICPR
, pp. 127-133
-
-
Rangarajan, A.1
-
16
-
-
0016094244
-
Optimization of stochastic linear systems with additive measurement and process noise using exponential performance criteria
-
J Speyer, John Deyst, and D Jacobson. Optimization of stochastic linear systems with additive measurement and process noise using exponential performance criteria. Automatic Control, IEEE Transactions on, 19(4):358-366, 1974.
-
(1974)
Automatic Control, IEEE Transactions on
, vol.19
, Issue.4
, pp. 358-366
-
-
Speyer, J.1
Deyst, J.2
Jacobson, D.3
-
17
-
-
77955836276
-
Reinforcement learning of motor skills in high dimensions: A path integral approach
-
IEEE
-
E. Theodorou, J. Buchli, and S. Schaal. Reinforcement learning of motor skills in high dimensions: A path integral approach. In Robotics and Automation (ICRA), 2010 IEEE International Conference on, pages 2397-2403. IEEE, 2010a.
-
(2010)
Robotics and Automation (ICRA), 2010 IEEE International Conference on
, pp. 2397-2403
-
-
Theodorou, E.1
Buchli, J.2
Schaal, S.3
-
18
-
-
79551503171
-
A generalized path integral control approach to reinforcement learning
-
Evangelos Theodorou, Jonas Buchli, and Stefan Schaal. A generalized path integral control approach to reinforcement learning. The Journal of Machine Learning Research, 9999:3137-3181, 2010b.
-
(2010)
The Journal of Machine Learning Research
, vol.9999
, pp. 3137-3181
-
-
Theodorou, E.1
Buchli, J.2
Schaal, S.3
-
20
-
-
71149083296
-
Robot trajectory optimization using approximate inference
-
M. Toussaint. Robot trajectory optimization using approximate inference. International Conference on Machine Learning, 26:1049-1056, 2009.
-
(2009)
International Conference on Machine Learning
, vol.26
, pp. 1049-1056
-
-
Toussaint, M.1
|