-
1
-
-
85158005713
-
An application of reinforcement learning to aerobatic helicopter flight
-
P. Abbeel, A. Coates, M. Quigley, and A. Ng. An application of reinforcement learning to aerobatic helicopter flight. NIPS, 2006.
-
(2006)
NIPS
-
-
Abbeel, P.1
Coates, A.2
Quigley, M.3
Ng, A.4
-
3
-
-
84898945898
-
A message-passing algorithm for multi-agent trajectory planning
-
Jos Bento, Nate Derbinsky, Javier Alonso-Mora, and Jonathan S. Yedidia. A message-passing algorithm for multi-agent trajectory planning. In NIPS, pages 521–529, 2013.
-
(2013)
NIPS
, pp. 521-529
-
-
Bento, Jos1
Derbinsky, Nate2
Alonso-Mora, Javier3
Yedidia, Jonathan S.4
-
5
-
-
79957583390
-
Distributed optimization and statistical learning via the alternating direction method of multipliers
-
S. Boyd, N. Pariks, E. Chu, B. Peleato, and J. Eckstein. Distributed optimization and statistical learning via the alternating direction method of multipliers. Foundations and Trends in Machine Learning, 2011.
-
(2011)
Foundations and Trends in Machine Learning
-
-
Boyd, S.1
Pariks, N.2
Chu, E.3
Peleato, B.4
Eckstein, J.5
-
6
-
-
84890526837
-
New types of deep neural network learning for speech recognition and related applications: An overview
-
L. Deng, G. Hinton, and B. Kingsbury. New types of deep neural network learning for speech recognition and related applications: An overview. ICASSP, 2013.
-
(2013)
ICASSP
-
-
Deng, L.1
Hinton, G.2
Kingsbury, B.3
-
7
-
-
77953234105
-
A muscle-reflex model that encodes principles of legged mechanics produces human walking dynamics and muscle activities
-
Hartmut Geyer and Hugh Herr. A muscle-reflex model that encodes principles of legged mechanics produces human walking dynamics and muscle activities. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 18(3), 2010.
-
(2010)
IEEE Transactions on Neural Systems and Rehabilitation Engineering
, vol.18
, Issue.3
-
-
Geyer, Hartmut1
Herr, Hugh2
-
8
-
-
37249089713
-
Quadrotor helicopter flight dynamics and control: Theory and experiment
-
G. Hoffmann, H. Huang, S. Waslander, and C. Tomlin. Quadrotor helicopter flight dynamics and control: Theory and experiment. Proc AIAA, 2007.
-
(2007)
Proc AIAA
-
-
Hoffmann, G.1
Huang, H.2
Waslander, S.3
Tomlin, C.4
-
9
-
-
40049092971
-
Central pattern generators for locomotion control in animals and robots: a review
-
A. Ijspeert. Central pattern generators for locomotion control in animals and robots: a review. Neural Networks, 2008.
-
(2008)
Neural Networks
-
-
Ijspeert, A.1
-
10
-
-
84876231242
-
Imagenet classification with deep convolutional neural networks
-
A. Krizhevsky, I. Sutskever, and G. Hinton. Imagenet classification with deep convolutional neural networks. NIPS, 2012.
-
(2012)
NIPS
-
-
Krizhevsky, A.1
Sutskever, I.2
Hinton, G.3
-
13
-
-
84898932265
-
Variational policy search via trajectory optimization
-
S. Levine and V. Koltun. Variational policy search via trajectory optimization. NIPS, 2013.
-
(2013)
NIPS
-
-
Levine, S.1
Koltun, V.2
-
19
-
-
84872224046
-
Discovery of complex behaviors through contact-invariant optimization
-
I. Mordatch, E Todorov, and Z. Popovic. Discovery of complex behaviors through contact-invariant optimization. SIGGRAPH, 2012.
-
(2012)
SIGGRAPH
-
-
Mordatch, I.1
Todorov, E2
Popovic, Z.3
-
21
-
-
84891310987
-
Stochastic alternating direction method of multipliers
-
JMLR.org
-
Hua Ouyang, Niao He, Long Tran, and Alexander G. Gray. Stochastic alternating direction method of multipliers. In ICML (1), volume 28 of JMLR Proceedings, pages 80–88. JMLR.org, 2013.
-
(2013)
ICML (1), volume 28 of JMLR Proceedings
, pp. 80-88
-
-
Ouyang, Hua1
He, Niao2
Tran, Long3
Gray, Alexander G.4
-
22
-
-
40649106649
-
Natural actor-critic
-
J. Peters and S. Schaal. Natural actor-critic. Neurocom- puting, 71:1180–1190, 2008.
-
(2008)
Neurocom- puting
, vol.71
, pp. 1180-1190
-
-
Peters, J.1
Schaal, S.2
-
24
-
-
80053460450
-
Contractive auto-encoders: Explicit invariance during feature extraction
-
Salah Rifai, Pascal Vincent, Xavier Muller, Xavier Glo-rot, and Yoshua Bengio. Contractive auto-encoders: Explicit invariance during feature extraction. In ICML, pages 833–840, 2011.
-
(2011)
ICML
, pp. 833-840
-
-
Rifai, Salah1
Vincent, Pascal2
Muller, Xavier3
Glo-rot, Xavier4
Bengio, Yoshua5
-
25
-
-
84862273266
-
A reduction of imitation learning and structured prediction to no-regret online learning
-
JMLR.org
-
Stphane Ross, Geoffrey J. Gordon, and Drew Bagnell. A reduction of imitation learning and structured prediction to no-regret online learning. In AISTATS, volume 15 of JMLR Proceedings, pages 627–635. JMLR.org, 2011.
-
(2011)
AISTATS, volume 15 of JMLR Proceedings
, pp. 627-635
-
-
Ross, Stphane1
Gordon, Geoffrey J.2
Bagnell, Drew3
-
27
-
-
85083951635
-
-
arXiv preprint
-
P. Sermanet, D. Eigen, X. Zhang, M. Mathieu, R. Fergus, and Y. LeCun. Overfeat: Integrated recognition, localization and detection using convolutional networks. arXiv preprint, 2014.
-
(2014)
Overfeat: Integrated recognition, localization and detection using convolutional networks
-
-
Sermanet, P.1
Eigen, D.2
Zhang, X.3
Mathieu, M.4
Fergus, R.5
LeCun, Y.6
-
33
-
-
84872363924
-
Synthesis and stabilization of complex behaviors through online trajectory optimization
-
Y. Tassa, T. Erez, and E. Todorov. Synthesis and stabilization of complex behaviors through online trajectory optimization. IROS, 2012.
-
(2012)
IROS
-
-
Tassa, Y.1
Erez, T.2
Todorov, E.3
-
34
-
-
84862011769
-
Learning policy improvements with path integrals
-
E. Theodorou, J. Buchli, and S. Schaal. Learning policy improvements with path integrals. AISTATS, 2010.
-
(2010)
AISTATS
-
-
Theodorou, E.1
Buchli, J.2
Schaal, S.3
-
35
-
-
23944452693
-
A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems
-
E. Todorov and W. Li. A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems. American Control Conference, pages 300–306, 2005.
-
(2005)
American Control Conference
, pp. 300-306
-
-
Todorov, E.1
Li, W.2
-
36
-
-
84872292044
-
MuJoCo: A physics engine for model-based control
-
E. Todorov, T. Erez, and Y. Tassa. MuJoCo: A physics engine for model-based control. IROS, 2012.
-
(2012)
IROS
-
-
Todorov, E.1
Erez, T.2
Tassa, Y.3
-
37
-
-
71149083296
-
Robot trajectory optimization using approximate inference
-
M. Toussaint. Robot trajectory optimization using approximate inference. International Conference on Machine Learning, 26:1049–1056, 2009.
-
(2009)
International Conference on Machine Learning
, vol.26
, pp. 1049-1056
-
-
Toussaint, M.1
-
39
-
-
85131217346
-
Value function approximation and model-predictive control
-
M. Zhong, M. abd Johnson, Y. Tassa, T. Erez, and E Todorov. Value function approximation and model-predictive control. IEEE ADPRL, 2013.
-
(2013)
IEEE ADPRL
-
-
Zhong, M.1
abd Johnson, M.2
Tassa, Y.3
Erez, T.4
Todorov, E5
|