-
1
-
-
84873747053
-
A survey of point-based pomdp solvers
-
G. Shani, J. Pineau, and R. Kaplow, "A survey of point-based pomdp solvers, " Autonomous Agents and Multi-Agent Systems, vol. 27, no. 1, pp. 1-51, 2013.
-
(2013)
Autonomous Agents and Multi-Agent Systems
, vol.27
, Issue.1
, pp. 1-51
-
-
Shani, G.1
Pineau, J.2
Kaplow, R.3
-
2
-
-
0002103968
-
Learning finite-state controllers for partially observable environments
-
N. Meuleau, L. Peshkin, K.-E. Kim, and L. P. Kaelbling, "Learning finite-state controllers for partially observable environments, " in Proceedings of the Fifteenth conference on Uncertainty in Artificial Intelligence (UAI), 1999, pp. 427-436.
-
(1999)
Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence (UAI)
, pp. 427-436
-
-
Meuleau, N.1
Peshkin, L.2
Kim, K.-E.3
Kaelbling, L.P.4
-
3
-
-
0001998385
-
Learning policies with external memory
-
L. Peshkin, N. Meuleau, and L. Kaelbling, "Learning policies with external memory, " in Proceedings of the Sixteenth International Conference on Machine Learning (ICML), 2001, pp. 307-314.
-
(2001)
Proceedings of the Sixteenth International Conference on Machine Learning (ICML)
, pp. 307-314
-
-
Peshkin, L.1
Meuleau, N.2
Kaelbling, L.3
-
6
-
-
84965129327
-
Embed to control: A locally linear latent dynamics model for control from raw images
-
M. Watter, J. T. Springenberg, J. Boedecker, and M. Riedmiller, "Embed to control: A locally linear latent dynamics model for control from raw images, " in Advances in Neural Information Processing Systems (NIPS), 2015.
-
(2015)
Advances in Neural Information Processing Systems (NIPS)
-
-
Watter, M.1
Springenberg, J.T.2
Boedecker, J.3
Riedmiller, M.4
-
7
-
-
84903590417
-
A survey on policy search for robotics
-
M. Deisenroth, G. Neumann, and J. Peters, "A survey on policy search for robotics, " Foundations and Trends in Robotics, vol. 2, no. 1-2, pp. 1-142, 2013.
-
(2013)
Foundations and Trends in Robotics
, vol.2
, Issue.1-2
, pp. 1-142
-
-
Deisenroth, M.1
Neumann, G.2
Peters, J.3
-
8
-
-
38149018611
-
Solving deep memory pomdps with recurrent policy gradients
-
Springer
-
D. Wierstra, A. Foerster, J. Peters, and J. Schmidhuber, "Solving deep memory pomdps with recurrent policy gradients, " in Artificial Neural Networks-ICANN 2007. Springer, 2007, pp. 697-706.
-
(2007)
Artificial Neural Networks-ICANN 2007
, pp. 697-706
-
-
Wierstra, D.1
Foerster, A.2
Peters, J.3
Schmidhuber, J.4
-
10
-
-
84943767635
-
-
arXiv preprint arXiv:1504. 00702
-
S. Levine, C. Finn, T. Darrell, and P. Abbeel, "End-to-end training of deep visuomotor policies, " arXiv preprint arXiv:1504. 00702, 2015.
-
(2015)
End-to-end Training of Deep Visuomotor Policies
-
-
Levine, S.1
Finn, C.2
Darrell, T.3
Abbeel, P.4
-
11
-
-
52249086942
-
Online planning algorithms for pomdps
-
S. Ross, J. Pineau, S. Paquet, and B. Chaib-Draa, "Online planning algorithms for pomdps, " Journal of Artificial Intelligence Research, pp. 663-704, 2008.
-
(2008)
Journal of Artificial Intelligence Research
, pp. 663-704
-
-
Ross, S.1
Pineau, J.2
Paquet, S.3
Chaib-Draa, B.4
-
14
-
-
0031573117
-
Long short-term memory
-
S. Hochreiter and J. Schmidhuber, "Long short-term memory, " Neural computation, vol. 9, no. 8, pp. 1735-1780, 1997.
-
(1997)
Neural Computation
, vol.9
, Issue.8
, pp. 1735-1780
-
-
Hochreiter, S.1
Schmidhuber, J.2
-
15
-
-
84919728106
-
-
arXiv preprint arXiv:1406. 1078
-
K. Cho, B. van Merrienboer, C. Gulcehre, F. Bougares, H. Schwenk, and Y. Bengio, "Learning phrase representations using RNN encoder-decoder for statistical machine translation, " arXiv preprint arXiv:1406. 1078, 2014.
-
(2014)
Learning Phrase Representations Using RNN Encoder-decoder for Statistical Machine Translation
-
-
Cho, K.1
Van Merrienboer, B.2
Gulcehre, C.3
Bougares, F.4
Schwenk, H.5
Bengio, Y.6
-
17
-
-
84862273266
-
A reduction of imitation learning and structured prediction to no-regret online learning
-
S. Ross, G. Gordon, and A. Bagnell, "A reduction of imitation learning and structured prediction to no-regret online learning, " Journal of Machine Learning Research, vol. 15, pp. 627-635, 2011.
-
(2011)
Journal of Machine Learning Research
, vol.15
, pp. 627-635
-
-
Ross, S.1
Gordon, G.2
Bagnell, A.3
-
20
-
-
44949241322
-
Reinforcement learning of motor skills with policy gradients
-
J. Peters and S. Schaal, "Reinforcement learning of motor skills with policy gradients, " Neural Networks, vol. 21, no. 4, pp. 682-697, 2008.
-
(2008)
Neural Networks
, vol.21
, Issue.4
, pp. 682-697
-
-
Peters, J.1
Schaal, S.2
-
22
-
-
84928547704
-
Sequence to sequence learning with neural networks
-
I. Sutskever, O. Vinyals, and Q. V. Le, "Sequence to sequence learning with neural networks, " in Advances in Neural Information Processing Systems (NIPS), 2014, pp. 3104-3112.
-
(2014)
Advances in Neural Information Processing Systems (NIPS)
, pp. 3104-3112
-
-
Sutskever, I.1
Vinyals, O.2
Le, Q.V.3
|