SCOPUS 정보 검색 플랫폼

Proceedings - IEEE International Conference on Robotics and Automation

Volumn 2016-June, Issue , 2016, Pages 520-527

Learning deep neural network policies with continuous memory states

(5) Zhang, Marvin a McCarthy, Zoe a Finn, Chelsea a Levine, Sergey a Abbeel, Pieter a

a UNIVERSITY OF CALIFORNIA (United States)

Author keywords

[No Author keywords available]

Indexed keywords

AERODYNAMICS; COMPLEX NETWORKS; MANIPULATORS; ROBOTS; SUPERVISED LEARNING;

CONTINUOUS CONTROL; CONTINUOUS SYSTEM; DEEP NEURAL NETWORKS; HIGH-DIMENSIONAL; LEARNING POLICY; POLICY LEARNING; ROBOTIC MANIPULATORS; TRAJECTORY OPTIMIZATION;

ROBOTICS;

EID: 84977555800 PISSN: 10504729 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICRA.2016.7487174 Document Type: Conference Paper

Times cited : (88)

References (24)

1
- 84873747053
- A survey of point-based pomdp solvers
- G. Shani, J. Pineau, and R. Kaplow, "A survey of point-based pomdp solvers, " Autonomous Agents and Multi-Agent Systems, vol. 27, no. 1, pp. 1-51, 2013.
- (2013) Autonomous Agents and Multi-Agent Systems , vol.27 , Issue.1 , pp. 1-51
- Shani, G.¹ Pineau, J.² Kaplow, R.³

2
- 0002103968
- Learning finite-state controllers for partially observable environments
- N. Meuleau, L. Peshkin, K.-E. Kim, and L. P. Kaelbling, "Learning finite-state controllers for partially observable environments, " in Proceedings of the Fifteenth conference on Uncertainty in Artificial Intelligence (UAI), 1999, pp. 427-436.
- (1999) Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence (UAI) , pp. 427-436
- Meuleau, N.¹ Peshkin, L.² Kim, K.-E.³ Kaelbling, L.P.⁴

3
- 0001998385
- Learning policies with external memory
- L. Peshkin, N. Meuleau, and L. Kaelbling, "Learning policies with external memory, " in Proceedings of the Sixteenth International Conference on Machine Learning (ICML), 2001, pp. 307-314.
- (2001) Proceedings of the Sixteenth International Conference on Machine Learning (ICML) , pp. 307-314
- Peshkin, L.¹ Meuleau, N.² Kaelbling, L.³

4
- 67349102783
- Hierarchical pomdp controller optimization by likelihood maximization
- M. Toussaint, L. Charlin, and P. Poupart, "Hierarchical pomdp controller optimization by likelihood maximization. " in Proceedings of the 24th conference on Uncertainty in Artificial Intelligence (UAI), 2008, pp. 562-570.
- (2008) Proceedings of the 24th Conference on Uncertainty in Artificial Intelligence (UAI) , pp. 562-570
- Toussaint, M.¹ Charlin, L.² Poupart, P.³

5
- 80053441894
- PILCO: A model-based and data-efficient approach to policy search
- M. Deisenroth and C. E. Rasmussen, "PILCO: A model-based and data-efficient approach to policy search, " in Proceedings of the 28th International Conference on Machine Learning (ICML), 2011, pp. 465-472.
- (2011) Proceedings of the 28th International Conference on Machine Learning (ICML) , pp. 465-472
- Deisenroth, M.¹ Rasmussen, C.E.²

6
- 84965129327
- Embed to control: A locally linear latent dynamics model for control from raw images
- M. Watter, J. T. Springenberg, J. Boedecker, and M. Riedmiller, "Embed to control: A locally linear latent dynamics model for control from raw images, " in Advances in Neural Information Processing Systems (NIPS), 2015.
- (2015) Advances in Neural Information Processing Systems (NIPS)
- Watter, M.¹ Springenberg, J.T.² Boedecker, J.³ Riedmiller, M.⁴

7
- 84903590417
- A survey on policy search for robotics
- M. Deisenroth, G. Neumann, and J. Peters, "A survey on policy search for robotics, " Foundations and Trends in Robotics, vol. 2, no. 1-2, pp. 1-142, 2013.
- (2013) Foundations and Trends in Robotics , vol.2 , Issue.1-2 , pp. 1-142
- Deisenroth, M.¹ Neumann, G.² Peters, J.³

8
- 38149018611
- Solving deep memory pomdps with recurrent policy gradients
- Springer
- D. Wierstra, A. Foerster, J. Peters, and J. Schmidhuber, "Solving deep memory pomdps with recurrent policy gradients, " in Artificial Neural Networks-ICANN 2007. Springer, 2007, pp. 697-706.
- (2007) Artificial Neural Networks-ICANN 2007 , pp. 697-706
- Wierstra, D.¹ Foerster, A.² Peters, J.³ Schmidhuber, J.⁴

9
- 84977523039
- S. Levine and P. Abbeel, "Learning neural network policies with guided policy search under unknown dynamics, " 2014.
- (2014) Learning Neural Network Policies with Guided Policy Search under Unknown Dynamics
- Levine, S.¹ Abbeel, P.²

10
- 84943767635
- arXiv preprint arXiv:1504. 00702
- S. Levine, C. Finn, T. Darrell, and P. Abbeel, "End-to-end training of deep visuomotor policies, " arXiv preprint arXiv:1504. 00702, 2015.
- (2015) End-to-end Training of Deep Visuomotor Policies
- Levine, S.¹ Finn, C.² Darrell, T.³ Abbeel, P.⁴

11
- 52249086942
- Online planning algorithms for pomdps
- S. Ross, J. Pineau, S. Paquet, and B. Chaib-Draa, "Online planning algorithms for pomdps, " Journal of Artificial Intelligence Research, pp. 663-704, 2008.
- (2008) Journal of Artificial Intelligence Research , pp. 663-704
- Ross, S.¹ Pineau, J.² Paquet, S.³ Chaib-Draa, B.⁴

12
- 84864577176
- Continuous-state pomdps with hybrid dynamics
- E. Brunskill, L. P. Kaelbling, T. Lozano-Perez, and N. Roy, "Continuous-state pomdps with hybrid dynamics. " in ISAIM, 2008.
- (2008) ISAIM
- Brunskill, E.¹ Kaelbling, L.P.² Lozano-Perez, T.³ Roy, N.⁴

13
- 84892982833
- arXiv preprint arXiv:1211. 5063
- R. Pascanu, T. Mikolov, and Y. Bengio, "On the difficulty of training recurrent neural networks, " arXiv preprint arXiv:1211. 5063, 2012.
- (2012) On the Difficulty of Training Recurrent Neural Networks
- Pascanu, R.¹ Mikolov, T.² Bengio, Y.³

14
- 0031573117
- Long short-term memory
- S. Hochreiter and J. Schmidhuber, "Long short-term memory, " Neural computation, vol. 9, no. 8, pp. 1735-1780, 1997.
- (1997) Neural Computation , vol.9 , Issue.8 , pp. 1735-1780
- Hochreiter, S.¹ Schmidhuber, J.²

15
- 84919728106
- arXiv preprint arXiv:1406. 1078
- K. Cho, B. van Merrienboer, C. Gulcehre, F. Bougares, H. Schwenk, and Y. Bengio, "Learning phrase representations using RNN encoder-decoder for statistical machine translation, " arXiv preprint arXiv:1406. 1078, 2014.
- (2014) Learning Phrase Representations Using RNN Encoder-decoder for Statistical Machine Translation
- Cho, K.¹ Van Merrienboer, B.² Gulcehre, C.³ Bougares, F.⁴ Schwenk, H.⁵ Bengio, Y.⁶

16
- 84938277141
- Learning contact-rich manipulation skills with guided policy search
- S. Levine, N. Wagener, and P. Abbeel, "Learning contact-rich manipulation skills with guided policy search, " in International Conference on Robotics and Automation (ICRA), 2015.
- (2015) International Conference on Robotics and Automation (ICRA)
- Levine, S.¹ Wagener, N.² Abbeel, P.³

17
- 84862273266
- A reduction of imitation learning and structured prediction to no-regret online learning
- S. Ross, G. Gordon, and A. Bagnell, "A reduction of imitation learning and structured prediction to no-regret online learning, " Journal of Machine Learning Research, vol. 15, pp. 627-635, 2011.
- (2011) Journal of Machine Learning Research , vol.15 , pp. 627-635
- Ross, S.¹ Gordon, G.² Bagnell, A.³

18
- 84919913608
- Learning complex neural network policies with trajectory optimization
- S. Levine and V. Koltun, "Learning complex neural network policies with trajectory optimization, " in International Conference on Machine Learning (ICML), 2014.
- (2014) International Conference on Machine Learning (ICML)
- Levine, S.¹ Koltun, V.²

19
- 84858765598
- Covariant policy search
- J. A. Bagnell and J. Schneider, "Covariant policy search, " in International Joint Conference on Artificial Intelligence (IJCAI), 2003.
- (2003) International Joint Conference on Artificial Intelligence (IJCAI)
- Bagnell, J.A.¹ Schneider, J.²

20
- 44949241322
- Reinforcement learning of motor skills with policy gradients
- J. Peters and S. Schaal, "Reinforcement learning of motor skills with policy gradients, " Neural Networks, vol. 21, no. 4, pp. 682-697, 2008.
- (2008) Neural Networks , vol.21 , Issue.4 , pp. 682-697
- Peters, J.¹ Schaal, S.²

21
- 77958569725
- Relative entropy policy search
- J. Peters, K. Mülling, and Y. Altün, "Relative entropy policy search, " in AAAI Conference on Artificial Intelligence, 2010.
- (2010) AAAI Conference on Artificial Intelligence
- Peters, J.¹ Mülling, K.² Altün, Y.³

22
- 84928547704
- Sequence to sequence learning with neural networks
- I. Sutskever, O. Vinyals, and Q. V. Le, "Sequence to sequence learning with neural networks, " in Advances in Neural Information Processing Systems (NIPS), 2014, pp. 3104-3112.
- (2014) Advances in Neural Information Processing Systems (NIPS) , pp. 3104-3112
- Sutskever, I.¹ Vinyals, O.² Le, Q.V.³

23
- 84886998125
- Applying the episodic natural actor-critic architecture to motor primitive learning
- J. Peters and S. Schaal, "Applying the episodic natural actor-critic architecture to motor primitive learning, " in European Symposium on Artificial Neural Networks (ESANN), 2007.
- (2007) European Symposium on Artificial Neural Networks (ESANN)
- Peters, J.¹ Schaal, S.²

24
- 85060321083
- Learning motor primitives for robotics
- J. Kober and J. Peters, "Learning motor primitives for robotics, " in International Conference on Robotics and Automation (ICRA), 2009.
- (2009) International Conference on Robotics and Automation (ICRA)
- Kober, J.¹ Peters, J.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.