SCOPUS 정보 검색 플랫폼

IEEE International Conference on Intelligent Robots and Systems

Volumn 2016-November, Issue , 2016, Pages 3928-3934

Stable reinforcement learning with autoencoders for tactile and visual data

(5) Van Hoof, Herke a Chen, Nutan b Karl, Maximilian b Van Der Smagt, Patrick b,c Peters, Jan a,d

a DARMSTADT UNIVERSITY OF TECHNOLOGY (Germany)

b TECHNICAL UNIVERSITY OF MUNICH (Germany)

c Fortiss (Germany)

d MAX PLANCK INSTITUTE FOR INTELLIGENT SYSTEMS (Germany)

Author keywords

[No Author keywords available]

Indexed keywords

FEEDBACK; INTELLIGENT ROBOTS; LEARNING SYSTEMS; REINFORCEMENT LEARNING; ROBOTS; VISUAL COMMUNICATION; VISUAL SERVOING;

AUTO ENCODERS; COMPACT REPRESENTATION; CONTINUOUS STATE SPACE; HIGH-DIMENSIONAL; NON-TRIVIAL; TRIAL AND ERROR; VISUAL DATA; VISUAL FEEDBACK;

LEARNING ALGORITHMS;

EID: 85006416116 PISSN: 21530858 EISSN: 21530866 Source Type: Conference Proceeding
DOI: 10.1109/IROS.2016.7759578 Document Type: Conference Paper

Times cited : (158)

References (36)

1
- 84884276459
- Reinforcement learning in robotics: A survey
- J. Kober, J. A. Bagnell, and J. Peters, "Reinforcement learning in robotics: A survey", Int. J. Robotics Research, vol. 11, no. 32, pp. 1238-1274, 2013.
- (2013) Int. J. Robotics Research , vol.11 , Issue.32 , pp. 1238-1274
- Kober, J.¹ Bagnell, J.A.² Peters, J.³

2
- 0029679044
- Reinforcement learning: A survey
- L. P. Kaelbling, M. L. Littman, and A. W. Moore, "Reinforcement learning: A survey", J. Artificial Intell. Research, vol. 4, 1996.
- (1996) J. Artificial Intell. Research , vol.4
- Kaelbling, L.P.¹ Littman, M.L.² Moore, A.W.³

3
- 84893561637
- Acquiring visual servoing reaching and grasping skills using neural reinforcement learning
- T. Lampe and M. Riedmiller, "Acquiring visual servoing reaching and grasping skills using neural reinforcement learning", in IJCNN, 2013.
- (2013) IJCNN
- Lampe, T.¹ Riedmiller, M.²

4
- 84938227861
- Towards learning hierarchical skills for multi-phase manipulation tasks
- O. Kroemer, C. Daniel, G. Neumann, H. van Hoof, and J. Peters, "Towards learning hierarchical skills for multi-phase manipulation tasks", in ICRA, 2015.
- (2015) ICRA
- Kroemer, O.¹ Daniel, C.² Neumann, G.³ Van Hoof, H.⁴ Peters, J.⁵

5
- 84455188451
- Learning force control policies for compliant manipulation
- M. Kalakrishnan, L. Righetti, P. Pastor, and S. Schaal, "Learning force control policies for compliant manipulation", in IROS, 2011.
- (2011) IROS
- Kalakrishnan, M.¹ Righetti, L.² Pastor, P.³ Schaal, S.⁴

6
- 0034291488
- Control of contact via tactile sensing
- H. Zhang and N. Chen, "Control of contact via tactile sensing", Trans. Robotics and Automation, vol. 16, no. 5, pp. 482-495, 2000.
- (2000) Trans. Robotics and Automation , vol.16 , Issue.5 , pp. 482-495
- Zhang, H.¹ Chen, N.²

7
- 84929223521
- Limits to compliance and the role of tactile sensing in grasping
- L. Jentoft, Q. Wan, and R. Howe, "Limits to compliance and the role of tactile sensing in grasping", in ICRA, 2014, pp. 6394-6399.
- (2014) ICRA , pp. 6394-6399
- Jentoft, L.¹ Wan, Q.² Howe, R.³

8
- 78651516759
- Contact-reactive grasping of objects with partial shape information
- K. Hsiao, S. Chitta, M. Ciocarlie, and E. Jones, "Contact-reactive grasping of objects with partial shape information", in IROS, 2010.
- (2010) IROS
- Hsiao, K.¹ Chitta, S.² Ciocarlie, M.³ Jones, E.⁴

9
- 84891103646
- Towards associative skill memories
- P. Pastor, M. Kalakrishnan, L. Righetti, and S. Schaal, "Towards associative skill memories", in Humanoids, 2012.
- (2012) Humanoids
- Pastor, P.¹ Kalakrishnan, M.² Righetti, L.³ Schaal, S.⁴

10
- 35248866146
- An introduction to reinforcement learning theory: Value function methods
- Springer Berlin Heidelberg
- P. L. Bartlett, "An introduction to reinforcement learning theory: Value function methods", in Advanced lectures on Machine Learning. Springer Berlin Heidelberg, 2003, pp. 184-202.
- (2003) Advanced Lectures on Machine Learning , pp. 184-202
- Bartlett, P.L.¹

11
- 84911470116
- Learning robot tactile sensing for object manipulation
- Y. Chebotar, O. Kroemer, and J. Peters, "Learning robot tactile sensing for object manipulation", in IROS, 2014, pp. 3368-3375.
- (2014) IROS , pp. 3368-3375
- Chebotar, Y.¹ Kroemer, O.² Peters, J.³

12
- 77955426970
- Combining active learning and reactive control for robot grasping
- O. Kroemer, R. Detry, J. Piater, and J. Peters, "Combining active learning and reactive control for robot grasping", Robotics and Autonomous Syst., vol. 58, no. 9, pp. 1105-1116, 2010.
- (2010) Robotics and Autonomous Syst. , vol.58 , Issue.9 , pp. 1105-1116
- Kroemer, O.¹ Detry, R.² Piater, J.³ Peters, J.⁴

13
- 84868008342
- Skill learning and task outcome prediction for manipulation
- P. Pastor, M. Kalakrishnan, S. Chitta, E. Theodorou, and S. Schaal, "Skill learning and task outcome prediction for manipulation", in ICRA, 2011.
- (2011) ICRA
- Pastor, P.¹ Kalakrishnan, M.² Chitta, S.³ Theodorou, E.⁴ Schaal, S.⁵

14
- 67650835709
- Learning perceptual coupling for motor primitives
- J. Kober, B. Mohler, and J. Peters, "Learning perceptual coupling for motor primitives", in IROS, 2008, pp. 834-839.
- (2008) IROS , pp. 834-839
- Kober, J.¹ Mohler, B.² Peters, J.³

15
- 84962230791
- Learning of non-parametric control policies with high-dimensional state features
- H. van Hoof, J. Peters, and G. Neumann, "Learning of non-parametric control policies with high-dimensional state features", in AISTATS, 2015.
- (2015) AISTATS
- Van Hoof, H.¹ Peters, J.² Neumann, G.³

16
- 84962314631
- Learning robot in-hand manipulation with tactile features
- H. van Hoof, T. Hermans, G. Neumann, and J. Peters, "Learning robot in-hand manipulation with tactile features", in Humanoids, 2015.
- (2015) Humanoids
- Van Hoof, H.¹ Hermans, T.² Neumann, G.³ Peters, J.⁴

17
- 56449089103
- Extracting and composing robust features with denoising autoencoders
- P. Vincent, H. Larochelle, Y. Bengio, and P.-A. Manzagol, "Extracting and composing robust features with denoising autoencoders", in ICML, 2008, pp. 1096-1103.
- (2008) ICML , pp. 1096-1103
- Vincent, P.¹ Larochelle, H.² Bengio, Y.³ Manzagol, P.-A.⁴

18
- 69349090197
- Learning deep architectures for AI
- Y. Bengio, "Learning deep architectures for AI", Found. Trends Mach. Learn., vol. 2, no. 1, pp. 1-127, 2009.
- (2009) Found. Trends Mach. Learn. , vol.2 , Issue.1 , pp. 1-127
- Bengio, Y.¹

19
- 85083952489
- Auto-encoding variational Bayes
- D. P. Kingma and M. Welling, "Auto-encoding variational Bayes", in ICLR, 2014.
- (2014) ICLR
- Kingma, D.P.¹ Welling, M.²

20
- 84962243554
- Efficient movement representation by embedding dynamic movement primitives in deep autoencoders
- N. Chen, J. Bayer, S. Urban, and P. van der Smagt, "Efficient movement representation by embedding dynamic movement primitives in deep autoencoders", in Humanoids, 2015.
- (2015) Humanoids
- Chen, N.¹ Bayer, J.² Urban, S.³ Van Der Smagt, P.⁴

21
- 85006369951
- Learn to swing up and balance a real pole based on raw visual input data
- J. Mattner, S. Lange, and M. Riedmiller, "Learn to swing up and balance a real pole based on raw visual input data", in ICONIP, 2012.
- (2012) ICONIP
- Mattner, J.¹ Lange, S.² Riedmiller, M.³

22
- 84977501896
- Deep spatial autoencoders for visuomotor learning
- C. Finn, X. Y. Tan, Y. Duan, T. Darrell, S. Levine, and P. Abbeel, "Deep spatial autoencoders for visuomotor learning", in ICRA, 2016.
- (2016) ICRA
- Finn, C.¹ Tan, X.Y.² Duan, Y.³ Darrell, T.⁴ Levine, S.⁵ Abbeel, P.⁶

23
- 84980051362
- ArXiv, Tech. Rep.
- J.-A. M. Assael, N. Wahlström, T. B. Schön, and M. P. Deisenroth, "Data-efficient learning of feedback policies from image pixels using deep dynamical models", ArXiv, Tech. Rep., 2015.
- (2015) Data-efficient Learning of Feedback Policies from Image Pixels Using Deep Dynamical Models
- Assael, M.J.-A.¹ Wahlström, N.² Schön, T.B.³ Deisenroth, M.P.⁴

24
- 84965104233
- ArXiv, Tech. Rep
- N. Wahlström, T. B. Schön, and M. P. Deisenroth, "From pixels to torques: Policy learning with deep dynamical models", ArXiv, Tech. Rep. 1502.02251, 2015.
- (2015) From Pixels to Torques: Policy Learning with Deep Dynamical Models
- Wahlström, N.¹ Schön, T.B.² Deisenroth, M.P.³

25
- 84865083902
- Autonomous reinforcement learning on raw visual input data in a real world application
- S. Lange, M. Riedmiller, and A. Voigtländer, "Autonomous reinforcement learning on raw visual input data in a real world application", in IJCNN, 2012.
- (2012) IJCNN
- Lange, S.¹ Riedmiller, M.² Voigtländer, A.³

26
- 33646687423
- Neural fitted Q iteration-first experiences with a data efficient neural reinforcement learning method
- M. Riedmiller, "Neural fitted Q iteration-first experiences with a data efficient neural reinforcement learning method", in ECML, 2005.
- (2005) ECML
- Riedmiller, M.¹

27
- 84924051598
- Human-level control through deep reinforcement learning
- V. Mnih, K. Kavukcuoglu, D. Silver, A. A. Rusu, J. Veness, M. G. Bellemare, A. Graves, M. Riedmiller, A. K. Fidjeland, G. Ostrovski, S. Petersen, C. Beattie, A. Sadik, I. Antonoglou, H. King, D. Kumaran, D. Wierstra, S. Legg, and D. Hassabis, "Human-level control through deep reinforcement learning", Nature, vol. 518, no. 7540, 2015.
- (2015) Nature , vol.518 , Issue.7540
- Mnih, V.¹ Kavukcuoglu, K.² Silver, D.³ Rusu, A.A.⁴ Veness, J.⁵ Bellemare, M.G.⁶ Graves, A.⁷ Riedmiller, M.⁸ Fidjeland, A.K.⁹ Ostrovski, G.¹⁰ Petersen, S.¹¹ Beattie, C.¹² Sadik, A.¹³ Antonoglou, I.¹⁴ King, H.¹⁵ Kumaran, D.¹⁶ Wierstra, D.¹⁷ Legg, S.¹⁸ Hassabis, D.¹⁹

28
- 84883060087
- Evolving largescale neural networks for vision-based reinforcement learning
- J. Koutník, G. Cuccu, J. Schmidhuber, and F. Gomez, "Evolving largescale neural networks for vision-based reinforcement learning", in Annu. Conf. Genetic and Evolutionary Computation, 2013.
- (2013) Annu. Conf. Genetic and Evolutionary Computation
- Koutník, J.¹ Cuccu, G.² Schmidhuber, J.³ Gomez, F.⁴

29
- 84965135289
- arXiv, Tech. Rep
- T. P. Lillicrap, J. J. Hunt, A. Pritzel, N. Heess, T. Erez, Y. Tassa, D. Silver, and D. Wierstra, "Continuous control with deep reinforcement learning", arXiv, Tech. Rep. 1509.02971, 2015.
- (2015) Continuous Control with Deep Reinforcement Learning
- Lillicrap, T.P.¹ Hunt, J.J.² Pritzel, A.³ Heess, N.⁴ Erez, T.⁵ Tassa, Y.⁶ Silver, D.⁷ Wierstra, D.⁸

30
- 84969963490
- Trust region policy optimization
- J. Schulman, S. Levine, P. Moritz, M. Jordan, and P. Abbeel, "Trust region policy optimization", in ICML, 2015.
- (2015) ICML
- Schulman, J.¹ Levine, S.² Moritz, P.³ Jordan, M.⁴ Abbeel, P.⁵

31
- 84979924150
- End-to-end training of deep visuomotor policies
- S. Levine, C. Finn, T. Darrell, and P. Abbeel, "End-to-end training of deep visuomotor policies", Journal of Machine Learning Research, vol. 17, no. 39, pp. 1-40, 2016.
- (2016) Journal of Machine Learning Research , vol.17 , Issue.39 , pp. 1-40
- Levine, S.¹ Finn, C.² Darrell, T.³ Abbeel, P.⁴

32
- 84965129327
- Embed to control: A locally linear latent dynamics model for control from raw images
- M. Watter, J. T. Springenberg, J. Boedecker, and M. Riedmiller, "Embed to control: A locally linear latent dynamics model for control from raw images", in NIPS, 2015, pp. 2728-2736.
- (2015) NIPS , pp. 2728-2736
- Watter, M.¹ Springenberg, J.T.² Boedecker, J.³ Riedmiller, M.⁴

33
- 85010325694
- Deep Kalman filters
- R. G. Krishnan, U. Shalit, and D. Sontag, "Deep Kalman filters", in NIPS Advances in Variational Inference Workshop, 2015.
- (2015) NIPS Advances in Variational Inference Workshop
- Krishnan, R.G.¹ Shalit, U.² Sontag, D.³

34
- 77958569725
- Relative entropy policy search
- J. Peters, K. Mülling, and Y. Altün, "Relative entropy policy search", in AAAI, 2010, pp. 1607-1612.
- (2010) AAAI , pp. 1607-1612
- Peters, J.¹ Mülling, K.² Altün, Y.³

35
- 84858765598
- Covariant policy search
- J. Bagnell and J. Schneider, "Covariant policy search", in International Joint Conference on Artificial Intelligence, 2003.
- (2003) International Joint Conference on Artificial Intelligence
- Bagnell, J.¹ Schneider, J.²

36
- 77953218689
- Random features for large-scale kernel machines
- A. Rahimi and B. Recht, "Random features for large-scale kernel machines", in NIPS, 2007, pp. 1177-1184.
- (2007) NIPS , pp. 1177-1184
- Rahimi, A.¹ Recht, B.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.