SCOPUS 정보 검색 플랫폼

IEEE International Conference on Intelligent Robots and Systems

Volumn 2016-November, Issue , 2016, Pages 3947-3952

Improved deep reinforcement learning for robotics through distribution-based experience retention

(4) De Bruin, Tim a Kober, Jens a Tuyls, Karl a,b Babuška, Robert a

a DELFT UNIVERSITY OF TECHNOLOGY (Netherlands)

b UNIVERSITY OF LIVERPOOL (United Kingdom)

Author keywords

[No Author keywords available]

Indexed keywords

INTELLIGENT ROBOTS; MAGNETIC LEVITATION VEHICLES; ROBOTS;

DEEP NEURAL NETWORKS; FUNCTION APPROXIMATORS; GENERALIZATION PERFORMANCE; LIFE LONG LEARNING; MAGNETIC MANIPULATION; SAMPLE DISTRIBUTIONS; TEMPORAL DIFFERENCE ERRORS; UNIFORM DISTRIBUTION;

REINFORCEMENT LEARNING;

EID: 85006341752 PISSN: 21530858 EISSN: 21530866 Source Type: Conference Proceeding
DOI: 10.1109/IROS.2016.7759581 Document Type: Conference Paper

Times cited : (43)

References (14)

1
- 84884276459
- Reinforcement learning in robotics: A survey
- J. Kober, J. A. Bagnell, and J. Peters, "Reinforcement learning in robotics: A survey", International Journal of Robotics Research, vol. 32, no. 11, pp. 1238-1274, 2013.
- (2013) International Journal of Robotics Research , vol.32 , Issue.11 , pp. 1238-1274
- Kober, J.¹ Bagnell, J.A.² Peters, J.³

2
- 84943767635
- arXiv:1504.00702 cs. LG
- S. Levine, C. Finn, T. Darrell, and P. Abbeel, "End-to-end training of deep visuomotor policies", 2015, arXiv:1504.00702 [cs. LG].
- (2015) End-to-end Training of Deep Visuomotor Policies
- Levine, S.¹ Finn, C.² Darrell, T.³ Abbeel, P.⁴

3
- 84971448181
- arXiv:1602.01783 cs. LG
- V. Mnih, A. P. Badia, M. Mirza, A. Graves, T. P. Lillicrap, T. Harley, D. Silver, and K. Kavukcuoglu, "Asynchronous methods for deep reinforcement learning", 2016, arXiv:1602.01783 [cs. LG].
- (2016) Asynchronous Methods for Deep Reinforcement Learning
- Mnih, V.¹ Badia, A.P.² Mirza, M.³ Graves, A.⁴ Lillicrap, T.P.⁵ Harley, T.⁶ Silver, D.⁷ Kavukcuoglu, K.⁸

4
- 84963949906
- Mastering the game of go with deep neural networks and tree search
- D. Silver, A. Huang, C. J. Maddison, A. Guez, L. Sifre, G. V. D. Driessche, J. Schrittwieser, I. Antonoglou, V. Panneershelvam, M. Lanctot, S. Dieleman, D. Grewe, J. Nham, N. Kalchbrenner, I. Sutskever, T. Lillicrap, M. Leach, K. Kavukcuoglu, T. Graepel, and D. Hassabis, "Mastering the game of go with deep neural networks and tree search", Nature, vol. 529, no. 7585, pp. 484-489, 2016.
- (2016) Nature , vol.529 , Issue.7585 , pp. 484-489
- Silver, D.¹ Huang, A.² Maddison, C.J.³ Guez, A.⁴ Sifre, L.⁵ Driessche, G.V.D.⁶ Schrittwieser, J.⁷ Antonoglou, I.⁸ Panneershelvam, V.⁹ Lanctot, M.¹⁰ Dieleman, S.¹¹ Grewe, D.¹² Nham, J.¹³ Kalchbrenner, N.¹⁴ Sutskever, I.¹⁵ Lillicrap, T.¹⁶ Leach, M.¹⁷ Kavukcuoglu, K.¹⁸ Graepel, T.¹⁹ Hassabis, D.²⁰ more..

5
- 0000123778
- Self-improving reactive agents based on reinforcement learning, planning and teaching
- L.-J. Lin, "Self-improving reactive agents based on reinforcement learning, planning and teaching", Machine Learning, vol. 8, no. 3-4, pp. 293-321, 1992.
- (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 293-321
- Lin, L.-J.¹

6
- 85083953657
- Continuous control with deep reinforcement learning
- T. P. Lillicrap, J. J. Hunt, A. Pritzel, N. Heess, T. Erez, Y. Tassa, D. Silver, and D. Wierstra, "Continuous control with deep reinforcement learning", in International Conference on Learning Representations (ICLR), 2016.
- (2016) International Conference on Learning Representations (ICLR)
- Lillicrap, T.P.¹ Hunt, J.J.² Pritzel, A.³ Heess, N.⁴ Erez, T.⁵ Tassa, Y.⁶ Silver, D.⁷ Wierstra, D.⁸

7
- 84924051598
- Human-level control through deep reinforcement learning
- V. Mnih, K. Kavukcuoglu, D. Silver, A. A. Rusu, J. Veness, M. G. Bellemare, A. Graves, M. Riedmiller, A. K. Fidjeland, G. Ostrovski, S. Petersen, C. Beattie, A. Sadik, I. Antonoglou, H. King, D. Kumaran, D. Wierstra, S. Legg, and D. Hassabis, "Human-level control through deep reinforcement learning", Nature, vol. 518, no. 7540, pp. 529-533, 2015.
- (2015) Nature , vol.518 , Issue.7540 , pp. 529-533
- Mnih, V.¹ Kavukcuoglu, K.² Silver, D.³ Rusu, A.A.⁴ Veness, J.⁵ Bellemare, M.G.⁶ Graves, A.⁷ Riedmiller, M.⁸ Fidjeland, A.K.⁹ Ostrovski, G.¹⁰ Petersen, S.¹¹ Beattie, C.¹² Sadik, A.¹³ Antonoglou, I.¹⁴ King, H.¹⁵ Kumaran, D.¹⁶ Wierstra, D.¹⁷ Legg, S.¹⁸ Hassabis, D.¹⁹

8
- 85006438211
- deep Reinforcement Learning Workshop, Advances in Neural Information Processing Systems NIPS
- T. de Bruin, J. Kober, K. Tuyls, and R. Babuška, "The importance of experience replay database composition in deep reinforcement learning", 2015, deep Reinforcement Learning Workshop, Advances in Neural Information Processing Systems (NIPS).
- (2015) The Importance of Experience Replay Database Composition in Deep Reinforcement Learning
- De Bruin, T.¹ Kober, J.² Tuyls, K.³ Babuška, R.⁴

9
- 84980041049
- arXiv:1511.05952 cs. LG
- T. Schaul, J. Quan, I. Antonoglou, and D. Silver, "Prioritized experience replay", 2015, arXiv:1511.05952 [cs. LG].
- (2015) Prioritized Experience Replay
- Schaul, T.¹ Quan, J.² Antonoglou, I.³ Silver, D.⁴

10
- 0004135065
- Springer
- G. Montavon, G. B. Orr, and K.-R. Müller, Eds., Neural Networks: Tricks of the Trade, 2nd ed., ser. Lecture Notes in Computer Science (LNCS). Springer, 2012, vol. 7700.
- (2012) Neural Networks: Tricks of the Trade, 2nd Ed., Ser. Lecture Notes in Computer Science (LNCS) , vol.7700
- Montavon, G.¹ Orr, G.B.² Müller, K.-R.³

11
- 84919793697
- Deterministic policy gradient algorithms
- D. Silver, G. Lever, N. Heess, T. Degris, D. Wierstra, and M. Riedmiller, "Deterministic policy gradient algorithms", in International Conference on Machine Learning (ICML), 2014, pp. 387-395.
- (2014) International Conference on Machine Learning (ICML) , pp. 387-395
- Silver, D.¹ Lever, G.² Heess, N.³ Degris, T.⁴ Wierstra, D.⁵ Riedmiller, M.⁶

12
- 84908477926
- arXiv:1312.6211 stat. ML
- I. Goodfellow, M. Mirza, X. Da, A. Courville, and Y. Bengio, "An empirical investigation of catastrophic forgeting in gradient-based neural networks", 2013, arXiv:1312.6211 [stat. ML].
- (2013) An Empirical Investigation of Catastrophic Forgeting in Gradient-based Neural Networks
- Goodfellow, I.¹ Mirza, M.² Da, X.³ Courville, A.⁴ Bengio, Y.⁵

13
- 0001473437
- On estimation of a probability density function and mode
- E. Parzen, "On estimation of a probability density function and mode", The Annals of Mathematical Statistics, vol. 33, no. 3, pp. 1065-1076, 1962.
- (1962) The Annals of Mathematical Statistics , vol.33 , Issue.3 , pp. 1065-1076
- Parzen, E.¹

14
- 77949422045
- Robotic magnetic steering and locomotion of capsule endoscope for diagnostic and surgical endoluminal procedures
- G. Ciuti, P. Valdastri, A. Menciassi, and P. Dario, "Robotic magnetic steering and locomotion of capsule endoscope for diagnostic and surgical endoluminal procedures", Robotica, vol. 28, no. 02, pp. 199-207, 2010.
- (2010) Robotica , vol.28 , Issue.2 , pp. 199-207
- Ciuti, G.¹ Valdastri, P.² Menciassi, A.³ Dario, P.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.