SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 7700 LECTURE NO, Issue , 2012, Pages 735-757

10 Steps and some tricks to set up neural reinforcement controllers

(1) Riedmiller, Martin a

a UNIVERSITY OF FREIBURG (Germany)

Author keywords

batch reinforcement learning; fitted Q; learning control; Neural reinforcement learning

Indexed keywords

BENCHMARKING; CONTROLLERS; LEARNING SYSTEMS;

BATCH REINFORCEMENT LEARNING; CODE OF PRACTICE; CONTROL APPLICATIONS; CONTROL TASK; FITTED Q; LEARNING CONTROL; REAL-WORLD;

REINFORCEMENT LEARNING;

EID: 84872531075 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-642-35289-8_39 Document Type: Article

Times cited : (27)

References (32)

1
- 0003565783
- I II. Athena Scientific, Belmont
- Bertsekas, D.P.: Dynamic Programming and Optimal Control, vol. I, II. Athena Scientific, Belmont (1995)
- (1995) Dynamic Programming and Optimal Control
- Bertsekas, D.P.¹

2
- 84864477116
- A learned feature descriptor for object recognition in rgb-d data
- ICRA) St. Paul, Minnesota, USA
- Blum, M., Springenberg, J.T., Wulfing, J., Riedmiller, M.: A Learned Feature Descriptor for Object Recognition in RGB-D Data. In: Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), St. Paul, Minnesota, USA (2012)
- (2012) Proceedings Of The IEEE International Conference on Robotics and Automation
- Blum, M.¹ Springenberg, J.T.² Wulfing, J.³ Riedmiller, M.⁴

3
- 0003487482
- Athena Scientific, Belmont
- Bertsekas, D.P., Tsitsiklis, J.N.: Neuro Dynamic Programming. Athena Scientific, Belmont (1996)
- (1996) Neuro Dynamic Programming.
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

4
- 61849173491
- Gaussian process dynamic programming
- Deisenroth, M.P., Rasmussen, C.E., Peters, J.: Gaussian Process Dynamic Programming. Neurocomputing 72(7-9), 1508-1524 (2009)
- (2009) Neurocomputing , vol.72 , Issue.7-9 , pp. 1508-1524
- Deisenroth, M.P.¹ Rasmussen, C.E.² Peters, J.³

5
- 21844465127
- Tree-based batch mode reinforcement learning
- Ernst, D., Wehenkel, L., Geurts, P.: Tree-based batch mode reinforcement learning. Journal of Machine Learning Research 6, 503-556 (2005)
- (2005) Journal of Machine Learning Research , vol.6 , pp. 503-556
- Ernst, D.¹ Wehenkel, L.² Geurts, P.³

6
- 80052242429
- Improved neural fitted q iteration applied to a novel computer gaming and learning benchmark. in: Proceedings of the
- ADPRL 2011), Paris, France. IEEE Press (April
- Gabel, T., Lutz, C., Riedmiller, M.: Improved Neural Fitted Q Iteration Applied to a Novel Computer Gaming and Learning Benchmark. In: Proceedings of the IEEE Symposium on Approximate Dynamic Programming and Reinforcement Learning (ADPRL 2011), Paris, France. IEEE Press (April 2011)
- (2011) IEEE Symposium on Approximate Dynamic Programming and Reinforcement Learning
- Gabel, T.¹ Lutz, C.² Riedmiller, M.³

7
- 34548787715
- On experiences in a complex and competitive gaming domain: Reinforcement learning meets robocup
- Gabel, T., Riedmiller, M.: On Experiences in a Complex and Competitive Gaming Domain: Reinforcement Learning Meets RoboCup. In: Proceedings of the IEEE Symposium on Computational Intelligence and Games, Honolulu, USA (2007)
- (2007) Proceedings Of The IEEE Symposium on Computational Intelligence and Games, Honolulu, USA
- Gabel, T.¹ Riedmiller, M.²

8
- 79958814014
- Adaptive reactive job-shop scheduling with reinforcement learning agents
- Gabel, T., Riedmiller, M.: Adaptive Reactive Job-Shop Scheduling with Reinforcement Learning Agents. International Journal of Information Technology and Intelligent Computing 24(4) (2008)
- (2008) International Journal of Information Technology and Intelligent Computing , vol.24 , Issue.4
- Gabel, T.¹ Riedmiller, M.²

9
- 0346780165
- Reinforcement learning on an omnidirectional mobile robot
- IROS 2003), Las Vegas
- Hafner, R., Riedmiller, M.: Reinforcement learning on an omnidirectional mobile robot. In: Proceedings of the 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003), Las Vegas (2003)
- (2003) Proceedings of the 2003 IEEE/ RSJ International Conference on Intelligent Robots and Systems
- Hafner, R.¹ Riedmiller, M.²

10
- 36348930983
- Neural reinforcement learning controllers for a real robot application
- ICRA 2007), Rome, Italy
- Hafner, R., Riedmiller, M.: Neural Reinforcement Learning Controllers for a Real Robot Application. In: Proceedings of the IEEE International Conference on Robotics and Automation (ICRA 2007), Rome, Italy (2007)
- (2007) Proceedings of the IEEE International Conference on Robotics and Automation
- Hafner, R.¹ Riedmiller, M.²

11
- 79958779459
- Reinforcement learning in feedback control
- 10.1007/s10994-011-5235-x
- Hafner, R., Riedmiller, M.: Reinforcement learning in feedback control. Machine Learning 27(1), 55-74 (2011), 10.1007/s10994-011-5235-x
- (2011) Machine Learning , vol.27 , Issue.1 , pp. 55-74
- Hafner, R.¹ Riedmiller, M.²

12
- 79956136559
- Safe exploration for reinforcement learning
- Hans, A., Schneegass, D., Schafer, A.M., Udluft, S.: Safe exploration for reinforcement learning. In: ESANN, pp. 143-148 (2008)
- (2008) ESANN , pp. 143-148
- Hans, A.¹ Schneegass, D.² Schafer, A.M.³ Udluft, S.⁴

13
- 77950832758
- The neuro slot car racer: Reinforcement learning in a real world setting
- (ICMLA 2009) Miami, Florida Springer (December
- Kietzmann, T., Riedmiller, M.: The Neuro Slot Car Racer: Reinforcement Learning in a Real World Setting. In: Proceedings of the Int. Conference on Machine Learning Applications (ICMLA 2009), Miami, Florida. Springer (December 2009)
- (2009) Proceedings of the Int. Conference on Machine Learning Applications
- Kietzmann, T.¹ Riedmiller, M.²

14
- 0001857994
- Orr, G.B., Muller, K.-R. (eds.) NIPS-WS 1996. LNCS Springer, Heidelberg
- LeCun, Y., Bottou, L., Orr, G.B., Muller, K.-R.: Efficient backProp. In: Orr, G.B., Muller, K.-R. (eds.) NIPS-WS 1996. LNCS, vol. 1524, pp. 9-50. Springer, Heidelberg (1998)
- (1998) Efficient backProp , vol.1524 , pp. 9-50
- LeCun, Y.¹ Bottou, L.² Orr, G.B.³ Muller, K.-R.⁴

15
- 79959451979
- Deep auto-encoder neural networks in reinforcement learning
- IJCNN 2010), Barcelona, Spain
- Lange, S., Riedmiller, M.: Deep auto-encoder neural networks in reinforcement learning. In: International Joint Conference on Neural Networks (IJCNN 2010), Barcelona, Spain (2010)
- (2010) International Joint Conference on Neural Networks
- Lange, S.¹ Riedmiller, M.²

16
- 84887003459
- Deep learning of visual control policies
- ESANN 2010) Brugge Belgium
- Lange, S., Riedmiller, M.: Deep learning of visual control policies. In: European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN 2010), Brugge, Belgium (2010)
- (2010) European Symposium On Artificial Neural Networks Computational Intelligence And Machine Learning
- Lange, S.¹ Riedmiller, M.²

17
- 84943274699
- A direct adaptive method for faster backpropagation learning: The RPROP algorithm
- Ruspini, H. (ed.) (ICNN), San Francisco
- Riedmiller, M., Braun, H.: A direct adaptive method for faster backpropagation learning: The RPROP algorithm. In: Ruspini, H. (ed.) Proceedings of the IEEE International Conference on Neural Networks (ICNN), San Francisco, pp. 586-591 (1993)
- (1993) Proceedings of the IEEE International Conference on Neural Networks , pp. 586-591
- Riedmiller, M.¹ Braun, H.²

18
- 84858054371
- Distributed policy search reinforcement learning for job-shop scheduling tasks
- Available online from (May 2011
- Riedmiller, M., Gabel, T.: Distributed Policy Search Reinforcement Learning for Job-Shop Scheduling Tasks. TPRS International Journal of Production Research 50(1) (2012); Available online from (May 2011)
- (2012) TPRS International Journal of Production Research , vol.50 , Issue.1
- Riedmiller, M.¹ Gabel, T.²

19
- 67650996818
- Reinforcement learning for robot soccer
- Riedmiller, M., Gabel, T., Hafner, R., Lange, S.: Reinforcement Learning for Robot Soccer. Autonomous Robots 27(1), 55-74 (2009)
- (2009) Autonomous Robots , vol.27 , Issue.1 , pp. 55-74
- Riedmiller, M.¹ Gabel, T.² Hafner, R.³ Lange, S.⁴

20
- 51649126825
- Learning to dribble on a real robot by success and failure
- ICRA 2008), Pasadena CA Springer video presentation
- Riedmiller, M., Hafner, R., Lange, S., Lauer, M.: Learning to Dribble on a Real Robot by Success and Failure. In: Proceedings of the 2008 International Conference on Robotics and Automation (ICRA 2008), Pasadena CA. Springer (2008) (video presentation)
- (2008) Proceedings of the 2008 International Conference on Robotics and Automation
- Riedmiller, M.¹ Hafner, R.² Lange, S.³ Lauer, M.⁴

21
- 0348062295
- Learning to control dynamic systems
- Trappl, R. (ed.), EMCSR 1996
- Riedmiller, M.: Learning to control dynamic systems. In: Trappl, R. (ed.) Proceedings of the 13th European Meeting on Cybernetics and Systems Research, EMCSR 1996 (1996)
- (1996) Proceedings of the 13th European Meeting on Cybernetics and Systems Research
- Riedmiller, M.¹

22
- 0009267623
- Generating continuous control signals for reinforcement controllers using dynamic output elements
- ESANN 1997, Bruges
- Riedmiller, M.: Generating continuous control signals for reinforcement controllers using dynamic output elements. In: European Symposium on Artificial Neural Networks, ESANN 1997, Bruges (1997)
- (1997) European Symposium on Artificial Neural Networks
- Riedmiller, M.¹

23
- 33646398129
- Neural fitted Q iteration - First experiences with a data efficient neural reinforcement learning method
- Gama, J., Camacho, R., Brazdil, P.B., Jorge, A.M., Torgo, L. (eds.) Springer Heidelberg
- Riedmiller, M.: Neural Fitted Q Iteration - First Experiences with a Data Efficient Neural Reinforcement Learning Method. In: Gama, J., Camacho, R., Brazdil, P.B., Jorge, A.M., Torgo, L. (eds.) ECML 2005. LNCS (LNAI), vol. 3720, pp. 317-328. Springer, Heidelberg (2005)
- (2005) ECML 2005. LNCS (LNAI) , vol.3720 , pp. 317-328
- Riedmiller, M.¹

24
- 27944453854
- Neural reinforcement learning to swing-up and balance a real pole
- 2005, Big Island, USA October
- Riedmiller, M.: Neural reinforcement learning to swing-up and balance a real pole. In: Proc. of the Int. Conference on Systems, Man and Cybernetics, 2005, Big Island, USA (October 2005)
- (2005) Proc. of the Int. Conference on Systems Man and Cybernetics
- Riedmiller, M.¹

25
- 84865083902
- Autonomous reinforcement learning on raw visual input data in a real world application
- Riedmiller, M., Lange, S., Voigtlander, A.: Autonomous reinforcement learning on raw visual input data in a real world application. In: Proceedings of the International Joint Conference on Neural Networks, Brisbane, Australia (2012)
- (2012) Proceedings Of The International Joint Conference On Neural Networks Brisbane Australia
- Riedmiller, M.¹ Lange, S.² Voigtlander, A.³

26
- 79958857418
- Learning to drive in 20 minutes
- Springer Best Paper Award
- Riedmiller, M., Montemerlo, M., Dahlkamp, H.: Learning to Drive in 20 Minutes. In: Proceedings of the FBIT 2007 Conference, Jeju, Korea. Springer (2007) (Best Paper Award)
- (2007) Proceedings Of The FBIT 2007 Conference Jeju Korea
- Riedmiller, M.¹ Montemerlo, M.² Dahlkamp, H.³

27
- 0004007508
- MIT Press Cambridge
- Sutton, R.S., Barto, A.G.: Reinforcement Learning. MIT Press, Cambridge (1998)
- (1998) Reinforcement Learning
- Sutton, R.S.¹ Barto, A.G.²

28
- 0000723997
- In: Touretzky D.S. Mozer M.C. Hasselmo M.E. (eds
- Sutton, R.S.: Generalization in reinforcement learning: Successful examples using sparse coarse coding. In: Touretzky, D.S., Mozer, M.C., Hasselmo, M.E. (eds.)
- Generalization In Reinforcement Learning: Successful Examples Using Sparse Coarse Coding
- Sutton, R.S.¹

29
- 85156221438
- MIT Press, Cambridge
- Advances in Neural Information Processing Systems, vol. 8, pp. 1038-1044. MIT Press, Cambridge (1996)
- (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 1038-1044

30
- 34548767315
- Fitted q iteration with cmacs
- ADPRL 2007), Honolulu, USA
- Timmer, S., Riedmiller, M.: Fitted Q Iteration with CMACs. In: Proceedings of the IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning (ADPRL 2007), Honolulu, USA (2007)
- (2007) Proceedings of the IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning
- Timmer, S.¹ Riedmiller, M.²

31
- 0004049893
- Phd thesis Cambridge University
- Watkins, C.J.: Learning from Delayed Rewards. Phd thesis, Cambridge University (1989)
- (1989) Learning from Delayed Rewards
- Watkins, C.J.¹

32
- 38049150972
- Kok, J.N., Koronacki, J., Lopez de Mantaras, R., Matwin, S., Mladenič, D., Skowron, A. (eds.) ECML 2007. LNCS (LNAI) Springer, Heidelberg
- Walsh, T.J., Nouri, A., Li, L., Littman, M.L.: Planning and Learning in Environments with Delayed Feedback. In: Kok, J.N., Koronacki, J., Lopez de Mantaras, R., Matwin, S., Mladenič, D., Skowron, A. (eds.) ECML 2007. LNCS (LNAI), vol. 4701, pp. 442-453. Springer, Heidelberg (2007)
- (2007) Planning and Learning in Environments with Delayed Feedback , vol.4701 , pp. 442-453
- Walsh, T.J.¹ Nouri, A.² Li, L.³ Littman, M.L.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.