SCOPUS 정보 검색 플랫폼

Proceedings - 9th International Conference on Machine Learning and Applications, ICMLA 2010

Volumn , Issue , 2010, Pages 401-406

Ensembles of neural networks for robust reinforcement learning

(2) Hans, Alexander a Udluft, Steffen b

a ILMENAU UNIVERSITY OF TECHNOLOGY (Germany)

b SIEMENS AG (Germany)

Author keywords

[No Author keywords available]

Indexed keywords

BENCHMARK APPLICATIONS; FUNCTION APPROXIMATORS; LEARNING PROCESS; LOCAL MINIMUMS; MAJORITY VOTING; NEAR-OPTIMAL POLICIES; NETWORK TOPOLOGY; OPTIMAL CONTROL PROBLEM; OVERFITTING; SINGLE NETWORKS; TRAINING PROCESS;

BENCHMARKING; ELECTRIC NETWORK TOPOLOGY; ITERATIVE METHODS; LEARNING ALGORITHMS; OPTIMIZATION; REINFORCEMENT LEARNING;

NEURAL NETWORKS;

EID: 79952394156 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICMLA.2010.66 Document Type: Conference Paper

Times cited : (35)

References (19)

1
- 0004102479
- MIT Press
- R. Sutton and A. Barto, Reinforcement Learning: An Introduction. MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Barto, A.²

2
- 84880694195
- Stable function approximation in dynamic programming
- G. J. Gordon, "Stable function approximation in dynamic programming," in Proc. of the Int. Conf. on Machine Learning, 1995.
- Proc. of the Int. Conf. on Machine Learning, 1995
- Gordon, G.J.¹

3
- 21844465127
- Tree-based batch mode reinforcement learning
- D. Ernst, P. Geurts, and L. Wehenkel, "Tree-based batch mode reinforcement learning," Journal of Machine Learning Research, vol. 6, pp. 503-556, 2005.
- (2005) Journal of Machine Learning Research , vol.6 , pp. 503-556
- Ernst, D.¹ Geurts, P.² Wehenkel, L.³

4
- 4644323293
- Least-squares policy iteration
- M. G. Lagoudakis and R. Parr, "Least-squares policy iteration," Journal of Machine Learning Research, pp. 1107-1149, 2003.
- (2003) Journal of Machine Learning Research , pp. 1107-1149
- Lagoudakis, M.G.¹ Parr, R.²

5
- 33646398129
- Neural fitted Q-iteration - First experiences with a data efficient neural reinforcement learning method
- M. Riedmiller, "Neural fitted Q-iteration - first experiences with a data efficient neural reinforcement learning method," in Proc. of the 16th European Conf. on Machine Learning, 2005, pp. 317-328.
- Proc. of the 16th European Conf. on Machine Learning, 2005 , pp. 317-328
- Riedmiller, M.¹

6
- 80053403826
- Ensemble methods in machine learning
- T. Dietterich, "Ensemble methods in machine learning," Multiple classifier systems, pp. 1-15, 2000.
- (2000) Multiple Classifier Systems , pp. 1-15
- Dietterich, T.¹

7
- 49049105169
- Ensemble algorithms in reinforcement learning
- M. Wiering and H. van Hasselt, "Ensemble algorithms in reinforcement learning." IEEE transactions on systems, man, and cybernetics, vol. 38, no. 4, 2008.
- (2008) IEEE Transactions on Systems, Man, and Cybernetics , vol.38 , Issue.4
- Wiering, M.¹ Van Hasselt, H.²

8
- 0035478854
- Random forests
- L. Breiman, "Random forests," Machine learning, vol. 45, no. 1, pp. 5-32, 2001.
- (2001) Machine Learning , vol.45 , Issue.1 , pp. 5-32
- Breiman, L.¹

9
- 84887011356
- Neural rewards regression for near-optimal policy identification in Markovian and partial observable environments
- D. Schneegass, S. Udluft, and T. Martinetz, "Neural rewards regression for near-optimal policy identification in Markovian and partial observable environments," in Proc. of the European Symposium on Artificial Neural Networks, 2007.
- Proc. of the European Symposium on Artificial Neural Networks, 2007
- Schneegass, D.¹ Udluft, S.² Martinetz, T.³

10
- 84898995808
- Reinforcement learning with function approximation converges to a region
- G. J. Gordon, "Reinforcement learning with function approximation converges to a region," Advances in neural information processing systems, pp. 1040-1046, 2001.
- (2001) Advances in Neural Information Processing Systems , pp. 1040-1046
- Gordon, G.J.¹

11
- 79952421861
- Reducing policy degradation in neuro-dynamic programming
- T. Gabel and M. Riedmiller, "Reducing policy degradation in neuro-dynamic programming," Proc. of the European Symposium on Artificial Neural Networks, 2006.
- Proc. of the European Symposium on Artificial Neural Networks, 2006
- Gabel, T.¹ Riedmiller, M.²

12
- 0003270924
- Issues in using function approximation for reinforcement learning
- S. Thrun and A. Schwartz, "Issues in using function approximation for reinforcement learning," in Proc. of the 1993 Connectionist Models Summer School, Hillsdale, NJ, 1993.
- (1993) Proc. of the 1993 Connectionist Models Summer School, Hillsdale, NJ
- Thrun, S.¹ Schwartz, A.²

13
- 0346242001
- Ph.D. dissertation, The Australian National University
- C. Gaskett, "Q-learning for robot control," Ph.D. dissertation, The Australian National University, 2002.
- (2002) Q-learning for Robot Control
- Gaskett, C.¹

14
- 0030211964
- Bagging predictors
- L. Breiman, "Bagging predictors," Machine learning, vol. 24, no. 2, pp. 123-140, 1996.
- (1996) Machine Learning , vol.24 , Issue.2 , pp. 123-140
- Breiman, L.¹

15
- 0001963082
- A short introduction to boosting
- Y. Freund, R. Schapire, and N. Abe, "A short introduction to boosting," Journal of the Japanese Society for Artificial Intelligence, vol. 14, pp. 771-780, 1999.
- (1999) Journal of the Japanese Society for Artificial Intelligence , vol.14 , pp. 771-780
- Freund, Y.¹ Schapire, R.² Abe, N.³

16
- 33749841590
- Modeling large dynamical systems with dynamical consistent neural networks
- S. Haykin, J. Principe, T. Sejnowski, and J. McWhirter, Eds. MIT Press
- H.-G. Zimmermann, R. Grothmann, A. M. Schaefer, and C. Tietz, "Modeling large dynamical systems with dynamical consistent neural networks," in New Directions in Statistical Signal Processing: From Systems to Brain, S. Haykin, J. Principe, T. Sejnowski, and J. McWhirter, Eds. MIT Press, 2006, pp. 203-242.
- (2006) New Directions in Statistical Signal Processing: From Systems to Brain , pp. 203-242
- Zimmermann, H.-G.¹ Grothmann, R.² Schaefer, A.M.³ Tietz, C.⁴

17
- 34548763441
- A recurrent control neural network for data efficient reinforcement learning
- A. M. Schaefer, S. Udluft, and H.-G. Zimmermann, "A recurrent control neural network for data efficient reinforcement learning," in Proc. of the IEEE Int. Symposium on Approximate Dynamic Programming and Reinforcement Learning, Honolulu, HI, 2007.
- (2007) Proc. of the IEEE Int. Symposium on Approximate Dynamic Programming and Reinforcement Learning, Honolulu, HI
- Schaefer, A.M.¹ Udluft, S.² Zimmermann, H.-G.³

18
- 79952408018
- personal communication
- H. van Hasselt, personal communication, 2010.
- (2010)
- Van Hasselt, H.¹

19
- 0009589301
- How to train neural networks
- G. B. Orr and K.-R. Müller, Eds.
- R. Neuneier and H.-G. Zimmermann, "How to train neural networks," in Neural Networks: Tricks of the Trade, G. B. Orr and K.-R. Müller, Eds., 1996, pp. 373-423.
- (1996) Neural Networks: Tricks of the Trade , pp. 373-423
- Neuneier, R.¹ Zimmermann, H.-G.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.