메뉴 건너뛰기




Volumn , Issue , 2010, Pages 401-406

Ensembles of neural networks for robust reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

BENCHMARK APPLICATIONS; FUNCTION APPROXIMATORS; LEARNING PROCESS; LOCAL MINIMUMS; MAJORITY VOTING; NEAR-OPTIMAL POLICIES; NETWORK TOPOLOGY; OPTIMAL CONTROL PROBLEM; OVERFITTING; SINGLE NETWORKS; TRAINING PROCESS;

EID: 79952394156     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICMLA.2010.66     Document Type: Conference Paper
Times cited : (35)

References (19)
  • 5
    • 33646398129 scopus 로고    scopus 로고
    • Neural fitted Q-iteration - First experiences with a data efficient neural reinforcement learning method
    • M. Riedmiller, "Neural fitted Q-iteration - first experiences with a data efficient neural reinforcement learning method," in Proc. of the 16th European Conf. on Machine Learning, 2005, pp. 317-328.
    • Proc. of the 16th European Conf. on Machine Learning, 2005 , pp. 317-328
    • Riedmiller, M.1
  • 6
  • 8
    • 0035478854 scopus 로고    scopus 로고
    • Random forests
    • L. Breiman, "Random forests," Machine learning, vol. 45, no. 1, pp. 5-32, 2001.
    • (2001) Machine Learning , vol.45 , Issue.1 , pp. 5-32
    • Breiman, L.1
  • 10
    • 84898995808 scopus 로고    scopus 로고
    • Reinforcement learning with function approximation converges to a region
    • G. J. Gordon, "Reinforcement learning with function approximation converges to a region," Advances in neural information processing systems, pp. 1040-1046, 2001.
    • (2001) Advances in Neural Information Processing Systems , pp. 1040-1046
    • Gordon, G.J.1
  • 13
    • 0346242001 scopus 로고    scopus 로고
    • Ph.D. dissertation, The Australian National University
    • C. Gaskett, "Q-learning for robot control," Ph.D. dissertation, The Australian National University, 2002.
    • (2002) Q-learning for Robot Control
    • Gaskett, C.1
  • 14
    • 0030211964 scopus 로고    scopus 로고
    • Bagging predictors
    • L. Breiman, "Bagging predictors," Machine learning, vol. 24, no. 2, pp. 123-140, 1996.
    • (1996) Machine Learning , vol.24 , Issue.2 , pp. 123-140
    • Breiman, L.1
  • 18
    • 79952408018 scopus 로고    scopus 로고
    • personal communication
    • H. van Hasselt, personal communication, 2010.
    • (2010)
    • Van Hasselt, H.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.