메뉴 건너뛰기




Volumn , Issue , 2008, Pages 143-148

Safe exploration for reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

CONTROLLED SYSTEM; CRITICAL STATE; SAFETY CONSTRAINT; SAFETY DEGREE; SAFETY FUNCTIONS;

EID: 79956136559     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (134)

References (7)
  • 2
    • 51749100839 scopus 로고    scopus 로고
    • A neural reinforcement learning approach to gas turbine control
    • Joint Conf. on Neural Networks, Orlando, MIT Press
    • A.M. Schaefer, D. Schneegass, V. Sterzing, and S. Udluft. A neural reinforcement learning approach to gas turbine control. In Proc. of the 20th Int. Joint Conf. on Neural Networks, Orlando, 2007. MIT Press.
    • (2007) In Proc. of the 20th Int
    • Schaefer, A.M.1    Schneegass, D.2    Sterzing, V.3    Udluft, S.4
  • 3
    • 85120861483 scopus 로고
    • Consideration of risk in reinforcement learning
    • Morgan Kaufmann
    • M. Heger. Consideration of risk in reinforcement learning. In Proc. of the 11th Int. Conf. on Machine Learning, pages 105-111. Morgan Kaufmann, 1994.
    • (1994) Proc. of the 11th Int. Conf. On Machine Learning , pp. 105-111
    • Heger, M.1
  • 5
    • 13444290317 scopus 로고    scopus 로고
    • Reinforcement learning with bounded risk
    • Morgan Kaufmann, San Francisco, CA
    • P. Geibel. Reinforcement learning with bounded risk. In Proc. of the 18th Int. Conf. on Machine Learning, pages 162-169. Morgan Kaufmann, San Francisco, CA, 2001.
    • (2001) Proc. of the 18th Int. Conf. On Machine Learning , pp. 162-169
    • Geibel, P.1
  • 7
    • 33646398129 scopus 로고    scopus 로고
    • Neural fitted q-iteration - first experiences with a data efficient neural reinforcement learning method
    • M. Riedmiller. Neural fitted q-iteration - first experiences with a data efficient neural reinforcement learning method. In Proc. of the 16th European Conf. on Machine Learning, pages 317-328, 2005.
    • (2005) Proc. of the 16th European Conf. On Machine Learning , pp. 317-328
    • Riedmiller, M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.