메뉴 건너뛰기




Volumn , Issue , 2007, Pages 42-48

FF+FPG: Guiding a Policy-Gradient planner

Author keywords

[No Author keywords available]

Indexed keywords

IMPORTANCE SAMPLING; STOCHASTIC SYSTEMS; TEACHING;

EID: 57749179024     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (17)

References (20)
  • 5
    • 0001240715 scopus 로고
    • Importance sampling for stochastic simulations
    • Glynn, P., and Iglehart, D. 1989. Importance sampling for stochastic simulations. Management Science 35(11):1367-1392.
    • (1989) Management Science , vol.35 , Issue.11 , pp. 1367-1392
    • Glynn, P.1    Iglehart, D.2
  • 6
    • 0036377352 scopus 로고    scopus 로고
    • The FF planning system: Fast plan generation through heuristic search
    • Hoffmann, J., and Nebel, B. 2001. The FF planning system: Fast plan generation through heuristic search. Journal of Artificial Intelligence Research 14:253-302.
    • (2001) Journal of Artificial Intelligence Research , vol.14 , pp. 253-302
    • Hoffmann, J.1    Nebel, B.2
  • 7
    • 0035441926 scopus 로고    scopus 로고
    • FF: The fast-forward planning system
    • Hoffmann, J. 2001. FF: The fast-forward planning system. AI Magazine 22(3):57-62.
    • (2001) AI Magazine , vol.22 , Issue.3 , pp. 57-62
    • Hoffmann, J.1
  • 10
    • 33746878798 scopus 로고    scopus 로고
    • Exploration in gradient-based reinforcement learning
    • Memo 2001-003, MIT, AI lab
    • Meuleau, N.; Peshkin, L.; and Kim, K. 2001. Exploration in gradient-based reinforcement learning. Technical Report AI Memo 2001-003, MIT - AI lab.
    • (2001) Technical Report AI
    • Meuleau, N.1    Peshkin, L.2    Kim, K.3
  • 14
    • 0005942760 scopus 로고    scopus 로고
    • Importance sampling for reinforcement learning with multiple objectives
    • Memo 2001-003, MIT AI Lab
    • Shelton, C. 2001. Importance sampling for reinforcement learning with multiple objectives. Technical Report AI Memo 2001-003, MIT AI Lab.
    • (2001) Technical Report AI
    • Shelton, C.1
  • 16
    • 0000337576 scopus 로고
    • Simple statistical gradient-following algorithms for connectionnist reinforcement learning
    • Williams, R. 1992. Simple statistical gradient-following algorithms for connectionnist reinforcement learning. Machine Learning 8(3):229-256.
    • (1992) Machine Learning , vol.8 , Issue.3 , pp. 229-256
    • Williams, R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.