메뉴 건너뛰기




Volumn 1, Issue , 2003, Pages 131-138

Design for an Optimal Probe

Author keywords

[No Author keywords available]

Indexed keywords

LEARNING ALGORITHMS; LEARNING SYSTEMS; MARKOV PROCESSES; OPTIMIZATION; PROBABILISTIC LOGICS; PROCESS CONTROL; UNCERTAIN SYSTEMS;

EID: 1942421168     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (42)

References (7)
  • 7
    • 84898939480 scopus 로고    scopus 로고
    • Policy gradient methods for reinforcement learning with function approximation
    • S.A. Solla, T. K. Leen, & K. R. Muller (Eds.). MIT Press, Cambridge, MA
    • Sutton, R. S., McAllester, D., Singh, S., & Mansour, Y. (2000). Policy gradient methods for reinforcement learning with function approximation. In S.A. Solla, T. K. Leen, & K. R. Muller (Eds.), Advances in Neural Information Processing Systems-12 (pp. 1057-1063). MIT Press, Cambridge, MA.
    • (2000) Advances in Neural Information Processing Systems-12 , pp. 1057-1063
    • Sutton, R.S.1    McAllester, D.2    Singh, S.3    Mansour, Y.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.