메뉴 건너뛰기




Volumn 64, Issue 1, 2002, Pages 133-150

Estimation and approximation bounds for gradient-based reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; APPROXIMATION THEORY; ARTIFICIAL INTELLIGENCE; CONVERGENCE OF NUMERICAL METHODS; MARKOV PROCESSES; MATHEMATICAL MODELS; REINFORCEMENT; SET THEORY;

EID: 0036477347     PISSN: 00220000     EISSN: None     Source Type: Journal    
DOI: 10.1006/jcss.2001.1793     Document Type: Article
Times cited : (13)

References (20)
  • 15
    • 0033904367 scopus 로고    scopus 로고
    • Nonparametric time series prediction through adaptive model selection
    • (2000) Mach. Learning , vol.39 , pp. 5-34
    • Meir, R.1
  • 20
    • 0000337576 scopus 로고
    • Simple statistical gradient-following algorithms for connectionist reinforcement learning
    • (1992) Mach. Learning , vol.8 , pp. 229-256
    • Williams, R.J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.