SCOPUS 정보 검색 플랫폼

Volumn , Issue , 1997, Pages 1068-1074

On-line policy improvement using Monte-Carlo search

Author keywords

[No Author keywords available]

Indexed keywords

ADAPTIVE CONTROL SYSTEMS; INTELLIGENT SYSTEMS; SUPERCOMPUTERS;

ADAPTIVE CONTROL; ADAPTIVE CONTROLLERS; ERROR RATE; MONTE CARLO ALGORITHMS; MONTE-CARLO SIMULATIONS; SUBSTANTIAL REDUCTION;

ALGORITHMS;

EID: 84898992015 PISSN: 10495258 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (170)

References (8)

1
- 0003565783
- Athena Scientific, Belmont MA
- D. P. Bertsekas, Dynamic Programming and Optimal Control. Athena Scientific, Belmont, MA (1995).
- (1995) Dynamic Programming and Optimal Control
- Bertsekas, D.P.¹

3
- 0000218399
- Programming a computer for playing chess
- C. E. Shannon, "Programming a computer for playing chess." Philosophical Magazine 41, 265-275 (1950).
- (1950) Philosophical Magazine , vol.41 , pp. 265-275
- Shannon, C.E.¹

4
- 33847202724
- Learning to predict by the methods of temporal differences
- R. S. Sutton, "Learning to predict by the methods of temporal differences." Machine Learning 3, 9-44 (1988).
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, R.S.¹

6
- 0001046225
- Practical issues in temporal difference learning
- G. Tesauro, "Practical issues in temporal difference learning." Machine Learning 8, 257-277 (1992).
- (1992) Machine Learning , vol.8 , pp. 257-277
- Tesauro, G.¹

7
- 0029276036
- Temporal difference learning and TD-Gammon
- G. Tesauro, "Temporal difference learning and TD-Gammon." Comm. of the ACM, 38:3, 58-67 (1995).
- (1995) Comm. of the ACM , vol.38 , Issue.3 , pp. 58-67
- Tesauro, G.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.