|
Volumn 42, Issue 3, 2001, Pages 241-267
|
On the convergence of temporal-difference learning with linear function approximation
|
Author keywords
[No Author keywords available]
|
Indexed keywords
APPROXIMATION THEORY;
ASYMPTOTIC STABILITY;
CONVERGENCE OF NUMERICAL METHODS;
DYNAMIC PROGRAMMING;
ERROR ANALYSIS;
FUNCTION EVALUATION;
MARKOV PROCESSES;
STATE SPACE METHODS;
LINEAR FUNCTION APPROXIMATION;
NEURODYNAMIC PROGRAMMING;
POSITIVE HARRIS RECURRENCE;
REINFORCEMENT LEARNING;
TEMPORAL DIFFERENCE LEARNING;
LEARNING ALGORITHMS;
|
EID: 0035283402
PISSN: 08856125
EISSN: None
Source Type: Journal
DOI: 10.1023/A:1007609817671 Document Type: Article |
Times cited : (52)
|
References (21)
|