|
Volumn 105, Issue 3, 2000, Pages 589-608
|
On the existence of fixed points for approximate value iteration and temporal-difference learning
a a |
Author keywords
Dynamic programming; Neurodynamic programming; Reinforcement learning; Temporal difference learning; Value iteration
|
Indexed keywords
DYNAMIC PROGRAMMING;
ITERATIVE METHODS;
ORDINARY DIFFERENTIAL EQUATIONS;
CURSE OF DIMENSIONALITY;
DYNAMIC PROGRAMS;
FIXED POINTS;
NEURO-DYNAMIC PROGRAMMING;
REINFORCEMENT LEARNINGS;
SIMPLE ALGORITHM;
SIMPLER ALGORITHMS;
TEMPORAL DIFFERENCE LEARNING;
VALUE ITERATION;
VALUE ITERATION ALGORITHM;
REINFORCEMENT LEARNING;
|
EID: 0034342516
PISSN: 00223239
EISSN: None
Source Type: Journal
DOI: 10.1023/A:1004641123405 Document Type: Article |
Times cited : (67)
|
References (11)
|