|
Volumn 35, Issue 11, 1999, Pages 1799-1808
|
Average cost temporal-difference learning
|
Author keywords
[No Author keywords available]
|
Indexed keywords
APPROXIMATION THEORY;
BOUNDARY CONDITIONS;
COMPUTER SIMULATION;
CONVERGENCE OF NUMERICAL METHODS;
MARKOV PROCESSES;
TABLE LOOKUP;
APPROXIMATION ERROR;
AVERAGE COST;
NEURODYNAMIC PROGRAMMING;
REINFORCEMENT LEARNING;
TEMPORAL DIFFERENCE;
DYNAMIC PROGRAMMING;
|
EID: 0033221519
PISSN: 00051098
EISSN: None
Source Type: Journal
DOI: 10.1016/S0005-1098(99)00099-0 Document Type: Article |
Times cited : (144)
|
References (11)
|