|
Volumn , Issue , 2008, Pages 560-567
|
A worst-case comparison between temporal difference and residual gradient with linear function approximation
|
Author keywords
[No Author keywords available]
|
Indexed keywords
LEARNING ALGORITHMS;
MACHINE LEARNING;
MARKOV PROCESSES;
FUNCTIONS;
INTERNET;
LEARNING SYSTEMS;
PROBABILITY DENSITY FUNCTION;
ROBOT LEARNING;
FORMAL ANALYSIS;
FUNCTION APPROXIMATION;
LINEAR FUNCTIONS;
NON-PROBABILISTIC;
POLICY EVALUATION;
PREDICTION ERRORS;
RESIDUAL GRADIENT;
TEMPORAL DIFFERENCES;
APPROXIMATION ALGORITHMS;
FORMAL ANALYSIS;
FUNCTION APPROXIMATIONS;
LINEAR FUNCTIONS;
MARKOV CHAINS;
MARKOVIAN;
NON-PROBABILISTIC;
ON-LINE LEARNING;
POLICY EVALUATIONS;
PREDICTION ERRORS;
RESIDUAL GRADIENTS;
TEMPORAL DIFFERENCES;
|
EID: 56449125197
PISSN: None
EISSN: None
Source Type: Conference Proceeding
DOI: 10.1145/1390156.1390227 Document Type: Conference Paper |
Times cited : (22)
|
References (15)
|