|
Volumn , Issue , 2002, Pages
|
Model-free least squares policy iteration
|
Author keywords
[No Author keywords available]
|
Indexed keywords
ALGORITHMS;
ITERATIVE METHODS;
REINFORCEMENT LEARNING;
CONTROL PROBLEMS;
FUNCTION APPROXIMATION;
LEAST SQUARES POLICY ITERATIONS;
LEAST-SQUARES TEMPORAL DIFFERENCES;
NEW APPROACHES;
POLICY ITERATION;
PREDICTION PROBLEM;
TEMPORAL-DIFFERENCE ALGORITHM;
LEAST SQUARES APPROXIMATIONS;
|
EID: 84898963274
PISSN: 10495258
EISSN: None
Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper |
Times cited : (28)
|
References (13)
|