|
Volumn 7188 LNAI, Issue , 2012, Pages 102-114
|
Regularized least squares temporal difference learning with nested ℓ 2 and ℓ 1 penalization
|
Author keywords
[No Author keywords available]
|
Indexed keywords
APPROXIMATE VALUE FUNCTION;
APPROXIMATION SPACES;
CENTRAL PROBLEMS;
HIGH-DIMENSIONAL FEATURE SPACE;
LEAST SQUARE;
NUMBER OF SAMPLES;
OVERFITTING;
POLICY EVALUATION;
PREDICTION PERFORMANCE;
PROJECTION OPERATOR;
REGULARIZED LEAST SQUARES;
REGULARIZED METHOD;
TEMPORAL DIFFERENCE LEARNING;
ARTIFICIAL INTELLIGENCE;
REINFORCEMENT LEARNING;
|
EID: 84861687861
PISSN: 03029743
EISSN: 16113349
Source Type: Book Series
DOI: 10.1007/978-3-642-29946-9_13 Document Type: Conference Paper |
Times cited : (19)
|
References (15)
|