|
Volumn 76, Issue 2-3, 2009, Pages 243-256
|
Hybrid least-squares algorithms for approximate policy evaluation
|
Author keywords
Markov decision processes; Reinforcement learning
|
Indexed keywords
FIXED POINT METHODS;
FIXED-POINT ALGORITHMS;
GEOMETRIC INTERPRETATION;
HYBRID ALGORITHMS;
LARGE DOMAIN;
LEAST-SQUARES ALGORITHMS;
MARKOV DECISION PROCESSES;
OPTIMIZATION CRITERIA;
POLICY EVALUATION;
POLICY ITERATION;
RESIDUAL METHOD;
TARGET VALUES;
CIRCUIT THEORY;
MARKOV PROCESSES;
OPTIMIZATION;
REINFORCEMENT;
REINFORCEMENT LEARNING;
ALGORITHMS;
|
EID: 68949099445
PISSN: 08856125
EISSN: 15730565
Source Type: Journal
DOI: 10.1007/s10994-009-5128-4 Document Type: Conference Paper |
Times cited : (12)
|
References (15)
|