![]() |
Volumn 22, Issue 10, 2009, Pages 1399-1410
|
Adaptive importance sampling for value function approximation in off-policy reinforcement learning
|
Author keywords
Adaptive importance sampling; Efficient sample reuse; Importance weighted cross validation; Off policy reinforcement learning; Policy iteration; Value function approximation
|
Indexed keywords
ADAPTIVE IMPORTANCE SAMPLING;
BIAS AND VARIANCE;
CROSS VALIDATION;
DATA SAMPLE;
EFFICIENT SAMPLE REUSE;
IMPORTANCE SAMPLING;
VALUE FUNCTION APPROXIMATION;
VALUE FUNCTIONS;
EDUCATION;
REINFORCEMENT LEARNING;
REINFORCEMENT;
ALGORITHM;
ARTICLE;
LEARNING;
MATHEMATICAL ANALYSIS;
POLICY;
PRIORITY JOURNAL;
PROBABILITY;
REINFORCEMENT;
SAMPLING;
SIMULATION;
VALIDATION PROCESS;
ALGORITHMS;
ARTIFICIAL INTELLIGENCE;
DATA INTERPRETATION, STATISTICAL;
LEARNING;
MARKOV CHAINS;
MODELS, NEUROLOGICAL;
MODELS, STATISTICAL;
NEURAL NETWORKS (COMPUTER);
PUBLIC POLICY;
REINFORCEMENT (PSYCHOLOGY);
REPRODUCIBILITY OF RESULTS;
|
EID: 70549113878
PISSN: 08936080
EISSN: None
Source Type: Journal
DOI: 10.1016/j.neunet.2009.01.002 Document Type: Article |
Times cited : (45)
|
References (19)
|