![]() |
Volumn 5212 LNAI, Issue PART 2, 2008, Pages 234-249
|
State-dependent exploration for policy gradient methods
|
Author keywords
[No Author keywords available]
|
Indexed keywords
DATABASE SYSTEMS;
GRADIENT METHODS;
LEARNING SYSTEMS;
MAXIMUM LIKELIHOOD ESTIMATION;
PROBLEM SOLVING;
REINFORCEMENT;
REINFORCEMENT LEARNING;
ROBOT LEARNING;
SOLUTIONS;
FASTER CONVERGENCES;
GRADIENT ESTIMATORS;
LIKELIHOOD RATIOS;
POLICY GRADIENT METHODS;
REINFORCEMENT LEARNING ALGORITHMS;
TIME STEPS;
TO MANY;
LEARNING ALGORITHMS;
|
EID: 56049089041
PISSN: 03029743
EISSN: 16113349
Source Type: Book Series
DOI: 10.1007/978-3-540-87481-2_16 Document Type: Conference Paper |
Times cited : (61)
|
References (15)
|