|
Volumn 3, Issue , 2002, Pages 3367-3371
|
Gradient-based policy iteration: An example
|
Author keywords
Discrete event dynamic systems; Markov decision processes; Perturbation analysis; Poisson equations; Potentials; Q learning; W factors
|
Indexed keywords
GRADIENT METHODS;
ITERATIVE METHODS;
MARKOV PROCESSES;
OPTIMIZATION;
PERTURBATION TECHNIQUES;
POISSON EQUATION;
SENSITIVITY ANALYSIS;
DISCRETE EVENT DYNAMIC SYSTEMS;
REINFORCEMENT LEARNING;
DISCRETE TIME CONTROL SYSTEMS;
|
EID: 0036992818
PISSN: 01912216
EISSN: None
Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper |
Times cited : (11)
|
References (12)
|