|
Volumn 54, Issue 3, 2005, Pages 207-213
|
An actor-critic algorithm for constrained Markov decision processes
|
Author keywords
Actor critic algorithms; Constrained Markov decision processes; Envelope theorem; Reinforcement learning; Stochastic approximation
|
Indexed keywords
ALGORITHMS;
APPROXIMATION THEORY;
DECISION THEORY;
DYNAMIC PROGRAMMING;
LEARNING SYSTEMS;
THEOREM PROVING;
ACTOR-CRITIC ALGORITHMS;
CONSTRAINED MARKOV DECISION PROCESSES;
ENVELOPE THEOREM;
REINFORCEMENT LEARNING;
STOCHASTIC APPROXIMATION;
MARKOV PROCESSES;
|
EID: 13244278201
PISSN: 01676911
EISSN: None
Source Type: Journal
DOI: 10.1016/j.sysconle.2004.08.007 Document Type: Article |
Times cited : (211)
|
References (17)
|