|
Volumn 44, Issue 4, 2008, Pages 1111-1119
|
New algorithms of the Q-learning type
|
Author keywords
Markov decision processes; Q learning; Reinforcement learning; SPSA; Two timescale stochastic approximation
|
Indexed keywords
APPROXIMATION ALGORITHMS;
LEARNING ALGORITHMS;
MARKOV PROCESSES;
ROUTING ALGORITHMS;
TELECOMMUNICATION NETWORKS;
MARKOV DECISION PROCESSES;
Q-LEARNING;
TWO TIMESCALE STOCHASTIC APPROXIMATION;
REINFORCEMENT LEARNING;
|
EID: 41049095293
PISSN: 00051098
EISSN: None
Source Type: Journal
DOI: 10.1016/j.automatica.2007.09.009 Document Type: Article |
Times cited : (24)
|
References (10)
|