![]() |
Volumn 40, Issue 3, 2002, Pages 681-698
|
Learning algorithms for Markov decision processes with average cost
|
Author keywords
Average cost control; Controlled Markov chains; Dynamic programming; Q learning; Simulation based algorithms; Stochastic approximation
|
Indexed keywords
COMPUTER SIMULATION;
COSTS;
DECISION THEORY;
DYNAMIC PROGRAMMING;
LEARNING ALGORITHMS;
OPTIMAL CONTROL SYSTEMS;
AVERAGE COST CONTROL;
MARKOV PROCESSES;
|
EID: 0036287773
PISSN: 03630129
EISSN: None
Source Type: Journal
DOI: 10.1137/S0363012999361974 Document Type: Article |
Times cited : (190)
|
References (29)
|