|
Volumn 27, Issue 2, 2002, Pages 294-311
|
Q-learning for risk-sensitive control
|
Author keywords
Dynamic programming; Markov decision processes; Q learning; Reinforcement learning; Risk sensitive control; Stochastic approximation
|
Indexed keywords
BOUNDARY CONDITIONS;
COMPUTER SIMULATION;
CONVERGENCE OF NUMERICAL METHODS;
DECISION THEORY;
DYNAMIC PROGRAMMING;
LEARNING ALGORITHMS;
LEARNING SYSTEMS;
MATRIX ALGEBRA;
ORDINARY DIFFERENTIAL EQUATIONS;
RISK ASSESSMENT;
THEOREM PROVING;
MARKOV DECISION PROCESSES;
Q-LEARNING ALGORITHM;
REINFORCEMENT LEARNING;
RISK SENSITIVE CONTROL;
STOCHASTIC APPROXIMATION ALGORITHM;
MARKOV PROCESSES;
|
EID: 0036577013
PISSN: 0364765X
EISSN: None
Source Type: Journal
DOI: 10.1287/moor.27.2.294.324 Document Type: Article |
Times cited : (156)
|
References (35)
|