|
Volumn 22, Issue 1-3, 1996, Pages 159-195
|
Average reward reinforcement learning: foundations, algorithms, and empirical results
|
Author keywords
Markov decision processes; Reinforcement learning
|
Indexed keywords
AUTOMATA THEORY;
DYNAMIC PROGRAMMING;
LEARNING ALGORITHMS;
MARKOV PROCESSES;
OPTIMAL CONTROL SYSTEMS;
PERFORMANCE;
SENSITIVITY ANALYSIS;
BIAS OPTIMAL;
GAIN OPTIMAL;
LEARNING AUTOMATA;
MARKOV DECISION PROCESSES;
REINFORCEMENT LEARNING;
LEARNING SYSTEMS;
|
EID: 0029752592
PISSN: 08856125
EISSN: None
Source Type: Journal
DOI: 10.1007/BF00114727 Document Type: Article |
Times cited : (356)
|
References (8)
|