|
Volumn , Issue , 2008, Pages
|
Optimistic Linear Programming gives logarithmic regret for irreducible MDPs
|
Author keywords
[No Author keywords available]
|
Indexed keywords
ARTIFICIAL INTELLIGENCE;
MARKOV PROCESSES;
AVERAGE REWARD;
LINEAR PROGRAMS;
LINEAR-PROGRAMMING;
MARKOV DECISION PROCESSES;
OPTIMAL POLICIES;
OPTIMISTICS;
REGRET BOUNDS;
SIMPLE++;
STATE TRANSITION PROBABILITIES;
TRANSITION PROBABILITIES;
LINEAR PROGRAMMING;
|
EID: 85162041468
PISSN: None
EISSN: None
Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper |
Times cited : (84)
|
References (9)
|