|
Volumn WS-06-11, Issue , 2006, Pages 50-56
|
PAC reinforcement learning bounds for RTDP and Rand-RTDP
|
Author keywords
[No Author keywords available]
|
Indexed keywords
CONVERGENCE OF NUMERICAL METHODS;
DECISION MAKING;
LEARNING ALGORITHMS;
MARKOV PROCESSES;
POLYNOMIAL APPROXIMATION;
PROBABILITY DISTRIBUTIONS;
REAL TIME SYSTEMS;
PROBABLY APPROXIMATELY CORRECT (PAC);
REAL-TIME DYNAMIC PROGRAMMING (RTDP);
DYNAMIC PROGRAMMING;
|
EID: 33845972675
PISSN: None
EISSN: None
Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper |
Times cited : (7)
|
References (14)
|