|
Volumn 47, Issue 2-3, 2002, Pages 235-256
|
Finite-time analysis of the multiarmed bandit problem
|
Author keywords
Adaptive allocation rules; Bandit problems; Finite horizon regret
|
Indexed keywords
COMPUTATION THEORY;
PROBLEM SOLVING;
STATISTICS;
THEOREM PROVING;
FINITE-TIME ANALYSIS;
LEARNING SYSTEMS;
|
EID: 0036568025
PISSN: 08856125
EISSN: None
Source Type: Journal
DOI: 10.1023/A:1013689704352 Document Type: Article |
Times cited : (6240)
|
References (12)
|