|
Volumn , Issue , 2014, Pages 459-467
|
Bandits with switching costs: T2/3 regret
|
Author keywords
Lower bounds; Multi armed Bandit; Online learning; Switching costs
|
Indexed keywords
COSTS;
LEARNING ALGORITHMS;
MARKOV PROCESSES;
STATISTICS;
TIME SWITCHES;
ADAPTIVE ADVERSARY;
BANDIT FEEDBACKS;
LOWER BOUNDS;
MARKOV DECISION PROCESSES;
MULTI ARMED BANDIT;
MULTI-ARMED BANDIT PROBLEM;
ONLINE LEARNING;
SWITCHING COSTS;
E-LEARNING;
|
EID: 84904307224
PISSN: 07378017
EISSN: None
Source Type: Conference Proceeding
DOI: 10.1145/2591796.2591868 Document Type: Conference Paper |
Times cited : (113)
|
References (17)
|