SCOPUS 정보 검색 플랫폼

Proceedings of the Annual ACM Symposium on Theory of Computing

Volumn , Issue , 2014, Pages 459-467

Bandits with switching costs: T2/3 regret

(4) Dekel, Ofer a Ding, Jian b Koren, Tomer c Peres, Yuval a

a MICROSOFT RESEARCH (United States)

b UNIVERSITY OF CHICAGO (United States)

c TECHNION ISRAEL INSTITUTE OF TECHNOLOGY (Israel)

Author keywords

Lower bounds; Multi armed Bandit; Online learning; Switching costs

Indexed keywords

COSTS; LEARNING ALGORITHMS; MARKOV PROCESSES; STATISTICS; TIME SWITCHES;

ADAPTIVE ADVERSARY; BANDIT FEEDBACKS; LOWER BOUNDS; MARKOV DECISION PROCESSES; MULTI ARMED BANDIT; MULTI-ARMED BANDIT PROBLEM; ONLINE LEARNING; SWITCHING COSTS;

E-LEARNING;

EID: 84904307224 PISSN: 07378017 EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/2591796.2591868 Document Type: Conference Paper

Times cited : (113)

References (17)

1
- 84867129684
- Online bandit learning against an adaptive adversary: From regret to policy regret
- R. Arora, O. Dekel, and A. Tewari. Online bandit learning against an adaptive adversary: from regret to policy regret. In Proceedings of the Twenty-Ninth International Conference on Machine Learning, 2012.
- Proceedings of the Twenty-Ninth International Conference on Machine Learning , vol.2012
- Arora, R.¹ Dekel, O.² Tewari, A.³

2
- 84898079018
- Minimax policies for adversarial and stochastic bandits
- J.-Y. Audibert, S. Bubeck, et al. Minimax policies for adversarial and stochastic bandits. In Proceedings of the 22th annual conference on learning theory (COLT), pages 217-226, 2009.
- (2009) Proceedings of the 22th Annual Conference on Learning Theory (COLT) , pp. 217-226
- Audibert, J.-Y.¹ Bubeck, S.²

3
- 0037709910
- The nonstochastic multiarmed bandit problem
- P. Auer, N. Cesa-Bianchi, Y. Freund, and R. Schapire. The nonstochastic multiarmed bandit problem. SIAM Journal on Computing, 32(1):48-77, 2002.
- (2002) SIAM Journal on Computing , vol.32 , Issue.1 , pp. 48-77
- Auer, P.¹ Cesa-Bianchi, N.² Freund, Y.³ Schapire, R.⁴

4
- 84926078662
- Cambridge University Press
- N. Cesa-Bianchi and G. Lugosi. Prediction, learning, and games. Cambridge University Press, 2006.
- (2006) Prediction, Learning, and Games
- Cesa-Bianchi, N.¹ Lugosi, G.²

5
- 0031140246
- How to use expert advice
- May
- N. Cesa-Bianchi, Y. Freund, D. Haussler, D. P. Helmbold, R. E. Schapire, and M. K. Warmuth. How to use expert advice. Journal of the ACM, 44(3):427-485, May 1997.
- (1997) Journal of the ACM , vol.44 , Issue.3 , pp. 427-485
- Cesa-Bianchi, N.¹ Freund, Y.² Haussler, D.³ Helmbold, D.P.⁴ Schapire, R.E.⁵ Warmuth, M.K.⁶

6
- 84894413813
- Online learning with switching costs and other adaptive adversaries
- N. Cesa-Bianchi, O. Dekel, and O. Shamir. Online learning with switching costs and other adaptive adversaries. In Advances in Neural Information Processing Systems 26, 2013.
- (2013) Advances in Neural Information Processing Systems , vol.26
- Cesa-Bianchi, N.¹ Dekel, O.² Shamir, O.³

7
- 38249021538
- Graphs with small bandwidth and cutwidth
- F. R. K. Chung and P. D. Seymour. Graphs with small bandwidth and cutwidth. Discrete Mathematics, 75(1- 3):113-119, 1989.
- (1989) Discrete Mathematics , vol.75 , Issue.1-3 , pp. 113-119
- Chung, F.R.K.¹ Seymour, P.D.²

8
- 84889281816
- John Wiley & Sons
- T. Cover and J. Thomas. Elements of information theory. John Wiley & Sons, 2006.
- (2006) Elements of Information Theory
- Cover, T.¹ Thomas, J.²

9
- 84897554269
- Better rates for any adversarial deterministic MDP
- O. Dekel and E. Hazan. Better rates for any adversarial deterministic MDP. In Proceedings of the Thirtieth International Conference on Machine Learning, 2013.
- (2013) Proceedings of the Thirtieth International Conference on Machine Learning
- Dekel, O.¹ Hazan, E.²

10
- 0031211090
- A decision-theoretic generalization of on-line learning and an application to boosting
- Y. Freund and R. Schapire. A decision-theoretic generalization of on-line learning and an application to boosting. Journal of computer and System Sciences, 55(1): 119-139, 1997.
- (1997) Journal of Computer and System Sciences , vol.55 , Issue.1 , pp. 119-139
- Freund, Y.¹ Schapire, R.²

11
- 80054819291
- Regret minimization for online buffering problems using the weighted majority algorithm
- S. Geulen, B. Vöcking, and M. Winkler. Regret minimization for online buffering problems using the weighted majority algorithm. In Proceedings of the 23rd International Conference on Learning Theory, pages 132-143, 2010.
- (2010) Proceedings of the 23rd International Conference on Learning Theory , pp. 132-143
- Geulen, S.¹ Vöcking, B.² Winkler, M.³

12
- 80054798353
- Near-optimal rates for limiteddelay universal lossy source coding
- IEEE
- A. Gyorgy and G. Neu. Near-optimal rates for limiteddelay universal lossy source coding. In Information Theory Proceedings (ISIT), 2011 IEEE International Symposium on, pages 2218-2222. IEEE, 2011.
- (2011) Information Theory Proceedings (ISIT), 2011 IEEE International Symposium on , pp. 2218-2222
- Gyorgy, A.¹ Neu, G.²

13
- 24644463787
- Efficient algorithms for online decision problems
- A. Kalai and S. Vempala. Efficient algorithms for online decision problems. Journal of Computer and System Sciences, 71:291-307, 2005.
- (2005) Journal of Computer and System Sciences , vol.71 , pp. 291-307
- Kalai, A.¹ Vempala, S.²

14
- 35148838877
- The weighted majority algorithm
- N. Littlestone and M.Warmuth. The weighted majority algorithm. Information and Computation, 108:212-261, 1994.
- (1994) Information and Computation , vol.108 , pp. 212-261
- Littlestone, N.¹ Warmuth, M.²

15
- 85162052729
- Online Markov decision processes under bandit feedback
- G. Neu, A. György, C. Szepesvári, and A. Antos. Online Markov decision processes under bandit feedback. In Advances in Neural Information Processing Systems 23, pages 1804-1812, 2010.
- (2010) Advances in Neural Information Processing Systems , vol.23 , pp. 1804-1812
- Neu, G.¹ György, A.² Szepesvári, C.³ Antos, A.⁴

16
- 85026748110
- Probabilistic computations: Toward a unified measure of complexity
- A. Yao. Probabilistic computations: Toward a unified measure of complexity. In Proceedings of the 18th IEEE Symposium on Foundations of Computer Science (FOCS), pages 222-227, 1977.
- (1977) Proceedings of the 18th IEEE Symposium on Foundations of Computer Science (FOCS) , pp. 222-227
- Yao, A.¹

17
- 70349280578
- Markov decision processes with arbitrary reward processes
- J. Y. Yu, S. Mannor, and N. Shimkin. Markov decision processes with arbitrary reward processes. Mathematics of Operations Research, 34(3):737-757, 2009.
- (2009) Mathematics of Operations Research , vol.34 , Issue.3 , pp. 737-757
- Yu, J.Y.¹ Mannor, S.² Shimkin, N.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.