SCOPUS 정보 검색 플랫폼

Machine Learning

Volumn 80, Issue 2-3, 2010, Pages 245-272

Regret bounds for sleeping experts and bandits

(3) Kleinberg, Robert a Niculescu Mizil, Alexandru a,b Sharma, Yogeshwer a

a Department of Computer Science and School of Operations Research and Information Engineering (United States)

b IBM T J WATSON RESEARCH CENTER (United States)

Author keywords

Computational learning theory; Online algorithms; Regret

Indexed keywords

COMPUTATIONAL LEARNING THEORY; DECISION ALGORITHMS; MULTI ARMED BANDIT; ON-LINE ALGORITHMS; ONLINE DECISIONS; OPTIMAL REGRET; PRACTICAL PROBLEMS; REGRET;

COMPUTATION THEORY; STOCHASTIC MODELS;

LEARNING ALGORITHMS;

EID: 77955660815 PISSN: 08856125 EISSN: 15730565 Source Type: Journal
DOI: 10.1007/s10994-010-5178-7 Document Type: Article

Times cited : (179)

References (23)

1
- 84898079018
- Minimax policies for adversarial and stochastic bandits
- Audibert, J.-Y., & Bubeck, S. (2009). Minimax policies for adversarial and stochastic bandits. In Proceedings of the 22nd conference on learning theory (COLT).
- (2009) Proceedings of the 22nd Conference on Learning Theory (COLT)
- Audibert, J.-Y.¹ Bubeck, S.²

2
- 0036568025
- Finite-time analysis of the multiarmed bandit problem
- DOI 10.1023/A:1013689704352, Computational Learning Theory
- P. Auer N. Cesa-Bianchi P. Fischer 2002 Finite-time analysis of the multiarmed bandit problem Machine Learning 47 235 256 1012.68093 10.1023/A:1013689704352 (Pubitemid 34126111)
- (2002) Machine Learning , vol.47 , Issue.2-3 , pp. 235-256
- Auer, P.¹ Cesa-Bianchi, N.² Fischer, P.³

3
- 0037709910
- The nonstochastic multiarmed bandit problem
- 1029.68087 10.1137/S0097539701398375 1954855
- P. Auer N. Cesa-Bianchi Y. Freund R. E. Schapire 2002 The nonstochastic multiarmed bandit problem SIAM Journal on Computing 32 48 77 1029.68087 10.1137/S0097539701398375 1954855
- (2002) SIAM Journal on Computing , vol.32 , pp. 48-77
- Auer, P.¹ Cesa-Bianchi, N.² Freund, Y.³ Schapire, R.E.⁴

4
- 84972574511
- Weighted sums of certain dependent random variables
- 0178.21103 10.2748/tmj/1178243286 221571
- K. Azuma 1967 Weighted sums of certain dependent random variables Tohoku Mathematical Journal 19 357 367 0178.21103 10.2748/tmj/1178243286 221571
- (1967) Tohoku Mathematical Journal , vol.19 , pp. 357-367
- Azuma, K.¹

5
- 26944476270
- From external to internal regret
- Blum, A., & Mansour, Y. (2005). From external to internal regret. In Proceedings of the 18th conference on learning theory (COLT) (pp. 621-636).
- (2005) Proceedings of the 18th Conference on Learning Theory (COLT) , pp. 621-636
- Blum, A.¹ Mansour, Y.²

6
- 0031140246
- How to use expert advice
- 0890.68066 10.1145/258128.258179 1470152
- N. Cesa-Bianchi Y. Freund D. Haussler D. P. Helmbold R. E. Schapire M. K. Warmuth 1997 How to use expert advice Journal of ACM 44 427 485 0890.68066 10.1145/258128.258179 1470152
- (1997) Journal of ACM , vol.44 , pp. 427-485
- Cesa-Bianchi, N.¹ Freund, Y.² Haussler, D.³ Helmbold, D.P.⁴ Schapire, R.E.⁵ Warmuth, M.K.⁶

7
- 84889281816
- Wiley New York
- Cover, T. M., & Thomas, J. A. (1999). Elements of information theory. New York: Wiley.
- (1999) Elements of Information Theory
- Cover, T.M.¹ Thomas, J.A.²

8
- 0002267135
- Adaptive game playing using multiplicative weights
- 0964.91007 10.1006/game.1999.0738 1729311
- Y. Freund R. E. Schapire 1999 Adaptive game playing using multiplicative weights Games and Economic Behavior 29 79 103 0964.91007 10.1006/game.1999.0738 1729311
- (1999) Games and Economic Behavior , vol.29 , pp. 79-103
- Freund, Y.¹ Schapire, R.E.²

9
- 0030643068
- Using and combining predictors that specialize
- Freund, Y., Schapire, R. E., Singer, Y., & Warmuth, M. K. (1997). Using and combining predictors that specialize. In Proceedings of the 29th ACM symp. on theory of computing (STOC) (pp. 334-343).
- (1997) Proceedings of the 29th ACM Symp. on Theory of Computing (STOC) , pp. 334-343
- Freund, Y.¹ Schapire, R.E.² Singer, Y.³ Warmuth, M.K.⁴

10
- 0003603813
- Freeman New York 0411.68039
- Garey, M. R., & Johnson, D. S. (1979). Computers and intractability: a guide to the theory of NP-completeness. New York: Freeman.
- (1979) Computers and Intractability: A Guide to the Theory of NP-completeness
- Garey, M.R.¹ Johnson, D.S.²

11
- 0000169010
- Bandit processes and dynamic allocation indices
- 0411.62055 547241
- J. C. Gittins 1979 Bandit processes and dynamic allocation indices Journal of the Royal Statistical Society, Series B 41 148 177 0411.62055 547241
- (1979) Journal of the Royal Statistical Society, Series B , vol.41 , pp. 148-177
- Gittins, J.C.¹

12
- 0018709825
- A dynamic allocation index for the discounted multiarmed bandit problem
- 10.1093/biomet/66.3.561
- J. C. Gittins D. M. Jones 1979 A dynamic allocation index for the discounted multiarmed bandit problem Biometrika 66 561 565 10.1093/biomet/66.3. 561
- (1979) Biometrika , vol.66 , pp. 561-565
- Gittins, J.C.¹ Jones, D.M.²

13
- 0001976283
- Approximation to Bayes risk in repeated plays
- M. Dresher A. Tucker P. Wolfe (eds). Princeton University Press Princeton
- Hannan, J. (1957). Approximation to Bayes risk in repeated plays. In M. Dresher, A. Tucker, & P. Wolfe (Eds.), Contributions to the theory of games (pp. 97-139). Princeton: Princeton University Press.
- (1957) Contributions to the Theory of Games , pp. 97-139
- Hannan, J.¹

14
- 84947403595
- Probability inequalities for sums of bounded random variables
- 0127.10602 10.2307/2282952 144363
- W. Hoeffding 1963 Probability inequalities for sums of bounded random variables Journal of the American Statistical Association 58 13 30 0127.10602 10.2307/2282952 144363
- (1963) Journal of the American Statistical Association , vol.58 , pp. 13-30
- Hoeffding, W.¹

15
- 24644463787
- Efficient algorithms for online decision problems
- DOI 10.1016/j.jcss.2004.10.016, PII S0022000004001394
- A. T. Kalai S. Vempala 2005 Efficient algorithms for on-line optimization Journal of Computer and System Sciences 71 291 307 1094.68112 10.1016/j.jcss.2004.10.016 2168355 (Pubitemid 41278182)
- (2005) Journal of Computer and System Sciences , vol.71 , Issue.3 , pp. 291-307
- Kalai, A.¹ Vempala, S.²

16
- 84969199624
- Noisy binary search and its applications
- Karp, R. M., & Kleinberg, R. (2007). Noisy binary search and its applications. In Proceedings of the 18th ACM-SIAM symp. discrete algorithms (SODA) (pp. 881-890).
- (2007) Proceedings of the 18th ACM-SIAM Symp. Discrete Algorithms (SODA) , pp. 881-890
- Karp, R.M.¹ Kleinberg, R.²

17
- 0001280583
- Über dyadische Brüche
- 10.1007/BF01192399 1544623
- A. Khintchine 1923 Über dyadische Brüche Mathematische Zeitschsift 18 109 116 10.1007/BF01192399 1544623
- (1923) Mathematische Zeitschsift , vol.18 , pp. 109-116
- Khintchine, A.¹

18
- 0002899547
- Asymptotically efficient adaptive allocations rules
- 0568.62074 10.1016/0196-8858(85)90002-8 776826
- T. L. Lai H. Robbins 1985 Asymptotically efficient adaptive allocations rules Advances in Applied Mathematics 6 4 22 0568.62074 10.1016/0196-8858(85) 90002-8 776826
- (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
- Lai, T.L.¹ Robbins, H.²

19
- 77956144722
- The Epoch-Greedy algorithm for multiarmed bandits with side information
- Langford, J., & Zhang, T. (2007). The Epoch-Greedy algorithm for multiarmed bandits with side information. In Proceedings of the 21st conference on neural information processing systems (NIPS).
- (2007) Proceedings of the 21st Conference on Neural Information Processing Systems (NIPS)
- Langford, J.¹ Zhang, T.²

20
- 35148838877
- The weighted majority algorithm
- 0804.68121 10.1006/inco.1994.1009 1265851 An extended abstract appeared in IEEE symposium on foundations of computer science, 1989 (pp. 256-261)
- N. Littlestone M. K. Warmuth 1994 The weighted majority algorithm Information and Computation 108 212 261 0804.68121 10.1006/inco.1994.1009 1265851 An extended abstract appeared in IEEE symposium on foundations of computer science, 1989 (pp. 256-261)
- (1994) Information and Computation , vol.108 , pp. 212-261
- Littlestone, N.¹ Warmuth, M.K.²

21
- 84966203785
- Some aspects of the sequential design of experiments
- 0049.37009 10.1090/S0002-9904-1952-09620-8 50246
- H. Robbins 1952 Some aspects of the sequential design of experiments Bulletin of the American Mathematical Society 58 527 535 0049.37009 10.1090/S0002-9904-1952-09620-8 50246
- (1952) Bulletin of the American Mathematical Society , vol.58 , pp. 527-535
- Robbins, H.¹

22
- 85048665932
- Aggregating strategies
- Vovk, V. G. (1990). Aggregating strategies. In Proceedings of the 3rd conference on learning theory (COLT) (pp. 371-386).
- (1990) Proceedings of the 3rd Conference on Learning Theory (COLT) , pp. 371-386
- Vovk, V.G.¹

23
- 0032047115
- A game of prediction with expert advice
- 0945.68528 10.1006/jcss.1997.1556 1629690 An extended abstract appeared in COLT, 1995 (pp. 51-60)
- V. G. Vovk 1998 A game of prediction with expert advice Journal of Computer and System Sciences 56 153 173 0945.68528 10.1006/jcss.1997.1556 1629690 An extended abstract appeared in COLT, 1995 (pp. 51-60)
- (1998) Journal of Computer and System Sciences , vol.56 , pp. 153-173
- Vovk, V.G.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.