SCOPUS 정보 검색 플랫폼

Journal of Computer and System Sciences

Volumn 78, Issue 5, 2012, Pages 1538-1556

The K-armed dueling bandits problem

(4) Yue, Yisong a Broder, Josef b Kleinberg, Robert c Joachims, Thorsten c

a CARNEGIE MELLON UNIVERSITY (United States)

b Department of Obstetrics Gynecology (United States)

c School of Operations Research and Information Engineering (United States)

Author keywords

Multi armed bandits; Online learning; Preference elicitation

Indexed keywords

INFORMATION THEORY;

CONVENTIONAL APPROACH; MULTI ARMED BANDIT; ONLINE LEARNING; PAIR-WISE COMPARISON; PARTIAL INFORMATION; PERCEIVED QUALITY; PREFERENCE ELICITATION; PRODUCT ATTRACTIVENESS;

E-LEARNING;

EID: 84861586270 PISSN: 00220000 EISSN: 10902724 Source Type: Journal
DOI: 10.1016/j.jcss.2011.12.028 Document Type: Conference Paper

Times cited : (327)

References (29)

1
- 84898079018
- Minimax policies for adversarial and stochastic bandits
- Jean-Yves Audibert, Sébastien Bubeck, Minimax policies for adversarial and stochastic bandits, in: Conference on Learning Theory (COLT), 2009.
- (2009) Conference on Learning Theory (COLT)
- Jean-Yves Audibert, S.¹

2
- 0036568025
- Finite-time analysis of the multiarmed bandit problem
- Peter Auer, Nicolò Cesa-Bianchi, and Paul Fischer Finite-time analysis of the multiarmed bandit problem Mach. Learn. 47 2 2002 235 256
- (2002) Mach. Learn. , vol.47 , Issue.2 , pp. 235-256
- Auer, P.¹ Cesa-Bianchi, N.² Fischer, P.³

3
- 0037709910
- The nonstochastic multiarmed bandit problem
- Peter Auer, Nicolò Cesa-Bianchi, Yoav Freund, and Robert Schapire The nonstochastic multiarmed bandit problem SIAM J. Comput. 32 1 2002 48 77
- (2002) SIAM J. Comput. , vol.32 , Issue.1 , pp. 48-77
- Auer, P.¹ Cesa-Bianchi, N.² Freund, Y.³ Schapire, R.⁴

4
- 0028317505
- Selection in the presence of noise: The design of playoff systems
- Micah Adler, Peter Gemmell, Mor Harchol-Balter, Richard Karp, Claire Kenyon, Selection in the presence of noise: The design of playoff systems, in: ACM-SIAM Symposium on Discrete Algorithms (SODA), 1994.
- (1994) ACM-SIAM Symposium on Discrete Algorithms (SODA)
- Adler, M.¹ Gemmell, P.² Harchol-Balter, M.³ Karp, R.⁴ Kenyon, C.⁵

5
- 84898065480
- An efficient reduction of ranking to classification
- Nir Ailon, Mehryar Mohri, An efficient reduction of ranking to classification, in: Conference on Learning Theory (COLT), 2008.
- (2008) Conference on Learning Theory (COLT)
- Ailon, N.¹ Mohri, M.²

6
- 0041966002
- Using confidence bounds for exploitation-exploration trade
- Peter Auer Using confidence bounds for exploitation-exploration trade J. Mach. Learn. Res. 3 2003 397 422
- (2003) J. Mach. Learn. Res. , vol.3 , pp. 397-422
- Auer, P.¹

7
- 84861596367
- Robust reductions from ranking to classification
- Maria-Florina Balcan, Nikhil Bansal, Alina Beygelzimer, Don Coppersmith, John Langford, Gregory Sorkin, Robust reductions from ranking to classification, in: Conference on Learning Theory (COLT), 2007.
- (2007) Conference on Learning Theory (COLT)
- Balcan, M.¹ Bansal, N.² Beygelzimer, A.³ Coppersmith, D.⁴ Langford, J.⁵ Sorkin, G.⁶

8
- 57949112800
- The bayesian learner is optimal for noisy binary search (and pretty good for quantum as well)
- Michael Ben-Or, Avinatan Hassidim, The bayesian learner is optimal for noisy binary search (and pretty good for quantum as well), in: IEEE Symposium on Foundations of Computer Science (FOCS), 2008.
- (2008) IEEE Symposium on Foundations of Computer Science (FOCS)
- Ben-Or, M.¹ Hassidim, A.²

9
- 33748442333
- Regret minimization under partial monitoring
- Nicolò Cesa-Bianchi, Gábor Lugosi, and Gilles Stoltz Regret minimization under partial monitoring Math. Oper. Res. 31 3 2006 562 580
- (2006) Math. Oper. Res. , vol.31 , Issue.3 , pp. 562-580
- Cesa-Bianchi, N.¹ Lugosi, G.² Stoltz, G.³

10
- 0033314011
- Learning to order things
- William Cohen, Robert Schapire, and Yoram Singer Learning to order things J. Artificial Intelligence Res. 10 1999 243 270
- (1999) J. Artificial Intelligence Res. , vol.10 , pp. 243-270
- Cohen, W.¹ Schapire, R.² Singer, Y.³

11
- 84889281816
- J. Wiley
- Thomas M. Cover, and Joy A. Thomas Elements of Information Theory 1999 J. Wiley
- (1999) Elements of Information Theory
- Cover, T.M.¹ Thomas, J.A.²

12
- 33745295134
- Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems
- Eyal Even-Dar, Shie Mannor, and Yishay Mansour Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems J. Mach. Learn. Res. 7 2006 1079 1105
- (2006) J. Mach. Learn. Res. , vol.7 , pp. 1079-1105
- Even-Dar, E.¹ Mannor, S.² Mansour, Y.³

13
- 4644367942
- An efficient boosting algorithm for combining preferences
- Yoav Freund, Raj Iyer, Robert Schapire, and Yoram Singer An efficient boosting algorithm for combining preferences J. Mach. Learn. Res. 4 2003 933 969
- (2003) J. Mach. Learn. Res. , vol.4 , pp. 933-969
- Freund, Y.¹ Iyer, R.² Schapire, R.³ Singer, Y.⁴

14
- 0028516898
- Computing with noisy information
- Uriel Feige, Prabhakar Raghavan, David Peleg, and Eli Upfal Computing with noisy information SIAM J. Comput. 23 5 1994
- (1994) SIAM J. Comput. , vol.23 , Issue.5
- Feige, U.¹ Raghavan, P.² Peleg, D.³ Upfal, E.⁴

15
- 0033322991
- Support vector learning for ordinal regression
- Ralf Herbrich, Thore Graepel, Klaus Obermayer, Support vector learning for ordinal regression, in: International Conference on Artificial Neural Networks (ICANN), 1999.
- (1999) International Conference on Artificial Neural Networks (ICANN)
- Herbrich, R.¹ Graepel, T.² Obermayer, K.³

16
- 84947403595
- Probability inequalities for sums of bounded random variables
- Wassily Hoeffding Probability inequalities for sums of bounded random variables J. Amer. Statist. Assoc. 58 1963 13 30
- (1963) J. Amer. Statist. Assoc. , vol.58 , pp. 13-30
- Hoeffding, W.¹

17
- 31844446804
- A support vector method for multivariate performance measures
- Thorsten Joachims, A support vector method for multivariate performance measures, in: International Conference on Machine Learning (ICML), 2005.
- (2005) International Conference on Machine Learning (ICML)
- Joachims, T.¹

18
- 84969199624
- Noisy binary search and its applications
- Richard M. Karp, Robert Kleinberg, Noisy binary search and its applications, in: ACM-SIAM Symposium on Discrete Algorithms (SODA), 2007.
- (2007) ACM-SIAM Symposium on Discrete Algorithms (SODA)
- Richard, M.K.¹ Kleinberg, R.²

19
- 84862291603
- Regret bounds for sleeping experts and bandits
- Robert Kleinberg, Alexandru Niculescu-Mizil, Yogeshwer Sharma, Regret bounds for sleeping experts and bandits, in: Conference on Learning Theory (COLT), 2008.
- (2008) Conference on Learning Theory (COLT)
- Kleinberg, R.¹ Niculescu-Mizil, A.² Sharma, Y.³

20
- 0002899547
- Asymptotically efficient adaptive allocation rules
- T.L. Lai, and Herbert Robbins Asymptotically efficient adaptive allocation rules Adv. in Appl. Math. 6 1985 4 22
- (1985) Adv. in Appl. Math. , vol.6 , pp. 4-22
- Lai, T.L.¹ Robbins, H.²

21
- 85029368260
- Boosting the area under the ROC curve
- Phil Long, Rocco Servedio, Boosting the area under the ROC curve, in: Proceedings of Neural Information Processing Systems (NIPS), 2007.
- (2007) Proceedings of Neural Information Processing Systems (NIPS)
- Long, P.¹ Servedio, R.²

22
- 77956144722
- The epoch-greedy algorithm for contextual multi-armed bandits
- John Langford, Tong Zhang, The epoch-greedy algorithm for contextual multi-armed bandits, in: Proceedings of Neural Information Processing Systems (NIPS), 2007.
- (2007) Proceedings of Neural Information Processing Systems (NIPS)
- Langford, J.¹ Zhang, T.²

23
- 0004168557
- Cambridge University Press
- Rajeev Motwani, and Prabhakar Raghavan Randomized Algorithms 1995 Cambridge University Press
- (1995) Randomized Algorithms
- Motwani, R.¹ Raghavan, P.²

24
- 30044441333
- The sample complexity of exploration in the multi-armed bandit problem
- Shie Mannor, and John N. Tsitsiklis The sample complexity of exploration in the multi-armed bandit problem J. Mach. Learn. Res. 5 2004 623 648
- (2004) J. Mach. Learn. Res. , vol.5 , pp. 623-648
- Mannor, S.¹ Tsitsiklis, J.N.²

25
- 70049106076
- Bandits for taxonomies: A model-based approach
- Sandeep Pandey, Deepak Agarwal, Deepayan Chakrabarti, Vanja Josifovski, Bandits for taxonomies: A model-based approach, in: SIAM Conference on Data Mining (SDM), 2007.
- (2007) SIAM Conference on Data Mining (SDM)
- Pandey, S.¹ Agarwal, D.² Chakrabarti, D.³ Josifovski, V.⁴

26
- 67650085898
- How does clickthrough data reflect retrieval quality?
- Filip Radlinski, Madhu Kurup, Thorsten Joachims, How does clickthrough data reflect retrieval quality?, in: ACM Conference on Information and Knowledge Management (CIKM), 2008.
- (2008) ACM Conference on Information and Knowledge Management (CIKM)
- Radlinski, F.¹ Kurup, M.² Joachims, T.³

27
- 84966203785
- Some aspects of the sequential design of experiments
- Herbert Robbins Some aspects of the sequential design of experiments Bull. Amer. Math. Soc. 58 1952 527 535
- (1952) Bull. Amer. Math. Soc. , vol.58 , pp. 527-535
- Robbins, H.¹

28
- 84898077397
- The k-armed dueling bandits problem
- Yisong Yue, Josef Broder, Robert Kleinberg, Thorsten Joachims, The k-armed dueling bandits problem, in: Conference on Learning Theory (COLT), 2009.
- (2009) Conference on Learning Theory (COLT)
- Yue, Y.¹ Broder, J.² Kleinberg, R.³ Joachims, T.⁴

29
- 71149114227
- Interactively optimizing information retrieval systems as a dueling bandits problem
- Yisong Yue, Thorsten Joachims, Interactively optimizing information retrieval systems as a dueling bandits problem, in: International Conference on Machine Learning (ICML), 2009.
- (2009) International Conference on Machine Learning (ICML)
- Yue, Y.¹ Joachims, T.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.