SCOPUS 정보 검색 플랫폼

COLT 2009 - The 22nd Conference on Learning Theory

Volumn , Issue , 2009, Pages

The K-armed dueling bandits problem

(4) Yue, Yisong a Broder, Josef b Kleinberg, Robert a Joachims, Thorsten a

a Department of Computer Science and School of Operations Research and Information Engineering (United States)

b Department of Obstetrics Gynecology (United States)

Author keywords

[No Author keywords available]

Indexed keywords

BINARY FEEDBACK; CONSTANT FACTORS; CONVENTIONAL APPROACH; OPTIMAL REGRET; PAIR-WISE COMPARISON; PRODUCT ATTRACTIVENESS;

EID: 84898077397 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (28)

References (28)

1
- 0036568025
- Finite-time analysis of the multiarmed bandit problem
- Peter Auer, Nicolò Cesa-Bianchi, and Paul Fischer. Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47(2): 235-256, 2002.
- (2002) Machine Learning , vol.47 , Issue.2 , pp. 235-256
- Auer, P.¹ Cesa-Bianchi, N.² Fischer, P.³

2
- 0037709910
- The nonstochastic multiarmed bandit problem
- Peter Auer, Nicolò Cesa-Bianchi, Yoav Freund, and Robert Schapire. The nonstochastic multiarmed bandit problem. SIAM Journal on Computing, 32(1): 48-77, 2002.
- (2002) SIAM Journal on Computing , vol.32 , Issue.1 , pp. 48-77
- Auer, P.¹ Cesa-Bianchi, N.² Freund, Y.³ Schapire, R.⁴

3
- 0028317505
- Selection in the presence of noise: The design of playoff systems
- Micah Adler, Peter Gemmell, Mor Harchol-Balter, Richard Karp, and Claire Kenyon. Selection in the presence of noise: The design of playoff systems. In ACM-SIAM Symposium on Discrete Algorithms (SODA), 1994.
- (1994) ACM-SIAM Symposium on Discrete Algorithms (SODA)
- Adler, M.¹ Gemmell, P.² Harchol-Balter, M.³ Karp, R.⁴ Kenyon, C.⁵

4
- 84898065480
- An efficient reduction of ranking to classification
- Nir Ailon and Mehryar Mohri. An efficient reduction of ranking to classification. In Conference on Learning Theory (COLT), 2008.
- (2008) Conference on Learning Theory (COLT)
- Ailon, N.¹ Mohri, M.²

5
- 0041966002
- Using confidence bounds for exploitation-exploration trade
- Peter Auer. Using confidence bounds for exploitation-exploration trade. Journal of Machine Learning Research (JMLR), 3: 397-422, 2003.
- (2003) Journal of Machine Learning Research (JMLR) , vol.3 , pp. 397-422
- Auer, P.¹

6
- 84861596367
- Robust reductions from ranking to classification
- Maria-Florina Balcan, Nikhil Bansal, Alina Beygelzimer, Don Coppersmith, John Langford, and Gregory Sorkin. Robust reductions from ranking to classification. In Conference on Learning Theory (COLT), 2007.
- (2007) Conference on Learning Theory (COLT)
- Balcan, M.-F.¹ Bansal, N.² Beygelzimer, A.³ Coppersmith, D.⁴ Langford, J.⁵ Sorkin, G.⁶

7
- 57949112800
- The Bayesian learner is optimal for noisy binary search (and pretty good for quantum as well)
- Michael Ben-Or and Avinatan Hassidim. The bayesian learner is optimal for noisy binary search (and pretty good for quantum as well). In IEEE Symposium on Foundations of Computer Science (FOCS), 2008.
- (2008) IEEE Symposium on Foundations of Computer Science (FOCS)
- Ben-Or, M.¹ Hassidim, A.²

8
- 33748442333
- Regret minimization under partial monitoring
- Nicolò Cesa-Bianchi, Gábor Lugosi, and Gilles Stoltz. Regret minimization under partial monitoring. Mathematics of Operations Research, 31(3): 562-580, 2006.
- (2006) Mathematics of Operations Research , vol.31 , Issue.3 , pp. 562-580
- Cesa-Bianchi, N.¹ Lugosi, G.² Stoltz, G.³

9
- 0033314011
- Learning to order things
- William Cohen, Robert Schapire, and Yoram Singer. Learning to order things. Journal of Artificial Intelligence Research (JAIR), 10: 243-270, 1999.
- (1999) Journal of Artificial Intelligence Research (JAIR) , vol.10 , pp. 243-270
- Cohen, W.¹ Schapire, R.² Singer, Y.³

10
- 84889281816
- J. Wiley
- Thomas M. Cover and Joy A. Thomas. Elements of Information Theory. J. Wiley, 1999.
- (1999) Elements of Information Theory
- Cover, T.M.¹ Thomas, J.A.²

11
- 33745295134
- Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems
- Eyal Even-Dar, Shie Mannor, and Yishay Mansour. Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems. Journal of Machine Learning Research (JMLR), 7: 1079-1105, 2006.
- (2006) Journal of Machine Learning Research (JMLR) , vol.7 , pp. 1079-1105
- Even-Dar, E.¹ Mannor, S.² Mansour, Y.³

12
- 4644367942
- An efficient boosting algorithm for combining preferences
- Yoav Freund, Raj Iyer, Robert Schapire, and Yoram Singer. An efficient boosting algorithm for combining preferences. Journal of Machine Learning Research (JMLR), 4: 933-969, 2003.
- (2003) Journal of Machine Learning Research (JMLR) , vol.4 , pp. 933-969
- Freund, Y.¹ Iyer, R.² Schapire, R.³ Singer, Y.⁴

13
- 0028516898
- Computing with noisy information
- Uriel Feige, Prabhakar Raghavan, David Peleg, and Eli Upfal. Computing with noisy information. SIAM Journal on Computing, 23(5), 1994.
- (1994) SIAM Journal on Computing , vol.23 , Issue.5
- Feige, U.¹ Raghavan, P.² Peleg, D.³ Upfal, E.⁴

14
- 0033322991
- Support vector learning for ordinal regression
- Ralf Herbrich, Thore Graepel, and Klaus Obermayer. Support vector learning for ordinal regression. In International Conference on Artificial Neural Networks (ICANN), 1999.
- (1999) International Conference on Artificial Neural Networks (ICANN)
- Herbrich, R.¹ Graepel, T.² Obermayer, K.³

15
- 84947403595
- Probability inequalities for sums of bounded random variables
- Wassily Hoeffding. Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association, 58: 13-30, 1963.
- (1963) Journal of the American Statistical Association , vol.58 , pp. 13-30
- Hoeffding, W.¹

16
- 31844446804
- A support vector method for multivariate performance measures
- Thorsten Joachims. A support vector method for multivariate performance measures. In International Conference on Machine Learning (ICML), 2005.
- (2005) International Conference on Machine Learning (ICML)
- Joachims, T.¹

17
- 84969199624
- Noisy binary search and its applications
- Richard M Karp and Robert Kleinberg. Noisy binary search and its applications. In ACM-SIAM Symposium on Discrete Algorithms (SODA), 2007.
- (2007) ACM-SIAM Symposium on Discrete Algorithms (SODA)
- Karp, R.M.¹ Kleinberg, R.²

18
- 84862291603
- Regret bounds for sleeping experts and bandits
- Robert Kleinberg, Alexandru Niculescu-Mizil, and Yogeshwer Sharma. Regret bounds for sleeping experts and bandits. In Conference on Learning Theory (COLT), 2008.
- (2008) Conference on Learning Theory (COLT)
- Kleinberg, R.¹ Niculescu-Mizil, A.² Sharma, Y.³

19
- 84862283462
- September Blog entry at Machine Learning (Theory)
- John Langford. How do we get weak action dependence for learning with partial observations? http://hunch.net/?p=421, September 2008. Blog entry at Machine Learning (Theory).
- (2008) How do We Get Weak Action Dependence for Learning with Partial Observations?
- Langford, J.¹

20
- 0002899547
- Asymptotically efficient adaptive allocation rules
- T. L. Lai and Herbert Robbins. Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics, 6: 4-22, 1985.
- (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
- Lai, T.L.¹ Robbins, H.²

21
- 85029368260
- Boosting the area under the roc curve
- Phil Long and Rocco Servedio. Boosting the area under the roc curve. In Proceedings of Neural Information Processing Systems (NIPS), 2007.
- (2007) Proceedings of Neural Information Processing Systems (NIPS)
- Long, P.¹ Servedio, R.²

22
- 77956144722
- The epoch-greedy algorithm for contextual multi-armed bandits
- John Langford and Tong Zhang. The epoch-greedy algorithm for contextual multi-armed bandits. In Proceedings of Neural Information Processing Systems (NIPS), 2007.
- (2007) Proceedings of Neural Information Processing Systems (NIPS)
- Langford, J.¹ Zhang, T.²

23
- 0004168557
- Cambridge University Press
- Rajeev Motwani and Prabhakar Raghavan. Randomized Algorithms. Cambridge University Press, 1995.
- (1995) Randomized Algorithms
- Motwani, R.¹ Raghavan, P.²

24
- 30044441333
- The sample complexity of exploration in the multi-armed bandit problem
- Shie Mannor and John N. Tsitsiklis. The sample complexity of exploration in the multi-armed bandit problem. Journal of Machine Learning Research (JMLR), 5: 623-648, 2004.
- (2004) Journal of Machine Learning Research (JMLR) , vol.5 , pp. 623-648
- Mannor, S.¹ Tsitsiklis, J.N.²

25
- 70049106076
- Bandits for taxonomies: A model-based approach
- Sandeep Pandey, Deepak Agarwal, Deepayan Chakrabarti, and Vanja Josifovski. Bandits for taxonomies: A model-based approach. In SIAM Conference on Data Mining (SDM), 2007.
- (2007) SIAM Conference on Data Mining (SDM)
- Pandey, S.¹ Agarwal, D.² Chakrabarti, D.³ Josifovski, V.⁴

26
- 67650085898
- How does clickthrough data reflect retrieval quality?
- Filip Radlinski, Madhu Kurup, and Thorsten Joachims. How does clickthrough data reflect retrieval quality? In ACM Conference on Information and Knowledge Management (CIKM), 2008.
- (2008) ACM Conference on Information and Knowledge Management (CIKM)
- Radlinski, F.¹ Kurup, M.² Joachims, T.³

27
- 84966203785
- Some aspects of the sequential design of experiments
- Herbert Robbins. Some Aspects of the Sequential Design of Experiments. Bull. Amer. Math. Soc., 58: 527-535, 1952.
- (1952) Bull. Amer. Math. Soc. , vol.58 , pp. 527-535
- Robbins, H.¹

28
- 71149114227
- Interactively optimizing information retrieval systems as a dueling bandits problem
- Yisong Yue and Thorsten Joachims. Interactively optimizing information retrieval systems as a dueling bandits problem. In International Conference on Machine Learning (ICML), 2009.
- (2009) International Conference on Machine Learning (ICML)
- Yue, Y.¹ Joachims, T.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.