-
1
-
-
0036568025
-
Finite-time analysis of the multiarmed bandit problem
-
DOI 10.1023/A:1013689704352, Computational Learning Theory
-
Auer, P., Cesa-Bianchi, N., and Fischer, P. Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47(2-3):235-256, 2002a. (Pubitemid 34126111)
-
(2002)
Machine Learning
, vol.47
, Issue.2-3
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
2
-
-
0037709910
-
The non-stochastic multi-armed bandit problem
-
Auer, P., Cesa-Bianchi, N., Freund, Y., and Schapire, R. The non-stochastic multi-armed bandit problem. SIAM Journal on Computing, 32(1):48-77, 2002b.
-
(2002)
SIAM Journal on Computing
, vol.32
, Issue.1
, pp. 48-77
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.4
-
3
-
-
78249286512
-
Toward a classification of finite partial-monitoring games
-
Bartók, Gabór, Pál, Dávid, and Szepesvári, Csaba. Toward a classification of finite partial-monitoring games. In ALT, pp. 224-238, 2010.
-
(2010)
ALT
, pp. 224-238
-
-
Bartók, G.1
Pál, D.2
Szepesvári, C.3
-
5
-
-
80053460837
-
Yahoo! learning to rank challenge overview
-
Chapelle, O. and Chang, Y. Yahoo! learning to rank challenge overview. JMLR - Proceedings Track, 14:1-24, 2011.
-
(2011)
JMLR - Proceedings Track
, vol.14
, pp. 1-24
-
-
Chapelle, O.1
Chang, Y.2
-
6
-
-
84859070397
-
Large-scale validation and analysis of interleaved search evaluation
-
Chapelle, O., Joachims, T., Radlinski, F., and Yue, Yisong. Large-scale validation and analysis of interleaved search evaluation. ACM Transactions on Information Systems (TOIS), 30(1):6:1-6:41, 2012.
-
(2012)
ACM Transactions on Information Systems (TOIS)
, vol.30
, Issue.1
-
-
Chapelle, O.1
Joachims, T.2
Radlinski, F.3
Yue, Y.4
-
7
-
-
33645782058
-
The consistency of multicategory support vector machines
-
DOI 10.1007/s10444-004-7207-1
-
Chen, D. and Xiang, D. The consistency of multicategory support vector machines. Adv. Comput. Math, 24(1-4):155-169, 2006. (Pubitemid 43563917)
-
(2006)
Advances in Computational Mathematics
, vol.24
, Issue.1-4
, pp. 155-169
-
-
Chen, D.-R.1
Xiang, D.-H.2
-
8
-
-
31844453751
-
Preference learning with gaussian processes
-
Chu, W. and Ghahramani, Z. Preference learning with gaussian processes. In ICML, 2005.
-
(2005)
ICML
-
-
Chu, W.1
Ghahramani, Z.2
-
9
-
-
85127836544
-
Discriminative training methods for hidden markov models: Theory and experiments with perceptron algorithms
-
Collins, M. Discriminative training methods for hidden markov models: theory and experiments with perceptron algorithms. In EMNLP, 2002.
-
(2002)
EMNLP
-
-
Collins, M.1
-
11
-
-
20744454447
-
Online convex optimization in the bandit setting: Gradient descent without a gradient
-
Flaxman, A., Kalai, A. T., and McMahan, H. B. Online convex optimization in the bandit setting: gradient descent without a gradient. In SODA, 2005.
-
(2005)
SODA
-
-
Flaxman, A.1
Kalai, A.T.2
McMahan, H.B.3
-
12
-
-
4644367942
-
An efficient boosting algorithm for combining preferences
-
Freund, Y., Iyer, R. D., Schapire, R. E., and Singer, Y. An efficient boosting algorithm for combining preferences. Journal of Machine Learning Research, 4:933-969, 2003.
-
(2003)
Journal of Machine Learning Research
, vol.4
, pp. 933-969
-
-
Freund, Y.1
Iyer, R.D.2
Schapire, R.E.3
Singer, Y.4
-
14
-
-
0242456822
-
Optimizing search engines using clickthrough data
-
Joachims, T. Optimizing search engines using clickthrough data. In KDD, 2002.
-
(2002)
KDD
-
-
Joachims, T.1
-
15
-
-
2142775432
-
Multicategory support vector machines
-
Lee, Yoonkyung, Lin, Yi, and Wahba, Grace. Multicategory support vector machines. Journal of the American Statistical Association, 99(465):67-81, 2004.
-
(2004)
Journal of the American Statistical Association
, vol.99
, Issue.465
, pp. 67-81
-
-
Lee, Y.1
Lin, Y.2
Wahba, G.3
-
16
-
-
84876811202
-
RCV1: A new benchmark collection for text categorization research
-
Lewis, D. D., Yang, Y., Rose, T. G., and Li, F. RCV1: A new benchmark collection for text categorization research. JMLR, 5:361-397, 2004.
-
(2004)
JMLR
, vol.5
, pp. 361-397
-
-
Lewis, D.D.1
Yang, Y.2
Rose, T.G.3
Li, F.4
-
19
-
-
33750724722
-
Minimally invasive randomization for collecting unbiased preferences from clickthrough logs
-
Radlinski, F. and Joachims, T. Minimally invasive randomization for collecting unbiased preferences from clickthrough logs. In AAAI, pp. 1406-1412, 2006.
-
(2006)
AAAI
, pp. 1406-1412
-
-
Radlinski, F.1
Joachims, T.2
-
20
-
-
84866033717
-
Online learning to diversify from implicit feedback
-
Raman, K., Shivaswamy, P., and Joachims, T. Online learning to diversify from implicit feedback. In KDD, 2012.
-
(2012)
KDD
-
-
Raman, K.1
Shivaswamy, P.2
Joachims, T.3
-
21
-
-
84867138308
-
Online structured prediction via coactive learning
-
Shivaswamy, P. and Joachims, T. Online structured prediction via coactive learning. In ICML, 2012.
-
(2012)
ICML
-
-
Shivaswamy, P.1
Joachims, T.2
-
22
-
-
71149114227
-
Interactively optimizing information retrieval systems as a dueling bandits problem
-
Yue, Y. and Joachims, T. Interactively optimizing information retrieval systems as a dueling bandits problem. In ICML, 2009.
-
(2009)
ICML
-
-
Yue, Y.1
Joachims, T.2
-
23
-
-
84898077397
-
The k-armed dueling bandits problem
-
Yue, Y., Broder, J., Kleinberg, R., and Joachims, T. The k-armed dueling bandits problem. In COLT, 2009.
-
(2009)
COLT
-
-
Yue, Y.1
Broder, J.2
Kleinberg, R.3
Joachims, T.4
-
24
-
-
1942484421
-
Online convex programming and generalized infinitesimal gradient ascent
-
Zinkevich, M. Online convex programming and generalized infinitesimal gradient ascent. In ICML, 2003.
-
(2003)
ICML
-
-
Zinkevich, M.1
|