-
1
-
-
77950884255
-
Spatio-temporal models for estimating click- through rate
-
New York, NY, USA, ACM
-
Agarwal, Deepak, Chen, Bee-Chung, and Elango, Pradheep. Spatio-temporal models for estimating click- through rate. In Proceedings of the 18th international conference on World wide web(WWW), pp. 21-30, New York, NY, USA, 2009. ACM. ISBN 978-1-60558-487-4. doi: 10.1145/1526709.1526713.
-
(2009)
Proceedings of the 18th International Conference on World Wide Web(WWW)
, pp. 21-30
-
-
Agarwal, D.1
Chen, B.-C.2
Elango, P.3
-
2
-
-
0036568025
-
Finite-time analysis of the multiarmed bandit problem
-
May
-
Auer, Peter, Cesa-Bianchi, Nicolo, and Fischer, Paul. Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47:235-256, May 2002. ISSN 0885- 6125.
-
(2002)
Machine Learning
, vol.47
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
3
-
-
0037709910
-
The nonstochastic multiarmed bandit problem
-
January
-
Auer, Peter, Cesa-Bianchi, Nicolo, Freund, Yoav, and Schapire, Robert E. The nonstochastic multiarmed bandit problem. SIAM J. Comput., 32(1):48-77, January 2003. ISSN 0097-5397. doi: 10.1137/S0097539701398375.
-
(2003)
SIAM J. Comput.
, vol.32
, Issue.1
, pp. 48-77
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.E.4
-
4
-
-
84919925761
-
-
Technical report, arXiv: 1209.2355, September
-
Bottou, Leon, Peters, Jonas, Quinonero Candela, Joaquin, Charles, Denis X., Chickering, D. Max, Portugualy, Elon, Ray, Dipankar, Simard, Patrice, and Snelson, Ed. Counterfactual reasoning and learning systems. Technical report, arXiv: 1209.2355, September 2012.
-
(2012)
Counterfactual Reasoning and Learning Systems
-
-
Bottou, L.1
Peters, J.2
Quinonero Candela, J.3
Charles, D.X.4
Chickering, D.M.5
Portugualy, E.6
Ray, D.7
Simard, P.8
Snelson, E.9
-
5
-
-
84919925760
-
Doubly robust policy evaluation and learning
-
abs/1103, 4601
-
Dudik, Miroslav, Langford, John, and Li, Lihong. Doubly robust policy evaluation and learning. CoRR, abs/1103, 4601, 2011.
-
(2011)
CoRR
-
-
Dudik, M.1
Langford, J.2
Li, L.3
-
6
-
-
0002344794
-
Bootstrap methods: Another look at the jack- knife
-
Efron, B. Bootstrap methods: Another look at the jack- knife. The Annals of Statistics, 7(1): 1-26, 1979. ISSN 00905364. doi: 10.2307/2958830.
-
(1979)
The Annals of Statistics
, vol.7
, Issue.1
, pp. 1-26
-
-
Efron, B.1
-
8
-
-
84867129586
-
The big data bootstrap
-
Langford, John and Pineau, Joelle (eds.), New York, NY, USA, July, Omnipress
-
Kleiner, Ariel, Talwalkar, Ameet, Sarkar, Puraamrita, and Jordan, Michael. The big data bootstrap. In Langford, John and Pineau, Joelle (eds.), Proceedings of the 29th International Conference on Machine Learning (ICML-12), ICML '12, pp. 1759-1766, New York, NY, USA, July 2012. Omnipress. ISBN 978-1-4503-1285-1.
-
(2012)
Proceedings of the 29th International Conference on Machine Learning (ICML-12), ICML '12
, pp. 1759-1766
-
-
Kleiner, A.1
Talwalkar, A.2
Sarkar, P.3
Jordan, M.4
-
9
-
-
57849115716
-
Controlled experiments on the web: Survey and practical guide
-
Kohavi, Ron, Longbotham, Roger, Sommerfield, Dan, and Henne, Randal M. Controlled experiments on the web: Survey and practical guide. Journal of Data Mining and Knowledge Discovery, 18:140-181, 2009.
-
(2009)
Journal of Data Mining and Knowledge Discovery
, vol.18
, pp. 140-181
-
-
Kohavi, R.1
Longbotham, R.2
Sommerfield, D.3
Henne, R.M.4
-
10
-
-
0001334793
-
Kernel regression and backpropagation training with noise
-
Moody, John E. Hanson, Steve J. and Lippmann, Richard P. (eds.), San Francisco, CA: Morgan Kaufmann
-
Koistinen, Petri and Holmstrom, Lassc. Kernel regression and backpropagation training with noise. In Moody, John E., Hanson, Steve J., and Lippmann, Richard P. (eds.), Advances in Neural Information Processing Systems 4, pp. 1033-1039. San Francisco, CA: Morgan Kaufmann, 1992.
-
(1992)
Advances in Neural Information Processing Systems
, vol.4
, pp. 1033-1039
-
-
Koistinen, P.1
Holmstrom, L.2
-
11
-
-
77956144722
-
The epoch-greedy algorithm for multi-armed bandits with side information
-
Langford, John and Zhang, Tong. The epoch-greedy algorithm for multi-armed bandits with side information. In Proc. NIPS, 2007.
-
(2007)
Proc. NIPS
-
-
Langford, J.1
Zhang, T.2
-
12
-
-
56449124046
-
Exploration scavenging
-
Langford, John, Strehl, Alexander, and Wortman, Jennifer. Exploration scavenging. In Proceedings of the International Conference on Machine Learning (ICML), pp. 528-535, 2008.
-
(2008)
Proceedings of the International Conference on Machine Learning (ICML)
, pp. 528-535
-
-
Langford, J.1
Strehl, A.2
Wortman, J.3
-
13
-
-
77954641643
-
A contextual-bandit approach to personalized news article recommendation
-
New York, NY, USA, ACM
-
Li, Lihong, Chu, Wei, Langford, John, and Schapire, Robert E. A contextual-bandit approach to personalized news article recommendation. In Proceedings of the 19th international conference on World wide web (WWW), pp. 661-670, New York, NY, USA, 2010. ACM. ISBN 978-1-60558-799-8. doi: 10.1145/1772690.1772758.
-
(2010)
Proceedings of the 19th International Conference on World Wide Web (WWW)
, pp. 661-670
-
-
Li, L.1
Chu, W.2
Langford, J.3
Schapire, R.E.4
-
14
-
-
79952384747
-
Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms
-
King, Irwin, Nejdl, Wolfgang, and Li, Hang (eds.), ACM
-
Li, Lihong, Chu, Wei, Langford, John, and Wang, Xuanhui. Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms. In King, Irwin, Nejdl, Wolfgang, and Li, Hang (eds.), Proc. Web Search and Data Mining (WSDM), pp. 297-306. ACM, 2011. ISBN 978-1-4503-0493-1.
-
(2011)
Proc. Web Search and Data Mining (WSDM)
, pp. 297-306
-
-
Li, L.1
Chu, W.2
Langford, J.3
Wang, X.4
-
16
-
-
84954310738
-
-
PhD thesis, Universite Lille 1, Cite Scientifique, Villeneuve d'Ascq, France
-
Nicol, Olivier. Data-driven evaluation of Contextual Bandit algorithms and applications to Dynamic Recommendation. PhD thesis, Universite Lille 1, Cite Scientifique, Villeneuve d'Ascq, France, 2014.
-
(2014)
Data-driven Evaluation of Contextual Bandit Algorithms and Applications to Dynamic Recommendation
-
-
Nicol, O.1
-
17
-
-
84966203785
-
Some aspects of the sequential design of experiments
-
Robbins, Herbert. Some aspects of the sequential design of experiments. Bulletin of the American Mathematical Society, 58(5):527-535, 1952.
-
(1952)
Bulletin of the American Mathematical Society
, vol.58
, Issue.5
, pp. 527-535
-
-
Robbins, H.1
-
19
-
-
0000521133
-
The bootstrap: To smooth or not to smooth?
-
Silverman, BW and Young, GA. The bootstrap: To smooth or not to smooth? Biometrika, 74(3):469-479, 1987.
-
(1987)
Biometrika
, vol.74
, Issue.3
, pp. 469-479
-
-
Silverman, B.W.1
Young, G.A.2
-
20
-
-
85162031443
-
Learning from logged implicit exploration data
-
Strehl, Alexander L., Langford, John, Li, Lihong, and Kakade, Sham. Learning from logged implicit exploration data. In Proc. NIPS, pp. 2217-2225, 2010.
-
(2010)
Proc. NIPS
, pp. 2217-2225
-
-
Strehl, A.L.1
Langford, J.2
Li, L.3
Kakade, S.4
-
22
-
-
0001395850
-
On the likelihood that one unknown probability exceeds another in view of the evidence of two samples
-
Thompson, W.R. On the likelihood that one unknown probability exceeds another in view of the evidence of two samples. Biometrika, 25(3-4):285-294, 1933.
-
(1933)
Biometrika
, vol.25
, Issue.3-4
, pp. 285-294
-
-
Thompson, W.R.1
-
23
-
-
84919925758
-
-
Yahoo! Research. R6B - Yahoo! frontpage today module user click log dataset, publicly released via the Yahoo! webscope program, 2012
-
Yahoo! Research. R6B - Yahoo! frontpage today module user click log dataset, publicly released via the Yahoo! webscope program, 2012.
-
-
-
|