SCOPUS 정보 검색 플랫폼

31st International Conference on Machine Learning, ICML 2014

Volumn 2, Issue , 2014, Pages 1390-1398

Improving offline evaluation of contextual bandit algorithms via bootstrapping techniques

(3) Nicol, Olivier a Mary, Jérémie a Preux, Philippe a

a UNIV LILLE (France)

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; ARTIFICIAL INTELLIGENCE; E-LEARNING; LEARNING ALGORITHMS; LEARNING SYSTEMS; SOCIAL NETWORKING (ONLINE);

CONTEXTUAL BANDITS; CRITICAL ISSUES; DATA-DRIVEN METHODS; MODEL-BASED METHOD; NEWS RECOMMENDATION; OFFLINE EVALUATION; ONLINE LEARNING ALGORITHMS; STATE-OF-THE-ART METHODS;

QUALITY CONTROL;

EID: 84919909115 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (6)

References (23)

1
- 77950884255
- Spatio-temporal models for estimating click- through rate
- New York, NY, USA, ACM
- Agarwal, Deepak, Chen, Bee-Chung, and Elango, Pradheep. Spatio-temporal models for estimating click- through rate. In Proceedings of the 18th international conference on World wide web(WWW), pp. 21-30, New York, NY, USA, 2009. ACM. ISBN 978-1-60558-487-4. doi: 10.1145/1526709.1526713.
- (2009) Proceedings of the 18th International Conference on World Wide Web(WWW) , pp. 21-30
- Agarwal, D.¹ Chen, B.-C.² Elango, P.³

2
- 0036568025
- Finite-time analysis of the multiarmed bandit problem
- May
- Auer, Peter, Cesa-Bianchi, Nicolo, and Fischer, Paul. Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47:235-256, May 2002. ISSN 0885- 6125.
- (2002) Machine Learning , vol.47 , pp. 235-256
- Auer, P.¹ Cesa-Bianchi, N.² Fischer, P.³

3
- 0037709910
- The nonstochastic multiarmed bandit problem
- January
- Auer, Peter, Cesa-Bianchi, Nicolo, Freund, Yoav, and Schapire, Robert E. The nonstochastic multiarmed bandit problem. SIAM J. Comput., 32(1):48-77, January 2003. ISSN 0097-5397. doi: 10.1137/S0097539701398375.
- (2003) SIAM J. Comput. , vol.32 , Issue.1 , pp. 48-77
- Auer, P.¹ Cesa-Bianchi, N.² Freund, Y.³ Schapire, R.E.⁴

4
- 84919925761
- Technical report, arXiv: 1209.2355, September
- Bottou, Leon, Peters, Jonas, Quinonero Candela, Joaquin, Charles, Denis X., Chickering, D. Max, Portugualy, Elon, Ray, Dipankar, Simard, Patrice, and Snelson, Ed. Counterfactual reasoning and learning systems. Technical report, arXiv: 1209.2355, September 2012.
- (2012) Counterfactual Reasoning and Learning Systems
- Bottou, L.¹ Peters, J.² Quinonero Candela, J.³ Charles, D.X.⁴ Chickering, D.M.⁵ Portugualy, E.⁶ Ray, D.⁷ Simard, P.⁸ Snelson, E.⁹

5
- 84919925760
- Doubly robust policy evaluation and learning
- abs/1103, 4601
- Dudik, Miroslav, Langford, John, and Li, Lihong. Doubly robust policy evaluation and learning. CoRR, abs/1103, 4601, 2011.
- (2011) CoRR
- Dudik, M.¹ Langford, J.² Li, L.³

6
- 0002344794
- Bootstrap methods: Another look at the jack- knife
- Efron, B. Bootstrap methods: Another look at the jack- knife. The Annals of Statistics, 7(1): 1-26, 1979. ISSN 00905364. doi: 10.2307/2958830.
- (1979) The Annals of Statistics , vol.7 , Issue.1 , pp. 1-26
- Efron, B.¹

7
- 70350120268
- The bootstrap
- Horowitz, Joel L. The bootstrap. Handbook of econometrics, 5:3159-3228, 2001.
- (2001) Handbook of Econometrics , vol.5 , pp. 3159-3228
- Horowitz, J.L.¹

8
- 84867129586
- The big data bootstrap
- Langford, John and Pineau, Joelle (eds.), New York, NY, USA, July, Omnipress
- Kleiner, Ariel, Talwalkar, Ameet, Sarkar, Puraamrita, and Jordan, Michael. The big data bootstrap. In Langford, John and Pineau, Joelle (eds.), Proceedings of the 29th International Conference on Machine Learning (ICML-12), ICML '12, pp. 1759-1766, New York, NY, USA, July 2012. Omnipress. ISBN 978-1-4503-1285-1.
- (2012) Proceedings of the 29th International Conference on Machine Learning (ICML-12), ICML '12 , pp. 1759-1766
- Kleiner, A.¹ Talwalkar, A.² Sarkar, P.³ Jordan, M.⁴

9
- 57849115716
- Controlled experiments on the web: Survey and practical guide
- Kohavi, Ron, Longbotham, Roger, Sommerfield, Dan, and Henne, Randal M. Controlled experiments on the web: Survey and practical guide. Journal of Data Mining and Knowledge Discovery, 18:140-181, 2009.
- (2009) Journal of Data Mining and Knowledge Discovery , vol.18 , pp. 140-181
- Kohavi, R.¹ Longbotham, R.² Sommerfield, D.³ Henne, R.M.⁴

10
- 0001334793
- Kernel regression and backpropagation training with noise
- Moody, John E. Hanson, Steve J. and Lippmann, Richard P. (eds.), San Francisco, CA: Morgan Kaufmann
- Koistinen, Petri and Holmstrom, Lassc. Kernel regression and backpropagation training with noise. In Moody, John E., Hanson, Steve J., and Lippmann, Richard P. (eds.), Advances in Neural Information Processing Systems 4, pp. 1033-1039. San Francisco, CA: Morgan Kaufmann, 1992.
- (1992) Advances in Neural Information Processing Systems , vol.4 , pp. 1033-1039
- Koistinen, P.¹ Holmstrom, L.²

11
- 77956144722
- The epoch-greedy algorithm for multi-armed bandits with side information
- Langford, John and Zhang, Tong. The epoch-greedy algorithm for multi-armed bandits with side information. In Proc. NIPS, 2007.
- (2007) Proc. NIPS
- Langford, J.¹ Zhang, T.²

12
- 56449124046
- Exploration scavenging
- Langford, John, Strehl, Alexander, and Wortman, Jennifer. Exploration scavenging. In Proceedings of the International Conference on Machine Learning (ICML), pp. 528-535, 2008.
- (2008) Proceedings of the International Conference on Machine Learning (ICML) , pp. 528-535
- Langford, J.¹ Strehl, A.² Wortman, J.³

13
- 77954641643
- A contextual-bandit approach to personalized news article recommendation
- New York, NY, USA, ACM
- Li, Lihong, Chu, Wei, Langford, John, and Schapire, Robert E. A contextual-bandit approach to personalized news article recommendation. In Proceedings of the 19th international conference on World wide web (WWW), pp. 661-670, New York, NY, USA, 2010. ACM. ISBN 978-1-60558-799-8. doi: 10.1145/1772690.1772758.
- (2010) Proceedings of the 19th International Conference on World Wide Web (WWW) , pp. 661-670
- Li, L.¹ Chu, W.² Langford, J.³ Schapire, R.E.⁴

14
- 79952384747
- Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms
- King, Irwin, Nejdl, Wolfgang, and Li, Hang (eds.), ACM
- Li, Lihong, Chu, Wei, Langford, John, and Wang, Xuanhui. Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms. In King, Irwin, Nejdl, Wolfgang, and Li, Hang (eds.), Proc. Web Search and Data Mining (WSDM), pp. 297-306. ACM, 2011. ISBN 978-1-4503-0493-1.
- (2011) Proc. Web Search and Data Mining (WSDM) , pp. 297-306
- Li, L.¹ Chu, W.² Langford, J.³ Wang, X.⁴

15
- 84862301554
- Contextual multi-armed bandits
- May 13-15
- th Artificial Intelligence and Statistics (AI & Stats), JMLR: W&CP 9, May 13-15 2010.
- (2010) th Artificial Intelligence and Statistics (AI & Stats), JMLR: W&CP , vol.9
- Lu, T.¹ Pal, D.² Pal, M.³

16
- 84954310738
- PhD thesis, Universite Lille 1, Cite Scientifique, Villeneuve d'Ascq, France
- Nicol, Olivier. Data-driven evaluation of Contextual Bandit algorithms and applications to Dynamic Recommendation. PhD thesis, Universite Lille 1, Cite Scientifique, Villeneuve d'Ascq, France, 2014.
- (2014) Data-driven Evaluation of Contextual Bandit Algorithms and Applications to Dynamic Recommendation
- Nicol, O.¹

17
- 84966203785
- Some aspects of the sequential design of experiments
- Robbins, Herbert. Some aspects of the sequential design of experiments. Bulletin of the American Mathematical Society, 58(5):527-535, 1952.
- (1952) Bulletin of the American Mathematical Society , vol.58 , Issue.5 , pp. 527-535
- Robbins, H.¹

18
- 0003545910
- Wiley Series in Probability and Statistics. Wiley
- Scott, D.W. Multivariate Density Estimation: Theory, Practice, and Visualization. Wiley Series in Probability and Statistics. Wiley, 1992. ISBN 9780471547709.
- (1992) Multivariate Density Estimation: Theory, Practice, and Visualization
- Scott, D.W.¹

19
- 0000521133
- The bootstrap: To smooth or not to smooth?
- Silverman, BW and Young, GA. The bootstrap: To smooth or not to smooth? Biometrika, 74(3):469-479, 1987.
- (1987) Biometrika , vol.74 , Issue.3 , pp. 469-479
- Silverman, B.W.¹ Young, G.A.²

20
- 85162031443
- Learning from logged implicit exploration data
- Strehl, Alexander L., Langford, John, Li, Lihong, and Kakade, Sham. Learning from logged implicit exploration data. In Proc. NIPS, pp. 2217-2225, 2010.
- (2010) Proc. NIPS , pp. 2217-2225
- Strehl, A.L.¹ Langford, J.² Li, L.³ Kakade, S.⁴

21
- 0004102479
- Reinforcement learning: An introduction
- MIT Press
- Sutton, R.S. and Barto, A.G. Reinforcement Learning: An Introduction. Adaptive Computation and Machine Learning Series. MIT Press, 1998. ISBN 9780262193986.
- (1998) Adaptive Computation and Machine Learning Series
- Sutton, R.S.¹ Barto, A.G.²

22
- 0001395850
- On the likelihood that one unknown probability exceeds another in view of the evidence of two samples
- Thompson, W.R. On the likelihood that one unknown probability exceeds another in view of the evidence of two samples. Biometrika, 25(3-4):285-294, 1933.
- (1933) Biometrika , vol.25 , Issue.3-4 , pp. 285-294
- Thompson, W.R.¹

23
- 84919925758
- Yahoo! Research. R6B - Yahoo! frontpage today module user click log dataset, publicly released via the Yahoo! webscope program, 2012
- Yahoo! Research. R6B - Yahoo! frontpage today module user click log dataset, publicly released via the Yahoo! webscope program, 2012.

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.