메뉴 건너뛰기




Volumn 2, Issue , 2014, Pages 1390-1398

Improving offline evaluation of contextual bandit algorithms via bootstrapping techniques

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; ARTIFICIAL INTELLIGENCE; E-LEARNING; LEARNING ALGORITHMS; LEARNING SYSTEMS; SOCIAL NETWORKING (ONLINE);

EID: 84919909115     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (6)

References (23)
  • 2
    • 0036568025 scopus 로고    scopus 로고
    • Finite-time analysis of the multiarmed bandit problem
    • May
    • Auer, Peter, Cesa-Bianchi, Nicolo, and Fischer, Paul. Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47:235-256, May 2002. ISSN 0885- 6125.
    • (2002) Machine Learning , vol.47 , pp. 235-256
    • Auer, P.1    Cesa-Bianchi, N.2    Fischer, P.3
  • 3
    • 0037709910 scopus 로고    scopus 로고
    • The nonstochastic multiarmed bandit problem
    • January
    • Auer, Peter, Cesa-Bianchi, Nicolo, Freund, Yoav, and Schapire, Robert E. The nonstochastic multiarmed bandit problem. SIAM J. Comput., 32(1):48-77, January 2003. ISSN 0097-5397. doi: 10.1137/S0097539701398375.
    • (2003) SIAM J. Comput. , vol.32 , Issue.1 , pp. 48-77
    • Auer, P.1    Cesa-Bianchi, N.2    Freund, Y.3    Schapire, R.E.4
  • 5
    • 84919925760 scopus 로고    scopus 로고
    • Doubly robust policy evaluation and learning
    • abs/1103, 4601
    • Dudik, Miroslav, Langford, John, and Li, Lihong. Doubly robust policy evaluation and learning. CoRR, abs/1103, 4601, 2011.
    • (2011) CoRR
    • Dudik, M.1    Langford, J.2    Li, L.3
  • 6
    • 0002344794 scopus 로고
    • Bootstrap methods: Another look at the jack- knife
    • Efron, B. Bootstrap methods: Another look at the jack- knife. The Annals of Statistics, 7(1): 1-26, 1979. ISSN 00905364. doi: 10.2307/2958830.
    • (1979) The Annals of Statistics , vol.7 , Issue.1 , pp. 1-26
    • Efron, B.1
  • 10
    • 0001334793 scopus 로고
    • Kernel regression and backpropagation training with noise
    • Moody, John E. Hanson, Steve J. and Lippmann, Richard P. (eds.), San Francisco, CA: Morgan Kaufmann
    • Koistinen, Petri and Holmstrom, Lassc. Kernel regression and backpropagation training with noise. In Moody, John E., Hanson, Steve J., and Lippmann, Richard P. (eds.), Advances in Neural Information Processing Systems 4, pp. 1033-1039. San Francisco, CA: Morgan Kaufmann, 1992.
    • (1992) Advances in Neural Information Processing Systems , vol.4 , pp. 1033-1039
    • Koistinen, P.1    Holmstrom, L.2
  • 11
    • 77956144722 scopus 로고    scopus 로고
    • The epoch-greedy algorithm for multi-armed bandits with side information
    • Langford, John and Zhang, Tong. The epoch-greedy algorithm for multi-armed bandits with side information. In Proc. NIPS, 2007.
    • (2007) Proc. NIPS
    • Langford, J.1    Zhang, T.2
  • 14
    • 79952384747 scopus 로고    scopus 로고
    • Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms
    • King, Irwin, Nejdl, Wolfgang, and Li, Hang (eds.), ACM
    • Li, Lihong, Chu, Wei, Langford, John, and Wang, Xuanhui. Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms. In King, Irwin, Nejdl, Wolfgang, and Li, Hang (eds.), Proc. Web Search and Data Mining (WSDM), pp. 297-306. ACM, 2011. ISBN 978-1-4503-0493-1.
    • (2011) Proc. Web Search and Data Mining (WSDM) , pp. 297-306
    • Li, L.1    Chu, W.2    Langford, J.3    Wang, X.4
  • 17
    • 84966203785 scopus 로고
    • Some aspects of the sequential design of experiments
    • Robbins, Herbert. Some aspects of the sequential design of experiments. Bulletin of the American Mathematical Society, 58(5):527-535, 1952.
    • (1952) Bulletin of the American Mathematical Society , vol.58 , Issue.5 , pp. 527-535
    • Robbins, H.1
  • 19
    • 0000521133 scopus 로고
    • The bootstrap: To smooth or not to smooth?
    • Silverman, BW and Young, GA. The bootstrap: To smooth or not to smooth? Biometrika, 74(3):469-479, 1987.
    • (1987) Biometrika , vol.74 , Issue.3 , pp. 469-479
    • Silverman, B.W.1    Young, G.A.2
  • 20
    • 85162031443 scopus 로고    scopus 로고
    • Learning from logged implicit exploration data
    • Strehl, Alexander L., Langford, John, Li, Lihong, and Kakade, Sham. Learning from logged implicit exploration data. In Proc. NIPS, pp. 2217-2225, 2010.
    • (2010) Proc. NIPS , pp. 2217-2225
    • Strehl, A.L.1    Langford, J.2    Li, L.3    Kakade, S.4
  • 22
    • 0001395850 scopus 로고
    • On the likelihood that one unknown probability exceeds another in view of the evidence of two samples
    • Thompson, W.R. On the likelihood that one unknown probability exceeds another in view of the evidence of two samples. Biometrika, 25(3-4):285-294, 1933.
    • (1933) Biometrika , vol.25 , Issue.3-4 , pp. 285-294
    • Thompson, W.R.1
  • 23
    • 84919925758 scopus 로고    scopus 로고
    • Yahoo! Research. R6B - Yahoo! frontpage today module user click log dataset, publicly released via the Yahoo! webscope program, 2012
    • Yahoo! Research. R6B - Yahoo! frontpage today module user click log dataset, publicly released via the Yahoo! webscope program, 2012.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.