메뉴 건너뛰기




Volumn , Issue , 2011, Pages 297-306

Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms

Author keywords

Benchmark dataset; Contextual bandit; Multi armed bandit; Offline evaluation; Recommendation

Indexed keywords

BENCHMARK DATASETS; CONTEXTUAL BANDIT; MULTI-ARMED BANDIT; OFFLINE EVALUATION; RECOMMENDATION;

EID: 79952384747     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1935826.1935878     Document Type: Conference Paper
Times cited : (527)

References (22)
  • 1
    • 0344118814 scopus 로고    scopus 로고
    • Long. Reinforcement learning with immediate rewards and linear hypotheses
    • Naoki Abe, Alan W. Biermann, and Philip M. Long. Reinforcement learning with immediate rewards and linear hypotheses. Algorithmica, 37(4): 263-293, 2003.
    • (2003) Algorithmica , vol.37 , Issue.4 , pp. 263-293
    • Abe, N.1    Biermann, A.W.2    Philip, M.3
  • 4
    • 0041966002 scopus 로고    scopus 로고
    • Using confidence bounds for exploitation-exploration trade-offs
    • Peter Auer. Using confidence bounds for exploitation-exploration trade-offs. Journal of Machine Learning Research, 3: 397-422, 2002.
    • (2002) Journal of Machine Learning Research , vol.3 , pp. 397-422
    • Auer, P.1
  • 5
    • 0036568025 scopus 로고    scopus 로고
    • Finite-time analysis of the multiarmed bandit problem
    • Peter Auer, Nicolò Cesa-Bianchi, and Paul Fischer. Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47(2-3): 235-256, 2002.
    • (2002) Machine Learning , vol.47 , Issue.2-3 , pp. 235-256
    • Auer, P.1    Cesa-Bianchi, N.2    Fischer, P.3
  • 9
    • 0028442413 scopus 로고
    • Associative reinforcement learning: Functions in k-DNF
    • Leslie Pack Kaelbling. Associative reinforcement learning: Functions in k-DNF. Machine Learning, 15(3):279-298, 1994.
    • (1994) Machine Learning , vol.15 , Issue.3 , pp. 279-298
    • Kaelbling, L.P.1
  • 11
    • 0002899547 scopus 로고
    • Asymptotically efficient adaptive allocation rules
    • Tze Leung Lai and Herbert Robbins. Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics, 6(1): 4-22, 1985.
    • (1985) Advances in Applied Mathematics , vol.6 , Issue.1 , pp. 4-22
    • Lai, T.L.1    Robbins, H.2
  • 22
    • 0001631327 scopus 로고
    • A one-armed bandit problem with a concomitant variable
    • Michael Woodroofe. A one-armed bandit problem with a concomitant variable. Journal of the American Statistics Association, 74(368):799-806, 1979.
    • (1979) Journal of the American Statistics Association , vol.74 , Issue.368 , pp. 799-806
    • Woodroofe, M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.