메뉴 건너뛰기




Volumn 9, Issue , 2010, Pages 485-492

Contextual multi-armed bandits

Author keywords

[No Author keywords available]

Indexed keywords

BOUNDED SUBSET; CLICK-THROUGH RATE; CONTEXT-FREE; EUCLIDEAN SPACES; INTERNET SEARCH ENGINE; LIPSCHITZ; LIPSCHITZ CONDITIONS; LOWER BOUNDS; METRIC SPACES; MULTI ARMED BANDIT; MULTI-ARMED BANDIT PROBLEM; ON-LINE ALGORITHMS; PACKING DIMENSION; PAYOFF FUNCTION; SEARCH QUERIES; SIDE INFORMATION; UPPER AND LOWER BOUNDS; WEB SEARCHES;

EID: 84862301554     PISSN: 15324435     EISSN: 15337928     Source Type: Journal    
DOI: None     Document Type: Conference Paper
Times cited : (192)

References (20)
  • 1
    • 0345224411 scopus 로고
    • The continuum-armed bandit problem
    • R. Agrawal. The continuum-armed bandit problem. SIAM J. Control and Optimization, 33:1926-1951, 1995.
    • (1995) SIAM J. Control and Optimization , vol.33 , pp. 1926-1951
    • Agrawal, R.1
  • 3
    • 0036568025 scopus 로고    scopus 로고
    • Finitetime analysis of the multiarmed bandit problem
    • Peter Auer, Nicolò Cesa-Bianchi, and Paul Fischer. Finitetime analysis of the multiarmed bandit problem. Machine Learning, 47(2-3):235-256, 2002.
    • (2002) Machine Learning , vol.47 , Issue.2-3 , pp. 235-256
    • Auer, P.1    Cesa-Bianchi, N.2    Fischer, P.3
  • 6
    • 77952027689 scopus 로고    scopus 로고
    • Online optimization in x-armed bandits
    • Sébastien Bubeck, Rémi Munos, Gilles Stoltz, and Csaba Szepesvári. Online optimization in x-armed bandits. In NIPS, pages 201-208, 2008.
    • (2008) NIPS , pp. 201-208
    • Bubeck, S.1    Munos, R.2    Stoltz, G.3    Szepesvári, C.4
  • 9
    • 33745295134 scopus 로고    scopus 로고
    • Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems
    • Eyal Even-Dar, Shie Mannor, and Yishay Mansour. Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems. Journal of Machine Learning Research, 7:1079-1105, 2006.
    • (2006) Journal of Machine Learning Research , vol.7 , pp. 1079-1105
    • Even-Dar, E.1    Mannor, S.2    Mansour, Y.3
  • 10
    • 20744454447 scopus 로고    scopus 로고
    • Online convex optimization in the bandit setting: Gradient descent without a gradient
    • Society for Industrial and Applied Mathematics Philadelphia, PA, USA
    • Abraham D. Flaxman, Adam T. Kalai, and H. Brendan McMahan. Online convex optimization in the bandit setting: gradient descent without a gradient. In Proceedings of the sixteenth annual ACM-SIAM symposium on Discrete algorithms (SODA 2005), pages 385-394. Society for Industrial and Applied Mathematics Philadelphia, PA, USA, 2005.
    • (2005) Proceedings of the Sixteenth Annual ACM-SIAM Symposium on Discrete Algorithms (SODA 2005) , pp. 385-394
    • Flaxman, A.D.1    Kalai, A.T.2    McMahan, H.B.3
  • 12
    • 84947403595 scopus 로고
    • Probability inequalities for sums of bounded random variables
    • Wassily Hoeffding. Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association, 58(301):13-30, 1963.
    • (1963) Journal of the American Statistical Association , vol.58 , Issue.301 , pp. 13-30
    • Hoeffding, W.1
  • 13
    • 84898981061 scopus 로고    scopus 로고
    • Nearly tight bounds for the continuum-armed bandit problem
    • Lawrence K. Saul, Yair Weiss, and Léon Bottou, editors. MIT Press
    • Robert D. Kleinberg. Nearly tight bounds for the continuum-armed bandit problem. In Lawrence K. Saul, Yair Weiss, and Léon Bottou, editors, Advances in Neural Information Processing Systems 17, (NIPS 2005), pages 697-704. MIT Press, 2005a.
    • (2005) Advances in Neural Information Processing Systems 17, (NIPS 2005) , pp. 697-704
    • Kleinberg, R.D.1
  • 16
    • 0002899547 scopus 로고
    • Asymptotically efficient adaptive allocation rules
    • T. L. Lai and Herbert Robbins. Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics, 6(1):4-22, 1985.
    • (1985) Advances in Applied Mathematics , vol.6 , Issue.1 , pp. 4-22
    • Lai, T.L.1    Robbins, H.2
  • 18
    • 83055177001 scopus 로고    scopus 로고
    • The epoch-greedy algorithm for multi-armed bandits with side information
    • John Langford and Tong Zhang. The epoch-greedy algorithm for multi-armed bandits with side information. In NIPS, 2007.
    • (2007) NIPS
    • Langford, J.1    Zhang, T.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.