메뉴 건너뛰기




Volumn 8139 LNAI, Issue , 2013, Pages 234-248

An efficient algorithm for learning with semi-bandit feedback

Author keywords

bandit problems; combinatorial optimization; Follow the perturbed leader; online learning

Indexed keywords

BANDIT PROBLEMS; BINARY VECTORS; FOLLOW-THE-PERTURBED-LEADER; FULL INFORMATIONS; LOSS ESTIMATION; ONLINE LEARNING; PREDICTION METHODS; REGRET BOUNDS;

EID: 84887500930     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-642-40935-6_17     Document Type: Conference Paper
Times cited : (71)

References (17)
  • 1
    • 33750733956 scopus 로고    scopus 로고
    • Hannan consistency in on-line learning in case of unbounded losses under partial monitoring
    • Balcázar, J.L., Long, P.M., Stephan, F. (eds.) ALT 2006. Springer, Heidelberg
    • Allenberg, C., Auer, P., Györfi, L., Ottucsák, G.: Hannan consistency in on-line learning in case of unbounded losses under partial monitoring. In: Balcázar, J.L., Long, P.M., Stephan, F. (eds.) ALT 2006. LNCS (LNAI), vol. 4264, pp. 229-243. Springer, Heidelberg (2006)
    • (2006) LNCS (LNAI) , vol.4264 , pp. 229-243
    • Allenberg, C.1    Auer, P.2    Györfi, L.3    Ottucsák, G.4
  • 2
    • 78649420293 scopus 로고    scopus 로고
    • Regret bounds and minimax policies under partial monitoring
    • Audibert, J.-Y., Bubeck, S.: Regret bounds and minimax policies under partial monitoring. Journal of Machine Learning Research 11, 2635-2686 (2010)
    • (2010) Journal of Machine Learning Research , vol.11 , pp. 2635-2686
    • Audibert, J.-Y.1    Bubeck, S.2
  • 11
    • 0001976283 scopus 로고
    • Approximation to Bayes risk in repeated play
    • Hannan, J.: Approximation to Bayes risk in repeated play. Contributions to the Theory of Games 3, 97-139 (1957)
    • (1957) Contributions to the Theory of Games , vol.3 , pp. 97-139
    • Hannan, J.1
  • 14
    • 9444257628 scopus 로고    scopus 로고
    • Online geometric optimization in the bandit setting against an adaptive adversary
    • Shawe-Taylor, J., Singer, Y. (eds.) COLT 2004. Springer, Heidelberg
    • McMahan, H.B., Blum, A.: Online geometric optimization in the bandit setting against an adaptive adversary. In: Shawe-Taylor, J., Singer, Y. (eds.) COLT 2004. LNCS (LNAI), vol. 3120, pp. 109-123. Springer, Heidelberg (2004)
    • (2004) LNCS (LNAI) , vol.3120 , pp. 109-123
    • McMahan, H.B.1    Blum, A.2
  • 15
    • 33646753171 scopus 로고    scopus 로고
    • FPL analysis for adaptive bandits
    • Lupanov, O.B., Kasim-Zade, O.M., Chaskin, A.V., Steinhöfel, K. (eds.) SAGA 2005. Springer, Heidelberg
    • Poland, J.: FPL analysis for adaptive bandits. In: Lupanov, O.B., Kasim-Zade, O.M., Chaskin, A.V., Steinhöfel, K. (eds.) SAGA 2005. LNCS, vol. 3777, pp. 58-69. Springer, Heidelberg (2005)
    • (2005) LNCS , vol.3777 , pp. 58-69
    • Poland, J.1
  • 16
    • 84867856605 scopus 로고    scopus 로고
    • Online prediction under submodular constraints
    • Bshouty, N.H., Stoltz, G., Vayatis, N., Zeugmann, T. (eds.) ALT 2012. Springer, Heidelberg
    • Suehiro, D., Hatano, K., Kijima, S., Takimoto, E., Nagano, K.: Online prediction under submodular constraints. In: Bshouty, N.H., Stoltz, G., Vayatis, N., Zeugmann, T. (eds.) ALT 2012. LNCS, vol. 7568, pp. 260-274. Springer, Heidelberg (2012)
    • (2012) LNCS , vol.7568 , pp. 260-274
    • Suehiro, D.1    Hatano, K.2    Kijima, S.3    Takimoto, E.4    Nagano, K.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.