메뉴 건너뛰기




Volumn , Issue , 2011, Pages 204-212

Learning to trade off between exploration and exploitation in multiclass bandit prediction

Author keywords

Bandit feedback; Exploration vs. exploitation; Multi class classification; Online learning

Indexed keywords

DATA MINING; E-LEARNING; LEARNING ALGORITHMS; LEARNING SYSTEMS; PARAMETER ESTIMATION;

EID: 80052674910     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2020408.2020445     Document Type: Conference Paper
Times cited : (12)

References (19)
  • 1
    • 0036568025 scopus 로고    scopus 로고
    • Finite-time analysis of the multiarmed bandit problem
    • DOI 10.1023/A:1013689704352, Computational Learning Theory
    • Peter Auer, Nicolò Cesa-Bianchi, and Paul Fischer. Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47(2-3):235-256, 2002. (Pubitemid 34126111)
    • (2002) Machine Learning , vol.47 , Issue.2-3 , pp. 235-256
    • Auer, P.1    Cesa-Bianchi, N.2    Fischer, P.3
  • 6
    • 33745295134 scopus 로고    scopus 로고
    • Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems
    • Eyal Even-Dar, Shie Mannor, and Yishay Mansour. Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems. Journal of Machine Learning Research, 7:1079-1105, 2006. (Pubitemid 43938989)
    • (2006) Journal of Machine Learning Research , vol.7 , pp. 1079-1105
    • Even-Bar, E.1    Mannor, S.2    Mansour, Y.3
  • 13
    • 30044441333 scopus 로고    scopus 로고
    • The sample complexity of exploration in the multi-armed bandit problem
    • Shie Mannor and John N. Tsitsiklis. The sample complexity of exploration in the multi-armed bandit problem. Journal of Machine Learning Research, 5:623-648, 2004.
    • (2004) Journal of Machine Learning Research , vol.5 , pp. 623-648
    • Mannor, S.1    Tsitsiklis, J.N.2
  • 14
    • 84966203785 scopus 로고    scopus 로고
    • Some aspects of the sequential design of experiments
    • Herbert Robbins. some aspects of the sequential design of experiments. Bulletin of the American Mathematical Society, 58:527-535, 1952.
    • (1952) Bulletin of the American Mathematical Society , vol.58 , pp. 527-535
    • Robbins, H.1
  • 15
    • 84966203785 scopus 로고    scopus 로고
    • Some aspects of the sequential design of experiments
    • Herbert Robins. Some aspects of the sequential design of experiments. Bull. Amer. Math. Soc., 58(5):527-535, 2010.
    • (2010) Bull. Amer. Math. Soc. , vol.58 , Issue.5 , pp. 527-535
    • Robins, H.1
  • 16
    • 11144273669 scopus 로고
    • The perceptron: A probabilistic model for information storage and organization in the brain
    • F. Rosenblatt. The perceptron: a probabilistic model for information storage and organization in the brain. Psychological review, 65:386-408, 1958.
    • (1958) Psychological Review , vol.65 , pp. 386-408
    • Rosenblatt, F.1
  • 17
    • 33646406807 scopus 로고    scopus 로고
    • Multi-armed bandit algorithms and empirical evaluation
    • Springer
    • Joannès Vermorel and Mehryar Mohri. Multi-armed bandit algorithms and empirical evaluation. In In European Conference on Machine Learning, pages 437-448. Springer, 2005.
    • (2005) European Conference on Machine Learning , pp. 437-448
    • Vermorel, J.1    Mohri, M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.