메뉴 건너뛰기




Volumn 5313 LNCS, Issue , 2008, Pages 56-68

Improving the exploration strategy in bandit algorithms

Author keywords

[No Author keywords available]

Indexed keywords

PROBABILITY; PROBABILITY DISTRIBUTIONS; RANDOM PROCESSES;

EID: 58349084664     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-540-92695-5_5     Document Type: Conference Paper
Times cited : (19)

References (13)
  • 1
    • 0036568025 scopus 로고    scopus 로고
    • Finite-time analysis of the multiarmed bandit problem
    • Auer, P., Cesa-Bianchi, N., Fischer, P.: Finite-time analysis of the multiarmed bandit problem. Machine Learning 47(2/3), 235-256 (2002)
    • (2002) Machine Learning , vol.47 , Issue.2-3 , pp. 235-256
    • Auer, P.1    Cesa-Bianchi, N.2    Fischer, P.3
  • 2
    • 4243096065 scopus 로고    scopus 로고
    • Exploitation vs. exploration: Choosing a supplier in an environment of incomplete information
    • Azoulay-Schwartz, R., Kraus, S., Wilkenfeld, J.: Exploitation vs. exploration: choosing a supplier in an environment of incomplete information. Decision support systems 38(1), 1-18 (2004)
    • (2004) Decision support systems , vol.38 , Issue.1 , pp. 1-18
    • Azoulay-Schwartz, R.1    Kraus, S.2    Wilkenfeld, J.3
  • 4
    • 0001341675 scopus 로고
    • Numerical computation of multivariate normal probabilities
    • Genz, A.: Numerical computation of multivariate normal probabilities. Journal of Computational and Graphical Statistics (1), 141-149 (1992)
    • (1992) Journal of Computational and Graphical Statistics , vol.1 , pp. 141-149
    • Genz, A.1
  • 6
    • 0343088226 scopus 로고
    • Bandit strategies for ethical sequential allocation
    • Hardwick, J., Stout, Q.: Bandit strategies for ethical sequential allocation. Computing Science and Statistics 23, 421-424 (1991)
    • (1991) Computing Science and Statistics , vol.23 , pp. 421-424
    • Hardwick, J.1    Stout, Q.2
  • 13
    • 33646406807 scopus 로고    scopus 로고
    • Vermorel, J., Mohri, M.: Multi-armed bandit algorithms and empirical evaluation. In: Gama, J., Camacho, R., Brazdil, P.B., Jorge, A.M., Torgo, L. (eds.) ECML 2005. LNCS, 3720, pp. 437-448. Springer, Heidelberg (2005)
    • Vermorel, J., Mohri, M.: Multi-armed bandit algorithms and empirical evaluation. In: Gama, J., Camacho, R., Brazdil, P.B., Jorge, A.M., Torgo, L. (eds.) ECML 2005. LNCS, vol. 3720, pp. 437-448. Springer, Heidelberg (2005)


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.