메뉴 건너뛰기




Volumn 227, Issue , 2007, Pages 721-728

Multi-armed bandit problems with dependent arms

Author keywords

[No Author keywords available]

Indexed keywords

APPROXIMATION THEORY; CLUSTER ANALYSIS; ERRORS; KNOWLEDGE ACQUISITION; PROBLEM SOLVING; REAL TIME SYSTEMS;

EID: 34547966991     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1273496.1273587     Document Type: Conference Paper
Times cited : (117)

References (15)
  • 1
    • 0000616723 scopus 로고
    • Sample mean based index policies with O(log n) regret for the multi-armed bandit problem
    • Agrawal, R. (1995). Sample mean based index policies with O(log n) regret for the multi-armed bandit problem. Advances in Applied Probability, 27, 1054-1078.
    • (1995) Advances in Applied Probability , vol.27 , pp. 1054-1078
    • Agrawal, R.1
  • 2
    • 0009953451 scopus 로고
    • Optimal stopping and dynamic allocation
    • Chang, F., & Lai, T. L. (1987). Optimal stopping and dynamic allocation. Advances in Applied Probability, 19, 829-853.
    • (1987) Advances in Applied Probability , vol.19 , pp. 829-853
    • Chang, F.1    Lai, T.L.2
  • 3
    • 71049162986 scopus 로고    scopus 로고
    • Coarse sample complexity bounds for active learning
    • Dasgupta, S. (2005). Coarse sample complexity bounds for active learning. NIPS.
    • (2005) NIPS
    • Dasgupta, S.1
  • 6
    • 34547975806 scopus 로고    scopus 로고
    • Bandit based monte-carlo planning
    • Kocsis, L., & Szepesvári, C (2006). Bandit based monte-carlo planning. ECML.
    • (2006) ECML
    • Kocsis, L.1    Szepesvári, C.2
  • 7
    • 0000854435 scopus 로고
    • Adaptive treatment allocation and multi-armed bandit problem
    • Lai, T. L. (1987). Adaptive treatment allocation and multi-armed bandit problem. Annals of Statistics, 15(3), 1091-1114.
    • (1987) Annals of Statistics , vol.15 , Issue.3 , pp. 1091-1114
    • Lai, T.L.1
  • 8
    • 0002899547 scopus 로고
    • Asymptotically efficient adaptive allocation rules
    • Lai, T. L., & Robbins, H. (1985). Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics, 6, 4-22.
    • (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
    • Lai, T.L.1    Robbins, H.2
  • 9
    • 0000695404 scopus 로고
    • Information-based objective functions for active data selection
    • MacKay, D. (1992). Information-based objective functions for active data selection. Neural Computation, 4, 590-604.
    • (1992) Neural Computation , vol.4 , pp. 590-604
    • MacKay, D.1
  • 11
    • 0036568025 scopus 로고    scopus 로고
    • Finitetime analysis of the multiarmed bandit problem
    • P.Auer, N.Cesa-Bianchi, & P.Fischer (2002). Finitetime analysis of the multiarmed bandit problem. Machine Learning, 47, 235-256.
    • (2002) Machine Learning , vol.47 , pp. 235-256
    • Auer, P.1    Cesa-Bianchi, N.2    Fischer, P.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.