메뉴 건너뛰기




Volumn 17, Issue 1, 2003, Pages 53-82

Asymptotic bayes analysis for the finite-horizon one-armed-bandit problem

Author keywords

[No Author keywords available]

Indexed keywords

ECONOMIC AND SOCIAL EFFECTS;

EID: 0037249547     PISSN: 02699648     EISSN: None     Source Type: Journal    
DOI: 10.1017/S0269964803171045     Document Type: Article
Times cited : (10)

References (29)
  • 1
    • 0024089489 scopus 로고
    • Asymptotically efficient adaptive allocation rules for the multi-armed bandit problem with switching cost
    • Agrawal, R., Hedge, M., & Teneketzis, D. (1988). Asymptotically efficient adaptive allocation rules for the multi-armed bandit problem with switching cost. IEEE Transactions on Automated Control 33: 899-906.
    • (1988) IEEE Transactions on Automated Control , vol.33 , pp. 899-906
    • Agrawal, R.1    Hedge, M.2    Teneketzis, D.3
  • 2
    • 0004870746 scopus 로고
    • A problem in the sequential design of experiments
    • Bellman, R. (1956). A problem in the sequential design of experiments. Sankhyā 16: 221-229.
    • (1956) Sankhyā , vol.16 , pp. 221-229
    • Bellman, R.1
  • 6
    • 0030159874 scopus 로고    scopus 로고
    • Optimal adaptive policies for sequential allocation problems
    • Burnetas, A.N. & Katehakis, M.N. (1996). Optimal adaptive policies for sequential allocation problems. Advances in Applied Mathematics 17(2): 122-142.
    • (1996) Advances in Applied Mathematics , vol.17 , Issue.2 , pp. 122-142
    • Burnetas, A.N.1    Katehakis, M.N.2
  • 8
    • 0031070051 scopus 로고    scopus 로고
    • Optimal adaptive policies for Markov decision processes
    • Burnetas, A.N. & Katehakis, M.N. (1997). Optimal adaptive policies for Markov decision processes. Mathematics of Operations Research 22(1): 222-255.
    • (1997) Mathematics of Operations Research , vol.22 , Issue.1 , pp. 222-255
    • Burnetas, A.N.1    Katehakis, M.N.2
  • 10
    • 0038635077 scopus 로고
    • A Bayes sequential sampling inspection problem
    • Chernoff, H. & Ray, S. (1965). A Bayes sequential sampling inspection problem. Annals of Mathematical Statistics 36:1387-1407.
    • (1965) Annals of Mathematical Statistics , vol.36 , pp. 1387-1407
    • Chernoff, H.1    Ray, S.2
  • 16
    • 0036792912 scopus 로고    scopus 로고
    • An index policy for a stochastic scheduling model with improving/deteriorating jobs
    • Glazebrook, K.D. & Mitchell, H.M. (2002). An index policy for a stochastic scheduling model with improving/deteriorating jobs. Naval Research Logistics 49: 706-721.
    • (2002) Naval Research Logistics , vol.49 , pp. 706-721
    • Glazebrook, K.D.1    Mitchell, H.M.2
  • 19
    • 0023345261 scopus 로고
    • The multi-armed bandit problem: Decomposition and computation
    • Katehakis, M.N. & Veinott, A.F., Jr. (1987). The multi-armed bandit problem: Decomposition and computation. Mathematics of Operations Research 22(2): 262-268.
    • (1987) Mathematics of Operations Research , vol.22 , Issue.2 , pp. 262-268
    • Katehakis, M.N.1    Veinott A.F., Jr.2
  • 21
    • 0000854435 scopus 로고
    • Adaptive treatment allocation and the multi-armed bandit problem
    • Lai, T.L. (1987). Adaptive treatment allocation and the multi-armed bandit problem. Annals of Statistics 15 (3): 1091-1114.
    • (1987) Annals of Statistics , vol.15 , Issue.3 , pp. 1091-1114
    • Lai, T.L.1
  • 22
    • 0346405517 scopus 로고    scopus 로고
    • Sequential analysis; Some classical problems and new challenges
    • Lai, T.L. (2001). Sequential analysis; some classical problems and new challenges. Statistica Sinica 11(2): 303-352.
    • (2001) Statistica Sinica , vol.11 , Issue.2 , pp. 303-352
    • Lai, T.L.1
  • 23
    • 0002899547 scopus 로고
    • Asymptotically efficient adaptive allocation rules
    • Lai, T.L. & Robbins, H. (1985). Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics 6: 4-22.
    • (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
    • Lai, T.L.1    Robbins, H.2
  • 24
    • 84966203785 scopus 로고
    • Some aspects of the sequential design of experiments
    • Robbins, H. (1952). Some aspects of the sequential design of experiments. Bulletin of the American Mathematics Monthly 58: 527-536.
    • (1952) Bulletin of the American Mathematics Monthly , vol.58 , pp. 527-536
    • Robbins, H.1
  • 25
  • 26
    • 0038161359 scopus 로고
    • Asymptotically efficient adaptive strategies in repeated games, Part I: Certainty equivalence strategies
    • Shimkin, N. & Shwartz, A. (1995). Asymptotically efficient adaptive strategies in repeated games, Part I: Certainty equivalence strategies. Mathematics of Operations Research 20: 743-767.
    • (1995) Mathematics of Operations Research , vol.20 , pp. 743-767
    • Shimkin, N.1    Shwartz, A.2
  • 27
    • 0030134723 scopus 로고
    • Asymptotically efficient adaptive strategies in repeated games, Part II: Asymptotic optimality
    • Shimkin, N. & Shwartz, A. (1995). Asymptotically efficient adaptive strategies in repeated games, Part II: Asymptotic optimality. Mathematics of Operations Research 21: 487-512.
    • (1995) Mathematics of Operations Research , vol.21 , pp. 487-512
    • Shimkin, N.1    Shwartz, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.