메뉴 건너뛰기




Volumn 23, Issue 2, 2011, Pages 254-267

Computing a classic index for finite-horizon bandits

Author keywords

Analysis of algorithms; Bandits, finite horizon; Computational complexity; Dynamic programming, Markov; Index policies

Indexed keywords


EID: 79955755016     PISSN: 10919856     EISSN: 15265528     Source Type: Journal    
DOI: 10.1287/ijoc.1100.0398     Document Type: Article
Times cited : (35)

References (17)
  • 1
    • 0004870746 scopus 로고
    • A problem in the sequential design of experiments
    • Bellman, R. 1956. A problem in the sequential design of experiments. Sankhya 16 (3/4) 221-229.
    • (1956) Sankhya , vol.16 , Issue.3-4 , pp. 221-229
    • Bellman, R.1
  • 3
    • 0000964340 scopus 로고
    • On sequential designs for maximizing the sum of n observations
    • Bradt, R. N., S. M. Johnson, S. Karlin. 1956. On sequential designs for maximizing the sum of n observations. Ann. Math. Statist. 27 (4) 1060-1074.
    • (1956) Ann. Math. Statist. , vol.27 , Issue.4 , pp. 1060-1074
    • Bradt, R.N.1    Johnson, S.M.2    Karlin, S.3
  • 4
    • 33847255926 scopus 로고    scopus 로고
    • Dynamic assortment with demand learning for seasonal consumer goods
    • DOI 10.1287/mnsc.1060.0613
    • Caro, F., J. Gallien. 2007. Dynamic assortment with demand learning for seasonal consumer goods. Management Sci. 53 (2) 276-292. (Pubitemid 46326180)
    • (2007) Management Science , vol.53 , Issue.2 , pp. 276-292
    • Caro, F.1    Gallien, J.2
  • 6
    • 0346873971 scopus 로고    scopus 로고
    • Small-sample performance of Bernoulli two-armed bandit Bayesian strategies
    • Ginebra, J., M. K. Clayton. 1999. Small-sample performance of Bernoulli two-armed bandit Bayesian strategies. J. Statist. Plann. Inference 79 (1) 107-122.
    • (1999) J. Statist. Plann. Inference , vol.79 , Issue.1 , pp. 107-122
    • Ginebra, J.1    Clayton, M.K.2
  • 7
    • 0000169010 scopus 로고
    • Bandit processes and dynamic allocation indices
    • Gittins, J. C. 1979. Bandit processes and dynamic allocation indices. J. Roy. Statist. Soc. Ser. B 41 (2) 148-177.
    • (1979) J. Roy. Statist. Soc. Ser. B , vol.41 , Issue.2 , pp. 148-177
    • Gittins, J.C.1
  • 9
    • 0002955623 scopus 로고
    • A dynamic allocation index for the sequential design of experiments
    • J. Gani, K. Sarkadi, I. Vincze, eds, Budapest, 1972. North-Holland, Amsterdam
    • Gittins, J. C., D. M. Jones. 1974. A dynamic allocation index for the sequential design of experiments. J. Gani, K. Sarkadi, I. Vincze, eds. Progress in Statistics (European Meeting of Statisticians, Budapest, 1972). North-Holland, Amsterdam, 241-266.
    • (1974) Progress in Statistics (European Meeting of Statisticians , pp. 241-266
    • Gittins, J.C.1    Jones, D.M.2
  • 10
    • 17744388964 scopus 로고    scopus 로고
    • Restless bandits, partial conservation laws and indexability
    • DOI 10.1239/aap/999187898
    • Niño-Mora, J. 2001. Restless bandits, partial conservation laws and indexability Adv. Appl. Probab. 33 (1) 76-98. (Pubitemid 32443970)
    • (2001) Advances in Applied Probability , vol.33 , Issue.1 , pp. 76-98
    • Nino-Mora, J.1
  • 11
    • 2442577720 scopus 로고    scopus 로고
    • Dynamic allocation indices for restless projects and queueing admission control: A polyhedral approach
    • DOI 10.1007/s10107-002-0362-6
    • Niño-Mora, J. 2002. Dynamic allocation indices for restless projects and queueing admission control: A polyhedral approach. Math. Programming 93 (3) 361-413. (Pubitemid 44744737)
    • (2002) Mathematical Programming, Series B , vol.93 , Issue.3 , pp. 361-413
    • Nino-Mora, J.1
  • 12
    • 33847228067 scopus 로고    scopus 로고
    • A marginal productivity index policy for the finite-horizon multiarmed bandit problem
    • DOI 10.1109/CDC.2005.1582407, 1582407, Proceedings of the 44th IEEE Conference on Decision and Control, and the European Control Conference, CDC-ECC '05
    • Niño-Mora, J. 2005. A marginal productivity index policy for the finite-horizon multiarmed bandit problem. CDC-ECC'05: Proc. 44th IEEE Conf. Decision Control Eur. Control Conf. 2005, Seville, Spain. IEEE, Washington, DC, 1718-1722. (Pubitemid 46297216)
    • (2005) Proceedings of the 44th IEEE Conference on Decision and Control, and the European Control Conference, CDC-ECC '05 , vol.2005 , pp. 1718-1722
    • Nino-Mora, J.1
  • 13
    • 49349091331 scopus 로고    scopus 로고
    • 3 fast-pivoting algorithm for the Gittins index and optimal stopping of a Markov chain
    • 3 fast-pivoting algorithm for the Gittins index and optimal stopping of a Markov chain. INFORMS J. Comput. 19 (4) 596-606.
    • (2007) INFORMS J. Comput. , vol.19 , Issue.4 , pp. 596-606
    • Niño-Mora, J.1
  • 15
    • 84966203785 scopus 로고
    • Some aspects of the sequential design of experiments
    • Robbins, H. 1952. Some aspects of the sequential design of experiments. Bull. Amer. Math. Soc. 58 (5) 527-535.
    • (1952) Bull. Amer. Math. Soc. , vol.58 , Issue.5 , pp. 527-535
    • Robbins, H.1
  • 17
    • 79955773062 scopus 로고    scopus 로고
    • Error bounds for calculation of the Gittins indices
    • Wang, Y.-G. 1997. Error bounds for calculation of the Gittins indices. Austral. J. Statist. 39 (2) 225-233.
    • (1997) Austral. J. Statist. , vol.39 , Issue.2 , pp. 225-233
    • Wang, Y.-G.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.