메뉴 건너뛰기




Volumn 34, Issue 1, 2009, Pages 26-44

A generalized gittins index for a class of multiarmed bandits with general resource requirements

Author keywords

Asymptotic optimality; Bandit problems; Dynamic programming; Gittins index; Resource allocation

Indexed keywords

ASYMPTOTIC OPTIMALITY; BANDIT PROBLEMS; BANDIT PROCESS; CLASSICAL MODEL; DECISION POINTS; DYNAMIC ALLOCATIONS; GITTINS INDEX; INDEX HEURISTICS; MULTI ARMED BANDIT; NUMERICAL STUDIES; OPTIMALITY; PER UNIT; RESOURCE DISTRIBUTION; RESOURCE REQUIREMENTS; SIMPLE MODEL; WEIGHTED INDEX;

EID: 67649922844     PISSN: 0364765X     EISSN: 15265471     Source Type: Journal    
DOI: 10.1287/moor.1080.0342     Document Type: Article
Times cited : (12)

References (31)
  • 1
    • 0024089489 scopus 로고
    • Asymptotically efficient adaptive allocation rules for the multiarmed bandit problem with switching cost
    • Agrawal, R., M. Hedge, D. Teneketzis. 1988. Asymptotically efficient adaptive allocation rules for the multiarmed bandit problem with switching cost. IEEE Trans. Aut. Ctrl. 33 899-906.
    • (1988) IEEE Trans. Aut. Ctrl , vol.33 , pp. 899-906
    • Agrawal, R.1    Hedge, M.2    Teneketzis, D.3
  • 2
    • 2442525727 scopus 로고    scopus 로고
    • Whittle's index policy for a multi-class queueing system with convex holding costs
    • Ansell, P. S., K. D. Glazebrook, J. Niño-Mora, M. O'Keeffe. 2003. Whittle's index policy for a multi-class queueing system with convex holding costs. Math. Methods Oper. Res. 57 21-39.
    • (2003) Math. Methods Oper. Res , vol.57 , pp. 21-39
    • Ansell, P.S.1    Glazebrook, K.D.2    Niño-Mora, J.3    O'Keeffe, M.4
  • 3
    • 0000456128 scopus 로고
    • Switching costs and the Gittins index
    • Banks, J. S., D. Sundaram. 1994. Switching costs and the Gittins index. Econometrica 62 687-694.
    • (1994) Econometrica , vol.62 , pp. 687-694
    • Banks, J.S.1    Sundaram, D.2
  • 5
    • 0030134077 scopus 로고    scopus 로고
    • Conservation laws, extended polymatroids and multi-armed bandit problems: A polyhedral approach to indexable systems
    • Bertsimas, D., J. Niño-Mora. 1996. Conservation laws, extended polymatroids and multi-armed bandit problems: A polyhedral approach to indexable systems. Math. Oper. Res. 21 257-306.
    • (1996) Math. Oper. Res , vol.21 , pp. 257-306
    • Bertsimas, D.1    Niño-Mora, J.2
  • 6
    • 0034187039 scopus 로고    scopus 로고
    • Index policies and a novel performance space structure for a class of generalised branching bandit problems
    • Crosbie, J. H., K. D. Glazebrook. 2000. Index policies and a novel performance space structure for a class of generalised branching bandit problems. Math. Oper. Res. 25 281-297.
    • (2000) Math. Oper. Res , vol.25 , pp. 281-297
    • Crosbie, J.H.1    Glazebrook, K.D.2
  • 7
    • 4043169028 scopus 로고    scopus 로고
    • Denardo, E. V., U. G. Rothblum, L. van der Heyden. 2004. Index policies for stochastic search in a forest with an application to R. & D. project management. Math. Oper. Res. 29 162-181.
    • Denardo, E. V., U. G. Rothblum, L. van der Heyden. 2004. Index policies for stochastic search in a forest with an application to R. & D. project management. Math. Oper. Res. 29 162-181.
  • 8
    • 0000169010 scopus 로고
    • Bandit processes and dynamic allocation indices (with discussion)
    • Gittins, J. C. 1979. Bandit processes and dynamic allocation indices (with discussion). J. Roy. Statist. Soc. B41 148-177.
    • (1979) J. Roy. Statist. Soc , vol.B41 , pp. 148-177
    • Gittins, J.C.1
  • 10
    • 4344702543 scopus 로고
    • On stochastic scheduling with precedence relations and switching costs
    • Glazebrook, K. D. 1980. On stochastic scheduling with precedence relations and switching costs. J. Appl. Probab. 17 1016-1024.
    • (1980) J. Appl. Probab , vol.17 , pp. 1016-1024
    • Glazebrook, K.D.1
  • 11
    • 0033476090 scopus 로고    scopus 로고
    • Almost optimal policies for stochastic systems which almost satisfy conservation laws
    • Glazebrook, K. D., R. Garbe. 1999. Almost optimal policies for stochastic systems which almost satisfy conservation laws. Ann. Oper. Res. 99 19-43.
    • (1999) Ann. Oper. Res , vol.99 , pp. 19-43
    • Glazebrook, K.D.1    Garbe, R.2
  • 12
    • 67649940320 scopus 로고    scopus 로고
    • Index policies for the routing of background jobs
    • Glazebrook, K. D., C. Kirkbride. 2004. Index policies for the routing of background jobs. Naval Res. Logist. 52 381-398.
    • (2004) Naval Res. Logist , vol.52 , pp. 381-398
    • Glazebrook, K.D.1    Kirkbride, C.2
  • 13
    • 33846376782 scopus 로고    scopus 로고
    • Dynamic routing to heterogeneous collections of unreliable servers
    • Glazebrook, K. D., C. Kirkbride. 2007. Dynamic routing to heterogeneous collections of unreliable servers. Queueing Systems 55 9-25.
    • (2007) Queueing Systems , vol.55 , pp. 9-25
    • Glazebrook, K.D.1    Kirkbride, C.2
  • 14
    • 33646081779 scopus 로고    scopus 로고
    • Spinning plates and squad systems - Policies for bi-directional restless bandits
    • Glazebrook, K. D., C. Kirkbride, D. Ruiz-Hernandez. 2006a. Spinning plates and squad systems - Policies for bi-directional restless bandits. Adv. Appl. Probab. 38 95-115.
    • (2006) Adv. Appl. Probab , vol.38 , pp. 95-115
    • Glazebrook, K.D.1    Kirkbride, C.2    Ruiz-Hernandez, D.3
  • 16
    • 0036950786 scopus 로고    scopus 로고
    • Index policies for a class of discounted restless bandits
    • Glazebrook, K. D., J. Niño-Mora, P. S. Ansell. 2002. Index policies for a class of discounted restless bandits. Adv. Appl. Probab. 34 754-774.
    • (2002) Adv. Appl. Probab , vol.34 , pp. 754-774
    • Glazebrook, K.D.1    Niño-Mora, J.2    Ansell, P.S.3
  • 17
    • 0023345261 scopus 로고
    • The multi-armed bandit problem: Decomposition and computation
    • Katehakis, M. N., A. F. Veinott, Jr. 1987. The multi-armed bandit problem: Decomposition and computation. Math. Oper. Res. 12 262-268.
    • (1987) Math. Oper. Res , vol.12 , pp. 262-268
    • Katehakis, M.N.1    Veinott Jr., A.F.2
  • 19
    • 0000963016 scopus 로고
    • A generalized bandit problem
    • Nash, P. 1980. A generalized bandit problem. J. Roy. Statist. Soc. B42 165-169.
    • (1980) J. Roy. Statist. Soc , vol.B42 , pp. 165-169
    • Nash, P.1
  • 20
    • 17744388964 scopus 로고    scopus 로고
    • Restless bandits, partial conservation laws and indexability
    • Niño-Mora, J. 2001. Restless bandits, partial conservation laws and indexability. Adv. Appl. Probab. 33 76-98.
    • (2001) Adv. Appl. Probab , vol.33 , pp. 76-98
    • Niño-Mora, J.1
  • 21
    • 2442577720 scopus 로고    scopus 로고
    • Dynamic allocation indices for restless projects and queueing admission control: A polyhedral approach
    • Niño-Mora, J. 2002. Dynamic allocation indices for restless projects and queueing admission control: A polyhedral approach. Math. Program. 93 361-413.
    • (2002) Math. Program , vol.93 , pp. 361-413
    • Niño-Mora, J.1
  • 22
    • 49349091331 scopus 로고    scopus 로고
    • 3 fast-pivoting algorithm for the Gittins index and optimal stopping of a Markov chain
    • 3 fast-pivoting algorithm for the Gittins index and optimal stopping of a Markov chain. INFORMS J. Comput. 19 596-606.
    • (2007) INFORMS J. Comput , vol.19 , pp. 596-606
    • Niño-Mora, J.1
  • 23
    • 24144456244 scopus 로고    scopus 로고
    • Outsourcing warranty repairs - Dynamic allocation
    • Opp, M., K. D. Glazebrook, V. Kulkarni. 2005. Outsourcing warranty repairs - Dynamic allocation. Naval Res. Logist. 52 381-398.
    • (2005) Naval Res. Logist , vol.52 , pp. 381-398
    • Opp, M.1    Glazebrook, K.D.2    Kulkarni, V.3
  • 24
    • 0032628612 scopus 로고    scopus 로고
    • The complexity of optimal queueing network control
    • Papadimitriou, C. H., J. N. Tsitsiklis. 1999. The complexity of optimal queueing network control. Math. Oper. Res. 24 293-305.
    • (1999) Math. Oper. Res , vol.24 , pp. 293-305
    • Papadimitriou, C.H.1    Tsitsiklis, J.N.2
  • 26
    • 0032115326 scopus 로고    scopus 로고
    • Dynamic scheduling of a two-class queue with set-ups
    • Reiman, M. I., L. M. Wein. 1998. Dynamic scheduling of a two-class queue with set-ups. Oper. Res. 46 532-547.
    • (1998) Oper. Res , vol.46 , pp. 532-547
    • Reiman, M.I.1    Wein, L.M.2
  • 27
    • 0020116650 scopus 로고
    • Algorithms for evaluating the dynamic allocation index
    • Robinson, D. R. 1982. Algorithms for evaluating the dynamic allocation index. Oper. Res. Lett. 1 72-74.
    • (1982) Oper. Res. Lett , vol.1 , pp. 72-74
    • Robinson, D.R.1
  • 28
    • 0008550960 scopus 로고
    • Optimal stochastic scheduling of forest networks with switching penalties
    • Van Oyen, M. P., D. Teneketzis. 1994. Optimal stochastic scheduling of forest networks with switching penalties. Adv. Appl. Probab. 26 474-479.
    • (1994) Adv. Appl. Probab , vol.26 , pp. 474-479
    • Van Oyen, M.P.1    Teneketzis, D.2
  • 29
    • 0002327722 scopus 로고
    • On an index policy for restless bandits
    • Weber, R. R., G. Weiss. 1990. On an index policy for restless bandits. J. Appl. Probab. 27 637-648,
    • (1990) J. Appl. Probab , vol.27 , pp. 637-648
    • Weber, R.R.1    Weiss, G.2
  • 30
    • 67649917745 scopus 로고
    • Addendum:, 429-430
    • (Addendum: Adv. Appl. Probab., 23 1991, 429-430).
    • (1991) Adv. Appl. Probab , vol.23
  • 31
    • 0001043843 scopus 로고
    • Restless bandits: Activity allocation in a changing world
    • J. Gani, ed, 25A, Probability Trust, Sheffield, UK
    • Whittle, P. 1988. Restless bandits: Activity allocation in a changing world. J. Gani, ed. A Celebration of Applied Probability. J. Appl. Probab. Special Volume 25A 287-298. Probability Trust, Sheffield, UK.
    • (1988) A Celebration of Applied Probability. J. Appl. Probab , vol.Special Volume , pp. 287-298
    • Whittle, P.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.