메뉴 건너뛰기




Volumn , Issue , 2011, Pages

Multi-Armed Bandit Allocation Indices: 2nd Edition

Author keywords

[No Author keywords available]

Indexed keywords


EID: 84891584370     PISSN: None     EISSN: None     Source Type: Book    
DOI: 10.1002/9780470980033     Document Type: Book
Times cited : (1205)

References (225)
  • 1
    • 72249096472 scopus 로고    scopus 로고
    • On the Gittins index in the M/G/1 queue
    • Aalto, S., Ayesta, U. and Righter, R. (2009). On the Gittins index in the M/G/1 queue, Queueing Syst. 63, 437-58.
    • (2009) Queueing Syst. , vol.63 , pp. 437-458
    • Aalto, S.1    Ayesta, U.2    Righter, R.3
  • 2
    • 0024089489 scopus 로고
    • Asymptotically efficient adaptive allocation rules for the multiarmed bandit problem with switching costs
    • Agrawal, R., Hedge, M. and Teneketzis, D. (1988). Asymptotically efficient adaptive allocation rules for the multiarmed bandit problem with switching costs, IEEE T. Automat. Contr. 33, 899-906.
    • (1988) IEEE T. Automat. Contr. , vol.33 , pp. 899-906
    • Agrawal, R.1    Hedge, M.2    Teneketzis, D.3
  • 3
    • 0003426797 scopus 로고
    • Search Problems
    • John Wiley & Sons, Ltd
    • Ahlswede, R. and Wegener, I. (1987). Search Problems, John Wiley & Sons, Ltd.
    • (1987)
    • Ahlswede, R.1    Wegener, I.2
  • 5
    • 0038223016 scopus 로고    scopus 로고
    • The Theory of Search Games and Rendezvous
    • Springer
    • Alpern, S. and Gal, S. (2003). The Theory of Search Games and Rendezvous, Springer.
    • (2003)
    • Alpern, S.1    Gal, S.2
  • 6
    • 84886667945 scopus 로고
    • Aspects of Optimal Sequential Resource Allocation
    • do D.Phil. thesis, Oxford
    • Amaral, J.A.F.P., do (1985). Aspects of Optimal Sequential Resource Allocation, D.Phil. thesis, Oxford.
    • (1985)
    • Amaral, J.A.F.P.1
  • 7
    • 0023453059 scopus 로고
    • Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays -Part I: I.I.D. rewards
    • Anantharam, V., Varaiya, P. and Walrand, J. (1987a). Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays -Part I: I.I.D. rewards, IEEE T. Automat. Contr. 32, 968-76.
    • (1987) IEEE T. Automat. Contr. , vol.32 , pp. 968-976
    • Anantharam, V.1    Varaiya, P.2    Walrand, J.3
  • 8
    • 0023450663 scopus 로고
    • Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays -Part II: Markovian rewards
    • Anantharam, V., Varaiya, P. and Walrand, J. (1987b). Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays -Part II: Markovian rewards, IEEE T. Automat. Contr. 32, 977-82.
    • (1987) IEEE T. Automat. Contr. , vol.32 , pp. 977-982
    • Anantharam, V.1    Varaiya, P.2    Walrand, J.3
  • 9
    • 2442525727 scopus 로고    scopus 로고
    • Whittle's index policy for a multi-class queueing system with convex holding costs
    • Ansell, P.S., Glazebrook, K.D., Niño-Mora, J. and O'Keeffe, M. (2003). Whittle's index policy for a multi-class queueing system with convex holding costs, Math. Meth. Oper. Res. 57, 21-39.
    • (2003) Math. Meth. Oper. Res. , vol.57 , pp. 21-39
    • Ansell, P.S.1    Glazebrook, K.D.2    Niño-mora, J.3    O'Keeffe, M.4
  • 10
    • 10844240284 scopus 로고    scopus 로고
    • Evaluating the effectiveness of shoot-look-shoot tactics in the presence of incomplete damage information
    • Aviv, Y. and Kress, M. (1997). Evaluating the effectiveness of shoot-look-shoot tactics in the presence of incomplete damage information, Military Oper. Res. 3, 79-89.
    • (1997) Military Oper. Res. , vol.3 , pp. 79-89
    • Aviv, Y.1    Kress, M.2
  • 11
    • 77950572152 scopus 로고    scopus 로고
    • Characterizing truthful multi-armed bandit mechanisms: extended abstract
    • in EC '09: Proceedings of the 2009 ACM Conference Electronic Commerce, ACM
    • Babaioff, M., Sharma, Y. and Slivkins, A. (2009). Characterizing truthful multi-armed bandit mechanisms: extended abstract, in EC '09: Proceedings of the 2009 ACM Conference Electronic Commerce, ACM, pages 79-88.
    • (2009) , pp. 79-88
    • Babaioff, M.1    Sharma, Y.2    Slivkins, A.3
  • 12
    • 0004028632 scopus 로고
    • Introduction to Sequencing and Scheduling
    • John Wiley & Sons, Ltd
    • Baker, K.R. (1974). Introduction to Sequencing and Scheduling, John Wiley & Sons, Ltd.
    • (1974)
    • Baker, K.R.1
  • 13
    • 34447629932 scopus 로고    scopus 로고
    • On Gittins index theorem in continuous time
    • Bank, P. and Küchler, C. (2007). On Gittins index theorem in continuous time, Stoc. Proc. Appl. 117, 1357-71.
    • (2007) Stoc. Proc. Appl. , vol.117 , pp. 1357-1371
    • Bank, P.1    Küchler, C.2
  • 14
    • 0000456128 scopus 로고
    • Switching costs and the Gittins index
    • Banks, J.S. and Sundaram, R.K. (1994). Switching costs and the Gittins index, Econometrica 62, 687-94.
    • (1994) Econometrica , vol.62 , pp. 687-694
    • Banks, J.S.1    Sundaram, R.K.2
  • 16
    • 0004216171 scopus 로고
    • Mathematical Basis of Statistics
    • Academic Press
    • Barra, J.-R. (1981). Mathematical Basis of Statistics, Academic Press.
    • (1981)
    • Barra, J.-R.1
  • 17
    • 29144487977 scopus 로고    scopus 로고
    • Dynamic routing and admission control in high-volume service systems: asymptotic analysis via multi-scale fluid limits
    • Bassamboo, A., Harrison, J.M. and Zeevi, A. (2005). Dynamic routing and admission control in high-volume service systems: asymptotic analysis via multi-scale fluid limits, Queueing Syst. 51, 249-85.
    • (2005) Queueing Syst. , vol.51 , pp. 249-285
    • Bassamboo, A.1    Harrison, J.M.2    Zeevi, A.3
  • 18
    • 84971160355 scopus 로고
    • Bayesian procedures for deciding the sign of a normal mean
    • Bather, J.A. (1962). Bayesian procedures for deciding the sign of a normal mean, Proc. Cambridge Phil. Soc. 58, 599-620.
    • (1962) Proc. Cambridge Phil. Soc. , vol.58 , pp. 599-620
    • Bather, J.A.1
  • 19
    • 0008953703 scopus 로고
    • Optimal stopping problems for Brownian motion
    • Bather, J.A. (1970). Optimal stopping problems for Brownian motion, Adv. Appl. Prob. 2, 259-86.
    • (1970) Adv. Appl. Prob. , vol.2 , pp. 259-286
    • Bather, J.A.1
  • 20
    • 0011447471 scopus 로고
    • Optimal stopping of Browian motion: a comparison technique
    • in D. Siegmund, H. Chernoff, J.S. Rustagi and M.H. Rizvi, editors, Recent Advances in Statistics, Academic Press
    • Bather, J.A. (1983). Optimal stopping of Browian motion: a comparison technique, in D. Siegmund, H. Chernoff, J.S. Rustagi and M.H. Rizvi, editors, Recent Advances in Statistics, Academic Press, pages 19-50.
    • (1983) , pp. 19-50
    • Bather, J.A.1
  • 21
    • 0035536037 scopus 로고    scopus 로고
    • Conservation laws for single-server fluid networks
    • Bäuerle, N. and Stidham, S., Jr. (2001). Conservation laws for single-server fluid networks, Queueing Syst. 38, 185-94.
    • (2001) Queueing Syst. , vol.38 , pp. 185-194
    • Bäuerle, N.1    Stidham, S.2
  • 22
    • 84886703170 scopus 로고
    • Contribution to the discussion of Gittins
    • Beale, E.M.L. (1979). Contribution to the discussion of Gittins, J. R. Statist. Soc. B 41, 171-2.
    • (1979) J. R. Statist. Soc. B , vol.41 , pp. 171-172
    • Beale, E.M.L.1
  • 23
    • 0004870746 scopus 로고
    • A problem in the sequential design of experiments
    • Bellman, R.E. (1956). A problem in the sequential design of experiments, Sankhya A 30, 221-52.
    • (1956) Sankhya A , vol.30 , pp. 221-252
    • Bellman, R.E.1
  • 24
    • 0003787146 scopus 로고
    • Dynamic Programming
    • Princeton University Press
    • Bellman, R.E. (1957). Dynamic Programming, Princeton University Press.
    • (1957)
    • Bellman, R.E.1
  • 25
    • 34249963623 scopus 로고
    • A bisection/successive approximation method for computing Gittins indices
    • Ben-Israel, A. and Fl°am, S.D. (1990). A bisection/successive approximation method for computing Gittins indices, Math. Meth. Oper. Res. 34, 411-22.
    • (1990) Math. Meth. Oper. Res. , vol.34 , pp. 411-422
    • Ben-israel, A.1    Fl°am, S.D.2
  • 26
    • 0003612843 scopus 로고
    • Applications des Inequations Variationelles en Controle Stochastique
    • Dunod
    • Bensoussan, A. and Lions, J.L. (1978). Applications des Inequations Variationelles en Controle Stochastique, Dunod.
    • (1978)
    • Bensoussan, A.1    Lions, J.L.2
  • 27
  • 28
    • 77955239759 scopus 로고    scopus 로고
    • Bandit problems
    • in S.N. Durlauf and L.E. Blume, editors, The New Palgrave Dictionary of Economics, Palgrave Macmillan
    • Bergemann, D. and Välimäki, J. (2008). Bandit problems, in S.N. Durlauf and L.E. Blume, editors, The New Palgrave Dictionary of Economics, Palgrave Macmillan, pages 336-40.
    • (2008) , pp. 336-340
    • Bergemann, D.1    Välimäki, J.2
  • 29
    • 84886668493 scopus 로고
    • Acceptance Sampling: The Buyer's Problem
    • Ph.D. thesis, Yale
    • Bergman, S.W. (1981). Acceptance Sampling: The Buyer's Problem, Ph.D. thesis, Yale.
    • (1981)
    • Bergman, S.W.1
  • 30
    • 0005907012 scopus 로고
    • Statistical Methods for Planning Pharmaceutical Research
    • Marcel Dekker
    • Bergman, S.W. and Gittins, J.C. (1985). Statistical Methods for Planning Pharmaceutical Research, Marcel Dekker.
    • (1985)
    • Bergman, S.W.1    Gittins, J.C.2
  • 31
    • 84886713514 scopus 로고
    • Das 'Multi-Armed-Bandit' Paradigms
    • Verlagsgruppe Athenaum
    • Berninghaus, S. (1984). Das 'Multi-Armed-Bandit' Paradigms, Verlagsgruppe Athenaum.
    • (1984)
    • Berninghaus, S.1
  • 32
    • 0004218171 scopus 로고
    • Bandit Problems
    • Chapman Hall
    • Berry, D.A. and Fristedt, B. (1985). Bandit Problems, Chapman Hall.
    • (1985)
    • Berry, D.A.1    Fristedt, B.2
  • 33
    • 0002083213 scopus 로고
    • Worth of perfect information in Bernoulli bandits
    • Berry, D.A. and Kertz, R.P. (1991). Worth of perfect information in Bernoulli bandits, Adv. Appl. Prob. 23, 1-23.
    • (1991) Adv. Appl. Prob. , vol.23 , pp. 1-23
    • Berry, D.A.1    Kertz, R.P.2
  • 34
    • 0003713964 scopus 로고    scopus 로고
    • Nonlinear Programming
    • Athena Scientific
    • Bertsekas, D.P. (1999). Nonlinear Programming, Athena Scientific.
    • (1999)
    • Bertsekas, D.P.1
  • 35
    • 84886702296 scopus 로고
    • Conservation laws
    • extended polymatroids and multi-armed bandit problems; a polyhedral approach to indexable systems, in G. Rinaldi and L. Wolsey, editors, Proceedings of the Third International Symposium on Integer Programming and Combinatorial Optimization (IPCO 93), Erice, Italy, CIACO
    • Bertsimas, D. and Niño-Mora, J. (1993). Conservation laws, extended polymatroids and multi-armed bandit problems; a polyhedral approach to indexable systems, in G. Rinaldi and L. Wolsey, editors, Proceedings of the Third International Symposium on Integer Programming and Combinatorial Optimization (IPCO 93), Erice, Italy, CIACO, pages 355-84.
    • (1993) , pp. 355-384
    • Bertsimas, D.1    Niño-mora, J.2
  • 37
    • 0029504871 scopus 로고
    • Branching bandits and Klimov's problem: achievable region and side constraints
    • Bertsimas, D., Paschalidis, I. and Tsitsiklis, J.N. (1995). Branching bandits and Klimov's problem: achievable region and side constraints, IEEE T. Automat. Contr. 40, 2063-75.
    • (1995) IEEE T. Automat. Contr. , vol.40 , pp. 2063-2075
    • Bertsimas, D.1    Paschalidis, I.2    Tsitsiklis, J.N.3
  • 38
    • 0003850196 scopus 로고    scopus 로고
    • Introduction to Linear Optimization
    • Athena Scientific
    • Bertsimas, D.J. and Tsitsiklis, J.N. (1997). Introduction to Linear Optimization, Athena Scientific.
    • (1997)
    • Bertsimas, D.J.1    Tsitsiklis, J.N.2
  • 39
    • 11544307548 scopus 로고
    • Extended polymatroids: properties and optimization
    • in E. Balas, G. Cornnjelos and R. Kannan, editors, Proceedings of the 2nd Integer Programming and Combinatorial Optimization Conference, Pittsburgh, University Printing and Publications, Carnegie Mellon University
    • Bhattacharya, P.P., Georgiadis, L. and Tsoucas, P. (1992). Extended polymatroids: properties and optimization, in E. Balas, G. Cornnjelos and R. Kannan, editors, Proceedings of the 2nd Integer Programming and Combinatorial Optimization Conference, Pittsburgh, University Printing and Publications, Carnegie Mellon University, pages 298-315.
    • (1992) , pp. 298-315
    • Bhattacharya, P.P.1    Georgiadis, L.2    Tsoucas, P.3
  • 40
    • 0034316108 scopus 로고    scopus 로고
    • On the value of learning for Bernoulli bandits with unknown parameters
    • Bhulai, S. and Koole, G.M. (2000). On the value of learning for Bernoulli bandits with unknown parameters, IEEE T. Automat. Contr. 45, 213-40.
    • (2000) IEEE T. Automat. Contr. , vol.45 , pp. 213-240
    • Bhulai, S.1    Koole, G.M.2
  • 41
    • 0000615044 scopus 로고
    • Discrete dynamic programming
    • Blackwell, D. (1962). Discrete dynamic programming, Ann. Math. Stat. 39, 719-26.
    • (1962) Ann. Math. Stat. , vol.39 , pp. 719-726
    • Blackwell, D.1
  • 42
    • 0001432658 scopus 로고
    • Discounted dynamic programming
    • Blackwell, D. (1965). Discounted dynamic programming, Ann. Math. Stat. 36, 226-35.
    • (1965) Ann. Math. Stat. , vol.36 , pp. 226-235
    • Blackwell, D.1
  • 43
    • 0000964340 scopus 로고
    • On sequential designs for maximizing the sum of n observations
    • Bradt, R.N., Johnson, S.M. and Karlin, S. (1956). On sequential designs for maximizing the sum of n observations, Ann. Math. Stat. 27, 1060-74.
    • (1956) Ann. Math. Stat. , vol.27 , pp. 1060-1074
    • Bradt, R.N.1    Johnson, S.M.2    Karlin, S.3
  • 44
    • 76549121186 scopus 로고    scopus 로고
    • Efficiency and redistribution in dynamic mechanism design
    • in EC'08: Proceedings of the 2008 ACM conference on Electronic Commerce, ACM
    • Cavallo, R. (2008). Efficiency and redistribution in dynamic mechanism design, in EC'08: Proceedings of the 2008 ACM conference on Electronic Commerce, ACM, pages 220-229.
    • (2008) , pp. 220-229
    • Cavallo, R.1
  • 45
    • 80053178227 scopus 로고    scopus 로고
    • Optimal coordinated planning amongst self-interested agents with private state
    • in R. Dechter and T. Richardson, editors, Proceedings of the 22nd Conference on Uncertainty in Artificial Intelligence (UAI'06), Cambridge, MA, AVAI Press
    • Cavallo, R., Parkes, D.C. and Singh, S. (2006). Optimal coordinated planning amongst self-interested agents with private state, in R. Dechter and T. Richardson, editors, Proceedings of the 22nd Conference on Uncertainty in Artificial Intelligence (UAI'06), Cambridge, MA, AVAI Press, pages 55-62.
    • (2006) , pp. 55-62
    • Cavallo, R.1    Parkes, D.C.2    Singh, S.3
  • 46
    • 0009953451 scopus 로고
    • Optimal stopping and dynamic allocation
    • Chang, F. and Lai, T.L. (1987). Optimal stopping and dynamic allocation, Adv. Appl. Prob. 19, 829-53.
    • (1987) Adv. Appl. Prob. , vol.19 , pp. 829-853
    • Chang, F.1    Lai, T.L.2
  • 47
    • 0008635259 scopus 로고
    • Linear programming for finite state multi-armed bandit problems
    • Chen, Y.R. and Katehakis, M.N. (1986). Linear programming for finite state multi-armed bandit problems, Math. Oper. Res., 11, 83.
    • (1986) Math. Oper. Res , vol.11 , pp. 83
    • Chen, Y.R.1    Katehakis, M.N.2
  • 48
    • 0009890069 scopus 로고
    • Sequential tests for the mean of a normal distribution
    • in J. Neyman, editor, Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability, volume 1, University of California Press
    • Chernoff, H. (1961). Sequential tests for the mean of a normal distribution, in J. Neyman, editor, Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability, volume 1, University of California Press, pages 79-81.
    • (1961) , pp. 79-81
    • Chernoff, H.1
  • 49
    • 0007162159 scopus 로고
    • Optimal stochastic control
    • Chernoff, H. (1968). Optimal stochastic control, Sankhya A 30, 221-52.
    • (1968) Sankhya A , vol.30 , pp. 221-252
    • Chernoff, H.1
  • 50
    • 0003741921 scopus 로고
    • Great Expectations: The Theory of Optimal Stopping
    • Houghton Mifflin
    • Chow, Y.S., Robbins, H. and Siegmund, D. (1971). Great Expectations: The Theory of Optimal Stopping, Houghton Mifflin.
    • (1971)
    • Chow, Y.S.1    Robbins, H.2    Siegmund, D.3
  • 51
    • 0004063929 scopus 로고
    • Computer and Job-Shop Scheduling Theory
    • John Wiley & Sons, Inc
    • Coffman, E.G., Jr. (1976). Computer and Job-Shop Scheduling Theory, John Wiley & Sons, Inc.
    • (1976)
    • Coffman, E.G.1
  • 52
    • 0019013348 scopus 로고
    • A characterization of waiting time performance realizable by single-server queues
    • Coffman, E.G., Jr. and Mitrani, I. (1980). A characterization of waiting time performance realizable by single-server queues, Oper. Res. 28, 810-21.
    • (1980) Oper. Res. , vol.28 , pp. 810-821
    • Coffman, E.G.1    Mitrani, I.2
  • 54
    • 0003402343 scopus 로고
    • Mathematical Methods of Statistics
    • Princeton University Press
    • Cramér, H. (1946). Mathematical Methods of Statistics, Princeton University Press.
    • (1946)
    • Cramér, H.1
  • 55
    • 0036114602 scopus 로고    scopus 로고
    • The dependence of optimal returns from multiclass queueing systems on their customer base
    • Dacre, M.J. and Glazebrook, K.D. (2002). The dependence of optimal returns from multiclass queueing systems on their customer base, Queueing Syst. 40, 93-115.
    • (2002) Queueing Syst. , vol.40 , pp. 93-115
    • Dacre, M.J.1    Glazebrook, K.D.2
  • 56
    • 0033477162 scopus 로고    scopus 로고
    • The achievable region approach to the optimal control of stochastic systems
    • with discussion
    • Dacre, M.J., Glazebrook, K.D. and Niño-Mora, J. (1999). The achievable region approach to the optimal control of stochastic systems, J. R. Statist. Soc. B, 747-91 (with discussion).
    • (1999) J. R. Statist. Soc. B , pp. 747-791
    • Dacre, M.J.1    Glazebrook, K.D.2    Niño-mora, J.3
  • 57
    • 0009111952 scopus 로고
    • Deterministic and Stochastic Scheduling
    • editors, Reidel
    • Dempster, M.A.H., Lenstra, J.K. and Rinnooy Kan, A.H.G., editors (1982). Deterministic and Stochastic Scheduling, Reidel.
    • (1982)
    • Dempster, M.A.H.1    Lenstra, J.K.2    Rinnooy Kan, A.H.G.3
  • 58
    • 0000083216 scopus 로고
    • An optimality criterion for discrete dynamic programming with no discounting
    • Denardo, E.V. and Miller, B.L. (1968). An optimality criterion for discrete dynamic programming with no discounting, Ann. Math. Stat. 39, 1220-7.
    • (1968) Ann. Math. Stat. , vol.39 , pp. 1220-1227
    • Denardo, E.V.1    Miller, B.L.2
  • 59
    • 84956089551 scopus 로고
    • A probability approach to the heat equation
    • Doob, J.L. (1955). A probability approach to the heat equation, Trans. Amer. Math. Soc. 80, 216-80.
    • (1955) Trans. Amer. Math. Soc. , vol.80 , pp. 216-280
    • Doob, J.L.1
  • 61
    • 84886648230 scopus 로고    scopus 로고
    • Bandit Problems on Parallel Machines
    • Ph.D. thesis, University of Edinburgh
    • Dunn, R.T. (2005). Bandit Problems on Parallel Machines, Ph.D. thesis, University of Edinburgh.
    • (2005)
    • Dunn, R.T.1
  • 62
    • 0035359944 scopus 로고    scopus 로고
    • The performance of index-based policies for bandit problems with stochastic machine availability
    • Dunn, R.T. and Glazebrook, K.D. (2001). The performance of index-based policies for bandit problems with stochastic machine availability, Adv. Appl. Prob. 33, 365-90.
    • (2001) Adv. Appl. Prob. , vol.33 , pp. 365-390
    • Dunn, R.T.1    Glazebrook, K.D.2
  • 63
    • 4043159095 scopus 로고    scopus 로고
    • Discounted multiarmed bandit problems on a collection of machines with varying speeds
    • Dunn, R.T. and Glazebrook, K.D. (2004). Discounted multiarmed bandit problems on a collection of machines with varying speeds, Math. Oper. Res. 29, 266-79.
    • (2004) Math. Oper. Res. , vol.29 , pp. 266-279
    • Dunn, R.T.1    Glazebrook, K.D.2
  • 64
    • 21144442171 scopus 로고
    • Gittins procedures for bandits with delayed responses
    • Eick, S.G. (1988). Gittins procedures for bandits with delayed responses, J. R. Statist. Soc. B 50, 125-32.
    • (1988) J. R. Statist. Soc. B , vol.50 , pp. 125-132
    • Eick, S.G.1
  • 65
    • 0027394228 scopus 로고
    • General Gittins index processes in discrete time
    • El Karoui, N. and Karatzas, I. (1993). General Gittins index processes in discrete time, Proc. Natl. Acad. Sci. USA 90, 1232-6.
    • (1993) Proc. Natl. Acad. Sci. USA , vol.90 , pp. 1232-1236
    • El Karoui, N.1    Karatzas, I.2
  • 66
    • 0007527314 scopus 로고
    • Dynamic allocation problems in continuous time
    • El Karoui, N. and Karatzas, I. (1994). Dynamic allocation problems in continuous time, Adv. Appl. Prob. 4, 255-86.
    • (1994) Adv. Appl. Prob. , vol.4 , pp. 255-286
    • El Karoui, N.1    Karatzas, I.2
  • 67
    • 0000977276 scopus 로고
    • Continuous-time allocation indices and their discrete-time approximations
    • Eplett, W.J.R. (1986). Continuous-time allocation indices and their discrete-time approximations, Adv. Appl. Prob. 18, 724-46.
    • (1986) Adv. Appl. Prob. , vol.18 , pp. 724-746
    • Eplett, W.J.R.1
  • 68
    • 84971103626 scopus 로고
    • A general model for the scheduling of alternative stochastic jobs that may fail
    • Fay, N.A. and Glazebrook, K.D. (1989). A general model for the scheduling of alternative stochastic jobs that may fail, Prob. Eng. Inform. Sci. 3, 199-221.
    • (1989) Prob. Eng. Inform. Sci. , vol.3 , pp. 199-221
    • Fay, N.A.1    Glazebrook, K.D.2
  • 69
    • 0001970572 scopus 로고
    • On approximately optimal index strategies for generalised arm problems
    • Fay, N.A. and Walrand, J.C. (1991). On approximately optimal index strategies for generalised arm problems, J. Appl. Prob. 28, 602-12.
    • (1991) J. Appl. Prob. , vol.28 , pp. 602-612
    • Fay, N.A.1    Walrand, J.C.2
  • 70
    • 0024073120 scopus 로고
    • Characterization and optimization of achieveable performances in general queueing systems
    • Federgruen, A. and Groenvelt, H. (1988). Characterization and optimization of achieveable performances in general queueing systems, Oper. Res. 36, 733-41.
    • (1988) Oper. Res. , vol.36 , pp. 733-741
    • Federgruen, A.1    Groenvelt, H.2
  • 71
    • 0003604817 scopus 로고
    • Mathematical Statistics: A Decision Theoretic Approach
    • Academic Press
    • Ferguson, T.S. (1967). Mathematical Statistics: A Decision Theoretic Approach, Academic Press.
    • (1967)
    • Ferguson, T.S.1
  • 72
    • 34548043000 scopus 로고    scopus 로고
    • Explicit Gittins' indices for a class of superdiffusive processes
    • Filliger, R. and Hongler, M.-O. (2007). Explicit Gittins' indices for a class of superdiffusive processes, J. Appl. Prob. 44, 554-9.
    • (2007) J. Appl. Prob. , vol.44 , pp. 554-559
    • Filliger, R.1    Hongler, M.-O.2
  • 73
    • 0003732093 scopus 로고
    • Sequencing and Scheduling: An Introduction to the Mathematics of the Job-Shop
    • Ellis Norwood
    • French, S. (1982). Sequencing and Scheduling: An Introduction to the Mathematics of the Job-Shop, Ellis Norwood.
    • (1982)
    • French, S.1
  • 74
    • 22544444317 scopus 로고    scopus 로고
    • Four proofs of Gittins' multi-armed bandit theorem
    • private communication
    • Frostig, E. and Weiss, G. (1999). Four proofs of Gittins' multi-armed bandit theorem, private communication.
    • (1999)
    • Frostig, E.1    Weiss, G.2
  • 75
    • 0004194991 scopus 로고
    • Search Games
    • Academic Press
    • Gal, S. (1980). Search Games, Academic Press.
    • (1980)
    • Gal, S.1
  • 76
    • 0031999142 scopus 로고    scopus 로고
    • Stochastic scheduling with priority classes
    • Garbe, R. and Glazebrook, K.D. (1998a). Stochastic scheduling with priority classes, Math. Oper. Res. 23, 119-44.
    • (1998) Math. Oper. Res. , vol.23 , pp. 119-144
    • Garbe, R.1    Glazebrook, K.D.2
  • 77
    • 0002383153 scopus 로고    scopus 로고
    • Submodular returns and greedy heuristics for queueing scheduling problems
    • Garbe, R. and Glazebrook, K.D. (1998b). Submodular returns and greedy heuristics for queueing scheduling problems, Oper. Res. 46, 336-46.
    • (1998) Oper. Res. , vol.46 , pp. 336-346
    • Garbe, R.1    Glazebrook, K.D.2
  • 79
    • 0000169010 scopus 로고
    • Bandit processes and dynamic allocation indices (with discussion)
    • Gittins, J.C. (1979). Bandit processes and dynamic allocation indices (with discussion), J. R. Statist. Soc. B 41, 148-77.
    • (1979) J. R. Statist. Soc. B , vol.41 , pp. 148-177
    • Gittins, J.C.1
  • 80
    • 26444490022 scopus 로고
    • Multiserver scheduling of jobs with increasing completion rates
    • Gittins, J.C. (1981). Multiserver scheduling of jobs with increasing completion rates, J. Appl. Prob. 18, 321-4.
    • (1981) J. Appl. Prob. , vol.18 , pp. 321-324
    • Gittins, J.C.1
  • 81
    • 0003848944 scopus 로고
    • Multi-armed Bandit Allocation Indices
    • John Wiley & Sons, Inc
    • Gittins, J.C. (1989). Multi-armed Bandit Allocation Indices, John Wiley & Sons, Inc.
    • (1989)
    • Gittins, J.C.1
  • 82
    • 84875692383 scopus 로고    scopus 로고
    • Stochastic models for the planning of pharmaceutical research
    • Gittins, J.C. (2003). Stochastic models for the planning of pharmaceutical research, J. Statist. Theory Appl. 2, 198-214.
    • (2003) J. Statist. Theory Appl. , vol.2 , pp. 198-214
    • Gittins, J.C.1
  • 83
    • 0002955623 scopus 로고
    • A dynamic allocation index for the sequential design of experiments
    • in J. Gani, editor, Progress in Statistics, North-Holland, read at the 1972 European Meeting of Statisticians, Budapest
    • Gittins, J.C. and Jones, D.M. (1974a). A dynamic allocation index for the sequential design of experiments, in J. Gani, editor, Progress in Statistics, North-Holland, pages 241-66, read at the 1972 European Meeting of Statisticians, Budapest.
    • (1974) , pp. 241-266
    • Gittins, J.C.1    Jones, D.M.2
  • 84
    • 84999646754 scopus 로고
    • A dynamic allocation index for new-product chemical research
    • Technical report, Cambridge University Engineering Department Report CUED/A -Mgt Stud/TR13, Cambridge University
    • Gittins, J.C. and Jones, D.M. (1974b). A dynamic allocation index for new-product chemical research, Technical report, Cambridge University Engineering Department Report CUED/A -Mgt Stud/TR13, Cambridge University.
    • (1974)
    • Gittins, J.C.1    Jones, D.M.2
  • 85
    • 0018709825 scopus 로고
    • A dynamic allocation index for the discounted multi-armed bandit problem
    • Gittins, J.C. and Jones, D.M. (1979). A dynamic allocation index for the discounted multi-armed bandit problem, Biometrika 66, 561-5.
    • (1979) Biometrika , vol.66 , pp. 561-565
    • Gittins, J.C.1    Jones, D.M.2
  • 86
    • 21144463800 scopus 로고
    • The learning component of dynamic allocation indices
    • Gittins, J.C. and Wang, Y.G. (1992). The learning component of dynamic allocation indices, Ann. Stat. 20, 1625-36.
    • (1992) Ann. Stat. , vol.20 , pp. 1625-1636
    • Gittins, J.C.1    Wang, Y.G.2
  • 87
    • 0016964105 scopus 로고
    • Stochastic scheduling with order constraints
    • Glazebrook, K.D. (1976). Stochastic scheduling with order constraints, Int. J. Sys. Sci. 7, 657-66.
    • (1976) Int. J. Sys. Sci. , vol.7 , pp. 657-666
    • Glazebrook, K.D.1
  • 88
    • 0018767566 scopus 로고
    • Stoppable families of alternative bandit processes
    • Glazebrook, K.D. (1979). Stoppable families of alternative bandit processes, J. Appl. Prob. 16, 843-54.
    • (1979) J. Appl. Prob. , vol.16 , pp. 843-854
    • Glazebrook, K.D.1
  • 89
    • 0018997545 scopus 로고
    • On single-machine sequencing with order constraints
    • Glazebrook, K.D. (1980a). On single-machine sequencing with order constraints, Naval Res. Logist. Quart. 27, 123-30.
    • (1980) Naval Res. Logist. Quart. , vol.27 , pp. 123-130
    • Glazebrook, K.D.1
  • 90
    • 4344702543 scopus 로고
    • On stochastic scheduling with precedence relations and switching costs
    • Glazebrook, K.D. (1980b). On stochastic scheduling with precedence relations and switching costs, J. Appl. Prob. 17, 1016-24.
    • (1980) J. Appl. Prob. , vol.17 , pp. 1016-1024
    • Glazebrook, K.D.1
  • 91
    • 0019575201 scopus 로고
    • On non-preemptive strategies for stochastic scheduling problems in continuous time
    • Glazebrook, K.D. (1981a). On non-preemptive strategies for stochastic scheduling problems in continuous time, Int. J. Sys. Sci. 12, 771-82.
    • (1981) Int. J. Sys. Sci. , vol.12 , pp. 771-782
    • Glazebrook, K.D.1
  • 92
    • 0019576633 scopus 로고
    • On non-preemptive strategies in stochastic scheduling
    • Glazebrook, K.D. (1981b). On non-preemptive strategies in stochastic scheduling, Naval Res. Logist. Quart. 28, 289-300.
    • (1981) Naval Res. Logist. Quart. , vol.28 , pp. 289-300
    • Glazebrook, K.D.1
  • 93
    • 17144394325 scopus 로고
    • On the evaluation of fixed permutations as strategies in stochastic scheduling
    • Glazebrook, K.D. (1982a). On the evaluation of fixed permutations as strategies in stochastic scheduling, Stoc. Proc. Appl., 13, 87.
    • (1982) Stoc. Proc. Appl , vol.13 , pp. 87
    • Glazebrook, K.D.1
  • 94
    • 84886666100 scopus 로고
    • On the evaluation of non-preemptive strategies in stochastic scheduling
    • in M.A.H. Dempster, J.K. Lenstra and A.H.G. Rinooy Kan, editors, Deterministic and Stochastic Scheduling, NATO Advanced Study Institute Series, Reidel
    • Glazebrook, K.D. (1982b). On the evaluation of non-preemptive strategies in stochastic scheduling, in M.A.H. Dempster, J.K. Lenstra and A.H.G. Rinooy Kan, editors, Deterministic and Stochastic Scheduling, NATO Advanced Study Institute Series, Reidel, pages 375-84.
    • (1982) , pp. 375-384
    • Glazebrook, K.D.1
  • 95
    • 0002232604 scopus 로고
    • On the evaluation of suboptimal strategies for families of alternative bandit processes
    • Glazebrook, K.D. (1982c). On the evaluation of suboptimal strategies for families of alternative bandit processes, J. Appl. Prob. 19, 716-22.
    • (1982) J. Appl. Prob. , vol.19 , pp. 716-722
    • Glazebrook, K.D.1
  • 96
    • 0020833555 scopus 로고
    • Methods for the evaluation of permutations as strategies in stochastic scheduling
    • Glazebrook, K.D. (1983). Methods for the evaluation of permutations as strategies in stochastic scheduling, Manage. Sci. 29, 1142-55.
    • (1983) Manage. Sci. , vol.29 , pp. 1142-1155
    • Glazebrook, K.D.1
  • 97
    • 0002844482 scopus 로고
    • On nonpreemptive policies for stochastic single-machine scheduling with breakdowns
    • Glazebrook, K.D. (1991). On nonpreemptive policies for stochastic single-machine scheduling with breakdowns, Prob. Eng. Inform. Sci. 5, 77-87.
    • (1991) Prob. Eng. Inform. Sci. , vol.5 , pp. 77-87
    • Glazebrook, K.D.1
  • 98
    • 44249107936 scopus 로고
    • A class of Bayesian models for optimal exploration
    • Glazebrook, K.D. and Boys, R.J. (1995). A class of Bayesian models for optimal exploration, J. R. Statist. Soc. B 57, 705-20.
    • (1995) J. R. Statist. Soc. B , vol.57 , pp. 705-720
    • Glazebrook, K.D.1    Boys, R.J.2
  • 99
    • 0001979352 scopus 로고
    • On the scheduling of alternative stochastic jobs on a single machine
    • Glazebrook, K.D. and Fay, N.A. (1987). On the scheduling of alternative stochastic jobs on a single machine, Adv. Appl. Prob. 19, 955-73.
    • (1987) Adv. Appl. Prob. , vol.19 , pp. 955-973
    • Glazebrook, K.D.1    Fay, N.A.2
  • 100
    • 0030268872 scopus 로고    scopus 로고
    • Reflections on a new approach to Gittins indexation
    • Glazebrook, K.D. and Garbe, R. (1996). Reflections on a new approach to Gittins indexation, J. Oper. Res. Soc. 47, 1301-9.
    • (1996) J. Oper. Res. Soc. , vol.47 , pp. 1301-1309
    • Glazebrook, K.D.1    Garbe, R.2
  • 101
    • 0019437250 scopus 로고
    • On single-machine scheduling with precedence relations and linear or discounted costs
    • Glazebrook, K.D. and Gittins, J.C. (1981). On single-machine scheduling with precedence relations and linear or discounted costs, Oper. Res. 29, 289-300.
    • (1981) Oper. Res. , vol.29 , pp. 289-300
    • Glazebrook, K.D.1    Gittins, J.C.2
  • 102
    • 84886709157 scopus 로고    scopus 로고
    • Monotone policies and indexability for bi-directional restless bandits
    • under review
    • Glazebrook, K.D., Hodge, D.J. and Kirkbride, C. (2010a). Monotone policies and indexability for bi-directional restless bandits, under review.
    • (2010)
    • Glazebrook, K.D.1    Hodge, D.J.2    Kirkbride, C.3
  • 103
    • 79956217553 scopus 로고    scopus 로고
    • General notions of indexability for queueing control and asset management
    • press
    • Glazebrook, K.D., Hodge, D.J. and Kirkbride, C. (2010b). General notions of indexability for queueing control and asset management, Ann. Appl. Prob. (in press).
    • (2010) Ann. Appl. Prob
    • Glazebrook, K.D.1    Hodge, D.J.2    Kirkbride, C.3
  • 105
    • 68349144001 scopus 로고    scopus 로고
    • Index policies for the admission control and routing of impatient customers to heterogeneous service stations
    • Glazebrook, K.D., Kirkbride, C. and Ouenniche, J. (2009). Index policies for the admission control and routing of impatient customers to heterogeneous service stations, Oper. Res. 57, 975-89.
    • (2009) Oper. Res. , vol.57 , pp. 975-989
    • Glazebrook, K.D.1    Kirkbride, C.2    Ouenniche, J.3
  • 106
    • 33646081779 scopus 로고    scopus 로고
    • Spinning plates and squad systems: policies for bi-directional restless bandits
    • Glazebrook, K.D., Kirkbride, C. and Ruiz-Hernandez, D. (2006). Spinning plates and squad systems: policies for bi-directional restless bandits, Adv. Appl. Prob. 38, 95-115.
    • (2006) Adv. Appl. Prob. , vol.38 , pp. 95-115
    • Glazebrook, K.D.1    Kirkbride, C.2    Ruiz-hernandez, D.3
  • 107
    • 1642419524 scopus 로고    scopus 로고
    • Index heuristics for multiclass M/G/1 systems with nonpreemptive service and convex holding costs
    • Glazebrook, K.D., Lumley, R.R. and Ansell, P.S. (2003). Index heuristics for multiclass M/G/1 systems with nonpreemptive service and convex holding costs, Queueing Syst. 45, 81-111.
    • (2003) Queueing Syst. , vol.45 , pp. 81-111
    • Glazebrook, K.D.1    Lumley, R.R.2    Ansell, P.S.3
  • 108
    • 0034748422 scopus 로고    scopus 로고
    • Parallel scheduling of multiclass M/M/m queues: approximate and heavy-traffic optimization of achievable performance
    • Glazebrook, K.D. and Niño-Mora, J. (2001). Parallel scheduling of multiclass M/M/m queues: approximate and heavy-traffic optimization of achievable performance, Oper. Res. 49, 609-23.
    • (2001) Oper. Res. , vol.49 , pp. 609-623
    • Glazebrook, K.D.1    Niño-mora, J.2
  • 109
    • 0036950786 scopus 로고    scopus 로고
    • Index policies for a class of discounted restless bandits
    • Glazebrook, K.D., Niño-Mora, J. and Ansell, P.S. (2002). Index policies for a class of discounted restless bandits, Adv. Appl. Prob. 34, 754-74.
    • (2002) Adv. Appl. Prob. , vol.34 , pp. 754-774
    • Glazebrook, K.D.1    Niño-mora, J.2    Ansell, P.S.3
  • 110
    • 19744378295 scopus 로고    scopus 로고
    • Shoot-look-shoot: a review and extension
    • Glazebrook, K.D. and Washburn, A. (2004). Shoot-look-shoot: a review and extension, Oper. Res. 52, 454-63.
    • (2004) Oper. Res. , vol.52 , pp. 454-463
    • Glazebrook, K.D.1    Washburn, A.2
  • 111
    • 0034346711 scopus 로고    scopus 로고
    • Index-based policies for discounted multiarmed bandits on parallel machines
    • Glazebrook, K.D. and Wilkinson, D.J. (2000). Index-based policies for discounted multiarmed bandits on parallel machines, Ann. Appl. Prob. 10, 877-96.
    • (2000) Ann. Appl. Prob. , vol.10 , pp. 877-896
    • Glazebrook, K.D.1    Wilkinson, D.J.2
  • 112
    • 70449097675 scopus 로고    scopus 로고
    • Multi-armed bandits with metric switching costs
    • in S. Albers, A. Marchetti-Spaccamela, Y. Matias, S. Nikoletseas and W. Thomas, editors, Automata, Languages and Programming, 36th Internatilonal Colloquium, ICALP 2009, Rhodes, Greece, July 5-12, 2009, proceedings, part II, Springer
    • Guha, S. and Munagala, K. (2009). Multi-armed bandits with metric switching costs, in S. Albers, A. Marchetti-Spaccamela, Y. Matias, S. Nikoletseas and W. Thomas, editors, Automata, Languages and Programming, 36th Internatilonal Colloquium, ICALP 2009, Rhodes, Greece, July 5-12, 2009, proceedings, part II, Springer, pages 496-507.
    • (2009) , pp. 496-507
    • Guha, S.1    Munagala, K.2
  • 113
    • 69449097218 scopus 로고    scopus 로고
    • Approximation algorithms for restless bandit problems
    • in C. Mathieu, editor, Proceedings of the Twentieth Annual ACM-SIAM Symposium on Discrete Algorithms, Society for Industrial and Applied Mathematics
    • Guha, S., Munagala, K. and Shi, P. (2009). Approximation algorithms for restless bandit problems, in C. Mathieu, editor, Proceedings of the Twentieth Annual ACM-SIAM Symposium on Discrete Algorithms, Society for Industrial and Applied Mathematics, pages 28-37.
    • (2009) , pp. 28-37
    • Guha, S.1    Munagala, K.2    Shi, P.3
  • 114
    • 0016483099 scopus 로고
    • Dynamic scheduling of a multiclass queue: discount optimality
    • Harrison, J.M. (1975). Dynamic scheduling of a multiclass queue: discount optimality, Oper. Res. 23, 270-82.
    • (1975) Oper. Res. , vol.23 , pp. 270-282
    • Harrison, J.M.1
  • 116
    • 0004188473 scopus 로고
    • Elements of Numerical Analysis
    • John Wiley & Sons, Inc
    • Henrici, P. (1964). Elements of Numerical Analysis, John Wiley & Sons, Inc.
    • (1964)
    • Henrici, P.1
  • 117
    • 0003644124 scopus 로고
    • Dynamic Programming and Markov Processes
    • MIT Press
    • Howard, R. (1960). Dynamic Programming and Markov Processes, MIT Press.
    • (1960)
    • Howard, R.1
  • 118
    • 0003871605 scopus 로고
    • Dynamic Probabilistic Systems
    • volume II, Semi-Markov and Decision Processes, John Wiley & Sons, Inc
    • Howard, R. (1971). Dynamic Probabilistic Systems, volume II, Semi-Markov and Decision Processes, John Wiley & Sons, Inc.
    • (1971)
    • Howard, R.1
  • 119
    • 0028531055 scopus 로고
    • Multi-armed bandit problem revisited
    • Ishikida, T. and Varaiya, P. (1994).Multi-armed bandit problem revisited, J. Optim. Theory Appl., 83, 113-54.
    • (1994) J. Optim. Theory Appl , vol.83 , pp. 113-154
    • Ishikida, T.1    Varaiya, P.2
  • 120
    • 0031507661 scopus 로고    scopus 로고
    • Scheduling jobs that are subject to deterministic due dates and have deteriorating expected rewards
    • Ishikida, T. and Wan, Y.W. (1997). Scheduling jobs that are subject to deterministic due dates and have deteriorating expected rewards, Prob. Eng. Inform. Sci., 11, 65-78.
    • (1997) Prob. Eng. Inform. Sci , vol.11 , pp. 65-78
    • Ishikida, T.1    Wan, Y.W.2
  • 121
    • 0242674578 scopus 로고    scopus 로고
    • Hitting times for random walks with restarts
    • Technical report, Department of Statistics, U.C. Berkeley, preprint
    • Janson, S. and Peres, Y. (2001). Hitting times for random walks with restarts, Technical report, Department of Statistics, U.C. Berkeley, preprint.
    • (2001)
    • Janson, S.1    Peres, Y.2
  • 122
    • 84886713601 scopus 로고
    • A Sequential Method for Industrial Chemical Research
    • Master's thesis, U.C.W. Aberystwyth
    • Jones, D.M. (1970). A Sequential Method for Industrial Chemical Research, Master's thesis, U.C.W. Aberystwyth.
    • (1970)
    • Jones, D.M.1
  • 123
    • 84886691949 scopus 로고
    • Search Procedures for Industrial Chemical Research
    • Ph.D. thesis, Cambridge
    • Jones, D.M. (1975). Search Procedures for Industrial Chemical Research, Ph.D. thesis, Cambridge.
    • (1975)
    • Jones, D.M.1
  • 124
    • 38949106488 scopus 로고    scopus 로고
    • On the worth of perfect information in bandits with random discounting
    • Jones, M.L. and Koo, R. (2008). On the worth of perfect information in bandits with random discounting, Sequential Anal. 27, 58-67.
    • (2008) Sequential Anal. , vol.27 , pp. 58-67
    • Jones, M.L.1    Koo, R.2
  • 125
    • 10944236938 scopus 로고    scopus 로고
    • A survey on the bandit problem with switching costs
    • Jun, T. (2004). A survey on the bandit problem with switching costs, De Economist 152, 513-41.
    • (2004) De Economist , vol.152 , pp. 513-541
    • Jun, T.1
  • 126
    • 0001789120 scopus 로고
    • Optimal strategies for a class of constrained sequential problems
    • Kadane, J.B. and Simon, H.A. (1977). Optimal strategies for a class of constrained sequential problems, Ann. Stat. 5, 237-55.
    • (1977) Ann. Stat. , vol.5 , pp. 237-255
    • Kadane, J.B.1    Simon, H.A.2
  • 127
    • 0008640760 scopus 로고
    • A note on Katehakis and Y.-R. Chen's computation of the Gittins index
    • Kallenberg, L.C.M. (1986). A note on Katehakis and Y.-R. Chen's computation of the Gittins index, Math. Oper. Res. 11, 184-6.
    • (1986) Math. Oper. Res. , vol.11 , pp. 184-186
    • Kallenberg, L.C.M.1
  • 128
    • 0000978159 scopus 로고
    • Gittins indices in the dynamic allocation problem for diffusion processes
    • Karatzas, I. (1984). Gittins indices in the dynamic allocation problem for diffusion processes, Adv. Appl. Prob. 12, 173-92.
    • (1984) Adv. Appl. Prob. , vol.12 , pp. 173-192
    • Karatzas, I.1
  • 129
    • 0032271173 scopus 로고    scopus 로고
    • Multi-armed bandits in discrete and continuous time
    • Kaspi, H. and Mandelbaum, A. (1998). Multi-armed bandits in discrete and continuous time, Ann. Appl. Prob. 8, 270-90.
    • (1998) Ann. Appl. Prob. , vol.8 , pp. 270-290
    • Kaspi, H.1    Mandelbaum, A.2
  • 130
    • 0011511430 scopus 로고
    • Computing optimal sequential allocation rules in clinical trials
    • in J. Van Ryzin, editor, Adaptive Statistical Procedures and Related Topics, Institute of Mathematical Statistics
    • Katehakis, M.N. and Derman, C. (1986). Computing optimal sequential allocation rules in clinical trials, in J. Van Ryzin, editor, Adaptive Statistical Procedures and Related Topics, Institute of Mathematical Statistics, pages 29-39.
    • (1986) , pp. 29-39
    • Katehakis, M.N.1    Derman, C.2
  • 131
    • 0023345261 scopus 로고
    • The multi-armed bandit problem: decomposition and computation
    • Katehakis, M.N. and Veinott, A.F. (1987). The multi-armed bandit problem: decomposition and computation, Math. Oper. Res. 12, 262-8.
    • (1987) Math. Oper. Res. , vol.12 , pp. 262-268
    • Katehakis, M.N.1    Veinott, A.F.2
  • 132
    • 84886651641 scopus 로고
    • Contribution to the discussion of Gittins
    • Kelly, F.P. (1979). Contribution to the discussion of Gittins, J. R. Statist. Soc. B 41, 167-8.
    • (1979) J. R. Statist. Soc. B , vol.41 , pp. 167-168
    • Kelly, F.P.1
  • 133
    • 0039771701 scopus 로고
    • Multi-armed bandits with discount factor near one: the Bernoulli case
    • Kelly, F.P. (1981). Multi-armed bandits with discount factor near one: the Bernoulli case, Ann. Stat. 9, 987-1001.
    • (1981) Ann. Stat. , vol.9 , pp. 987-1001
    • Kelly, F.P.1
  • 134
    • 0000257712 scopus 로고
    • Some problems in the theory of queues
    • Kendall, D.G. (1951). Some problems in the theory of queues, J. R. Statist. Soc. B, 13, 85.
    • (1951) J. R. Statist. Soc. B , vol.13 , pp. 85
    • Kendall, D.G.1
  • 135
    • 84886701707 scopus 로고
    • Decision processes under total expected concomitant constraints with approximations to bandit processes
    • private communication
    • Kertz, R.P. (1986). Decision processes under total expected concomitant constraints with approximations to bandit processes, private communication.
    • (1986)
    • Kertz, R.P.1
  • 136
    • 0001035326 scopus 로고
    • Time-sharing service systems I
    • Klimov, G.P. (1974). Time-sharing service systems I, Theory Probab. Appl. 19, 532-51.
    • (1974) Theory Probab. Appl. , vol.19 , pp. 532-551
    • Klimov, G.P.1
  • 137
    • 0001327852 scopus 로고
    • Time-sharing service systems II
    • Klimov, G.P. (1978). Time-sharing service systems II, Theory Probab. Appl. 23, 314-21.
    • (1978) Theory Probab. Appl. , vol.23 , pp. 314-321
    • Klimov, G.P.1
  • 138
    • 0010358271 scopus 로고
    • An operational critique of detection laws
    • Koopman, B.O. (1979). An operational critique of detection laws, Oper. Res., 27, 33.
    • (1979) Oper. Res , vol.27 , pp. 33
    • Koopman, B.O.1
  • 139
    • 67649963279 scopus 로고    scopus 로고
    • Partially observed Markov decision process multiarmed bandits -structural results
    • Krishnamurthy, V. and Wahlberg, B. (2009). Partially observed Markov decision process multiarmed bandits -structural results, Math. Oper. Res. 34, 287-302.
    • (2009) Math. Oper. Res. , vol.34 , pp. 287-302
    • Krishnamurthy, V.1    Wahlberg, B.2
  • 140
    • 0002899547 scopus 로고
    • Asymptotically efficient adaptive allocation rules
    • Lai, T.L. and Robbins, H. (1985). Asymptotically efficient adaptive allocation rules, Adv. Appl. Math. 6, 4-22.
    • (1985) Adv. Appl. Math. , vol.6 , pp. 4-22
    • Lai, T.L.1    Robbins, H.2
  • 141
    • 0040958004 scopus 로고
    • Open bandit processes and scheduling of queueing networks
    • Lai, T.L. and Ying, Z. (1988). Open bandit processes and scheduling of queueing networks, Adv. Appl. Prob. 20, 447-72.
    • (1988) Adv. Appl. Prob. , vol.20 , pp. 447-472
    • Lai, T.L.1    Ying, Z.2
  • 142
    • 84886673798 scopus 로고
    • Theory of Sequencing & Scheduling
    • John Wiley & Sons, Ltd
    • Lawler, E.L., Lenstra, J.K. and Rinnooy Kan, A.H.G. (1989). Theory of Sequencing & Scheduling, John Wiley & Sons, Ltd.
    • (1989)
    • Lawler, E.L.1    Lenstra, J.K.2    Rinnooy Kan, A.H.G.3
  • 143
    • 0345901561 scopus 로고
    • Sequencing and scheduling: algorithms and complexity
    • in S.C. Graves, A.H.G. Rinnooy Kan and P.H. Zipkin, editors, Handbooks in Operations Research and Management Science, Elsevier
    • Lawler, E.L., Lenstra, J.K., Rinnooy Kan, A.H.G. and Shmoys, D.B. (1993). Sequencing and scheduling: algorithms and complexity, in S.C. Graves, A.H.G. Rinnooy Kan and P.H. Zipkin, editors, Handbooks in Operations Research and Management Science, Elsevier, pages 445-522.
    • (1993) , pp. 445-522
    • Lawler, E.L.1    Lenstra, J.K.2    Rinnooy Kan, A.H.G.3    Shmoys, D.B.4
  • 144
    • 5244315180 scopus 로고
    • On the structure of discrete sequential search problems and of their solutions
    • Lehnerdt, M. (1982). On the structure of discrete sequential search problems and of their solutions, Math Operationsforschung u. Statist., Ser. Optimization 13, 533-7.
    • (1982) Math Operationsforschung u. Statist., Ser. Optimization , vol.13 , pp. 533-537
    • Lehnerdt, M.1
  • 145
    • 52449090226 scopus 로고    scopus 로고
    • Multi-UAV dynamic routing with partial observations using restless bandit allocation indices
    • in American Control Conference, 2008
    • Le Ny, J., Dahleh, M. and Feron, E. (2008). Multi-UAV dynamic routing with partial observations using restless bandit allocation indices, in American Control Conference, 2008, pages 4220-5.
    • (2008) , pp. 4220-4225
    • Le Ny, J.1    Dahleh, M.2    Feron, E.3
  • 146
    • 34047223046 scopus 로고    scopus 로고
    • Restless bandits with switching costs: linear programming relaxations
    • performance bounds and limited lookahead policies, in Proceedings of the 2006 American Control Conference
    • Le Ny, J. and Feron, E. (2006). Restless bandits with switching costs: linear programming relaxations, performance bounds and limited lookahead policies, in Proceedings of the 2006 American Control Conference, pages 1587-92.
    • (2006) , pp. 1587-1592
    • Le Ny, J.1    Feron, E.2
  • 147
    • 84886687998 scopus 로고    scopus 로고
    • Chasing a Moving Target: Learning in Dynamic Environments, Management Science
    • Available at SSRN
    • Levinthal, D. A. and Posen, H. E. (2011). Chasing a Moving Target: Learning in Dynamic Environments, Management Science. Available at SSRN: http://ssrn.com/abstract=1540100.
    • (2011)
    • Levinthal, D.A.1    Posen, H.E.2
  • 148
    • 0001874715 scopus 로고
    • Dynamic programming and decision theory
    • Lindley, D.V. (1960). Dynamic programming and decision theory, Appl. Statist. 10, 39-51.
    • (1960) Appl. Statist. , vol.10 , pp. 39-51
    • Lindley, D.V.1
  • 149
    • 0040400560 scopus 로고
    • The economics of belated information
    • Lippman, S.A. and McCall, J.J. (1981). The economics of belated information, Internat. Econ. Rev., 22, 46.
    • (1981) Internat. Econ. Rev , vol.22 , pp. 46
    • Lippman, S.A.1    McCall, J.J.2
  • 150
    • 77958597180 scopus 로고    scopus 로고
    • Indexability of restless bandit problems and optimality of Whittle's index for dynamic multichannel access
    • IEEE T. Inf. Theor., preprint at
    • Liu, K. and Zhao, Q. (2010). Indexability of restless bandit problems and optimality of Whittle's index for dynamic multichannel access, IEEE T. Inf. Theor., preprint at http://arxiv.org/abs/0810.4658.
    • (2010)
    • Liu, K.1    Zhao, Q.2
  • 151
    • 61449109791 scopus 로고    scopus 로고
    • Multi-armed bandit problems
    • in A.O. Hero III, D.A. Castanon, D. Cochran and K. Kastella, editors, Foundations and Applications of Sensor Management, Springer
    • Mahajan, A. and Teneketzis, D. (2008). Multi-armed bandit problems, in A.O. Hero III, D.A. Castanon, D. Cochran and K. Kastella, editors, Foundations and Applications of Sensor Management, Springer, pages 121-51.
    • (2008) , pp. 121-151
    • Mahajan, A.1    Teneketzis, D.2
  • 152
    • 0038958280 scopus 로고
    • Discrete multi-armed bandits and multi-parameter processes
    • Mandelbaum, A. (1986). Discrete multi-armed bandits and multi-parameter processes, Prob. Theory Rel. Fields 71, 129-47.
    • (1986) Prob. Theory Rel. Fields , vol.71 , pp. 129-147
    • Mandelbaum, A.1
  • 153
    • 0001388964 scopus 로고
    • Continuous multi-armed bandits and multi-parameter processes
    • Mandelbaum, A. (1988). Continuous multi-armed bandits and multi-parameter processes, Ann. Prob. 15, 1527-56.
    • (1988) Ann. Prob. , vol.15 , pp. 1527-1556
    • Mandelbaum, A.1
  • 154
    • 9444259137 scopus 로고    scopus 로고
    • Lower bounds on the sample complexity of exploration in the multi-armed bandit problem
    • in B. Schölkopf and M.K. Warmuth, editors, Sixteenth Annual Conference on Computational Learning Theory, Springer
    • Mannor, S. and Tsitsiklis, J.N. (2003). Lower bounds on the sample complexity of exploration in the multi-armed bandit problem, in B. Schölkopf and M.K. Warmuth, editors, Sixteenth Annual Conference on Computational Learning Theory, Springer, pages 418-32.
    • (2003) , pp. 418-432
    • Mannor, S.1    Tsitsiklis, J.N.2
  • 155
    • 0031257027 scopus 로고    scopus 로고
    • Optimality of the greedy shooting strategy in the presence of incomplete damage information
    • Manor, G. and Kress, M. (1997). Optimality of the greedy shooting strategy in the presence of incomplete damage information, Naval Res. Logist. 44, 613-22.
    • (1997) Naval Res. Logist. , vol.44 , pp. 613-622
    • Manor, G.1    Kress, M.2
  • 156
    • 0001812752 scopus 로고
    • Exploration and exploitation in organisational learning
    • March, J.G. (1991). Exploration and exploitation in organisational learning, Organ. Sci. 2, 71-86.
    • (1991) Organ. Sci. , vol.2 , pp. 71-86
    • March, J.G.1
  • 157
    • 0004401043 scopus 로고
    • A periodic optimal search
    • Matula, D. (1964). A periodic optimal search, Amer. Math. Monthly 71, 15-21.
    • (1964) Amer. Math. Monthly , vol.71 , pp. 15-21
    • Matula, D.1
  • 159
    • 0000834346 scopus 로고
    • Dynamic scheduling with convex delay costs: the generalized cμ-rule
    • van Meighem, J.A. (1995). Dynamic scheduling with convex delay costs: the generalized cμ-rule, Adv. Appl. Prob. 5, 809-33.
    • (1995) Adv. Appl. Prob. , vol.5 , pp. 809-833
    • Van Meighem, J.A.1
  • 160
    • 5744221168 scopus 로고
    • Multiple feedback at a single-server station
    • Meilijson, I. and Weiss, G. (1977). Multiple feedback at a single-server station, Stoc. Proc. Appl. 5, 195-205.
    • (1977) Stoc. Proc. Appl. , vol.5 , pp. 195-205
    • Meilijson, I.1    Weiss, G.2
  • 161
    • 84934562022 scopus 로고
    • Job matching and occupational choice
    • Miller, R.A. (1984). Job matching and occupational choice, J. Political Econ. 92, 1086-120.
    • (1984) J. Political Econ. , vol.92 , pp. 1086-1120
    • Miller, R.A.1
  • 162
    • 0001970576 scopus 로고
    • Optimal Allocation of Resources Between Research Projects
    • Ph.D. thesis, Cambridge
    • Nash, P. (1973). Optimal Allocation of Resources Between Research Projects, Ph.D. thesis, Cambridge.
    • (1973)
    • Nash, P.1
  • 163
    • 0000963016 scopus 로고
    • A generalized bandit problem
    • Nash, P. (1980). A generalized bandit problem, J. R. Statist. Soc. B 42, 165-9.
    • (1980) J. R. Statist. Soc. B , vol.42 , pp. 165-169
    • Nash, P.1
  • 164
    • 84886670473 scopus 로고
    • A Hamiltonian approach to optimal stochastic resource allocation
    • Nash, P. and Gittins, J.C. (1977). A Hamiltonian approach to optimal stochastic resource allocation, Adv. Appl. Prob. 9, 55-68.
    • (1977) Adv. Appl. Prob. , vol.9 , pp. 55-68
    • Nash, P.1    Gittins, J.C.2
  • 165
    • 77956958088 scopus 로고
    • Maximizing submodular set functions: formulations and analysis of algorithms
    • in P. Hansen, editor, Annals of Discrete Mathematics 11, Studies on Graphs and Discrete Programming, North-Holland
    • Nemhauser, G.L. and Wolsey, L.A. (1981). Maximizing submodular set functions: formulations and analysis of algorithms, in P. Hansen, editor, Annals of Discrete Mathematics 11, Studies on Graphs and Discrete Programming, North-Holland, pages 279-301.
    • (1981) , pp. 279-301
    • Nemhauser, G.L.1    Wolsey, L.A.2
  • 167
    • 49349091331 scopus 로고    scopus 로고
    • A (2/3)n3 fast-pivoting algorithm for the Gittins index and optimal stopping of a Markov chain
    • Niño-Mora, J. (2007). A (2/3)n3 fast-pivoting algorithm for the Gittins index and optimal stopping of a Markov chain, INFORMS J. on Computing 19, 596-606.
    • (2007) INFORMS J. on Computing , vol.19 , pp. 596-606
    • Niño-mora, J.1
  • 168
    • 84886660487 scopus 로고    scopus 로고
    • Klimov's model
    • in J.J. Cochran, editor, Wiley Encyclopedia of Operations Research and Management Science, John Wiley & Sons, Inc
    • Niño-Mora, J. (2011a). Klimov's model, in J.J. Cochran, editor, Wiley Encyclopedia of Operations Research and Management Science, John Wiley & Sons, Inc., pages 2265-74.
    • (2011) , pp. 2265-2274
    • Niño-mora, J.1
  • 169
    • 85086012069 scopus 로고    scopus 로고
    • Conservation laws and related applications
    • in J.J. Cochran, editor, Wiley Encyclopedia of Operations Research and Management Science, John Wiley & Sons, Inc
    • Niño-Mora, J. (2011b). Conservation laws and related applications, in J.J. Cochran, editor, Wiley Encyclopedia of Operations Research and Management Science, John Wiley & Sons, Inc., pages 939-50.
    • (2011) , pp. 939-950
    • Niño-mora, J.1
  • 170
    • 84886695003 scopus 로고
    • Cost-minimum priorities in queueing systems of type M/G/1
    • von Olivier, G. (1972). Cost-minimum priorities in queueing systems of type M/G/1, Elektron. Rechenanl. 14, 262-71.
    • (1972) Elektron. Rechenanl. , vol.14 , pp. 262-271
    • von Olivier, G.1
  • 171
    • 0008550960 scopus 로고
    • Optimal stochastic scheduling of forest networks with switching penalties
    • van Oyen, M.P. and Teneketzis, D. (1994). Optimal stochastic scheduling of forest networks with switching penalties, Adv. Appl. Prob. 26, 474-97.
    • (1994) Adv. Appl. Prob. , vol.26 , pp. 474-497
    • Van Oyen, M.P.1    Teneketzis, D.2
  • 172
    • 0346932893 scopus 로고
    • On the optimality of the Gittins index rule for multi-armed bandits with multiple plays
    • Pandelis, D.G. and Teneketzis, D. (1991). On the optimality of the Gittins index rule for multi-armed bandits with multiple plays, Math. Meth. Oper. Res. 50, 449-61.
    • (1991) Math. Meth. Oper. Res. , vol.50 , pp. 449-461
    • Pandelis, D.G.1    Teneketzis, D.2
  • 173
    • 0003725604 scopus 로고
    • Combinatorial Optimization: Algorithms and Complexity
    • Prentice-Hall
    • Papadimitriou, C.H. and Steiglitz, K. (1982). Combinatorial Optimization: Algorithms and Complexity, Prentice-Hall.
    • (1982)
    • Papadimitriou, C.H.1    Steiglitz, K.2
  • 174
    • 0032628612 scopus 로고    scopus 로고
    • The complexity of optimal queuing network control
    • Papadimitriou, C.H. and Tsitsiklis, J.N. (1999). The complexity of optimal queuing network control, Math. Oper. Res. 24, 293-305.
    • (1999) Math. Oper. Res. , vol.24 , pp. 293-305
    • Papadimitriou, C.H.1    Tsitsiklis, J.N.2
  • 175
    • 0037573224 scopus 로고    scopus 로고
    • Some memoryless bandit policies
    • Peköz, E.A. (2003). Some memoryless bandit policies, J. Appl. Prob. 40, 250-6.
    • (2003) J. Appl. Prob. , vol.40 , pp. 250-256
    • Peköz, E.A.1
  • 177
    • 84892234592 scopus 로고    scopus 로고
    • Scheduling: Theory
    • Algorithms and Systems, 3rd edition, Springer
    • Pinedo, M.L. (2008). Scheduling: Theory, Algorithms and Systems, 3rd edition, Springer.
    • (2008)
    • Pinedo, M.L.1
  • 178
    • 0002308024 scopus 로고
    • Sequential Control with Incomplete Information: The Bayesian Approach
    • Nauka
    • Presman, E.L. and Sonin, I.M. (1982). Sequential Control with Incomplete Information: The Bayesian Approach, Nauka.
    • (1982)
    • Presman, E.L.1    Sonin, I.M.2
  • 179
    • 85102627959 scopus 로고    scopus 로고
    • Markov Decision Processes: Discrete Stochastic Dynamic Programming
    • 2nd revised edition, John Wiley & Sons, Ltd
    • Puterman, M.L. (2005). Markov Decision Processes: Discrete Stochastic Dynamic Programming, 2nd revised edition, John Wiley & Sons, Ltd.
    • (2005)
    • Puterman, M.L.1
  • 180
    • 0004124101 scopus 로고
    • Applied Statistical Decision Theory
    • Harvard Business School
    • Raiffa, H. and Schlaifer, R. (1961). Applied Statistical Decision Theory, Harvard Business School.
    • (1961)
    • Raiffa, H.1    Schlaifer, R.2
  • 181
    • 0001101258 scopus 로고
    • On a characterization of the Poisson distribution
    • Rao, C.R. and Rubin, H. (1964). On a characterization of the Poisson distribution, Sankhya A 32, 265-70.
    • (1964) Sankhya A , vol.32 , pp. 265-270
    • Rao, C.R.1    Rubin, H.2
  • 183
    • 84966203785 scopus 로고
    • Some aspects of the sequential design of experiments
    • Robbins, H. (1952). Some aspects of the sequential design of experiments, Bull. Amer. Math. Soc. 58, 527-35.
    • (1952) Bull. Amer. Math. Soc. , vol.58 , pp. 527-535
    • Robbins, H.1
  • 184
    • 0020116650 scopus 로고
    • Algorithms for evaluating the dynamic allocation index
    • Robinson, D.R. (1982). Algorithms for evaluating the dynamic allocation index, Op. Res. Letters 1, 72-4.
    • (1982) Op. Res. Letters , vol.1 , pp. 72-74
    • Robinson, D.R.1
  • 185
    • 0024664332 scopus 로고
    • Randomized and past-dependent policies for Markov decision processes with multiple constraints
    • Ross, K.W. (1989). Randomized and past-dependent policies for Markov decision processes with multiple constraints, Oper. Res. 37, 474-7.
    • (1989) Oper. Res. , vol.37 , pp. 474-477
    • Ross, K.W.1
  • 186
    • 0003644137 scopus 로고
    • Applied Probability Models with Optimization Applications
    • Holden-Day
    • Ross, S.M. (1970). Applied Probability Models with Optimization Applications, Holden-Day.
    • (1970)
    • Ross, S.M.1
  • 187
    • 0004038871 scopus 로고
    • Introduction to Stochastic Dynamic Programming
    • Academic Press
    • Ross, S.M. (1983). Introduction to Stochastic Dynamic Programming, Academic Press.
    • (1983)
    • Ross, S.M.1
  • 188
    • 0005372684 scopus 로고
    • Scheduling with random service times
    • Rothkopf, M.H. (1966). Scheduling with random service times, Manage. Sci. 12, 707-13.
    • (1966) Manage. Sci. , vol.12 , pp. 707-713
    • Rothkopf, M.H.1
  • 189
    • 0015960322 scopus 로고
    • Scheduling for minimum total loss using service time distributions
    • Sevcik, K.C. (1974). Scheduling for minimum total loss using service time distributions, JACM 21, 66-75.
    • (1974) JACM , vol.21 , pp. 66-75
    • Sevcik, K.C.1
  • 190
    • 0026861383 scopus 로고
    • Multiclass queueing systems: polymatroidal structure and optimal scheduling control
    • Shanthikumar, J.G. and Yao, D.D. (1992). Multiclass queueing systems: polymatroidal structure and optimal scheduling control, Oper. Res., 40, S293-9.
    • (1992) Oper. Res , vol.40
    • Shanthikumar, J.G.1    Yao, D.D.2
  • 191
    • 0016483826 scopus 로고
    • Decomposition algorithms for single-machine scheduling with precedence relations and deferral costs
    • Sidney, J.B. (1975). Decomposition algorithms for single-machine scheduling with precedence relations and deferral costs, Oper. Res. 23, 283-93.
    • (1975) Oper. Res. , vol.23 , pp. 283-293
    • Sidney, J.B.1
  • 192
    • 0002066821 scopus 로고
    • Various optimizers for single-stage production
    • Smith, W.E. (1956). Various optimizers for single-stage production, Naval Res. Logist. Quart., 1-2, 59-63.
    • (1956) Naval Res. Logist. Quart , vol.1-2 , pp. 59-63
    • Smith, W.E.1
  • 193
    • 49349084225 scopus 로고    scopus 로고
    • A generalized Gittins index for a Markov chain and its recursive calculation
    • Sonin, I.M. (2008). A generalized Gittins index for a Markov chain and its recursive calculation, Statist. Probab. Lett. 78, 1526-33.
    • (2008) Statist. Probab. Lett. , vol.78 , pp. 1526-1533
    • Sonin, I.M.1
  • 194
    • 0004245209 scopus 로고
    • Theory of Optimal Search
    • Academic Press
    • Stone, L.D. (1975). Theory of Optimal Search, Academic Press.
    • (1975)
    • Stone, L.D.1
  • 195
    • 84886646373 scopus 로고
    • Search theory index
    • Technical Report TN-017-80, Institute for Maritime Technology, Simonstown, S. Africa
    • Strümpfer, J. (1980). Search theory index, Technical Report TN-017-80, Institute for Maritime Technology, Simonstown, S. Africa.
    • (1980)
    • Strümpfer, J.1
  • 196
    • 0017469807 scopus 로고
    • Optimal control of single-server queuing networks and multi-class M/G/1 queues with feedback
    • Tcha, D.W. and Pliska, S.R. (1977). Optimal control of single-server queuing networks and multi-class M/G/1 queues with feedback, Oper. Res. 25, 248-58.
    • (1977) Oper. Res. , vol.25 , pp. 248-258
    • Tcha, D.W.1    Pliska, S.R.2
  • 197
    • 0039074926 scopus 로고
    • A bibliography on semi-Markov processes
    • Teugels, J.L. (1976). A bibliography on semi-Markov processes, J. Comp. Appl. Maths., 2, 44.
    • (1976) J. Comp. Appl. Maths , vol.2 , pp. 44
    • Teugels, J.L.1
  • 198
    • 84886689602 scopus 로고
    • The multi-armed bandit problem and optimality of the Gittins index strategy
    • private communication
    • Thron, C. (1984). The multi-armed bandit problem and optimality of the Gittins index strategy, private communication.
    • (1984)
    • Thron, C.1
  • 199
    • 0022739904 scopus 로고
    • A lemma on the multiarmed bandit problem
    • Tsitsiklis, J.N. (1986). A lemma on the multiarmed bandit problem, IEEE T. Automat. Contr. 31, 576-7.
    • (1986) IEEE T. Automat. Contr. , vol.31 , pp. 576-577
    • Tsitsiklis, J.N.1
  • 200
    • 0242590668 scopus 로고
    • A short proof of the Gittins index theorem
    • Tsitsiklis, J.N. (1994). A short proof of the Gittins index theorem, Ann. Appl. Prob. 4, 194-9.
    • (1994) Ann. Appl. Prob. , vol.4 , pp. 194-199
    • Tsitsiklis, J.N.1
  • 201
    • 0003573650 scopus 로고
    • The region of achieveable performance in a model of Klimov
    • Technical report, Research Report RC16543, IBM T.J. Watson Research Center, Yorktown Heights, NY
    • Tsoucas, P. (1991). The region of achieveable performance in a model of Klimov, Technical report, Research Report RC16543, IBM T.J. Watson Research Center, Yorktown Heights, NY.
    • (1991)
    • Tsoucas, P.1
  • 202
    • 0003782186 scopus 로고    scopus 로고
    • Linear Programming: Foundations and Extensions
    • Springer
    • Vanderbei, R. (2007). Linear Programming: Foundations and Extensions, Springer.
    • (2007)
    • Vanderbei, R.1
  • 203
    • 0022060331 scopus 로고
    • Extensions of the multiarmed bandit problem: the discounted case
    • Varaiya, P., Walrand, J. and Buyukkoc, C. (1985). Extensions of the multiarmed bandit problem: the discounted case, IEEE T. Automat. Contr. 30, 426-39.
    • (1985) IEEE T. Automat. Contr. , vol.30 , pp. 426-439
    • Varaiya, P.1    Walrand, J.2    Buyukkoc, C.3
  • 204
    • 0001473026 scopus 로고
    • On finding optimal policies in discrete dynamic programming with no discounting
    • Veinott, A., Jr. (1966). On finding optimal policies in discrete dynamic programming with no discounting, Ann. Math. Stat. 37, 128-94.
    • (1966) Ann. Math. Stat. , vol.37 , pp. 128-194
    • Veinott, A.1
  • 205
    • 0003903696 scopus 로고
    • Statistical Decision Functions
    • John Wiley & Sons, Inc
    • Wald, A. (1950). Statistical Decision Functions, John Wiley & Sons, Inc.
    • (1950)
    • Wald, A.1
  • 206
    • 0003944095 scopus 로고
    • An Introduction to Queueing Networks
    • Prentice Hall
    • Walrand, J. (1988). An Introduction to Queueing Networks, Prentice Hall.
    • (1988)
    • Walrand, J.1
  • 207
    • 79955773062 scopus 로고    scopus 로고
    • Error bounds for calculation of the Gittins indices
    • Wang, Y.G. (1997). Error bounds for calculation of the Gittins indices, Austral. J. Statist. 39, 225-33.
    • (1997) Austral. J. Statist. , vol.39 , pp. 225-233
    • Wang, Y.G.1
  • 208
    • 61449109791 scopus 로고    scopus 로고
    • Application of multi-armed bandits to sensor management
    • in A.O. Hero III, D.A. Castanon, D. Cochran and K. Kastella, editors, Foundations and Applications of Sensor Management, Springer
    • Washburn, A. (2008). Application of multi-armed bandits to sensor management, in A.O. Hero III, D.A. Castanon, D. Cochran and K. Kastella, editors, Foundations and Applications of Sensor Management, Springer, pages 153-75.
    • (2008) , pp. 153-175
    • Washburn, A.1
  • 209
    • 84886703566 scopus 로고
    • Multi-server Stochastic Scheduling
    • Ph.D. thesis, Cambridge
    • Weber, R.R. (1980). Multi-server Stochastic Scheduling, Ph.D. thesis, Cambridge.
    • (1980)
    • Weber, R.R.1
  • 210
    • 0020100028 scopus 로고
    • Scheduling jobs with stochastic processing requirement on parallel machines to minimise makespan or flowtime
    • Weber, R.R. (1982). Scheduling jobs with stochastic processing requirement on parallel machines to minimise makespan or flowtime, J. Appl. Prob. 19, 167-82.
    • (1982) J. Appl. Prob. , vol.19 , pp. 167-182
    • Weber, R.R.1
  • 211
    • 0001072450 scopus 로고
    • On the Gittins index for multiarmed bandits
    • Weber, R.R. (1992). On the Gittins index for multiarmed bandits, Ann. Appl. Prob. 2, 1024-33.
    • (1992) Ann. Appl. Prob. , vol.2 , pp. 1024-1033
    • Weber, R.R.1
  • 212
    • 51149093231 scopus 로고    scopus 로고
    • Comments on: Dynamic priority allocation via restless bandit marginal productivity indices
    • Weber, R.R. (2007). Comments on: Dynamic priority allocation via restless bandit marginal productivity indices, TOP 15, 211-16.
    • (2007) TOP , vol.15 , pp. 211-216
    • Weber, R.R.1
  • 213
    • 0002327722 scopus 로고
    • On an index policy for restless bandits
    • Weber, R.R. and Weiss, G. (1990). On an index policy for restless bandits, J. Appl. Prob. 27, 637-48.
    • (1990) J. Appl. Prob. , vol.27 , pp. 637-648
    • Weber, R.R.1    Weiss, G.2
  • 214
    • 0000397511 scopus 로고
    • Addendum to 'On an index policy for restless bandits'
    • Weber, R.R. and Weiss, G. (1991). Addendum to 'On an index policy for restless bandits', Adv. Appl. Prob. 23, 429-30.
    • (1991) Adv. Appl. Prob. , vol.23 , pp. 429-430
    • Weber, R.R.1    Weiss, G.2
  • 215
    • 84975987963 scopus 로고
    • Branching bandit processes
    • Weiss, G. (1988). Branching bandit processes, Prob. Eng. Inform. Sci. 2, 269-78.
    • (1988) Prob. Eng. Inform. Sci. , vol.2 , pp. 269-278
    • Weiss, G.1
  • 216
    • 0000531531 scopus 로고
    • Optimal search for the best alternative
    • Weitzman, M.L. (1979). Optimal search for the best alternative, Econometrica 47, 641-54.
    • (1979) Econometrica , vol.47 , pp. 641-654
    • Weitzman, M.L.1
  • 217
    • 0000248624 scopus 로고
    • Multi-armed bandits and the Gittins index
    • Whittle, P. (1980). Multi-armed bandits and the Gittins index, J. R. Statist. Soc. B 42, 143-49.
    • (1980) J. R. Statist. Soc. B , vol.42 , pp. 143-149
    • Whittle, P.1
  • 218
    • 0000595228 scopus 로고
    • Arm-acquiring bandits
    • Whittle, P. (1981). Arm-acquiring bandits, Ann. Prob. 9, 284-92.
    • (1981) Ann. Prob. , vol.9 , pp. 284-292
    • Whittle, P.1
  • 219
    • 0003789938 scopus 로고
    • Optimization over Time
    • John Wiley & Sons, Ltd, 2
    • Whittle, P. (1983). Optimization over Time, John Wiley & Sons, Ltd; volume 1, 1982, volume 2, 1983.
    • (1982) , vol.1
    • Whittle, P.1
  • 220
    • 0001043843 scopus 로고
    • Restless bandits: activity allocation in a changing world
    • in J. Gani, editor, A Celebration of Applied Probability, Applied Probability Special Volume 25A, Applied Probability Trust
    • Whittle, P. (1988). Restless bandits: activity allocation in a changing world, in J. Gani, editor, A Celebration of Applied Probability, Applied Probability Special Volume 25A, Applied Probability Trust, pages 287-98.
    • (1988) , pp. 287-298
    • Whittle, P.1
  • 221
    • 84886711028 scopus 로고
    • Book review: Multi-armed Bandit Allocation Indices
    • Whittle, P. (1991). Book review: Multi-armed Bandit Allocation Indices, Stoch. Stoch. Rep. 35, 125-8.
    • (1991) Stoch. Stoch. Rep. , vol.35 , pp. 125-128
    • Whittle, P.1
  • 222
    • 0003427290 scopus 로고    scopus 로고
    • Optimal Control: Basics and Beyond
    • John Wiley & Sons, Ltd
    • Whittle, P. (1996). Optimal Control: Basics and Beyond, John Wiley & Sons, Ltd.
    • (1996)
    • Whittle, P.1
  • 223
    • 27944487945 scopus 로고    scopus 로고
    • Tax problems in the undiscounted case
    • Whittle, P. (2005). Tax problems in the undiscounted case, J. Appl. Prob. 42, 754-65.
    • (2005) J. Appl. Prob. , vol.42 , pp. 754-765
    • Whittle, P.1
  • 224
    • 84926203212 scopus 로고    scopus 로고
    • Networks: Optimisation and Evolution
    • Cambridge University Press
    • Whittle, P. (2007). Networks: Optimisation and Evolution, Cambridge University Press.
    • (2007)
    • Whittle, P.1
  • 225
    • 67650362301 scopus 로고    scopus 로고
    • Some results on the Gittins index for a normal reward process
    • in H. Ho, C. Ing and T. Lai, editors, Time Series and Related Topics: In Memory of Ching-Zong Wei, Institute of Mathematical Statistics
    • Yao, Y.C. (2006). Some results on the Gittins index for a normal reward process, in H. Ho, C. Ing and T. Lai, editors, Time Series and Related Topics: In Memory of Ching-Zong Wei, Institute of Mathematical Statistics, pages 284-94.
    • (2006) , pp. 284-294
    • Yao, Y.C.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.