메뉴 건너뛰기




Volumn 17, Issue 2, 1996, Pages 122-142

Optimal adaptive policies for sequential allocation problems

Author keywords

[No Author keywords available]

Indexed keywords


EID: 0030159874     PISSN: 01968858     EISSN: None     Source Type: Journal    
DOI: 10.1006/aama.1996.0007     Document Type: Article
Times cited : (176)

References (33)
  • 1
    • 0011427037 scopus 로고
    • Iterative adaptive control of denumerable state, average-cost Markov systems
    • 1. R. Acost-Abreu and O. Hernandez-Lerma, Iterative adaptive control of denumerable state, average-cost Markov systems, Control Cybernet. 14 (1985), 313-322.
    • (1985) Control Cybernet. , vol.14 , pp. 313-322
    • Acost-Abreu, R.1    Hernandez-Lerma, O.2
  • 2
    • 0024886640 scopus 로고
    • Asymtotically efficient adaptive allocation schemes for controlled markov chains: Finite parameter space
    • 2. R. Agrawal, D. Teneketzis, and V. Anantharam, Asymtotically efficient adaptive allocation schemes for controlled Markov chains: Finite parameter space, Trans. Automat. Control IEEE 34 (1989), 1249-1259.
    • (1989) Trans. Automat. Control IEEE , vol.34 , pp. 1249-1259
    • Agrawal, R.1    Teneketzis, D.2    Anantharam, V.3
  • 3
    • 0003787146 scopus 로고
    • Princeton Univ. Press, Princeton, NJ
    • 3. R. Bellman, "Dynamic Programming," Princeton Univ. Press, Princeton, NJ, 1957.
    • (1957) Dynamic Programming
    • Bellman, R.1
  • 5
    • 0001849338 scopus 로고
    • Routing in queueing networks under imperfect information: Stochastic dominance and thresholds
    • 5. F. J. Beutler and D. Teneketzis, Routing in queueing networks under imperfect information: Stochastic dominance and thresholds, Stochastics Stochastics Rep. 26 (1989), 81-100.
    • (1989) Stochastics Stochastics Rep. , vol.26 , pp. 81-100
    • Beutler, F.J.1    Teneketzis, D.2
  • 7
    • 84976024933 scopus 로고
    • On sequencing two types of tasks on a single processor under incomplete information
    • 7. A. N. Burnetas and M. N. Katehakis, On sequencing two types of tasks on a single processor under incomplete information, Probab. Engrg. Inform. Sci. 7 (1993) 85-119.
    • (1993) Probab. Engrg. Inform. Sci. , vol.7 , pp. 85-119
    • Burnetas, A.N.1    Katehakis, M.N.2
  • 12
    • 0019577148 scopus 로고
    • Nonstatoinary markov decision problems with converging parameters
    • 12. A. Federgruen and P. Schweitzer, Nonstatoinary Markov decision problems with converging parameters, J. Optim. Theory Appl. 34 (1981), 207-241.
    • (1981) J. Optim. Theory Appl. , vol.34 , pp. 207-241
    • Federgruen, A.1    Schweitzer, P.2
  • 14
    • 0011413450 scopus 로고
    • Adaptive policies for markov renewal programs
    • 14. B. L. Fox and J. F. Rolph, Adaptive policies for Markov renewal programs, Ann. Statist. 1 (1973), 334-341.
    • (1973) Ann. Statist. , vol.1 , pp. 334-341
    • Fox, B.L.1    Rolph, J.F.2
  • 15
    • 0000169010 scopus 로고
    • Bandit processes and dynamic allocation indices (with discussion)
    • 15. J. C. Gittins, Bandit processes and dynamic allocation indices (with discussion), J. Roy. Statist. Soc. Ser. B 41 (1979), 335-340.
    • (1979) J. Roy. Statist. Soc. Ser. B , vol.41 , pp. 335-340
    • Gittins, J.C.1
  • 16
    • 0000634558 scopus 로고
    • On bayesian models in stochastic scheduling
    • 16. J. C. Gittins and K. D. Glazebrook, On Bayesian models in stochastic scheduling, J. Appl. Probab. 14 (1977), 556-565.
    • (1977) J. Appl. Probab. , vol.14 , pp. 556-565
    • Gittins, J.C.1    Glazebrook, K.D.2
  • 17
    • 0018709825 scopus 로고
    • A dynamic allocation index for the discounted multarmed bandit problem
    • 17. J. C. Gittins and D. M. Jones, A dynamic allocation index for the discounted multarmed bandit problem, Biometrika 66 (1979), 561-565.
    • (1979) Biometrika , vol.66 , pp. 561-565
    • Gittins, J.C.1    Jones, D.M.2
  • 19
    • 0011511430 scopus 로고
    • Computing optimal sequential allocation rules in clinical trials
    • J. van Ryzin, Ed. IMS Lecture Notes - Monograph Series, Inst. Math .Statist., Hayward, CA
    • 19. M. N. Katehakis and C. Derman, Computing optimal sequential allocation rules in clinical trials, in "Adaptive Statistical Procedures and Related Topics," (J. van Ryzin, Ed. Vol. 8, pp. 29-39, IMS Lecture Notes - Monograph Series, Inst. Math .Statist., Hayward, CA, 1985.
    • (1985) Adaptive Statistical Procedures and Related Topics , vol.8 , pp. 29-39
    • Katehakis, M.N.1    Derman, C.2
  • 20
    • 0023345261 scopus 로고
    • The multi-armed bandit problem: Decomposition and computation
    • 20. M. N. Katehakis and A. F. Vienott, Jr., The multi-armed bandit problem: Decomposition and computation, Math. Oper. Res. 12 (1987), 262-268.
    • (1987) Math. Oper. Res. , vol.12 , pp. 262-268
    • Katehakis, M.N.1    Vienott A.F., Jr.2
  • 22
    • 0022062142 scopus 로고
    • A survey of some results in stochastic adaptive control
    • 22. P. R. Kumar, A survey of some results in stochastic adaptive control, SIAM J. Control Optim. 23 (1985) 329-380.
    • (1985) SIAM J. Control Optim. , vol.23 , pp. 329-380
    • Kumar, P.R.1
  • 23
    • 0000854435 scopus 로고
    • Adaptive treatment allocation and the multi-armed bandit problem
    • 23. T. L. Lai, Adaptive treatment allocation and the multi-armed bandit problem, Ann. Statist. 15 (1987), 1091-1114.
    • (1987) Ann. Statist. , vol.15 , pp. 1091-1114
    • Lai, T.L.1
  • 24
    • 0001732282 scopus 로고
    • Asymptotically optimal allocation of treatments in sequential experiments
    • T. J. Santner and A. C. Tamhane, Eds., Dekker, New York
    • 24. T. L. Lai and H. Robbins, Asymptotically optimal allocation of treatments in sequential experiments, in "Design of Experients: Ranking and Selection: Essays in Honor of Robert E. Bechhofer" (T. J. Santner and A. C. Tamhane, Eds.), Vol. 56, pp. 127-142, Dekker, New York, 1984.
    • (1984) Design of Experients: Ranking and Selection: Essays in Honor of Robert E. Bechhofer , vol.56 , pp. 127-142
    • Lai, T.L.1    Robbins, H.2
  • 25
    • 0002899547 scopus 로고
    • Asymptotically efficient adaptive allocation rules
    • 25. T. L. Lai and H. Robbins, Asymptotically efficient adaptive allocation rules, Adv. in Appl. Math. 6 (1985), 4-22.
    • (1985) Adv. in Appl. Math. , vol.6 , pp. 4-22
    • Lai, T.L.1    Robbins, H.2
  • 26
    • 0011412168 scopus 로고
    • Asymptotically efficient allocation rules for two Bernoulli populations
    • 26. Z. Li and C. Zhang, Asymptotically efficient allocation rules for two Bernoulli populations, J. Roy. Statist. Soc. Ser. B 54 (1992), 609-616.
    • (1992) J. Roy. Statist. Soc. Ser. B , vol.54 , pp. 609-616
    • Li, Z.1    Zhang, C.2
  • 27
    • 0002807389 scopus 로고
    • Estimation and control in Markov chains
    • 27. P. Mandl, Estimation and control in Markov chains, Adv. in Appl. Probab. 6 (1974), 40-60.
    • (1974) Adv. in Appl. Probab. , vol.6 , pp. 40-60
    • Mandl, P.1
  • 28
    • 0011459550 scopus 로고
    • Markov decision processes with unknown transition law: The average return case
    • D. J. White R. Hartley, and L. C. Thomas, Eds., Academic Press, New York
    • 28. K. M. Van Hee, Markov decision processes with unknown transition law: The average return case, in "Recent Developments in Markov Decision Processes" (D. J. White R. Hartley, and L. C. Thomas, Eds.), pp. 227-244. Academic Press, New York, 1980.
    • (1980) Recent Developments in Markov Decision Processes , pp. 227-244
    • Van Hee, K.M.1
  • 29
    • 0011457292 scopus 로고
    • A weak contrast function approach to adaptive semi-Markov decision models
    • S. Tzafestas and C. Watanabe, Eds., Dekker, New York
    • 29. R. A. Milito and J. B. Cruz, Jr., A weak contrast function approach to adaptive semi-Markov decision models, in "Stochastic Large Scale Engineering Systems" (S. Tzafestas and C. Watanabe, Eds.), pp. 253-278, Dekker, New York, 1992.
    • (1992) Stochastic Large Scale Engineering Systems , pp. 253-278
    • Milito, R.A.1    Cruz J.B., Jr.2
  • 31
    • 84966203785 scopus 로고
    • Some aspects of the sequential design of experiments
    • 31. H. Robbins, Some aspects of the sequential design of experiments, Bull. Amer. Math. Monthly 58 (1952), 527-536.
    • (1952) Bull. Amer. Math. Monthly , vol.58 , pp. 527-536
    • Robbins, H.1
  • 33
    • 0000607073 scopus 로고
    • Nonparametric bandit methods
    • 33. S. Yakowitz and W. Lowe, Nonparametric bandit methods, Ann. Oper. Res. 28 (1991), 297-312.
    • (1991) Ann. Oper. Res. , vol.28 , pp. 297-312
    • Yakowitz, S.1    Lowe, W.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.