메뉴 건너뛰기




Volumn 12, Issue , 2011, Pages 2931-2974

Hierarchical knowledge gradient for sequential sampling

Author keywords

Adaptive learning; Bayesian statistics; Hierarchical statistics; Ranking and selection; Sequential experimental design

Indexed keywords

ADAPTIVE LEARNING; BAYESIAN STATISTICS; HIERARCHICAL STATISTICS; RANKING AND SELECTION; SEQUENTIAL EXPERIMENTAL DESIGN;

EID: 80555145311     PISSN: 15324435     EISSN: 15337928     Source Type: Journal    
DOI: None     Document Type: Article
Times cited : (24)

References (56)
  • 2
    • 0036568025 scopus 로고    scopus 로고
    • Finite-time analysis of the multiarmed bandit problem
    • DOI 10.1023/A:1013689704352, Computational Learning Theory
    • Peter Auer, Nicolò Cesa-Bianchi, and Paul Fischer. Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47(2-3):235-256, 2002. (Pubitemid 34126111)
    • (2002) Machine Learning , vol.47 , Issue.2-3 , pp. 235-256
    • Auer, P.1    Cesa-Bianchi, N.2    Fischer, P.3
  • 5
    • 0002426110 scopus 로고
    • A single-sample multiple decision procedure for ranking means of normal populations with known variances
    • Robert E. Bechhofer. A single-sample multiple decision procedure for ranking means of normal populations with known variances. The Annals of Mathematical Statistics, 25(1):16-39, 1954.
    • (1954) The Annals of Mathematical Statistics , vol.25 , Issue.1 , pp. 16-39
    • Bechhofer, R.E.1
  • 7
    • 0024680419 scopus 로고
    • Adaptive aggregation methods for infinite horizon dynamic programming
    • Dimitri P. Bertsekas and David A. Castanon. Adaptive aggregation methods for infinite horizon dynamic programming. IEEE Transactions on Automatic Control, 34(6):589-598, 1989.
    • (1989) IEEE Transactions on Automatic Control , vol.34 , Issue.6 , pp. 589-598
    • Bertsekas, D.P.1    Castanon, D.A.2
  • 9
    • 0041965975 scopus 로고    scopus 로고
    • R-MAX - A general polynomial time algorithm for near-optimal reinforcement learning
    • Ronen I. Brafman and Moshe Tennenholtz. R-MAX - a general polynomial time algorithm for near-optimal reinforcement learning. Journal of Machine Learning Research, 3:213-231, 2003.
    • (2003) Journal of Machine Learning Research , vol.3 , pp. 213-231
    • Brafman, R.I.1    Tennenholtz, M.2
  • 10
    • 0000137128 scopus 로고
    • Widely convergent method for finding multiple solutions of simultaneous nonlinear equations
    • Franklin H. Branin. Widely convergent method for finding multiple solutions of simultaneous nonlinear equations. IBM Journal of Reseach and Development, 16(5):504-522, 1972.
    • (1972) IBM Journal of Reseach and Development , vol.16 , Issue.5 , pp. 504-522
    • Branin, F.H.1
  • 11
    • 77958068642 scopus 로고    scopus 로고
    • A tutorial on Bayesian optimization of expensive cost functions, with application to active user modeling and hierarchical reinforcement learning
    • University of British Columbia
    • Erik Brochu, Mike Cora, and Nando de Freitas. A tutorial on Bayesian optimization of expensive cost functions, with application to active user modeling and hierarchical reinforcement learning. Technical Report TR-2009-023, Department of Computer Science, University of British Columbia, 2009.
    • (2009) Technical Report TR-2009-023, Department of Computer Science
    • Brochu, E.1    Cora, M.2    De Freitas, N.3
  • 14
    • 0030387008 scopus 로고    scopus 로고
    • A gradient approach for smartly allocating computing budget for discrete event simulation
    • Chun-Hung Chen, Hsiao-Chang Chen, and Liyi Dai. A gradient approach for smartly allocating computing budget for discrete event simulation. In Proceedings of the 28th Conference on Winter Simulation, pages 398-405, 1996.
    • (1996) Proceedings of the 28th Conference on Winter Simulation , pp. 398-405
    • Chen, C.1    Chen, H.2    Dai, L.3
  • 15
    • 0035460965 scopus 로고    scopus 로고
    • New two-stage and sequential procedures for selecting the best simulated system
    • Stephen E. Chick and Koichiro Inoue. New two-stage and sequential procedures for selecting the best simulated system. Operations Research, 49(5):732-743, 2001. (Pubitemid 34011324)
    • (2001) Operations Research , vol.49 , Issue.5 , pp. 732-743
    • Chick, S.E.1    Inoue, K.2
  • 16
    • 77949359798 scopus 로고    scopus 로고
    • Sequential sampling to myopically maximize the expected value of information
    • Stephen E. Chick, Jurgen Branke, and Christian Schmidt. Sequential sampling to myopically maximize the expected value of information. INFORMS Journal on Computing, 22(1):71-80, 2010.
    • (2010) INFORMS Journal on Computing , vol.22 , Issue.1 , pp. 71-80
    • Chick, S.E.1    Branke, J.2    Schmidt, C.3
  • 19
    • 20744454447 scopus 로고    scopus 로고
    • Online convex optimization in the bandit setting: Gradient descent without a gradient
    • Proceedings of the Sixteenth Annual ACM-SIAM Symposium on Discrete Algorithms
    • Abraham D. Flaxman, Adam Tauman Kalai, and H. Brendan McMahan. Online convex optimization in the bandit setting: gradient descent without a gradient. In Proceedings of the Sixteenth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA '05, pages 385-394, 2005. (Pubitemid 40851394)
    • (2005) Proceedings of the Annual ACM-SIAM Symposium on Discrete Algorithms , pp. 385-394
    • Flaxman, A.D.1    Kalai, A.T.2    McMahan, H.B.3
  • 20
    • 55549135706 scopus 로고    scopus 로고
    • A knowledge-gradient policy for sequential information collection
    • Peter I. Frazier, Warren B. Powell, and Savas Dayanik. A knowledge-gradient policy for sequential information collection. SIAM Journal on Control and Optimization, 47(5):2410-2439, 2008.
    • (2008) SIAM Journal on Control and Optimization , vol.47 , Issue.5 , pp. 2410-2439
    • Frazier, P.I.1    Powell, W.B.2    Dayanik, S.3
  • 21
    • 70449498873 scopus 로고    scopus 로고
    • The knowledge-gradient policy for correlated normal beliefs
    • Peter I. Frazier, Warren B. Powell, and Savas Dayanik. The knowledge-gradient policy for correlated normal beliefs. INFORMS Journal on Computing, 21(4):599-613, 2009.
    • (2009) INFORMS Journal on Computing , vol.21 , Issue.4 , pp. 599-613
    • Frazier, P.I.1    Powell, W.B.2    Dayanik, S.3
  • 22
    • 56349109509 scopus 로고    scopus 로고
    • Value function approximation using multiple aggregation for multiattribute resource management
    • Abraham George, Warren B. Powell, and Sanjeev R. Kulkarni. Value function approximation using multiple aggregation for multiattribute resource management. Journal of Machine Learning Research, 9:2079-2111, 2008.
    • (2008) Journal of Machine Learning Research , vol.9 , pp. 2079-2111
    • George, A.1    Powell, W.B.2    Kulkarni, S.R.3
  • 25
    • 0030590294 scopus 로고    scopus 로고
    • Bayesian look ahead one-stage sampling allocations for selection of the best population
    • DOI 10.1016/0378-3758(95)00169-7
    • Shanti S. Gupta and Klaus J. Miescke. Bayesian look ahead one-stage sampling allocations for selection of the best population. Journal of Statistical Planning and Inference, 54(2):229-244, 1996. (Pubitemid 126161097)
    • (1996) Journal of Statistical Planning and Inference , vol.54 , Issue.2 , pp. 229-244
    • Gupta, S.S.1    Miescke, K.J.2
  • 27
    • 34548237750 scopus 로고    scopus 로고
    • Opportunity cost and OCBA selection procedures in ordinal optimization for a fixed number of alternative systems
    • DOI 10.1109/TSMCC.2007.900656
    • Donghai He, Stephen E. Chick, and Chun-Hung Chen. Opportunity cost and OCBA selection procedures in ordinal optimization for a fixed number of alternative systems. IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews, 37(5):951-961, 2007. (Pubitemid 47323564)
    • (2007) IEEE Transactions on Systems, Man and Cybernetics Part C: Applications and Reviews , vol.37 , Issue.5 , pp. 951-961
    • He, D.1    Chick, S.E.2    Chen, C.-H.3
  • 28
    • 33644791173 scopus 로고    scopus 로고
    • Global optimization of stochastic black-box systems via sequential kriging meta-models
    • DOI 10.1007/s10898-005-2454-3
    • Deng Huang, Theodore T. Allen, William I. Notz, and Ning Zheng. Global optimization of stochastic black-box systems via sequential kriging meta-models. Journal of Global Optimization, 34(3):441-466, 2006. (Pubitemid 43352177)
    • (2006) Journal of Global Optimization , vol.34 , Issue.3 , pp. 441-466
    • Huang, D.1    Allen, T.T.2    Notz, W.I.3    Zeng, N.4
  • 30
    • 0000561424 scopus 로고    scopus 로고
    • Efficient Global Optimization of Expensive Black-Box Functions
    • Donald R. Jones, Matthias Schonlau, and William J. Welch. Efficient global optimization of expensive black-box functions. Journal of Global Optimization, 13(4):455-492, 1998. (Pubitemid 128507405)
    • (1998) Journal of Global Optimization , vol.13 , Issue.4 , pp. 455-492
    • Jones, D.R.1    Schonlau, M.2    Welch, W.J.3
  • 32
    • 0036832954 scopus 로고    scopus 로고
    • Near-optimal reinforcement learning in polynomial time
    • DOI 10.1023/A:1017984413808
    • Michael Kearns and Satinder Singh. Near-optimal reinforcement learning in polynomial time. Machine Learning, 49(2-3):209-232, 2002. (Pubitemid 34325687)
    • (2002) Machine Learning , vol.49 , Issue.2-3 , pp. 209-232
    • Kearns, M.1    Singh, S.2
  • 36
    • 84998710865 scopus 로고
    • A new method of locating the maximum of an arbitrary multipeak curve in the presence of noise
    • Harold J. Kushner. A new method of locating the maximum of an arbitrary multipeak curve in the presence of noise. Journal of Basic Engineering, 86:97-106, 1964.
    • (1964) Journal of Basic Engineering , vol.86 , pp. 97-106
    • Kushner, H.J.1
  • 37
    • 0000854435 scopus 로고
    • Adaptive treatment allocation and the multi-armed bandit problem
    • Tze L. Lai. Adaptive treatment allocation and the multi-armed bandit problem. The Annals of Statistics, 15(3):1091-1114, 1987.
    • (1987) The Annals of Statistics , vol.15 , Issue.3 , pp. 1091-1114
    • Lai, T.L.1
  • 43
    • 85035066177 scopus 로고
    • On Bayesian methods for seeking the extremum
    • G. Marchuk, editor, July 17, 1974, of Lecture Notes in Computer Science, Springer Berlin/Heidelberg
    • Jonas Mockus. On Bayesian methods for seeking the extremum. In G. Marchuk, editor, Optimization Techniques IFIP Technical Conference Novosibirsk, July 17, 1974, volume 27 of Lecture Notes in Computer Science, pages 400-404. Springer Berlin/Heidelberg, 1975.
    • (1975) Optimization Techniques IFIP Technical Conference Novosibirsk , vol.27 , pp. 400-404
    • Mockus, J.1
  • 47
    • 0000606094 scopus 로고
    • Aggregation and disaggregation techniques and methodology in optimization
    • David F. Rogers, Robert D. Plante, Richard T. Wong, and James R. Evans. Aggregation and disaggregation techniques and methodology in optimization. Operations Research, 39(4):553-582, 1991.
    • (1991) Operations Research , vol.39 , Issue.4 , pp. 553-582
    • Rogers, D.F.1    Plante, R.D.2    Wong, R.T.3    Evans, J.R.4
  • 50
    • 63449141834 scopus 로고    scopus 로고
    • Powell. An approximate dynamic programming algorithm for large-scale fleet management: A case application
    • Hugo P. Simao, Jeff Day, Abraham P. George, Ted Gifford, John Nienow, and Warren B. Powell. An approximate dynamic programming algorithm for large-scale fleet management: A case application. Transportation Science, 43(2):178-197, 2009.
    • (2009) Transportation Science , vol.43 , Issue.2 , pp. 178-197
    • Simao, H.P.1    Day, J.2    George, A.P.3    Gifford, T.4    Nienow, J.5    Warren, B.6
  • 54
    • 77954033598 scopus 로고    scopus 로고
    • Convergence properties of the expected improvement algorithm with fixed mean and covariance functions
    • Emmanuel Vazquez and Julien Bect. Convergence properties of the expected improvement algorithm with fixed mean and covariance functions. Journal of Statistical Planning and Inference, 140(11):3088-3095, 2010.
    • (2010) Journal of Statistical Planning and Inference , vol.140 , Issue.11 , pp. 3088-3095
    • Vazquez, E.1    Bect, J.2
  • 55
    • 67650938640 scopus 로고    scopus 로고
    • An informational approach to the global optimization of expensive-to-evaluate functions
    • Julien Villemonteix, Emmanuel Vazquez, and Eric Walter. An informational approach to the global optimization of expensive-to-evaluate functions. Journal of Global Optimization, 44(4):509-534, 2009.
    • (2009) Journal of Global Optimization , vol.44 , Issue.4 , pp. 509-534
    • Villemonteix, J.1    Vazquez, E.2    Walter, E.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.