-
2
-
-
0036568025
-
Finite-time analysis of the multiarmed bandit problem
-
DOI 10.1023/A:1013689704352, Computational Learning Theory
-
Peter Auer, Nicolò Cesa-Bianchi, and Paul Fischer. Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47(2-3):235-256, 2002. (Pubitemid 34126111)
-
(2002)
Machine Learning
, vol.47
, Issue.2-3
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
3
-
-
80555137396
-
High-probability regret bounds for bandit online linear optimization
-
Peter L. Bartlett, Varsha Dani, Thomas P. Hayes, Sham Kakade, Alexander Rakhlin, and Ambuj Tewari. High-probability regret bounds for bandit online linear optimization. In Proceedings of the 21st Annual Conference on Learning Theory (COLT), pages 335-342, 2008.
-
(2008)
Proceedings of the 21st Annual Conference on Learning Theory (COLT)
, pp. 335-342
-
-
Bartlett, P.L.1
Dani, V.2
Hayes, T.P.3
Kakade, S.4
Rakhlin, A.5
Tewari, A.6
-
4
-
-
77950468650
-
Metamodel-based simulation optimization
-
Shane G. Henderson and Barry L. Nelson, editors, Elsevier
-
Russell R. Barton and Martin Meckesheimer. Metamodel-based simulation optimization. In Shane G. Henderson and Barry L. Nelson, editors, Simulation, volume 13 of Handbooks in Operations Research and Management Science, pages 535-574. Elsevier, 2006.
-
(2006)
Simulation, Volume 13 of Handbooks in Operations Research and Management Science
, pp. 535-574
-
-
Barton, R.R.1
Meckesheimer, M.2
-
5
-
-
0002426110
-
A single-sample multiple decision procedure for ranking means of normal populations with known variances
-
Robert E. Bechhofer. A single-sample multiple decision procedure for ranking means of normal populations with known variances. The Annals of Mathematical Statistics, 25(1):16-39, 1954.
-
(1954)
The Annals of Mathematical Statistics
, vol.25
, Issue.1
, pp. 16-39
-
-
Bechhofer, R.E.1
-
6
-
-
0003513355
-
-
John Wiley & Sons, New York, NY
-
Robert E. Bechhofer, Thomas J. Santner, and David M. Goldsman. Design and Analysis of Experiments for Statistical Selection, Screening and Multiple Comparisons. John Wiley & Sons, New York, NY, 1995.
-
(1995)
Design and Analysis of Experiments for Statistical Selection, Screening and Multiple Comparisons
-
-
Bechhofer, R.E.1
Santner, T.J.2
Goldsman, D.M.3
-
7
-
-
0024680419
-
Adaptive aggregation methods for infinite horizon dynamic programming
-
Dimitri P. Bertsekas and David A. Castanon. Adaptive aggregation methods for infinite horizon dynamic programming. IEEE Transactions on Automatic Control, 34(6):589-598, 1989.
-
(1989)
IEEE Transactions on Automatic Control
, vol.34
, Issue.6
, pp. 589-598
-
-
Bertsekas, D.P.1
Castanon, D.A.2
-
9
-
-
0041965975
-
R-MAX - A general polynomial time algorithm for near-optimal reinforcement learning
-
Ronen I. Brafman and Moshe Tennenholtz. R-MAX - a general polynomial time algorithm for near-optimal reinforcement learning. Journal of Machine Learning Research, 3:213-231, 2003.
-
(2003)
Journal of Machine Learning Research
, vol.3
, pp. 213-231
-
-
Brafman, R.I.1
Tennenholtz, M.2
-
10
-
-
0000137128
-
Widely convergent method for finding multiple solutions of simultaneous nonlinear equations
-
Franklin H. Branin. Widely convergent method for finding multiple solutions of simultaneous nonlinear equations. IBM Journal of Reseach and Development, 16(5):504-522, 1972.
-
(1972)
IBM Journal of Reseach and Development
, vol.16
, Issue.5
, pp. 504-522
-
-
Branin, F.H.1
-
11
-
-
77958068642
-
A tutorial on Bayesian optimization of expensive cost functions, with application to active user modeling and hierarchical reinforcement learning
-
University of British Columbia
-
Erik Brochu, Mike Cora, and Nando de Freitas. A tutorial on Bayesian optimization of expensive cost functions, with application to active user modeling and hierarchical reinforcement learning. Technical Report TR-2009-023, Department of Computer Science, University of British Columbia, 2009.
-
(2009)
Technical Report TR-2009-023, Department of Computer Science
-
-
Brochu, E.1
Cora, M.2
De Freitas, N.3
-
14
-
-
0030387008
-
A gradient approach for smartly allocating computing budget for discrete event simulation
-
Chun-Hung Chen, Hsiao-Chang Chen, and Liyi Dai. A gradient approach for smartly allocating computing budget for discrete event simulation. In Proceedings of the 28th Conference on Winter Simulation, pages 398-405, 1996.
-
(1996)
Proceedings of the 28th Conference on Winter Simulation
, pp. 398-405
-
-
Chen, C.1
Chen, H.2
Dai, L.3
-
15
-
-
0035460965
-
New two-stage and sequential procedures for selecting the best simulated system
-
Stephen E. Chick and Koichiro Inoue. New two-stage and sequential procedures for selecting the best simulated system. Operations Research, 49(5):732-743, 2001. (Pubitemid 34011324)
-
(2001)
Operations Research
, vol.49
, Issue.5
, pp. 732-743
-
-
Chick, S.E.1
Inoue, K.2
-
16
-
-
77949359798
-
Sequential sampling to myopically maximize the expected value of information
-
Stephen E. Chick, Jurgen Branke, and Christian Schmidt. Sequential sampling to myopically maximize the expected value of information. INFORMS Journal on Computing, 22(1):71-80, 2010.
-
(2010)
INFORMS Journal on Computing
, vol.22
, Issue.1
, pp. 71-80
-
-
Chick, S.E.1
Branke, J.2
Schmidt, C.3
-
19
-
-
20744454447
-
Online convex optimization in the bandit setting: Gradient descent without a gradient
-
Proceedings of the Sixteenth Annual ACM-SIAM Symposium on Discrete Algorithms
-
Abraham D. Flaxman, Adam Tauman Kalai, and H. Brendan McMahan. Online convex optimization in the bandit setting: gradient descent without a gradient. In Proceedings of the Sixteenth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA '05, pages 385-394, 2005. (Pubitemid 40851394)
-
(2005)
Proceedings of the Annual ACM-SIAM Symposium on Discrete Algorithms
, pp. 385-394
-
-
Flaxman, A.D.1
Kalai, A.T.2
McMahan, H.B.3
-
20
-
-
55549135706
-
A knowledge-gradient policy for sequential information collection
-
Peter I. Frazier, Warren B. Powell, and Savas Dayanik. A knowledge-gradient policy for sequential information collection. SIAM Journal on Control and Optimization, 47(5):2410-2439, 2008.
-
(2008)
SIAM Journal on Control and Optimization
, vol.47
, Issue.5
, pp. 2410-2439
-
-
Frazier, P.I.1
Powell, W.B.2
Dayanik, S.3
-
21
-
-
70449498873
-
The knowledge-gradient policy for correlated normal beliefs
-
Peter I. Frazier, Warren B. Powell, and Savas Dayanik. The knowledge-gradient policy for correlated normal beliefs. INFORMS Journal on Computing, 21(4):599-613, 2009.
-
(2009)
INFORMS Journal on Computing
, vol.21
, Issue.4
, pp. 599-613
-
-
Frazier, P.I.1
Powell, W.B.2
Dayanik, S.3
-
22
-
-
56349109509
-
Value function approximation using multiple aggregation for multiattribute resource management
-
Abraham George, Warren B. Powell, and Sanjeev R. Kulkarni. Value function approximation using multiple aggregation for multiattribute resource management. Journal of Machine Learning Research, 9:2079-2111, 2008.
-
(2008)
Journal of Machine Learning Research
, vol.9
, pp. 2079-2111
-
-
George, A.1
Powell, W.B.2
Kulkarni, S.R.3
-
25
-
-
0030590294
-
Bayesian look ahead one-stage sampling allocations for selection of the best population
-
DOI 10.1016/0378-3758(95)00169-7
-
Shanti S. Gupta and Klaus J. Miescke. Bayesian look ahead one-stage sampling allocations for selection of the best population. Journal of Statistical Planning and Inference, 54(2):229-244, 1996. (Pubitemid 126161097)
-
(1996)
Journal of Statistical Planning and Inference
, vol.54
, Issue.2
, pp. 229-244
-
-
Gupta, S.S.1
Miescke, K.J.2
-
26
-
-
0003684449
-
-
Springer series in Statistics, New York, NY
-
Trevor Hastie, Robert Tibshirani, and Jerome H. Friedman. The Elements of Statistical Learning. Springer series in Statistics, New York, NY, 2001.
-
(2001)
The Elements of Statistical Learning
-
-
Hastie, T.1
Tibshirani, R.2
Friedman, J.H.3
-
27
-
-
34548237750
-
Opportunity cost and OCBA selection procedures in ordinal optimization for a fixed number of alternative systems
-
DOI 10.1109/TSMCC.2007.900656
-
Donghai He, Stephen E. Chick, and Chun-Hung Chen. Opportunity cost and OCBA selection procedures in ordinal optimization for a fixed number of alternative systems. IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews, 37(5):951-961, 2007. (Pubitemid 47323564)
-
(2007)
IEEE Transactions on Systems, Man and Cybernetics Part C: Applications and Reviews
, vol.37
, Issue.5
, pp. 951-961
-
-
He, D.1
Chick, S.E.2
Chen, C.-H.3
-
28
-
-
33644791173
-
Global optimization of stochastic black-box systems via sequential kriging meta-models
-
DOI 10.1007/s10898-005-2454-3
-
Deng Huang, Theodore T. Allen, William I. Notz, and Ning Zheng. Global optimization of stochastic black-box systems via sequential kriging meta-models. Journal of Global Optimization, 34(3):441-466, 2006. (Pubitemid 43352177)
-
(2006)
Journal of Global Optimization
, vol.34
, Issue.3
, pp. 441-466
-
-
Huang, D.1
Allen, T.T.2
Notz, W.I.3
Zeng, N.4
-
30
-
-
0000561424
-
Efficient Global Optimization of Expensive Black-Box Functions
-
Donald R. Jones, Matthias Schonlau, and William J. Welch. Efficient global optimization of expensive black-box functions. Journal of Global Optimization, 13(4):455-492, 1998. (Pubitemid 128507405)
-
(1998)
Journal of Global Optimization
, vol.13
, Issue.4
, pp. 455-492
-
-
Jones, D.R.1
Schonlau, M.2
Welch, W.J.3
-
32
-
-
0036832954
-
Near-optimal reinforcement learning in polynomial time
-
DOI 10.1023/A:1017984413808
-
Michael Kearns and Satinder Singh. Near-optimal reinforcement learning in polynomial time. Machine Learning, 49(2-3):209-232, 2002. (Pubitemid 34325687)
-
(2002)
Machine Learning
, vol.49
, Issue.2-3
, pp. 209-232
-
-
Kearns, M.1
Singh, S.2
-
36
-
-
84998710865
-
A new method of locating the maximum of an arbitrary multipeak curve in the presence of noise
-
Harold J. Kushner. A new method of locating the maximum of an arbitrary multipeak curve in the presence of noise. Journal of Basic Engineering, 86:97-106, 1964.
-
(1964)
Journal of Basic Engineering
, vol.86
, pp. 97-106
-
-
Kushner, H.J.1
-
37
-
-
0000854435
-
Adaptive treatment allocation and the multi-armed bandit problem
-
Tze L. Lai. Adaptive treatment allocation and the multi-armed bandit problem. The Annals of Statistics, 15(3):1091-1114, 1987.
-
(1987)
The Annals of Statistics
, vol.15
, Issue.3
, pp. 1091-1114
-
-
Lai, T.L.1
-
43
-
-
85035066177
-
On Bayesian methods for seeking the extremum
-
G. Marchuk, editor, July 17, 1974, of Lecture Notes in Computer Science, Springer Berlin/Heidelberg
-
Jonas Mockus. On Bayesian methods for seeking the extremum. In G. Marchuk, editor, Optimization Techniques IFIP Technical Conference Novosibirsk, July 17, 1974, volume 27 of Lecture Notes in Computer Science, pages 400-404. Springer Berlin/Heidelberg, 1975.
-
(1975)
Optimization Techniques IFIP Technical Conference Novosibirsk
, vol.27
, pp. 400-404
-
-
Mockus, J.1
-
47
-
-
0000606094
-
Aggregation and disaggregation techniques and methodology in optimization
-
David F. Rogers, Robert D. Plante, Richard T. Wong, and James R. Evans. Aggregation and disaggregation techniques and methodology in optimization. Operations Research, 39(4):553-582, 1991.
-
(1991)
Operations Research
, vol.39
, Issue.4
, pp. 553-582
-
-
Rogers, D.F.1
Plante, R.D.2
Wong, R.T.3
Evans, J.R.4
-
50
-
-
63449141834
-
Powell. An approximate dynamic programming algorithm for large-scale fleet management: A case application
-
Hugo P. Simao, Jeff Day, Abraham P. George, Ted Gifford, John Nienow, and Warren B. Powell. An approximate dynamic programming algorithm for large-scale fleet management: A case application. Transportation Science, 43(2):178-197, 2009.
-
(2009)
Transportation Science
, vol.43
, Issue.2
, pp. 178-197
-
-
Simao, H.P.1
Day, J.2
George, A.P.3
Gifford, T.4
Nienow, J.5
Warren, B.6
-
54
-
-
77954033598
-
Convergence properties of the expected improvement algorithm with fixed mean and covariance functions
-
Emmanuel Vazquez and Julien Bect. Convergence properties of the expected improvement algorithm with fixed mean and covariance functions. Journal of Statistical Planning and Inference, 140(11):3088-3095, 2010.
-
(2010)
Journal of Statistical Planning and Inference
, vol.140
, Issue.11
, pp. 3088-3095
-
-
Vazquez, E.1
Bect, J.2
-
55
-
-
67650938640
-
An informational approach to the global optimization of expensive-to-evaluate functions
-
Julien Villemonteix, Emmanuel Vazquez, and Eric Walter. An informational approach to the global optimization of expensive-to-evaluate functions. Journal of Global Optimization, 44(4):509-534, 2009.
-
(2009)
Journal of Global Optimization
, vol.44
, Issue.4
, pp. 509-534
-
-
Villemonteix, J.1
Vazquez, E.2
Walter, E.3
|