-
1
-
-
84898063697
-
Competing in the dark: An efficient algorithm for bandit linear optimization
-
Abernethy, J., Hazan, E. & Rakhlin, A. (2008), Competing in the dark: An efficient algorithm for bandit linear optimization, in'Proceedings of the 21st Annual Conference on Learning Theory', pp. 263-274.
-
(2008)
Proceedings of the 21st Annual Conference on Learning Theory
, pp. 263-274
-
-
Abernethy, J.1
Hazan, E.2
Rakhlin, A.3
-
2
-
-
0041656119
-
Optimal learning by experimentation
-
Aghion, P., Bolton, P., Harris, C. & Jullien, B. (1991), 'Optimal learning by experimentation', The Review of Economic Studies 58, 621-654.
-
(1991)
The Review of Economic Studies
, vol.58
, pp. 621-654
-
-
Aghion, P.1
Bolton, P.2
Harris, C.3
Jullien, B.4
-
3
-
-
0000616723
-
Sample mean based index policies with O (log n) regret for the multi-armed bandit problem
-
Agrawal, R. (1995), 'Sample mean based index policies with O (log n) regret for the multi-armed bandit problem', Advance s in Applied Probability 27(4), 1054-1078.
-
(1995)
Advance s in Applied Probability
, vol.27
, Issue.4
, pp. 1054-1078
-
-
Agrawal, R.1
-
4
-
-
0344182998
-
A simulated annealing algorithm with constant temperature for discrete stochastic optimization
-
Alrefaei, M. H. & Andradottir, S. (1999), 'A simulated annealing algorithm with constant temperature for discrete stochastic optimization', Management Science 45,748-764.
-
(1999)
Management Science
, vol.45
, pp. 748-764
-
-
Alrefaei, M.H.1
Andradottir, S.2
-
5
-
-
77956926259
-
Adaptive random search for continuous simulation optimization
-
Andradottir, S. & Prudius, A. A. (2010), 'Adaptive random search for continuous simulation optimization', Naval Research Logistics 57(6), 583-604.
-
(2010)
Naval Research Logistics
, vol.57
, Issue.6
, pp. 583-604
-
-
Andradottir, S.1
Prudius, A.A.2
-
6
-
-
77951184800
-
Stochastic kriging for simulation metamodeling
-
Ankenman, B., Nelson, B. L. & Staum, J. (2010), 'Stochastic kriging for simulation metamodeling', Operations Research 58(2), 371-382.
-
(2010)
Operations Research
, vol.58
, Issue.2
, pp. 371-382
-
-
Ankenman, B.1
Nelson, B.L.2
Staum, J.3
-
7
-
-
70350228783
-
Dynamic pricing for non-perishable products with demand learning
-
Araman, V. & Caldentey, R. (2009), 'Dynamic pricing for non-perishable products with demand learning', Operations Research 57(5), 1169-1188.
-
(2009)
Operations Research
, vol.57
, Issue.5
, pp. 1169-1188
-
-
Araman, V.1
Caldentey, R.2
-
8
-
-
84864970677
-
Best arm identification in multi-armed bandits
-
Audibert, J. Y., Bubeck, S. & Munos, R. (2010), Best arm identification in multi-armed bandits, in'Proceedings of the 23rd Annual Conference on Learning Theory (COLT)', pp. 1-14.
-
(2010)
Proceedings of the 23rd Annual Conference on Learning Theory (COLT)
, pp. 1-14
-
-
Audibert, J.Y.1
Bubeck, S.2
Munos, R.3
-
9
-
-
0036568025
-
Finite-time analysis of the multi-armed bandit problem
-
Auer, P., Cesa-Bianchi, N. & Fischer, P. (2002), 'Finite-time analysis of the multi-armed bandit problem', Machine Learning 47(2), 235-256.
-
(2002)
Machine Learning
, vol.47
, Issue.2
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
10
-
-
73549103329
-
Near-optimal regret bounds for reinforcement learning
-
D. Koller, Y. Bengio, D. Schuurmans, L. Bottou & A. Culotta, eds
-
Auer, P., Jaksch, T. & Ortner, R. (2008), Near-optimal regret bounds for reinforcement learning, in D. Koller, Y. Bengio, D. Schuurmans, L. Bottou & A. Culotta, eds,'Advances in Neural Information Processing Systems', Vol. 21, pp. 89-96.
-
(2008)
Advances in Neural Information Processing Systems
, vol.21
, pp. 89-96
-
-
Auer, P.1
Jaksch, T.2
Ortner, R.3
-
11
-
-
25844499294
-
A partially observed Markov decision process for dynamic pricing
-
Aviv, Y. & Pazgal, A. (2005), 'A partially observed Markov decision process for dynamic pricing', Management Science 51(9), 1400-1416.
-
(2005)
Management Science
, vol.51
, Issue.9
, pp. 1400-1416
-
-
Aviv, Y.1
Pazgal, A.2
-
12
-
-
0011155531
-
Optimality proof for the symmetric Fibonacci search technique
-
Avriel, M. & Wilde, D. (1966), 'Optimality proof for the symmetric Fibonacci search technique', Fibonacci Quarterly 4, 265-269.
-
(1966)
Fibonacci Quarterly
, vol.4
, pp. 265-269
-
-
Avriel, M.1
Wilde, D.2
-
13
-
-
0033078286
-
Simulation optimization with qualitative variables and structural model changes: A genetic algorithm approach
-
Azadivar, F. & Tompkins, G. (1999), 'Simulation optimization with qualitative variables and structural model changes: A genetic algorithm approach', European Journal of Operational Research 113, 169-182.
-
(1999)
European Journal of Operational Research
, vol.113
, pp. 169-182
-
-
Azadivar, F.1
Tompkins, G.2
-
14
-
-
0034694502
-
Facility Layout Optimization Using Simulation and Genetic Algorithms
-
Azadivar, F. & Wang, J. (2000), 'Facility Layout Optimization Using Simulation and Genetic Algorithms', International Journal of Production Research 38( 17), 43694383.
-
(2000)
International Journal of Production Research
, vol.38
, Issue.17
, pp. 43694383
-
-
Azadivar, F.1
Wang, J.2
-
15
-
-
0003990308
-
-
Prentice-Hall, Inc., Englewood Cliffs, N.J
-
Banks, J., Nelson, B. L. & J. S. Carson, 1.1. (1996), Discrete-Event System Simulation, Prentice-Hall, Inc., Englewood Cliffs, N.J.
-
(1996)
Discrete-Event System Simulation
-
-
Banks, J.1
Nelson, B.L.2
Carson, J.S.3
-
16
-
-
0000541772
-
An introduction to ranking and selection procedures
-
Barr, D. R. & Rizvi, M. H. (1966), 'An introduction to ranking and selection procedures', J. Amer. Statist. Assoc. 61(315), 640-646.
-
(1966)
J. Amer. Statist. Assoc.
, vol.61
, Issue.315
, pp. 640-646
-
-
Barr, D.R.1
Rizvi, M.H.2
-
17
-
-
80555137396
-
High-probability regret bounds for bandit online linear optimization
-
Bartlett, P., Dani, V., Hayes, T, Kakade, S., Rakhlin, A. & Tewari, A. (2008), High-probability regret bounds for bandit online linear optimization, in'Proceedings of the 21st Annual Conference on Learning Theory', pp. 335-342.
-
(2008)
Proceedings of the 21st Annual Conference on Learning Theory
, pp. 335-342
-
-
Bartlett, P.1
Dani, V.2
Hayes, T.3
Kakade, S.4
Rakhlin, A.5
Tewari, A.6
-
18
-
-
0002426110
-
A Single-Sample Multiple Decision Procedure for Ranking Means of Normal Populations with known Variances
-
Bechhofer, R. E. (1954), 'A Single-Sample Multiple Decision Procedure for Ranking Means of Normal Populations with known Variances', The Annals of Mathematical Statistics 25, 16-39.
-
(1954)
The Annals of Mathematical Statistics
, vol.25
, pp. 16-39
-
-
Bechhofer, R.E.1
-
19
-
-
0004125919
-
-
University of Chicago Press, Chicago
-
Bechhofer, R. E., Kiefer, J. & Sobel, M. (1968), Sequential Identification and Ranking Procedures, University of Chicago Press, Chicago.
-
(1968)
Sequential Identification and Ranking Procedures
-
-
Bechhofer, R.E.1
Kiefer, J.2
Sobel, M.3
-
20
-
-
0003513355
-
-
J.Wiley & Sons, New York
-
Bechhofer, R., Santner, T. & Goldsman, D. (1995), Design and Analysis of Experiments for Statistical Selection, Screening and Multiple Comparisons, J.Wiley & Sons, New York.
-
(1995)
Design and Analysis of Experiments for Statistical Selection, Screening and Multiple Comparisons
-
-
Bechhofer, R.1
Santner, T.2
Goldsman, D.3
-
21
-
-
84938011869
-
On adaptive control processes
-
Bellman, R. & Kalaba, R. (1959), 'On adaptive control processes', IRE Trans. 4,1-9.
-
(1959)
IRE Trans.
, vol.4
, pp. 1-9
-
-
Bellman, R.1
Kalaba, R.2
-
22
-
-
84884052304
-
-
Princeton University Press, Princeton NJ
-
Ben-Tal, A., Ghaoui, L. E. & Nemirovski, A. (2009), Robust Optimization, Princeton University Press, Princeton NJ.
-
(2009)
Robust Optimization
-
-
Ben-Tal, A.1
Ghaoui, L.E.2
Nemirovski, A.3
-
23
-
-
0003778897
-
-
Springer-Verlag, New York
-
Benveniste, A., Metivier, M. & Priouret, P. (1990), Adaptive Algorithms and Stochastic Approximations, Springer-Verlag, New York.
-
(1990)
Adaptive Algorithms and Stochastic Approximations
-
-
Benveniste, A.1
Metivier, M.2
Priouret, P.3
-
24
-
-
0030352286
-
Learning and strategic pricing
-
Bergemann, D. & Välimäki, J. (1996), 'Learning and strategic pricing', Econometrica 64(5), 1125-1149.
-
(1996)
Econometrica
, vol.64
, Issue.5
, pp. 1125-1149
-
-
Bergemann, D.1
Välimäki, J.2
-
26
-
-
0003487482
-
-
Athena Scientific, Belmont, MA
-
Bertsekas, D. P. & Tsitsiklis, J. N. (1996), Neuro-dynamic programming, Athena Scientific, Belmont, MA.
-
(1996)
Neuro-dynamic programming
-
-
Bertsekas, D.P.1
Tsitsiklis, J.N.2
-
27
-
-
0343441515
-
Restless bandits, linear programming relaxations, and a primal-dual index heuristic
-
Bertsimas, D. J. & Nino-Mora, J. (2000), 'Restless bandits, linear programming relaxations, and a primal-dual index heuristic', Operations Research 48(1), 8090.
-
(2000)
Operations Research
, vol.48
, Issue.1
, pp. 8090
-
-
Bertsimas, D.J.1
Nino-Mora, J.2
-
28
-
-
58149250414
-
Simulation optimization: applications in risk management
-
Better, M., Glover, F. W., Kochenberger, G. & Wang, H. (2008), 'Simulation optimization: applications in risk management', International Journal of Information Technology and Decision Making 7(4), 571-587.
-
(2008)
International Journal of Information Technology and Decision Making
, vol.7
, Issue.4
, pp. 571-587
-
-
Better, M.1
Glover, F.W.2
Kochenberger, G.3
Wang, H.4
-
29
-
-
34249059079
-
Robust optimization: A comprehensive survey
-
Beyer, H. & Sendhoff, B. (2007), 'Robust optimization: A comprehensive survey', Computer Methods in Applied Mechanics and Engineering 196(33-34), 3190-3218.
-
(2007)
Computer Methods in Applied Mechanics and Engineering
, vol.196
, Issue.33-34
, pp. 3190-3218
-
-
Beyer, H.1
Sendhoff, B.2
-
32
-
-
0041965975
-
R-max - a general polynomial time algorithm for near-optimal reinforcement learning
-
Brafman, R. I. & Tennenholtz, M. (2003), 'R-max - a general polynomial time algorithm for near-optimal reinforcement learning', Journal of Machine Learning Research 3,213-231.
-
(2003)
Journal of Machine Learning Research
, vol.3
, pp. 213-231
-
-
Brafman, R.I.1
Tennenholtz, M.2
-
33
-
-
33745765950
-
New developments in ranking and selection: an empirical comparison of the three main approaches
-
M. Kuhl, N. Steiger, F. Argstrong & J. Joines, eds, IEEE, Inc., Piscataway, NJ
-
Branke, J., Chick, S. E. & Schmidt, C. (2005), New developments in ranking and selection: an empirical comparison of the three main approaches, in M. Kuhl, N. Steiger, F. Argstrong & J. Joines, eds,'Proc. 2005 Winter Simulation Conference', IEEE, Inc., Piscataway, NJ, pp. 708-717.
-
(2005)
Proc. 2005 Winter Simulation Conference
, pp. 708-717
-
-
Branke, J.1
Chick, S.E.2
Schmidt, C.3
-
34
-
-
38549097195
-
Selecting a Selection Procedure
-
Branke, J., Chick, S. E. & Schmidt, C. (2007), 'Selecting a Selection Procedure', Management Science 53, 1916-1932.
-
(2007)
Management Science
, vol.53
, pp. 1916-1932
-
-
Branke, J.1
Chick, S.E.2
Schmidt, C.3
-
35
-
-
0036334330
-
Optimal learning and experimentation in bandit problems
-
Brezzi, M. & Lai, T. L. (2002), 'Optimal learning and experimentation in bandit problems', Journal of Economic Dynamics and Control 27(1), 87-108.
-
(2002)
Journal of Economic Dynamics and Control
, vol.27
, Issue.1
, pp. 87-108
-
-
Brezzi, M.1
Lai, T.L.2
-
38
-
-
0001596834
-
A unified approach to a class of best choice problems with an unknown number of options
-
Bruss, F. (1984), 'A unified approach to a class of best choice problems with an unknown number of options', The Annals of Probability 12(3), 882-889.
-
(1984)
The Annals of Probability
, vol.12
, Issue.3
, pp. 882-889
-
-
Bruss, F.1
-
40
-
-
79960128338
-
X-Armed Bandits
-
Bubeck, S., Munos, R., Stoltz, G. & Szepesvari, C. (2011), 'X-Armed Bandits', Journal of Machine Learning Research 12, 1655-1695.
-
(2011)
Journal of Machine Learning Research
, vol.12
, pp. 1655-1695
-
-
Bubeck, S.1
Munos, R.2
Stoltz, G.3
Szepesvari, C.4
-
41
-
-
84949574363
-
Convergence rates of efficient global optimization algorithms
-
Bull, A. D. (2011), 'Convergence rates of efficient global optimization algorithms', Submitted for publication.
-
(2011)
Submitted for publication
-
-
Bull, A.D.1
-
42
-
-
60749134420
-
A comparative study of genetic algorithm components in simulation-based optimization
-
S. Mason, R. Hill, L. Mönch, O. Rose, T. Jefferson & J. Fowler, eds
-
Can, B., Beham, A. & Heavey, C. (2008), A comparative study of genetic algorithm components in simulation-based optimization, in S. Mason, R. Hill, L. Mönch, O. Rose, T. Jefferson & J. Fowler, eds,'Proceedings of the 2008 Winter Simulation Conference', pp. 1829-1837.
-
(2008)
Proceedings of the 2008 Winter Simulation Conference
, pp. 1829-1837
-
-
Can, B.1
Beham, A.2
Heavey, C.3
-
43
-
-
0141815596
-
Dynamic pricing and reinforcement learning
-
Carvalho, A. & Puterman, M. (2003), Dynamic pricing and reinforcement learning, in'Proceedings of the 2003 International Joint Conference on Neural Networks', Vol. 4, pp. 2916-2921.
-
(2003)
Proceedings of the 2003 International Joint Conference on Neural Networks
, vol.4
, pp. 2916-2921
-
-
Carvalho, A.1
Puterman, M.2
-
44
-
-
34547314820
-
Learning and pricing in an Internet environment with binomial demands
-
Carvalho, A. & Puterman, M. (2005), 'Learning and pricing in an Internet environment with binomial demands', Journal of Revenue and Pricing Management 3(4), 320-336.
-
(2005)
Journal of Revenue and Pricing Management
, vol.3
, Issue.4
, pp. 320-336
-
-
Carvalho, A.1
Puterman, M.2
-
45
-
-
1842266137
-
Mathematical questions with their solutions, No. 4528
-
Cayley, A. (1875), 'Mathematical questions with their solutions, No. 4528', Educational Times.
-
(1875)
Educational Times
-
-
Cayley, A.1
-
46
-
-
0036921026
-
Another Look at the Radner-Stiglitz Nonconcavity in the Value of Information
-
Chade, H. & Schlee, E. E. (2002), 'Another Look at the Radner-Stiglitz Nonconcavity in the Value of Information', Journal of Economic Theory 107(2), 421-452.
-
(2002)
Journal of Economic Theory
, vol.107
, Issue.2
, pp. 421-452
-
-
Chade, H.1
Schlee, E.E.2
-
47
-
-
34547120053
-
-
Springer, New York
-
Chang, H. S., Fu, M. C, Hu, J. & Marcus, S. I. (2007), Simulation-Based Algorithms for Markov Decision Processes, Springer, New York.
-
(2007)
Simulation-Based Algorithms for Markov Decision Processes
-
-
Chang, H.S.1
Fu, M.C.2
Hu, J.3
Marcus, S.I.4
-
48
-
-
27844503922
-
Application of genetic algorithms in production and operations management: a review
-
Chaudhry, S. & Luo, W. (2005), 'Application of genetic algorithms in production and operations management: a review', International Journal of Production Research 43(19), 4083-4101.
-
(2005)
International Journal of Production Research
, vol.43
, Issue.19
, pp. 4083-4101
-
-
Chaudhry, S.1
Luo, W.2
-
50
-
-
60749111637
-
Simulation and optimization
-
P. Gray, Z.-L. Chen & S. Raghavan, eds
-
Chen, C.-H., Fu, M. C. & Shi, L. (2008), Simulation and optimization, in P. Gray, Z.-L. Chen & S. Raghavan, eds,'2008 TutORials in Operations Research', pp. 247-260.
-
(2008)
2008 TutORials in Operations Research
, pp. 247-260
-
-
Chen, C.-H.1
Fu, M.C.2
Shi, L.3
-
51
-
-
33845807891
-
Efficient Dynamic Simulation Allocation in Ordinal Optimization
-
Chen, C.-H., He, D. & Fu, M. C. (2006), 'Efficient Dynamic Simulation Allocation in Ordinal Optimization', IEEE Transactions Automatic Control 51, 2005-2009.
-
(2006)
IEEE Transactions Automatic Control
, vol.51
, pp. 2005-2009
-
-
Chen, C.-H.1
He, D.2
Fu, M.C.3
-
52
-
-
0034225544
-
Simulation budget allocation for further enhancing the efficiency of ordinal optimization
-
Chen, C.-H., Lin, J., Yücesan, E. & Chick, S. E. (2000), 'Simulation budget allocation for further enhancing the efficiency of ordinal optimization', Discrete Event Dynamic Systems 10(3), 251-270.
-
(2000)
Discrete Event Dynamic Systems
, vol.10
, Issue.3
, pp. 251-270
-
-
Chen, C.-H.1
Lin, J.2
Yücesan, E.3
Chick, S.E.4
-
54
-
-
84949574364
-
Dynamic pricing with non-conjugate Pareto priors, Technical report
-
Rensselaer Polytechnic Institute
-
Chhabra, M. & Das, S. (2012), Dynamic pricing with non-conjugate Pareto priors, Technical report, Rensselaer Polytechnic Institute.
-
(2012)
-
-
Chhabra, M.1
Das, S.2
-
55
-
-
1642437901
-
Expected opportunity cost guarantees and indifference zone selection procedures
-
S. E. Chick, P. J. Sánchez, D. Ferrin & D. J. Morrice, eds
-
Chick, S. E. (2003), Expected opportunity cost guarantees and indifference zone selection procedures, in S. E. Chick, P. J. Sánchez, D. Ferrin & D. J. Morrice, eds,'Proceedings of the 2003 Winter Simulation Conference', pp. 465-473.
-
(2003)
Proceedings of the 2003 Winter Simulation Conference
, pp. 465-473
-
-
Chick, S.E.1
-
56
-
-
77950462837
-
Subjective Probability and Bayesian Methodology
-
S. Hen-derson & B. Nelson, eds, Simulation', North-Holland Publishing, Amsterdam
-
Chick, S. E. (2006), Subjective Probability and Bayesian Methodology, in S. Hen-derson & B. Nelson, eds,'Handbooks of Operations Research and Management Science, vol. 13: Simulation', North-Holland Publishing, Amsterdam, pp. 225-258.
-
(2006)
Handbooks of Operations Research and Management Science
, vol.13
, pp. 225-258
-
-
Chick, S.E.1
-
57
-
-
67649990621
-
Economic analysis of simulation selection problems
-
Chick, S. E. & Gans, N. (2009), 'Economic analysis of simulation selection problems', Management Science 55(3), 421-37.
-
(2009)
Management Science
, vol.55
, Issue.3
, pp. 421-437
-
-
Chick, S.E.1
Gans, N.2
-
58
-
-
0035460965
-
New two-stage and sequential procedures for selecting the best simulated system
-
Chick, S. E. & Inoue, K. (2001), 'New two-stage and sequential procedures for selecting the best simulated system', Operations Research 49(5), 732-743.
-
(2001)
Operations Research
, vol.49
, Issue.5
, pp. 732-743
-
-
Chick, S.E.1
Inoue, K.2
-
59
-
-
27344432350
-
Selection procedures with frequentist expected opportunity cost bounds
-
Chick, S. E. & Wu, Y. (2005), 'Selection procedures with frequentist expected opportunity cost bounds', Operations Research 53(5), 867-878.
-
(2005)
Operations Research
, vol.53
, Issue.5
, pp. 867-878
-
-
Chick, S.E.1
Wu, Y.2
-
60
-
-
77949359798
-
Sequential Sampling to Myopically Maximize the Expected Value of Information
-
Chick, S. E., Branke, J. & Schmidt, C. (2010), 'Sequential Sampling to Myopically Maximize the Expected Value of Information', INFORMS Journal on Computing 22(1), 71-80.
-
(2010)
INFORMS Journal on Computing
, vol.22
, Issue.1
, pp. 71-80
-
-
Chick, S.E.1
Branke, J.2
Schmidt, C.3
-
61
-
-
34548237750
-
Opportunity cost and OCBA selection procedures in ordinal optimization for a fixed number of alternative systems
-
Chick, S. E., He, D. H. & Chen, C.-H. (2007), 'Opportunity cost and OCBA selection procedures in ordinal optimization for a fixed number of alternative systems', IEEE Transactions on Systems Man and Cybernetics C37, 951-961.
-
(2007)
IEEE Transactions on Systems Man and Cybernetics
, vol.C37
, pp. 951-961
-
-
Chick, S.E.1
He, D.H.2
Chen, C.-H.3
-
64
-
-
34250348767
-
Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration.
-
Cohen, J. D., McClure, S. M. & Yu, A. J. (2007), 'Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration.', Philosophical transactions of the Royal Society of London B362(1481), 933-42.
-
(2007)
Philosophical transactions of the Royal Society of London
, vol.B362
, Issue.1481
, pp. 933-942
-
-
Cohen, J.D.1
McClure, S.M.2
Yu, A.J.3
-
65
-
-
0028424239
-
Improving generalization with active learning
-
Cohn, D., Atlas, L. & Ladner, R. (1994), 'Improving generalization with active learning', Machine Learning 5(2201), 221.
-
(1994)
Machine Learning
, vol.5
, Issue.2201
, pp. 221
-
-
Cohn, D.1
Atlas, L.2
Ladner, R.3
-
66
-
-
0029679131
-
Active learning with statistical models
-
Cohn, D., Ghahramani, Z. & Jordan, M. (1996), 'Active learning with statistical models', Journal of Artificial Intelligence Research 4, 129-145.
-
(1996)
Journal of Artificial Intelligence Research
, vol.4
, pp. 129-145
-
-
Cohn, D.1
Ghahramani, Z.2
Jordan, M.3
-
67
-
-
84900550689
-
Markov decision processes with uncertain transition probabilities
-
Technical Report 11, Operations Research Center, MIT
-
Cozzolino, J., Gonzalez-Zubieta, R. & Miller, R. (1965), Markov decision processes with uncertain transition probabilities, Technical Report 11, Operations Research Center, MIT.
-
(1965)
-
-
Cozzolino, J.1
Gonzalez-Zubieta, R.2
Miller, R.3
-
70
-
-
49349084239
-
Index policies for discounted bandit problems with availability constraints
-
Dayanik, S., Powell, W. B. & Yamazaki, K. (2008), 'Index policies for discounted bandit problems with availability constraints', Adv. in Appl. Probab 40, 377-100.
-
(2008)
Adv. in Appl. Probab
, vol.40
, pp. 377-100
-
-
Dayanik, S.1
Powell, W.B.2
Yamazaki, K.3
-
71
-
-
1142281527
-
Model-based Bayesian Exploration
-
Dearden, R., Friedman, N. & Andre, D. (1999), Model-based Bayesian Exploration, in'Proceedings of the 15th Conference on Uncertainty in Artificial Intelligence', pp. 150-159.
-
(1999)
Proceedings of the 15th Conference on Uncertainty in Artificial Intelligence
, pp. 150-159
-
-
Dearden, R.1
Friedman, N.2
Andre, D.3
-
72
-
-
0031619316
-
Bayesian Q-learning
-
Dearden, R., Friedman, N. & Russell, S. (1998), Bayesian Q-learning, in'Proceedings of the 15th National Conference on Artificial Intelligence', pp. 761-768.
-
(1998)
Proceedings of the 15th National Conference on Artificial Intelligence
, pp. 761-768
-
-
Dearden, R.1
Friedman, N.2
Russell, S.3
-
74
-
-
36048940444
-
A tight sufficient condition for Radner-Stiglitz non-concavity in the value of information
-
Delara, M. & Gilotte, L. (2007), 'A tight sufficient condition for Radner-Stiglitz non-concavity in the value of information', Journal of Economic Theory 137(1), 696-708.
-
(2007)
Journal of Economic Theory
, vol.137
, Issue.1
, pp. 696-708
-
-
Delara, M.1
Gilotte, L.2
-
76
-
-
16244388049
-
Local bandit approximation for optimal learning problems
-
M. Mozer, M. Jordan & T. Pesche, eds, Cambridge, MA: MIT Press
-
Duff, M. & Barto, A. (1996), Local bandit approximation for optimal learning problems, in M. Mozer, M. Jordan & T. Pesche, eds,'Advances in Neural Information Processing Systems', Vol. 9, Cambridge, MA: MIT Press, pp. 1019-1025.
-
(1996)
Advances in Neural Information Processing Systems
, vol.9
, pp. 1019-1025
-
-
Duff, M.1
Barto, A.2
-
77
-
-
0000145537
-
Multistage Stochastic Programs - The State of the Art and Selected Bibliography
-
Dupacova, J. (1995), 'Multistage Stochastic Programs - The State of the Art and Selected Bibliography', Kybernetica 31, 151-174.
-
(1995)
Kybernetica
, vol.31
, pp. 151-174
-
-
Dupacova, J.1
-
78
-
-
77249163740
-
Dynamic pricing with a prior on market response
-
Farias, V. & Van Roy, B. (2010), 'Dynamic pricing with a prior on market response', Operations Research 58(1), 16-29.
-
(2010)
Operations Research
, vol.58
, Issue.1
, pp. 16-29
-
-
Farias, V.1
Van Roy, B.2
-
80
-
-
78651309095
-
Paradoxes in Learning and the Marginal Value of Information
-
Frazier, P. I. & Powell, W. B. (2010), 'Paradoxes in Learning and the Marginal Value of Information', Decision Analysis 7(4), 378-403.
-
(2010)
Decision Analysis
, vol.7
, Issue.4
, pp. 378-403
-
-
Frazier, P.I.1
Powell, W.B.2
-
81
-
-
79952951436
-
Consistency of Sequential Bayesian Sampling Policies
-
Frazier, P. I. & Powell, W. B. (2011), 'Consistency of Sequential Bayesian Sampling Policies', SI AM Journal on Control and Optimization 49(2), 712-731.
-
(2011)
SI AM Journal on Control and Optimization
, vol.49
, Issue.2
, pp. 712-731
-
-
Frazier, P.I.1
Powell, W.B.2
-
82
-
-
55549135706
-
A Knowledge Gradient Policy for Sequential Information Collection
-
Frazier, P. I., Powell, W. B. & Dayanik, S. (2008), 'A Knowledge Gradient Policy for Sequential Information Collection', SI AM Journal on Control and Optimization 47(5), 2410-2439.
-
(2008)
SI AM Journal on Control and Optimization
, vol.47
, Issue.5
, pp. 2410-2439
-
-
Frazier, P.I.1
Powell, W.B.2
Dayanik, S.3
-
83
-
-
70449498873
-
The Knowledge-Gradient Policy for Correlated Normal Beliefs
-
Frazier, P. I., Powell, W. B. & Dayanik, S. (2009), 'The Knowledge-Gradient Policy for Correlated Normal Beliefs', INFORMS Journal on Computing 21(4), 599-613.
-
(2009)
INFORMS Journal on Computing
, vol.21
, Issue.4
, pp. 599-613
-
-
Frazier, P.I.1
Powell, W.B.2
Dayanik, S.3
-
84
-
-
0012260296
-
Optimization for simulation: Theory vs. practice
-
Fu, M. C. (2002), 'Optimization for simulation: Theory vs. practice', INFORMS Journal on Computing 14(3), 192-215.
-
(2002)
INFORMS Journal on Computing
, vol.14
, Issue.3
, pp. 192-215
-
-
Fu, M.C.1
-
85
-
-
33846679442
-
Simulation optimization: a review, new developments, and applications
-
M. E. Kuhl, N. M. Steiger, F. B. Armstrong & J. A. Joines, eds
-
Fu, M. C, Glover, F. & April, J. (2005), Simulation optimization: a review, new developments, and applications, in M. E. Kuhl, N. M. Steiger, F. B. Armstrong & J. A. Joines, eds,'Proceedings of the 2005 Winter Simulation Conference', pp. 83-95.
-
(2005)
Proceedings of the 2005 Winter Simulation Conference
, pp. 83-95
-
-
Fu, M.C.1
Glover, F.2
April, J.3
-
86
-
-
17744377782
-
Optimal computing budget allocation under correlated sampling
-
R. G. Ingalls, M. D. Rossetti, J. S. Smith & B. A. Peters, eds
-
Fu, M. C, Hu, J. Q., Chen, C.-H. & Xiong, X. (2004), Optimal computing budget allocation under correlated sampling, in R. G. Ingalls, M. D. Rossetti, J. S. Smith & B. A. Peters, eds,'Proceedings of the 2004 Winter Simulation Conference', pp. 595-603.
-
(2004)
Proceedings of the 2004 Winter Simulation Conference
, pp. 595-603
-
-
Fu, M.C.1
Hu, J.Q.2
Chen, C.-H.3
Xiong, X.4
-
87
-
-
0028480132
-
Optimal dynamic pricing of inventories with stochastic demand over finite horizons
-
Gallego, G. & Van Ryzin, G. (1994), 'Optimal dynamic pricing of inventories with stochastic demand over finite horizons', Management Science 40(8), 999-1020.
-
(1994)
Management Science
, vol.40
, Issue.8
, pp. 999-1020
-
-
Gallego, G.1
Van Ryzin, G.2
-
88
-
-
0004012196
-
Bayesian Data Analysis
-
2nd ed', Chapman & Hall, New York
-
Gelman, A., Carlin, J. B., Stern, H. S. & Rubin, D. B. (2004), 'Bayesian Data Analysis, 2nd ed', Chapman & Hall, New York p. 63.
-
(2004)
, pp. 63
-
-
Gelman, A.1
Carlin, J.B.2
Stern, H.S.3
Rubin, D.B.4
-
89
-
-
0001942829
-
Neural networks and the bias/variance dilemma
-
Geman, S., Bienenstock, E. & Doursat, R. (1992), 'Neural networks and the bias/variance dilemma', Neural computation 4(1), 1-58.
-
(1992)
Neural computation
, vol.4
, Issue.1
, pp. 1-58
-
-
Geman, S.1
Bienenstock, E.2
Doursat, R.3
-
90
-
-
27144549876
-
Sensitivity analysis in linear optimization: Invariant support set intervals
-
Ghaffari-Hadigheh, A. & Terlaky, T. (2006), 'Sensitivity analysis in linear optimization: Invariant support set intervals', European Journal of Operational Research 169(3), 1158-1175.
-
(2006)
European Journal of Operational Research
, vol.169
, Issue.3
, pp. 1158-1175
-
-
Ghaffari-Hadigheh, A.1
Terlaky, T.2
-
92
-
-
0000169010
-
Bandit processes and dynamic allocation indices
-
Gittins, J. C. (1979), 'Bandit processes and dynamic allocation indices', Journal of the Royal Statistical Society B41(2), 148-177.
-
(1979)
Journal of the Royal Statistical Society
, vol.B41
, Issue.2
, pp. 148-177
-
-
Gittins, J.C.1
-
93
-
-
84891584370
-
Multi-armed Bandit Allocation Indices
-
Wiley and Sons: New York
-
Gittins, J. C. (1989), 'Multi-armed Bandit Allocation Indices', Wiley and Sons: New York.
-
(1989)
-
-
Gittins, J.C.1
-
94
-
-
0002955623
-
A dynamic allocation index for the sequential design of experiments
-
J. Gani, ed., North Holland, Amsterdam
-
Gittins, J. C. & Jones, D. M. (1974), A dynamic allocation index for the sequential design of experiments, in J. Gani, ed.,'Progress in statistics', North Holland, Amsterdam, pp. 241-266.
-
(1974)
Progress in statistics
, pp. 241-266
-
-
Gittins, J.C.1
Jones, D.M.2
-
95
-
-
84891584370
-
-
John Wiley & Sons, New York
-
Gittins, J. C, Glazebrook, K. D. & Weber, R. R. (2011), Multi-Armed Bandit Allocation Indices, John Wiley & Sons, New York.
-
(2011)
Multi-Armed Bandit Allocation Indices
-
-
Gittins, J.C.1
Glazebrook, K.D.2
Weber, R.R.3
-
97
-
-
0002232604
-
On the evaluation of suboptimal strategies for families of alternative bandit processes
-
Glazebrook, K. D. (1982), 'On the evaluation of suboptimal strategies for families of alternative bandit processes', Journal of Applied Probability 19(3), 716-722.
-
(1982)
Journal of Applied Probability
, vol.19
, Issue.3
, pp. 716-722
-
-
Glazebrook, K.D.1
-
98
-
-
67649922844
-
A Generalized Gittins Index for a Class of Multiarmed Bandits with General Resource Requirements
-
Glazebrook, K. D. & Minty, R. (2009), 'A Generalized Gittins Index for a Class of Multiarmed Bandits with General Resource Requirements', Mathematics of Operations Research.
-
(2009)
Mathematics of Operations Research
-
-
Glazebrook, K.D.1
Minty, R.2
-
99
-
-
0024561585
-
A comparative analysis of selection schemes used in genetic algorithms
-
G. Rawlings, ed., Morgan Kaufmann Publishers, San Mateo, CA
-
Goldberg, D. E. & Deb, K. (1991), A comparative analysis of selection schemes used in genetic algorithms, in G. Rawlings, ed.,'Foundations of genetic algorithms', Morgan Kaufmann Publishers, San Mateo, CA, pp. 69-93.
-
(1991)
Foundations of genetic algorithms
, pp. 69-93
-
-
Goldberg, D.E.1
Deb, K.2
-
101
-
-
84859489788
-
Optimization under unknown constraints
-
Arxiv preprint arXiv:1004.4027
-
Gramacy, R. B. & Lee, H. K. H. (2011), 'Optimization under unknown constraints', Arxiv preprint arXiv:1004.4027.
-
(2011)
-
-
Gramacy, R.B.1
Lee, H.K.H.2
-
102
-
-
0000511415
-
Bayesian look ahead one stage sampling allocations for selecting the largest normal mean
-
Gupta, S. S. & Miescke, K. J. (1994), 'Bayesian look ahead one stage sampling allocations for selecting the largest normal mean', Statistical Papers 35,169-177.
-
(1994)
Statistical Papers
, vol.35
, pp. 169-177
-
-
Gupta, S.S.1
Miescke, K.J.2
-
103
-
-
0030590294
-
Bayesian look ahead one-stage sampling allocations for selection of the best population
-
Gupta, S. S. & Miescke, K. J. (1996), 'Bayesian look ahead one-stage sampling allocations for selection of the best population', Journal of Statistical Planning and Inference 54, 229-244.
-
(1996)
Journal of Statistical Planning and Inference
, vol.54
, pp. 229-244
-
-
Gupta, S.S.1
Miescke, K.J.2
-
104
-
-
84866344056
-
Bayesian dynamic pricing policies: Learning and earning under a binary prior distribution
-
Technical report, Working paper, Columbia and Stanford University
-
Harrison, J., Keskin, N. & Zeevi, A. (2010), Bayesian dynamic pricing policies: Learning and earning under a binary prior distribution, Technical report, Working paper, Columbia and Stanford University.
-
(2010)
-
-
Harrison, J.1
Keskin, N.2
Zeevi, A.3
-
105
-
-
0003684449
-
-
Springer, New York
-
Hastie, T., Tibshirani, R. & Friedman, J. (2009), The elements of statistical learning: data mining, inference and prediction, Springer, New York.
-
(2009)
The elements of statistical learning: data mining, inference and prediction
-
-
Hastie, T.1
Tibshirani, R.2
Friedman, J.3
-
106
-
-
0003684449
-
-
Springer, New York
-
Hastie, T., Tibshirani, R., Friedman, J. & Franklin, J. (2005), The elements of statistical learning: data mining, inference and prediction, Vol. 27, Springer, New York.
-
(2005)
The elements of statistical learning: data mining, inference and prediction
, vol.27
-
-
Hastie, T.1
Tibshirani, R.2
Friedman, J.3
Franklin, J.4
-
107
-
-
0035696433
-
A Genetic Algorithm and an Indifference-Zone Ranking and Selection Framework for Simulation Optimization
-
B. A. Peters, J. S. Smith, D. J. Medeiros & M. W. Rohrer, eds
-
Hedlund, H. E. & Mollaghasemi, M. (2001), A Genetic Algorithm and an Indifference-Zone Ranking and Selection Framework for Simulation Optimization, in B. A. Peters, J. S. Smith, D. J. Medeiros & M. W. Rohrer, eds,'Proceedings of the 2001 Winter Simulation Conference', pp. 417-421.
-
(2001)
Proceedings of the 2001 Winter Simulation Conference
, pp. 417-421
-
-
Hedlund, H.E.1
Mollaghasemi, M.2
-
108
-
-
33644525898
-
Discrete Optimization via Simulation Using COMPASS
-
Hong, L. J. & Nelson, B. L. (2006), 'Discrete Optimization via Simulation Using COMPASS', Operations Research 54(1), 115-129.
-
(2006)
Operations Research
, vol.54
, Issue.1
, pp. 115-129
-
-
Hong, L.J.1
Nelson, B.L.2
-
109
-
-
77951527013
-
A Brief Introduction To Optimization Via Simulation
-
M. Rosetti, R. Hill, B. Johansson, A. Dunkin & R. Ingalls, eds
-
Hong, L. J. & Nelson, B. L. (2009), A Brief Introduction To Optimization Via Simulation, in M. Rosetti, R. Hill, B. Johansson, A. Dunkin & R. Ingalls, eds,'Proceedings of the 2009 Winter Simulation Conference', pp. 75-85.
-
(2009)
Proceedings of the 2009 Winter Simulation Conference
, pp. 75-85
-
-
Hong, L.J.1
Nelson, B.L.2
-
110
-
-
39549108095
-
Ranking inequality: Applications of multivariate subset selection
-
Horrace, W., Marchand, J. & Smeeding, T. (2008), 'Ranking inequality: Applications of multivariate subset selection', Journal of Economic Inequality 6(1), 5-32.
-
(2008)
Journal of Economic Inequality
, vol.6
, Issue.1
, pp. 5-32
-
-
Horrace, W.1
Marchand, J.2
Smeeding, T.3
-
111
-
-
84939051589
-
Sequential transmission using noiseless feedback
-
Horstein, M. (1963), 'Sequential transmission using noiseless feedback', IEEE Trans-actions on Information Theory 9(3), 136-143.
-
(1963)
IEEE Trans-actions on Information Theory
, vol.9
, Issue.3
, pp. 136-143
-
-
Horstein, M.1
-
113
-
-
79951643127
-
An Approximate Annealing Search Algorithm to Global Optimization and its Connection to Stochastic Approximation
-
B. Johansson, S. Jain, J. Montoya-Torres, J. Hugan & E. Yücesan, eds
-
Hu, J. & Hu, P. (2010), An Approximate Annealing Search Algorithm to Global Optimization and its Connection to Stochastic Approximation, in B. Johansson, S. Jain, J. Montoya-Torres, J. Hugan & E. Yücesan, eds,'Proceedings of the 2010 Winter Simulation Conference', pp. 1223-1234.
-
(2010)
Proceedings of the 2010 Winter Simulation Conference
, pp. 1223-1234
-
-
Hu, J.1
Hu, P.2
-
114
-
-
79960273055
-
Annealing Adaptive Search, Cross-Entropy, and Stochastic Approximation in Global Optimization
-
to appear
-
Hu, J. & Hu, P. (2011), 'Annealing Adaptive Search, Cross-Entropy, and Stochastic Approximation in Global Optimization', Naval Research Logistics (to appear).
-
(2011)
Naval Research Logistics
-
-
Hu, J.1
Hu, P.2
-
115
-
-
84863275874
-
Discrete Optimization Via Approximate Annealing Adaptive Search With Stochastic Averaging
-
S. Jain, R. R. Creasey, J. Himmelspach, K. P. White & M. C. Fu, eds
-
Hu, J. & Wang, C. (2011), Discrete Optimization Via Approximate Annealing Adaptive Search With Stochastic Averaging, in S. Jain, R. R. Creasey, J. Himmelspach, K. P. White & M. C. Fu, eds,'Proceedings of the 2011 Winter Simulation Conference'.
-
(2011)
Proceedings of the 2011 Winter Simulation Conference
-
-
Hu, J.1
Wang, C.2
-
116
-
-
37249005626
-
A model reference adaptive search method for global optimization
-
Hu, J., Fu, M. & Marcus, S. (2007), 'A model reference adaptive search method for global optimization', Operations Research 55(3), 549-568.
-
(2007)
Operations Research
, vol.55
, Issue.3
, pp. 549-568
-
-
Hu, J.1
Fu, M.2
Marcus, S.3
-
117
-
-
33644791173
-
Global Optimization of Stochastic Black-Box Systems via Sequential Kriging Meta-Models
-
Huang, D., Allen, T. T., Notz, W. I. & Zeng, N. (2006), 'Global Optimization of Stochastic Black-Box Systems via Sequential Kriging Meta-Models', Journal of Global Optimization 34(3), 441-466.
-
(2006)
Journal of Global Optimization
, vol.34
, Issue.3
, pp. 441-466
-
-
Huang, D.1
Allen, T.T.2
Notz, W.I.3
Zeng, N.4
-
118
-
-
0012353958
-
An empirical evaluation of several methods to select the best system
-
381-4-07
-
Inoue, K., Chick, S. E. & Chen, C.-H. (1999), 'An empirical evaluation of several methods to select the best system', ACM Transactions on Modeling and Computer Simulation 9, 381-4-07.
-
(1999)
ACM Transactions on Modeling and Computer Simulation 9
-
-
Inoue, K.1
Chick, S.E.2
Chen, C.-H.3
-
119
-
-
0042685161
-
Bayesian parameter estimation via variational methods
-
Jaakkola, T. & Jordan, M. (2000), 'Bayesian parameter estimation via variational methods', Statistics and Computing 10(1), 25-37.
-
(2000)
Statistics and Computing
, vol.10
, Issue.1
, pp. 25-37
-
-
Jaakkola, T.1
Jordan, M.2
-
121
-
-
0000561424
-
Efficient global optimization of expensive black-box functions
-
Jones, D., Schonlau, M. & Welch, W. (1998), 'Efficient global optimization of expensive black-box functions', Journal of Global Optimization 13(4), 455-492.
-
(1998)
Journal of Global Optimization
, vol.13
, Issue.4
, pp. 455-492
-
-
Jones, D.1
Schonlau, M.2
Welch, W.3
-
123
-
-
0030305926
-
Pricing decisions under demand uncertainty: A Bayesian mixture model approach
-
Kalyanam, K. (1996), 'Pricing decisions under demand uncertainty: A Bayesian mixture model approach', Marketing Science 15(3), 207-221.
-
(1996)
Marketing Science
, vol.15
, Issue.3
, pp. 207-221
-
-
Kalyanam, K.1
-
124
-
-
0015207267
-
Discrete square root filtering: A survey of current techniques
-
Kaminski, P., Bryson, A. & Schmidt, S. (1971), 'Discrete square root filtering: A survey of current techniques', IEEE Transactions on Automatic Control 16(6), 727-736.
-
(1971)
IEEE Transactions on Automatic Control
, vol.16
, Issue.6
, pp. 727-736
-
-
Kaminski, P.1
Bryson, A.2
Schmidt, S.3
-
125
-
-
0023345261
-
The Multi-Armed Bandit Problem: Decomposition and Computation
-
Katehakis, M. & Veinott, A. (1987), 'The Multi-Armed Bandit Problem: Decomposition and Computation', Mathematics of Operations Research 12(2), 262-268.
-
(1987)
Mathematics of Operations Research
, vol.12
, Issue.2
, pp. 262-268
-
-
Katehakis, M.1
Veinott, A.2
-
126
-
-
0017707946
-
Application of the Free-Wilson Technique to Structurally Related Series of Homologues
-
Katz, R. & Ionescu, F. (1977), 'Application of the Free-Wilson Technique to Structurally Related Series of Homologues. Quantitative S tract ure-Activity Relations hip Studies of Narcotic Analgetics', 20(11), 1413-1419.
-
(1977)
Quantitative S tract ure-Activity Relations hip Studies of Narcotic Analgetics
, vol.20
, Issue.11
, pp. 1413-1419
-
-
Katz, R.1
Ionescu, F.2
-
127
-
-
0036832954
-
Near-optimal reinforcement learning in polynomial time.
-
Kearns, M. & Singh, S. (2002), 'Near-optimal reinforcement learning in polynomial time.', Machine Learning 49, 209-232.
-
(2002)
Machine Learning
, vol.49
, pp. 209-232
-
-
Kearns, M.1
Singh, S.2
-
128
-
-
0346963698
-
A fully sequential procedure for indifference-zone selection in simulation
-
Kim, S.-H. & Nelson, B. L. (2001), 'A fully sequential procedure for indifference-zone selection in simulation', ACM Trans. Model. Comput. Simul. 11, 251-273.
-
(2001)
ACM Trans. Model. Comput. Simul.
, vol.11
, pp. 251-273
-
-
Kim, S.-H.1
Nelson, B.L.2
-
129
-
-
33744788406
-
On the asymptotic validity of fully sequential selection procedures for steady-state simulation
-
Kim, S.-H. & Nelson, B. L. (2006), 'On the asymptotic validity of fully sequential selection procedures for steady-state simulation', Operations Research 54, 475-488.
-
(2006)
Operations Research
, vol.54
, pp. 475-488
-
-
Kim, S.-H.1
Nelson, B.L.2
-
130
-
-
38049011420
-
Nearly Tight Bounds for the Continuum-Armed Bandit Problem
-
L. Saul, Y. Weiss & L. Bottou, eds, MIT Press, Cambridge, MA
-
Kleinberg, R. (2004), Nearly Tight Bounds for the Continuum-Armed Bandit Problem, in L. Saul, Y. Weiss & L. Bottou, eds,'Advances in Neural Information Processing Systems', MIT Press, Cambridge, MA, pp. 697-704.
-
(2004)
Advances in Neural Information Processing Systems
, pp. 697-704
-
-
Kleinberg, R.1
-
131
-
-
77955660815
-
Regret bounds for sleeping experts and bandits
-
Kleinberg, R., Niculescu-Mizil, A. & Sharma, Y. (2010), 'Regret bounds for sleeping experts and bandits', Machine Learning 80(2), 245-272.
-
(2010)
Machine Learning
, vol.80
, Issue.2
, pp. 245-272
-
-
Kleinberg, R.1
Niculescu-Mizil, A.2
Sharma, Y.3
-
132
-
-
0022781718
-
Shortest paths in networks with exponentially distributed arc lengths
-
Kulkarni, V. (1986), 'Shortest paths in networks with exponentially distributed arc lengths', Networks 16, 255-274.
-
(1986)
Networks
, vol.16
, pp. 255-274
-
-
Kulkarni, V.1
-
134
-
-
0000854435
-
Adaptive treatment allocation and the multi-armed bandit problem
-
Lai, T. L. (1987), 'Adaptive treatment allocation and the multi-armed bandit problem', The Annals of Statistics 15(3), 1091-1114.
-
(1987)
The Annals of Statistics
, vol.15
, Issue.3
, pp. 1091-1114
-
-
Lai, T.L.1
-
135
-
-
0002899547
-
Asymptotically Efficient Adaptive Allocation Rules
-
Lai, T. L. & Robbins, H. (1985), 'Asymptotically Efficient Adaptive Allocation Rules', Advances in Applied Mathematics 6, 4-22.
-
(1985)
Advances in Applied Mathematics
, vol.6
, pp. 4-22
-
-
Lai, T.L.1
Robbins, H.2
-
136
-
-
0001388964
-
Continuous multi-armed bandits and multiparameter processes
-
Mandelbaum, A. (1987), 'Continuous multi-armed bandits and multiparameter processes', The Annals of Probability 15(4), 1527-1556.
-
(1987)
The Annals of Probability
, vol.15
, Issue.4
, pp. 1527-1556
-
-
Mandelbaum, A.1
-
138
-
-
0001036667
-
Two-stage multiple comparisons with the best for computer simulation
-
Matejcik, F. & Nelson, B. (1995), 'Two-stage multiple comparisons with the best for computer simulation', Operations Research 43(4), 633-640.
-
(1995)
Operations Research
, vol.43
, Issue.4
, pp. 633-640
-
-
Matejcik, F.1
Nelson, B.2
-
139
-
-
84949574365
-
-
U.S. Greenhouse Gas Abatement Mapping Initiative, Executive Report
-
McKinsey & Company (2007), 'Reducing U.S. Greenhouse Gas Emissions: How Much at What Cost?', U.S. Greenhouse Gas Abatement Mapping Initiative, Executive Report.
-
(2007)
-
-
McKinsey1
Company2
-
143
-
-
79961092747
-
The Knowledge-Gradient Algorithm for Sequencing Experiments in Drug Discovery
-
Negoescu, D. M., Frazier, P. I. & Powell, W. B. (2011), 'The Knowledge-Gradient Algorithm for Sequencing Experiments in Drug Discovery', INFORMS Journal on Computing 23(3), 346-363.
-
(2011)
INFORMS Journal on Computing
, vol.23
, Issue.3
, pp. 346-363
-
-
Negoescu, D.M.1
Frazier, P.I.2
Powell, W.B.3
-
144
-
-
0001338090
-
Using common random numbers for indifference-zone selection and multiple comparisons in simulation
-
Nelson, B. L. & Matejcik, F. J. (1995), 'Using common random numbers for indifference-zone selection and multiple comparisons in simulation', Management Science 41(12), 1935-1945.
-
(1995)
Management Science
, vol.41
, Issue.12
, pp. 1935-1945
-
-
Nelson, B.L.1
Matejcik, F.J.2
-
145
-
-
79955755016
-
Computing a Classic Index for Finite-Horizon Bandits
-
Nino-Mora, J. (2010), 'Computing a Classic Index for Finite-Horizon Bandits', INFORMS Journal on Computing 23(2), 254-267.
-
(2010)
INFORMS Journal on Computing
, vol.23
, Issue.2
, pp. 254-267
-
-
Nino-Mora, J.1
-
146
-
-
79952563147
-
The stochastic root-finding problem: Overview, solutions, and open questions
-
Pasupathy, R. & Kim, S. (2011), 'The stochastic root-finding problem: Overview, solutions, and open questions', ACM Transactions on Modeling and Computer Simulation 21(3), 19:1-19:23.
-
(2011)
ACM Transactions on Modeling and Computer Simulation
, vol.21
, Issue.3
, pp. 191-1923
-
-
Pasupathy, R.1
Kim, S.2
-
147
-
-
0000145493
-
A Sequential Procedure for Selecting the Population with the Largest Mean from k Normal Populations
-
Paulson, E. (1964), 'A Sequential Procedure for Selecting the Population with the Largest Mean from k Normal Populations', The Annals of Mathematical Statistics 35, 174-180.
-
(1964)
The Annals of Mathematical Statistics
, vol.35
, pp. 174-180
-
-
Paulson, E.1
-
150
-
-
84949764394
-
-
2nd. edn, John Wiley & Sons, Hoboken, NJ
-
Powell, W. B. (2011), Approximate Dynamic Programming: Solving the curses of dimensionality, 2nd. edn, John Wiley & Sons, Hoboken, NJ.
-
(2011)
Approximate Dynamic Programming: Solving the curses of dimensionality
-
-
Powell, W.B.1
-
152
-
-
0003029778
-
A Nonconcavity in the Value of Information
-
M. Boyer & R. Kihlstrom, eds, North-Holland, Amsterdam, chapter 3
-
Radner, R. & Stiglitz, J. (1984), A Nonconcavity in the Value of Information, in M. Boyer & R. Kihlstrom, eds,'Bayesian models in economic theory', Vol. 5, North-Holland, Amsterdam, chapter 3, pp. 33-52.
-
(1984)
Bayesian models in economic theory
, vol.5
, pp. 33-52
-
-
Radner, R.1
Stiglitz, J.2
-
154
-
-
0001552833
-
On two-stage selection procedures and related probability inequalities
-
Rinott, Y. (1978), 'On two-stage selection procedures and related probability inequalities', Communications in Statistics A7 pp. 799-811.
-
(1978)
Communications in Statistics A7
, pp. 799-811
-
-
Rinott, Y.1
-
155
-
-
0000016172
-
A stochastic approximation method
-
Robbins, H. & Monro, S. (1951), 'A stochastic approximation method', The Annals of Mathematical Statistics 22(3), 400-407.
-
(1951)
The Annals of Mathematical Statistics
, vol.22
, Issue.3
, pp. 400-407
-
-
Robbins, H.1
Monro, S.2
-
157
-
-
84974012804
-
Simulated annealing and adaptive search in global optimization
-
Romeijn, H. & Smith, R. (1994), 'Simulated annealing and adaptive search in global optimization', Probability in the Engineering and Informational Sciences 8(4), 571-590.
-
(1994)
Probability in the Engineering and Informational Sciences
, vol.8
, Issue.4
, pp. 571-590
-
-
Romeijn, H.1
Smith, R.2
-
158
-
-
0001058483
-
A two-armed bandit theory of market pricing
-
Rothschild, M. (1974), 'A two-armed bandit theory of market pricing', Journal of Economic Theory 9(2), 185-202.
-
(1974)
Journal of Economic Theory
, vol.9
, Issue.2
, pp. 185-202
-
-
Rothschild, M.1
-
160
-
-
77953111834
-
Linearly Parameterized Bandits
-
Rusmevichientong, R & Tsitsiklis, J. N. (2010), 'Linearly Parameterized Bandits', Mathematics of Operations Research 35(2), 395-411.
-
(2010)
Mathematics of Operations Research
, vol.35
, Issue.2
, pp. 395-411
-
-
Rusmevichientong, R.1
Tsitsiklis, J.N.2
-
161
-
-
77950479202
-
-
Elsevier, Amsterdam
-
Ruszczynski, A. & Shapiro, A. (2003), Handbooks in Operations Research and Man-agement Science: Stochastic Programming, Vol. 10, Elsevier, Amsterdam.
-
(2003)
Handbooks in Operations Research and Man-agement Science: Stochastic Programming
, vol.10
-
-
Ruszczynski, A.1
Shapiro, A.2
-
162
-
-
77951568757
-
A Monte-Carlo Knowledge Gradient Method For Learning Abatement Potential Of Emissions Reduction Technologies
-
M. D. Rossetti, R. R. Hill, B. Johansson, A. Dunkin & R. G. Ingalls, eds
-
Ryzhov, I. O. & Powell, W. B. (2009a), A Monte-Carlo Knowledge Gradient Method For Learning Abatement Potential Of Emissions Reduction Technologies, in M. D. Rossetti, R. R. Hill, B. Johansson, A. Dunkin & R. G. Ingalls, eds,'Proceedings of the 2009 Winter Simulation Conference'.
-
(2009)
Proceedings of the 2009 Winter Simulation Conference
-
-
Ryzhov, I.O.1
Powell, W.B.2
-
164
-
-
84949574366
-
Information collection for linear programs with unknown objective coefficients
-
Submitted for publication
-
Ryzhov, I. O. & Powell, W. B. (201 la),'Information collection for linear programs with unknown objective coefficients', Submitted for publication.
-
(2011)
-
-
Ryzhov, I.O.1
Powell, W.B.2
-
165
-
-
79952942276
-
Information Collection on a Graph
-
Ryzhov, I. O. & Powell, W. B. (2011b),'Information Collection on a Graph', Operations Research 59(1), 188-201.
-
(2011)
Operations Research
, vol.59
, Issue.1
, pp. 188-201
-
-
Ryzhov, I.O.1
Powell, W.B.2
-
167
-
-
84877933547
-
The knowledge gradient algorithm for a general class of online learning problems
-
to appear
-
Ryzhov, I. O., Powell, W. B. & Frazier, P. I. (2011), 'The knowledge gradient algorithm for a general class of online learning problems', Operations Research (to appear).
-
(2011)
Operations Research
-
-
Ryzhov, I.O.1
Powell, W.B.2
Frazier, P.I.3
-
168
-
-
79951586758
-
Optimal Learning of Transition Probabilities in the Two-Agent Newsvendor Problem
-
B. Johansson, S. Jain, J. Montoya-Torres, J. Hugan & E. Yucesan, eds
-
Ryzhov, I. O., Valdez-Vivas, M. R. & Powell, W B. (2010), Optimal Learning of Transition Probabilities in the Two-Agent Newsvendor Problem, in B. Johansson, S. Jain, J. Montoya-Torres, J. Hugan & E. Yucesan, eds,'Proceedings of the 2010 Winter Simulation Conference', pp. 1088-1098.
-
(2010)
Proceedings of the 2010 Winter Simulation Conference
, pp. 1088-1098
-
-
Ryzhov, I.O.1
Valdez-Vivas, M.R.2
Powell, W.B.3
-
169
-
-
84972517827
-
Design and analysis of computer experiments
-
Sacks, J., Welch, W, Mitchell, T. J. & Wynn, H. P. (1989), 'Design and analysis of computer experiments', Statistical Science 4(4), 409-423.
-
(1989)
Statistical Science
, vol.4
, Issue.4
, pp. 409-423
-
-
Sacks, J.1
Welch, W.2
Mitchell, T.J.3
Wynn, H.P.4
-
170
-
-
77249140907
-
Markov decision processes with imprecise transition probabilities
-
Satia, J. & Lave, R. (1973), 'Markov decision processes with imprecise transition probabilities', Operations Research 21(3), 755-763.
-
(1973)
Operations Research
, vol.21
, Issue.3
, pp. 755-763
-
-
Satia, J.1
Lave, R.2
-
171
-
-
80054748080
-
The Correlated Knowledge Gradient for Simulation Optimization of Continuous Parameters using Gaussian Process Regression
-
Scott, W. R., Frazier, P. I. & Powell, W. B. (2011), 'The Correlated Knowledge Gradient for Simulation Optimization of Continuous Parameters using Gaussian Process Regression', SI AM Journal on Optimization 21(3), 996-1026.
-
(2011)
SI AM Journal on Optimization
, vol.21
, Issue.3
, pp. 996-1026
-
-
Scott, W.R.1
Frazier, P.I.2
Powell, W.B.3
-
172
-
-
79951666282
-
Calibrating simulation models using the knowledge gradient with continuous parameters
-
B. Johansson, S. Jain, J. Montoya-Torres, J. Hugan & E. Yiicesan, eds
-
Scott, W. R., Powell, W. B. & Simão, H. P. (2010), Calibrating simulation models using the knowledge gradient with continuous parameters, in B. Johansson, S. Jain, J. Montoya-Torres, J. Hugan & E. Yiicesan, eds,'Proceedings of the 2010 Winter Simulation Conference', pp. 1099-1109.
-
(2010)
Proceedings of the 2010 Winter Simulation Conference
, pp. 1099-1109
-
-
Scott, W.R.1
Powell, W.B.2
Simão, H.P.3
-
173
-
-
68949137209
-
-
Computer Sciences Technical Report 1648, University of Wisconsin-Madison
-
Settles, B. (2009), Active learning literature survey, Computer Sciences Technical Report 1648, University of Wisconsin-Madison.
-
(2009)
Active learning literature survey
-
-
Settles, B.1
-
174
-
-
18544370594
-
-
Si, J., Barto, A. G., Powell, W. B. & Wunsch, D. (2005), Learning and Approximate Dynamic Programming.
-
(2005)
Learning and Approximate Dynamic Programming
-
-
Si, J.1
Barto, A.G.2
Powell, W.B.3
Wunsch, D.4
-
176
-
-
0033901602
-
Convergence results for single-step on-policy reinforcement-learning algorithms
-
Singh, S. P., Jaakkola, T., Szepesvari, C. & Littman, M. (2000), 'Convergence results for single-step on-policy reinforcement-learning algorithms', Machine Learning 38(3), 287-308.
-
(2000)
Machine Learning
, vol.38
, Issue.3
, pp. 287-308
-
-
Singh, S.P.1
Jaakkola, T.2
Szepesvari, C.3
Littman, M.4
-
177
-
-
34248179010
-
Probabilistic networks and network algorithms
-
M. Ball, T. Magnanti & C. Monma, eds, Networks', North-Holland Publishing, Amsterdam
-
Snyder, T. & Steele, J. (1995), Probabilistic networks and network algorithms, in M. Ball, T. Magnanti & C. Monma, eds,'Handbooks of Operations Research and Management Science, vol. 7: Networks', North-Holland Publishing, Amsterdam, pp. 401-424.
-
(1995)
Handbooks of Operations Research and Management Science
, vol.7
, pp. 401-424
-
-
Snyder, T.1
Steele, J.2
-
178
-
-
0013025914
-
-
John Wiley & Sons, Hoboken, NJ
-
Spall, J. C. (2003), Introduction to Stochastic Search and Optimization: Estimation, Simulation and Control, John Wiley & Sons, Hoboken, NJ.
-
(2003)
Introduction to Stochastic Search and Optimization: Estimation, Simulation and Control
-
-
Spall, J.C.1
-
179
-
-
77956501313
-
Gaussian process optimization in the bandit setting: No regret and experimental design
-
Srinivas, N., Krause, A., Kakade, S. M. & Seeger, M. (2010), Gaussian process optimization in the bandit setting: No regret and experimental design, in'Proceedings of the 27th International Conference on Machine Learning', pp. 1015-1022.
-
(2010)
Proceedings of the 27th International Conference on Machine Learning
, pp. 1015-1022
-
-
Srinivas, N.1
Krause, A.2
Kakade, S.M.3
Seeger, M.4
-
182
-
-
33750375100
-
A simple distribution-free approach to the max k-armed bandit problem
-
Lecture Notes in Computer Science
-
Streeter, M. & Smith, S. (2006), A simple distribution-free approach to the max k-armed bandit problem, in'Principles and Practice of Constraint Programming', Vol. 4204 of Lecture Notes in Computer Science, pp. 560-574.
-
(2006)
Principles and Practice of Constraint Programming
, vol.4204
, pp. 560-574
-
-
Streeter, M.1
Smith, S.2
-
183
-
-
0004007508
-
-
MIT Press, Cambridge, MA
-
Sutton, R. S. & Barto, A. G. (1998), Reinforcement Learning, Vol. 35, MIT Press, Cambridge, MA.
-
(1998)
Reinforcement Learning
, vol.35
-
-
Sutton, R.S.1
Barto, A.G.2
-
184
-
-
71149099079
-
Fast gradient-descent methods for temporal-difference learning with linear function approximation
-
Sutton, R. S., Maei, H. R., Precup, D., Bhatnagar, S., Silver, D., Szepesvari, C. & Wiewiora, E. (2009), Fast gradient-descent methods for temporal-difference learning with linear function approximation, in'Proceedings of the 26th International Conference on Machine Learning', pp. 993-1000.
-
(2009)
Proceedings of the 26th International Conference on Machine Learning
, pp. 993-1000
-
-
Sutton, R.S.1
Maei, H.R.2
Precup, D.3
Bhatnagar, S.4
Silver, D.5
Szepesvari, C.6
Wiewiora, E.7
-
185
-
-
84949574367
-
-
of Synthesis Lectures on Artificial Intelligence and Machine Learning, Morgan & Claypool Publishers
-
Szepesvári, C. (2010), Algorithms for reinforcement learning, Vol. 4 of Synthesis Lectures on Artificial Intelligence and Machine Learning, Morgan & Claypool Publishers.
-
(2010)
Algorithms for reinforcement learning
, vol.4
-
-
Szepesvári, C.1
-
187
-
-
0001046225
-
Practical Issues in Temporal Difference Learning
-
Tesauro, G. (1992), 'Practical Issues in Temporal Difference Learning', Machine Learning 8(3-4), 257-277.
-
(1992)
Machine Learning
, vol.8
, Issue.3-4
, pp. 257-277
-
-
Tesauro, G.1
-
188
-
-
2242471756
-
The optimal choice of a subset of a population
-
Vanderbei, R. J. (1980), 'The optimal choice of a subset of a population', Mathematics of Operations Research 5(4), 481-186.
-
(1980)
Mathematics of Operations Research
, vol.5
, Issue.4
, pp. 481-186
-
-
Vanderbei, R.J.1
-
191
-
-
0000193326
-
Optimum Character of the Sequential Probability Ratio Test
-
Wald, A. & Wolfowitz, J. (1948), 'Optimum Character of the Sequential Probability Ratio Test', The Annals of Mathematical Statistics 19, 326-339.
-
(1948)
The Annals of Mathematical Statistics
, vol.19
, pp. 326-339
-
-
Wald, A.1
Wolfowitz, J.2
-
192
-
-
0002327722
-
On an index policy for restless bandits
-
Weber, R. R. & Weiss, G. (1990), 'On an index policy for restless bandits', J. Appl. Prob. 27(3), 637-648.
-
(1990)
J. Appl. Prob.
, vol.27
, Issue.3
, pp. 637-648
-
-
Weber, R.R.1
Weiss, G.2
-
193
-
-
34547236489
-
Better May be Worse: Some Monotonicity Results and Paradoxes in Discrete Choice Under Uncertainty
-
Weibull, J. W., Mattsson, L.-G. & Voorneveld, M. (2007), 'Better May be Worse: Some Monotonicity Results and Paradoxes in Discrete Choice Under Uncertainty', Theory and Decision 63(2), 121-151.
-
(2007)
Theory and Decision
, vol.63
, Issue.2
, pp. 121-151
-
-
Weibull, J.W.1
Mattsson, L.-G.2
Voorneveld, M.3
-
195
-
-
0000248624
-
Multi-armed bandits and the Gittins index
-
Whittle, P. (1980), 'Multi-armed bandits and the Gittins index', Journal of the Royal Statistical Society B42(2), 143-149.
-
(1980)
Journal of the Royal Statistical Society
, vol.B42
, Issue.2
, pp. 143-149
-
-
Whittle, P.1
-
196
-
-
0001043843
-
Restless bandits: Activity Allocation in a Changing World
-
Whittle, P. (1988), 'Restless bandits: Activity Allocation in a Changing World', J. Appl. Prob. 25(1988), 287-298.
-
(1988)
J. Appl. Prob.
, vol.25
, Issue.1988
, pp. 287-298
-
-
Whittle, P.1
|