SCOPUS 정보 검색 플랫폼

Optimal Learning

Volumn , Issue , 2012, Pages 1-384

Optimal Learning

(2) Powell, Warren B a Ryzhov, Ilya O b

a Princeton University (United States)

b UNIVERSITY OF MARYLAND (United States)

Author keywords

[No Author keywords available]

Indexed keywords

MATHEMATICAL PROGRAMMING; MATLAB; TABLE LOOKUP;

ADAPTIVE LEARNING; CONTINUOUS MEASUREMENTS; OFF-LINE PROBLEMS; PROBABILITY AND STATISTICS; RANKING AND SELECTION; SIMULATION OPTIMIZATION; SPECIFIC LEARNING; WEBSITE FEATURES;

APPLICATION PROGRAMS;

EID: 84871543700 PISSN: None EISSN: None Source Type: Book
DOI: 10.1002/9781118309858 Document Type: Book

Times cited : (245)

References (196)

1
- 84898063697
- Competing in the dark: An efficient algorithm for bandit linear optimization
- Abernethy, J., Hazan, E. & Rakhlin, A. (2008), Competing in the dark: An efficient algorithm for bandit linear optimization, in'Proceedings of the 21st Annual Conference on Learning Theory', pp. 263-274.
- (2008) Proceedings of the 21st Annual Conference on Learning Theory , pp. 263-274
- Abernethy, J.¹ Hazan, E.² Rakhlin, A.³

2
- 0041656119
- Optimal learning by experimentation
- Aghion, P., Bolton, P., Harris, C. & Jullien, B. (1991), 'Optimal learning by experimentation', The Review of Economic Studies 58, 621-654.
- (1991) The Review of Economic Studies , vol.58 , pp. 621-654
- Aghion, P.¹ Bolton, P.² Harris, C.³ Jullien, B.⁴

3
- 0000616723
- Sample mean based index policies with O (log n) regret for the multi-armed bandit problem
- Agrawal, R. (1995), 'Sample mean based index policies with O (log n) regret for the multi-armed bandit problem', Advance s in Applied Probability 27(4), 1054-1078.
- (1995) Advance s in Applied Probability , vol.27 , Issue.4 , pp. 1054-1078
- Agrawal, R.¹

4
- 0344182998
- A simulated annealing algorithm with constant temperature for discrete stochastic optimization
- Alrefaei, M. H. & Andradottir, S. (1999), 'A simulated annealing algorithm with constant temperature for discrete stochastic optimization', Management Science 45,748-764.
- (1999) Management Science , vol.45 , pp. 748-764
- Alrefaei, M.H.¹ Andradottir, S.²

5
- 77956926259
- Adaptive random search for continuous simulation optimization
- Andradottir, S. & Prudius, A. A. (2010), 'Adaptive random search for continuous simulation optimization', Naval Research Logistics 57(6), 583-604.
- (2010) Naval Research Logistics , vol.57 , Issue.6 , pp. 583-604
- Andradottir, S.¹ Prudius, A.A.²

6
- 77951184800
- Stochastic kriging for simulation metamodeling
- Ankenman, B., Nelson, B. L. & Staum, J. (2010), 'Stochastic kriging for simulation metamodeling', Operations Research 58(2), 371-382.
- (2010) Operations Research , vol.58 , Issue.2 , pp. 371-382
- Ankenman, B.¹ Nelson, B.L.² Staum, J.³

7
- 70350228783
- Dynamic pricing for non-perishable products with demand learning
- Araman, V. & Caldentey, R. (2009), 'Dynamic pricing for non-perishable products with demand learning', Operations Research 57(5), 1169-1188.
- (2009) Operations Research , vol.57 , Issue.5 , pp. 1169-1188
- Araman, V.¹ Caldentey, R.²

8
- 84864970677
- Best arm identification in multi-armed bandits
- Audibert, J. Y., Bubeck, S. & Munos, R. (2010), Best arm identification in multi-armed bandits, in'Proceedings of the 23rd Annual Conference on Learning Theory (COLT)', pp. 1-14.
- (2010) Proceedings of the 23rd Annual Conference on Learning Theory (COLT) , pp. 1-14
- Audibert, J.Y.¹ Bubeck, S.² Munos, R.³

9
- 0036568025
- Finite-time analysis of the multi-armed bandit problem
- Auer, P., Cesa-Bianchi, N. & Fischer, P. (2002), 'Finite-time analysis of the multi-armed bandit problem', Machine Learning 47(2), 235-256.
- (2002) Machine Learning , vol.47 , Issue.2 , pp. 235-256
- Auer, P.¹ Cesa-Bianchi, N.² Fischer, P.³

10
- 73549103329
- Near-optimal regret bounds for reinforcement learning
- D. Koller, Y. Bengio, D. Schuurmans, L. Bottou & A. Culotta, eds
- Auer, P., Jaksch, T. & Ortner, R. (2008), Near-optimal regret bounds for reinforcement learning, in D. Koller, Y. Bengio, D. Schuurmans, L. Bottou & A. Culotta, eds,'Advances in Neural Information Processing Systems', Vol. 21, pp. 89-96.
- (2008) Advances in Neural Information Processing Systems , vol.21 , pp. 89-96
- Auer, P.¹ Jaksch, T.² Ortner, R.³

11
- 25844499294
- A partially observed Markov decision process for dynamic pricing
- Aviv, Y. & Pazgal, A. (2005), 'A partially observed Markov decision process for dynamic pricing', Management Science 51(9), 1400-1416.
- (2005) Management Science , vol.51 , Issue.9 , pp. 1400-1416
- Aviv, Y.¹ Pazgal, A.²

12
- 0011155531
- Optimality proof for the symmetric Fibonacci search technique
- Avriel, M. & Wilde, D. (1966), 'Optimality proof for the symmetric Fibonacci search technique', Fibonacci Quarterly 4, 265-269.
- (1966) Fibonacci Quarterly , vol.4 , pp. 265-269
- Avriel, M.¹ Wilde, D.²

13
- 0033078286
- Simulation optimization with qualitative variables and structural model changes: A genetic algorithm approach
- Azadivar, F. & Tompkins, G. (1999), 'Simulation optimization with qualitative variables and structural model changes: A genetic algorithm approach', European Journal of Operational Research 113, 169-182.
- (1999) European Journal of Operational Research , vol.113 , pp. 169-182
- Azadivar, F.¹ Tompkins, G.²

14
- 0034694502
- Facility Layout Optimization Using Simulation and Genetic Algorithms
- Azadivar, F. & Wang, J. (2000), 'Facility Layout Optimization Using Simulation and Genetic Algorithms', International Journal of Production Research 38( 17), 43694383.
- (2000) International Journal of Production Research , vol.38 , Issue.17 , pp. 43694383
- Azadivar, F.¹ Wang, J.²

15
- 0003990308
- Prentice-Hall, Inc., Englewood Cliffs, N.J
- Banks, J., Nelson, B. L. & J. S. Carson, 1.1. (1996), Discrete-Event System Simulation, Prentice-Hall, Inc., Englewood Cliffs, N.J.
- (1996) Discrete-Event System Simulation
- Banks, J.¹ Nelson, B.L.² Carson, J.S.³

16
- 0000541772
- An introduction to ranking and selection procedures
- Barr, D. R. & Rizvi, M. H. (1966), 'An introduction to ranking and selection procedures', J. Amer. Statist. Assoc. 61(315), 640-646.
- (1966) J. Amer. Statist. Assoc. , vol.61 , Issue.315 , pp. 640-646
- Barr, D.R.¹ Rizvi, M.H.²

17
- 80555137396
- High-probability regret bounds for bandit online linear optimization
- Bartlett, P., Dani, V., Hayes, T, Kakade, S., Rakhlin, A. & Tewari, A. (2008), High-probability regret bounds for bandit online linear optimization, in'Proceedings of the 21st Annual Conference on Learning Theory', pp. 335-342.
- (2008) Proceedings of the 21st Annual Conference on Learning Theory , pp. 335-342
- Bartlett, P.¹ Dani, V.² Hayes, T.³ Kakade, S.⁴ Rakhlin, A.⁵ Tewari, A.⁶

18
- 0002426110
- A Single-Sample Multiple Decision Procedure for Ranking Means of Normal Populations with known Variances
- Bechhofer, R. E. (1954), 'A Single-Sample Multiple Decision Procedure for Ranking Means of Normal Populations with known Variances', The Annals of Mathematical Statistics 25, 16-39.
- (1954) The Annals of Mathematical Statistics , vol.25 , pp. 16-39
- Bechhofer, R.E.¹

19
- 0004125919
- University of Chicago Press, Chicago
- Bechhofer, R. E., Kiefer, J. & Sobel, M. (1968), Sequential Identification and Ranking Procedures, University of Chicago Press, Chicago.
- (1968) Sequential Identification and Ranking Procedures
- Bechhofer, R.E.¹ Kiefer, J.² Sobel, M.³

20
- 0003513355
- J.Wiley & Sons, New York
- Bechhofer, R., Santner, T. & Goldsman, D. (1995), Design and Analysis of Experiments for Statistical Selection, Screening and Multiple Comparisons, J.Wiley & Sons, New York.
- (1995) Design and Analysis of Experiments for Statistical Selection, Screening and Multiple Comparisons
- Bechhofer, R.¹ Santner, T.² Goldsman, D.³

21
- 84938011869
- On adaptive control processes
- Bellman, R. & Kalaba, R. (1959), 'On adaptive control processes', IRE Trans. 4,1-9.
- (1959) IRE Trans. , vol.4 , pp. 1-9
- Bellman, R.¹ Kalaba, R.²

22
- 84884052304
- Princeton University Press, Princeton NJ
- Ben-Tal, A., Ghaoui, L. E. & Nemirovski, A. (2009), Robust Optimization, Princeton University Press, Princeton NJ.
- (2009) Robust Optimization
- Ben-Tal, A.¹ Ghaoui, L.E.² Nemirovski, A.³

23
- 0003778897
- Springer-Verlag, New York
- Benveniste, A., Metivier, M. & Priouret, P. (1990), Adaptive Algorithms and Stochastic Approximations, Springer-Verlag, New York.
- (1990) Adaptive Algorithms and Stochastic Approximations
- Benveniste, A.¹ Metivier, M.² Priouret, P.³

24
- 0030352286
- Learning and strategic pricing
- Bergemann, D. & Välimäki, J. (1996), 'Learning and strategic pricing', Econometrica 64(5), 1125-1149.
- (1996) Econometrica , vol.64 , Issue.5 , pp. 1125-1149
- Bergemann, D.¹ Välimäki, J.²

25
- 0004218171
- Chapman and Hall, London
- Berry, D. A. & Fristedt, B. (1985), Bandit Problems, Chapman and Hall, London.
- (1985) Bandit Problems
- Berry, D.A.¹ Fristedt, B.²

26
- 0003487482
- Athena Scientific, Belmont, MA
- Bertsekas, D. P. & Tsitsiklis, J. N. (1996), Neuro-dynamic programming, Athena Scientific, Belmont, MA.
- (1996) Neuro-dynamic programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

27
- 0343441515
- Restless bandits, linear programming relaxations, and a primal-dual index heuristic
- Bertsimas, D. J. & Nino-Mora, J. (2000), 'Restless bandits, linear programming relaxations, and a primal-dual index heuristic', Operations Research 48(1), 8090.
- (2000) Operations Research , vol.48 , Issue.1 , pp. 8090
- Bertsimas, D.J.¹ Nino-Mora, J.²

28
- 58149250414
- Simulation optimization: applications in risk management
- Better, M., Glover, F. W., Kochenberger, G. & Wang, H. (2008), 'Simulation optimization: applications in risk management', International Journal of Information Technology and Decision Making 7(4), 571-587.
- (2008) International Journal of Information Technology and Decision Making , vol.7 , Issue.4 , pp. 571-587
- Better, M.¹ Glover, F.W.² Kochenberger, G.³ Wang, H.⁴

29
- 34249059079
- Robust optimization: A comprehensive survey
- Beyer, H. & Sendhoff, B. (2007), 'Robust optimization: A comprehensive survey', Computer Methods in Applied Mechanics and Engineering 196(33-34), 3190-3218.
- (2007) Computer Methods in Applied Mechanics and Engineering , vol.196 , Issue.33-34 , pp. 3190-3218
- Beyer, H.¹ Sendhoff, B.²

30
- 84917186888
- Routledge, London
- Birchler, U. & Butler, M. (2007), Information Economics, Routledge, London.
- (2007) Information Economics
- Birchler, U.¹ Butler, M.²

31
- 0002436850
- Tractable inference for complex stochastic processes
- Boyen, X. & Koller, D. (1998), Tractable inference for complex stochastic processes, in'Proceedings of the 14th Conference on Uncertainty in Artificial Intelligence', pp. 33-42.
- (1998) Proceedings of the 14th Conference on Uncertainty in Artificial Intelligence , pp. 33-42
- Boyen, X.¹ Koller, D.²

32
- 0041965975
- R-max - a general polynomial time algorithm for near-optimal reinforcement learning
- Brafman, R. I. & Tennenholtz, M. (2003), 'R-max - a general polynomial time algorithm for near-optimal reinforcement learning', Journal of Machine Learning Research 3,213-231.
- (2003) Journal of Machine Learning Research , vol.3 , pp. 213-231
- Brafman, R.I.¹ Tennenholtz, M.²

33
- 33745765950
- New developments in ranking and selection: an empirical comparison of the three main approaches
- M. Kuhl, N. Steiger, F. Argstrong & J. Joines, eds, IEEE, Inc., Piscataway, NJ
- Branke, J., Chick, S. E. & Schmidt, C. (2005), New developments in ranking and selection: an empirical comparison of the three main approaches, in M. Kuhl, N. Steiger, F. Argstrong & J. Joines, eds,'Proc. 2005 Winter Simulation Conference', IEEE, Inc., Piscataway, NJ, pp. 708-717.
- (2005) Proc. 2005 Winter Simulation Conference , pp. 708-717
- Branke, J.¹ Chick, S.E.² Schmidt, C.³

34
- 38549097195
- Selecting a Selection Procedure
- Branke, J., Chick, S. E. & Schmidt, C. (2007), 'Selecting a Selection Procedure', Management Science 53, 1916-1932.
- (2007) Management Science , vol.53 , pp. 1916-1932
- Branke, J.¹ Chick, S.E.² Schmidt, C.³

35
- 0036334330
- Optimal learning and experimentation in bandit problems
- Brezzi, M. & Lai, T. L. (2002), 'Optimal learning and experimentation in bandit problems', Journal of Economic Dynamics and Control 27(1), 87-108.
- (2002) Journal of Economic Dynamics and Control , vol.27 , Issue.1 , pp. 87-108
- Brezzi, M.¹ Lai, T.L.²

36
- 84949547476
- Technical report, Cornell University
- Broder, J. & Rusmevichientong, P. (2010a), Dynamic pricing under a general parametric choice model, Technical report, Cornell University.
- (2010) Dynamic pricing under a general parametric choice model
- Broder, J.¹ Rusmevichientong, P.²

37
- 84949558069
- Technical report, Cornell University
- Broder, J. & Rusmevichientong, P. (2010b), Dynamic pricing under a logit choice model, Technical report, Cornell University.
- (2010) Dynamic pricing under a logit choice model
- Broder, J.¹ Rusmevichientong, P.²

38
- 0001596834
- A unified approach to a class of best choice problems with an unknown number of options
- Bruss, F. (1984), 'A unified approach to a class of best choice problems with an unknown number of options', The Annals of Probability 12(3), 882-889.
- (1984) The Annals of Probability , vol.12 , Issue.3 , pp. 882-889
- Bruss, F.¹

39
- 84862281056
- PhD thesis, Universite Lille
- Bubeck, S. (2010), Bandits Games and Clustering Foundations, PhD thesis, Universite Lille.
- (2010) Bandits Games and Clustering Foundations
- Bubeck, S.¹

40
- 79960128338
- X-Armed Bandits
- Bubeck, S., Munos, R., Stoltz, G. & Szepesvari, C. (2011), 'X-Armed Bandits', Journal of Machine Learning Research 12, 1655-1695.
- (2011) Journal of Machine Learning Research , vol.12 , pp. 1655-1695
- Bubeck, S.¹ Munos, R.² Stoltz, G.³ Szepesvari, C.⁴

41
- 84949574363
- Convergence rates of efficient global optimization algorithms
- Bull, A. D. (2011), 'Convergence rates of efficient global optimization algorithms', Submitted for publication.
- (2011) Submitted for publication
- Bull, A.D.¹

42
- 60749134420
- A comparative study of genetic algorithm components in simulation-based optimization
- S. Mason, R. Hill, L. Mönch, O. Rose, T. Jefferson & J. Fowler, eds
- Can, B., Beham, A. & Heavey, C. (2008), A comparative study of genetic algorithm components in simulation-based optimization, in S. Mason, R. Hill, L. Mönch, O. Rose, T. Jefferson & J. Fowler, eds,'Proceedings of the 2008 Winter Simulation Conference', pp. 1829-1837.
- (2008) Proceedings of the 2008 Winter Simulation Conference , pp. 1829-1837
- Can, B.¹ Beham, A.² Heavey, C.³

43
- 0141815596
- Dynamic pricing and reinforcement learning
- Carvalho, A. & Puterman, M. (2003), Dynamic pricing and reinforcement learning, in'Proceedings of the 2003 International Joint Conference on Neural Networks', Vol. 4, pp. 2916-2921.
- (2003) Proceedings of the 2003 International Joint Conference on Neural Networks , vol.4 , pp. 2916-2921
- Carvalho, A.¹ Puterman, M.²

44
- 34547314820
- Learning and pricing in an Internet environment with binomial demands
- Carvalho, A. & Puterman, M. (2005), 'Learning and pricing in an Internet environment with binomial demands', Journal of Revenue and Pricing Management 3(4), 320-336.
- (2005) Journal of Revenue and Pricing Management , vol.3 , Issue.4 , pp. 320-336
- Carvalho, A.¹ Puterman, M.²

45
- 1842266137
- Mathematical questions with their solutions, No. 4528
- Cayley, A. (1875), 'Mathematical questions with their solutions, No. 4528', Educational Times.
- (1875) Educational Times
- Cayley, A.¹

46
- 0036921026
- Another Look at the Radner-Stiglitz Nonconcavity in the Value of Information
- Chade, H. & Schlee, E. E. (2002), 'Another Look at the Radner-Stiglitz Nonconcavity in the Value of Information', Journal of Economic Theory 107(2), 421-452.
- (2002) Journal of Economic Theory , vol.107 , Issue.2 , pp. 421-452
- Chade, H.¹ Schlee, E.E.²

47
- 34547120053
- Springer, New York
- Chang, H. S., Fu, M. C, Hu, J. & Marcus, S. I. (2007), Simulation-Based Algorithms for Markov Decision Processes, Springer, New York.
- (2007) Simulation-Based Algorithms for Markov Decision Processes
- Chang, H.S.¹ Fu, M.C.² Hu, J.³ Marcus, S.I.⁴

48
- 27844503922
- Application of genetic algorithms in production and operations management: a review
- Chaudhry, S. & Luo, W. (2005), 'Application of genetic algorithms in production and operations management: a review', International Journal of Production Research 43(19), 4083-4101.
- (2005) International Journal of Production Research , vol.43 , Issue.19 , pp. 4083-4101
- Chaudhry, S.¹ Luo, W.²

49
- 85080610007
- World Scientific
- Chen, C.-H. & Lee, L. H. (2010), Stochastic Simulation Optimization: An Optimal Computing Budget Allocation, World Scientific.
- (2010) Stochastic Simulation Optimization: An Optimal Computing Budget Allocation
- Chen, C.-H.¹ Lee, L.H.²

50
- 60749111637
- Simulation and optimization
- P. Gray, Z.-L. Chen & S. Raghavan, eds
- Chen, C.-H., Fu, M. C. & Shi, L. (2008), Simulation and optimization, in P. Gray, Z.-L. Chen & S. Raghavan, eds,'2008 TutORials in Operations Research', pp. 247-260.
- (2008) 2008 TutORials in Operations Research , pp. 247-260
- Chen, C.-H.¹ Fu, M.C.² Shi, L.³

51
- 33845807891
- Efficient Dynamic Simulation Allocation in Ordinal Optimization
- Chen, C.-H., He, D. & Fu, M. C. (2006), 'Efficient Dynamic Simulation Allocation in Ordinal Optimization', IEEE Transactions Automatic Control 51, 2005-2009.
- (2006) IEEE Transactions Automatic Control , vol.51 , pp. 2005-2009
- Chen, C.-H.¹ He, D.² Fu, M.C.³

52
- 0034225544
- Simulation budget allocation for further enhancing the efficiency of ordinal optimization
- Chen, C.-H., Lin, J., Yücesan, E. & Chick, S. E. (2000), 'Simulation budget allocation for further enhancing the efficiency of ordinal optimization', Discrete Event Dynamic Systems 10(3), 251-270.
- (2000) Discrete Event Dynamic Systems , vol.10 , Issue.3 , pp. 251-270
- Chen, C.-H.¹ Lin, J.² Yücesan, E.³ Chick, S.E.⁴

53
- 84899413507
- Learning the Demand Curve in Posted-Price Digital Goods Auctions
- Chhabra, M. & Das, S. (2011), Learning the Demand Curve in Posted-Price Digital Goods Auctions, in'Proceedings of the 10th International Conference on Autonomous Agents and Multi-Agent Systems', pp. 63-70.
- (2011) Proceedings of the 10th International Conference on Autonomous Agents and Multi-Agent Systems , pp. 63-70
- Chhabra, M.¹ Das, S.²

54
- 84949574364
- Dynamic pricing with non-conjugate Pareto priors, Technical report
- Rensselaer Polytechnic Institute
- Chhabra, M. & Das, S. (2012), Dynamic pricing with non-conjugate Pareto priors, Technical report, Rensselaer Polytechnic Institute.
- (2012)
- Chhabra, M.¹ Das, S.²

55
- 1642437901
- Expected opportunity cost guarantees and indifference zone selection procedures
- S. E. Chick, P. J. Sánchez, D. Ferrin & D. J. Morrice, eds
- Chick, S. E. (2003), Expected opportunity cost guarantees and indifference zone selection procedures, in S. E. Chick, P. J. Sánchez, D. Ferrin & D. J. Morrice, eds,'Proceedings of the 2003 Winter Simulation Conference', pp. 465-473.
- (2003) Proceedings of the 2003 Winter Simulation Conference , pp. 465-473
- Chick, S.E.¹

56
- 77950462837
- Subjective Probability and Bayesian Methodology
- S. Hen-derson & B. Nelson, eds, Simulation', North-Holland Publishing, Amsterdam
- Chick, S. E. (2006), Subjective Probability and Bayesian Methodology, in S. Hen-derson & B. Nelson, eds,'Handbooks of Operations Research and Management Science, vol. 13: Simulation', North-Holland Publishing, Amsterdam, pp. 225-258.
- (2006) Handbooks of Operations Research and Management Science , vol.13 , pp. 225-258
- Chick, S.E.¹

57
- 67649990621
- Economic analysis of simulation selection problems
- Chick, S. E. & Gans, N. (2009), 'Economic analysis of simulation selection problems', Management Science 55(3), 421-37.
- (2009) Management Science , vol.55 , Issue.3 , pp. 421-437
- Chick, S.E.¹ Gans, N.²

58
- 0035460965
- New two-stage and sequential procedures for selecting the best simulated system
- Chick, S. E. & Inoue, K. (2001), 'New two-stage and sequential procedures for selecting the best simulated system', Operations Research 49(5), 732-743.
- (2001) Operations Research , vol.49 , Issue.5 , pp. 732-743
- Chick, S.E.¹ Inoue, K.²

59
- 27344432350
- Selection procedures with frequentist expected opportunity cost bounds
- Chick, S. E. & Wu, Y. (2005), 'Selection procedures with frequentist expected opportunity cost bounds', Operations Research 53(5), 867-878.
- (2005) Operations Research , vol.53 , Issue.5 , pp. 867-878
- Chick, S.E.¹ Wu, Y.²

60
- 77949359798
- Sequential Sampling to Myopically Maximize the Expected Value of Information
- Chick, S. E., Branke, J. & Schmidt, C. (2010), 'Sequential Sampling to Myopically Maximize the Expected Value of Information', INFORMS Journal on Computing 22(1), 71-80.
- (2010) INFORMS Journal on Computing , vol.22 , Issue.1 , pp. 71-80
- Chick, S.E.¹ Branke, J.² Schmidt, C.³

61
- 34548237750
- Opportunity cost and OCBA selection procedures in ordinal optimization for a fixed number of alternative systems
- Chick, S. E., He, D. H. & Chen, C.-H. (2007), 'Opportunity cost and OCBA selection procedures in ordinal optimization for a fixed number of alternative systems', IEEE Transactions on Systems Man and Cybernetics C37, 951-961.
- (2007) IEEE Transactions on Systems Man and Cybernetics , vol.C37 , pp. 951-961
- Chick, S.E.¹ He, D.H.² Chen, C.-H.³

62
- 0004139889
- WH Freeman
- Chvátal, V. (1983), Linear programming, WH Freeman.
- (1983) Linear programming
- Chvátal, V.¹

63
- 77954579628
- Springer
- Cinlar, E. (2011), Probability and Stochastics, Springer.
- (2011) Probability and Stochastics
- Cinlar, E.¹

64
- 34250348767
- Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration.
- Cohen, J. D., McClure, S. M. & Yu, A. J. (2007), 'Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration.', Philosophical transactions of the Royal Society of London B362(1481), 933-42.
- (2007) Philosophical transactions of the Royal Society of London , vol.B362 , Issue.1481 , pp. 933-942
- Cohen, J.D.¹ McClure, S.M.² Yu, A.J.³

65
- 0028424239
- Improving generalization with active learning
- Cohn, D., Atlas, L. & Ladner, R. (1994), 'Improving generalization with active learning', Machine Learning 5(2201), 221.
- (1994) Machine Learning , vol.5 , Issue.2201 , pp. 221
- Cohn, D.¹ Atlas, L.² Ladner, R.³

66
- 0029679131
- Active learning with statistical models
- Cohn, D., Ghahramani, Z. & Jordan, M. (1996), 'Active learning with statistical models', Journal of Artificial Intelligence Research 4, 129-145.
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 129-145
- Cohn, D.¹ Ghahramani, Z.² Jordan, M.³

67
- 84900550689
- Markov decision processes with uncertain transition probabilities
- Technical Report 11, Operations Research Center, MIT
- Cozzolino, J., Gonzalez-Zubieta, R. & Miller, R. (1965), Markov decision processes with uncertain transition probabilities, Technical Report 11, Operations Research Center, MIT.
- (1965)
- Cozzolino, J.¹ Gonzalez-Zubieta, R.² Miller, R.³

68
- 84894196956
- Technical Report 002v2, Lawrence Berkeley National Laboratory
- Crooks, G. (2009), Logistic approximation to the logistic-normal integral, Technical Report 002v2, Lawrence Berkeley National Laboratory.
- (2009) Logistic approximation to the logistic-normal integral
- Crooks, G.¹

69
- 0003849948
- Van Nostrand Reinhold
- Davis, L. & Mitchell, M. (1991), Handbook of genetic algorithms, Van Nostrand Reinhold.
- (1991) Handbook of genetic algorithms
- Davis, L.¹ Mitchell, M.²

70
- 49349084239
- Index policies for discounted bandit problems with availability constraints
- Dayanik, S., Powell, W. B. & Yamazaki, K. (2008), 'Index policies for discounted bandit problems with availability constraints', Adv. in Appl. Probab 40, 377-100.
- (2008) Adv. in Appl. Probab , vol.40 , pp. 377-100
- Dayanik, S.¹ Powell, W.B.² Yamazaki, K.³

71
- 1142281527
- Model-based Bayesian Exploration
- Dearden, R., Friedman, N. & Andre, D. (1999), Model-based Bayesian Exploration, in'Proceedings of the 15th Conference on Uncertainty in Artificial Intelligence', pp. 150-159.
- (1999) Proceedings of the 15th Conference on Uncertainty in Artificial Intelligence , pp. 150-159
- Dearden, R.¹ Friedman, N.² Andre, D.³

72
- 0031619316
- Bayesian Q-learning
- Dearden, R., Friedman, N. & Russell, S. (1998), Bayesian Q-learning, in'Proceedings of the 15th National Conference on Artificial Intelligence', pp. 761-768.
- (1998) Proceedings of the 15th National Conference on Artificial Intelligence , pp. 761-768
- Dearden, R.¹ Friedman, N.² Russell, S.³

73
- 0003759417
- John Wiley and Sons
- DeGroot, M. H. (1970), Optimal Statistical Decisions, John Wiley and Sons.
- (1970) Optimal Statistical Decisions
- DeGroot, M.H.¹

74
- 36048940444
- A tight sufficient condition for Radner-Stiglitz non-concavity in the value of information
- Delara, M. & Gilotte, L. (2007), 'A tight sufficient condition for Radner-Stiglitz non-concavity in the value of information', Journal of Economic Theory 137(1), 696-708.
- (2007) Journal of Economic Theory , vol.137 , Issue.1 , pp. 696-708
- Delara, M.¹ Gilotte, L.²

75
- 1942421168
- Design for an optimal probe
- Duff, M. (2003), Design for an optimal probe, in'Proceedings of the 20th Interna-tional Conference on Machine Learning', pp. 131-138.
- (2003) Proceedings of the 20th Interna-tional Conference on Machine Learning , pp. 131-138
- Duff, M.¹

76
- 16244388049
- Local bandit approximation for optimal learning problems
- M. Mozer, M. Jordan & T. Pesche, eds, Cambridge, MA: MIT Press
- Duff, M. & Barto, A. (1996), Local bandit approximation for optimal learning problems, in M. Mozer, M. Jordan & T. Pesche, eds,'Advances in Neural Information Processing Systems', Vol. 9, Cambridge, MA: MIT Press, pp. 1019-1025.
- (1996) Advances in Neural Information Processing Systems , vol.9 , pp. 1019-1025
- Duff, M.¹ Barto, A.²

77
- 0000145537
- Multistage Stochastic Programs - The State of the Art and Selected Bibliography
- Dupacova, J. (1995), 'Multistage Stochastic Programs - The State of the Art and Selected Bibliography', Kybernetica 31, 151-174.
- (1995) Kybernetica , vol.31 , pp. 151-174
- Dupacova, J.¹

78
- 77249163740
- Dynamic pricing with a prior on market response
- Farias, V. & Van Roy, B. (2010), 'Dynamic pricing with a prior on market response', Operations Research 58(1), 16-29.
- (2010) Operations Research , vol.58 , Issue.1 , pp. 16-29
- Farias, V.¹ Van Roy, B.²

79
- 58549105531
- John Wiley and Sons
- Forrester, A. I. J., Sobester, A. & Keane, A. J. (2008), Engineering design via surrogate modelling: a practical guide, John Wiley and Sons.
- (2008) Engineering design via surrogate modelling: a practical guide
- Forrester, A.I.J.¹ Sobester, A.² Keane, A.J.³

80
- 78651309095
- Paradoxes in Learning and the Marginal Value of Information
- Frazier, P. I. & Powell, W. B. (2010), 'Paradoxes in Learning and the Marginal Value of Information', Decision Analysis 7(4), 378-403.
- (2010) Decision Analysis , vol.7 , Issue.4 , pp. 378-403
- Frazier, P.I.¹ Powell, W.B.²

81
- 79952951436
- Consistency of Sequential Bayesian Sampling Policies
- Frazier, P. I. & Powell, W. B. (2011), 'Consistency of Sequential Bayesian Sampling Policies', SI AM Journal on Control and Optimization 49(2), 712-731.
- (2011) SI AM Journal on Control and Optimization , vol.49 , Issue.2 , pp. 712-731
- Frazier, P.I.¹ Powell, W.B.²

82
- 55549135706
- A Knowledge Gradient Policy for Sequential Information Collection
- Frazier, P. I., Powell, W. B. & Dayanik, S. (2008), 'A Knowledge Gradient Policy for Sequential Information Collection', SI AM Journal on Control and Optimization 47(5), 2410-2439.
- (2008) SI AM Journal on Control and Optimization , vol.47 , Issue.5 , pp. 2410-2439
- Frazier, P.I.¹ Powell, W.B.² Dayanik, S.³

83
- 70449498873
- The Knowledge-Gradient Policy for Correlated Normal Beliefs
- Frazier, P. I., Powell, W. B. & Dayanik, S. (2009), 'The Knowledge-Gradient Policy for Correlated Normal Beliefs', INFORMS Journal on Computing 21(4), 599-613.
- (2009) INFORMS Journal on Computing , vol.21 , Issue.4 , pp. 599-613
- Frazier, P.I.¹ Powell, W.B.² Dayanik, S.³

84
- 0012260296
- Optimization for simulation: Theory vs. practice
- Fu, M. C. (2002), 'Optimization for simulation: Theory vs. practice', INFORMS Journal on Computing 14(3), 192-215.
- (2002) INFORMS Journal on Computing , vol.14 , Issue.3 , pp. 192-215
- Fu, M.C.¹

85
- 33846679442
- Simulation optimization: a review, new developments, and applications
- M. E. Kuhl, N. M. Steiger, F. B. Armstrong & J. A. Joines, eds
- Fu, M. C, Glover, F. & April, J. (2005), Simulation optimization: a review, new developments, and applications, in M. E. Kuhl, N. M. Steiger, F. B. Armstrong & J. A. Joines, eds,'Proceedings of the 2005 Winter Simulation Conference', pp. 83-95.
- (2005) Proceedings of the 2005 Winter Simulation Conference , pp. 83-95
- Fu, M.C.¹ Glover, F.² April, J.³

86
- 17744377782
- Optimal computing budget allocation under correlated sampling
- R. G. Ingalls, M. D. Rossetti, J. S. Smith & B. A. Peters, eds
- Fu, M. C, Hu, J. Q., Chen, C.-H. & Xiong, X. (2004), Optimal computing budget allocation under correlated sampling, in R. G. Ingalls, M. D. Rossetti, J. S. Smith & B. A. Peters, eds,'Proceedings of the 2004 Winter Simulation Conference', pp. 595-603.
- (2004) Proceedings of the 2004 Winter Simulation Conference , pp. 595-603
- Fu, M.C.¹ Hu, J.Q.² Chen, C.-H.³ Xiong, X.⁴

87
- 0028480132
- Optimal dynamic pricing of inventories with stochastic demand over finite horizons
- Gallego, G. & Van Ryzin, G. (1994), 'Optimal dynamic pricing of inventories with stochastic demand over finite horizons', Management Science 40(8), 999-1020.
- (1994) Management Science , vol.40 , Issue.8 , pp. 999-1020
- Gallego, G.¹ Van Ryzin, G.²

88
- 0004012196
- Bayesian Data Analysis
- 2nd ed', Chapman & Hall, New York
- Gelman, A., Carlin, J. B., Stern, H. S. & Rubin, D. B. (2004), 'Bayesian Data Analysis, 2nd ed', Chapman & Hall, New York p. 63.
- (2004) , pp. 63
- Gelman, A.¹ Carlin, J.B.² Stern, H.S.³ Rubin, D.B.⁴

89
- 0001942829
- Neural networks and the bias/variance dilemma
- Geman, S., Bienenstock, E. & Doursat, R. (1992), 'Neural networks and the bias/variance dilemma', Neural computation 4(1), 1-58.
- (1992) Neural computation , vol.4 , Issue.1 , pp. 1-58
- Geman, S.¹ Bienenstock, E.² Doursat, R.³

90
- 27144549876
- Sensitivity analysis in linear optimization: Invariant support set intervals
- Ghaffari-Hadigheh, A. & Terlaky, T. (2006), 'Sensitivity analysis in linear optimization: Invariant support set intervals', European Journal of Operational Research 169(3), 1158-1175.
- (2006) European Journal of Operational Research , vol.169 , Issue.3 , pp. 1158-1175
- Ghaffari-Hadigheh, A.¹ Terlaky, T.²

91
- 0000532482
- Response Surface Bandits
- Ginebra, J. & Clayton, M. K. (1995), 'Response Surface Bandits', Journal of the Royal Statistical Society B57, 771-784.
- (1995) Journal of the Royal Statistical Society , vol.B57 , pp. 771-784
- Ginebra, J.¹ Clayton, M.K.²

92
- 0000169010
- Bandit processes and dynamic allocation indices
- Gittins, J. C. (1979), 'Bandit processes and dynamic allocation indices', Journal of the Royal Statistical Society B41(2), 148-177.
- (1979) Journal of the Royal Statistical Society , vol.B41 , Issue.2 , pp. 148-177
- Gittins, J.C.¹

93
- 84891584370
- Multi-armed Bandit Allocation Indices
- Wiley and Sons: New York
- Gittins, J. C. (1989), 'Multi-armed Bandit Allocation Indices', Wiley and Sons: New York.
- (1989)
- Gittins, J.C.¹

94
- 0002955623
- A dynamic allocation index for the sequential design of experiments
- J. Gani, ed., North Holland, Amsterdam
- Gittins, J. C. & Jones, D. M. (1974), A dynamic allocation index for the sequential design of experiments, in J. Gani, ed.,'Progress in statistics', North Holland, Amsterdam, pp. 241-266.
- (1974) Progress in statistics , pp. 241-266
- Gittins, J.C.¹ Jones, D.M.²

95
- 84891584370
- John Wiley & Sons, New York
- Gittins, J. C, Glazebrook, K. D. & Weber, R. R. (2011), Multi-Armed Bandit Allocation Indices, John Wiley & Sons, New York.
- (2011) Multi-Armed Bandit Allocation Indices
- Gittins, J.C.¹ Glazebrook, K.D.² Weber, R.R.³

96
- 1542342296
- Springer-Verlag, New York
- Glasserman, P. (2004), Monte Carlo Methods in Financial Engineering, Springer-Verlag, New York.
- (2004) Monte Carlo Methods in Financial Engineering
- Glasserman, P.¹

97
- 0002232604
- On the evaluation of suboptimal strategies for families of alternative bandit processes
- Glazebrook, K. D. (1982), 'On the evaluation of suboptimal strategies for families of alternative bandit processes', Journal of Applied Probability 19(3), 716-722.
- (1982) Journal of Applied Probability , vol.19 , Issue.3 , pp. 716-722
- Glazebrook, K.D.¹

98
- 67649922844
- A Generalized Gittins Index for a Class of Multiarmed Bandits with General Resource Requirements
- Glazebrook, K. D. & Minty, R. (2009), 'A Generalized Gittins Index for a Class of Multiarmed Bandits with General Resource Requirements', Mathematics of Operations Research.
- (2009) Mathematics of Operations Research
- Glazebrook, K.D.¹ Minty, R.²

99
- 0024561585
- A comparative analysis of selection schemes used in genetic algorithms
- G. Rawlings, ed., Morgan Kaufmann Publishers, San Mateo, CA
- Goldberg, D. E. & Deb, K. (1991), A comparative analysis of selection schemes used in genetic algorithms, in G. Rawlings, ed.,'Foundations of genetic algorithms', Morgan Kaufmann Publishers, San Mateo, CA, pp. 69-93.
- (1991) Foundations of genetic algorithms , pp. 69-93
- Goldberg, D.E.¹ Deb, K.²

100
- 84888630832
- Kluwer Academic Publishers, Norwell, MA
- Gosavi, A. (2003), Simulation-Based Optimization, Kluwer Academic Publishers, Norwell, MA.
- (2003) Simulation-Based Optimization
- Gosavi, A.¹

101
- 84859489788
- Optimization under unknown constraints
- Arxiv preprint arXiv:1004.4027
- Gramacy, R. B. & Lee, H. K. H. (2011), 'Optimization under unknown constraints', Arxiv preprint arXiv:1004.4027.
- (2011)
- Gramacy, R.B.¹ Lee, H.K.H.²

102
- 0000511415
- Bayesian look ahead one stage sampling allocations for selecting the largest normal mean
- Gupta, S. S. & Miescke, K. J. (1994), 'Bayesian look ahead one stage sampling allocations for selecting the largest normal mean', Statistical Papers 35,169-177.
- (1994) Statistical Papers , vol.35 , pp. 169-177
- Gupta, S.S.¹ Miescke, K.J.²

103
- 0030590294
- Bayesian look ahead one-stage sampling allocations for selection of the best population
- Gupta, S. S. & Miescke, K. J. (1996), 'Bayesian look ahead one-stage sampling allocations for selection of the best population', Journal of Statistical Planning and Inference 54, 229-244.
- (1996) Journal of Statistical Planning and Inference , vol.54 , pp. 229-244
- Gupta, S.S.¹ Miescke, K.J.²

104
- 84866344056
- Bayesian dynamic pricing policies: Learning and earning under a binary prior distribution
- Technical report, Working paper, Columbia and Stanford University
- Harrison, J., Keskin, N. & Zeevi, A. (2010), Bayesian dynamic pricing policies: Learning and earning under a binary prior distribution, Technical report, Working paper, Columbia and Stanford University.
- (2010)
- Harrison, J.¹ Keskin, N.² Zeevi, A.³

105
- 0003684449
- Springer, New York
- Hastie, T., Tibshirani, R. & Friedman, J. (2009), The elements of statistical learning: data mining, inference and prediction, Springer, New York.
- (2009) The elements of statistical learning: data mining, inference and prediction
- Hastie, T.¹ Tibshirani, R.² Friedman, J.³

106
- 0003684449
- Springer, New York
- Hastie, T., Tibshirani, R., Friedman, J. & Franklin, J. (2005), The elements of statistical learning: data mining, inference and prediction, Vol. 27, Springer, New York.
- (2005) The elements of statistical learning: data mining, inference and prediction , vol.27
- Hastie, T.¹ Tibshirani, R.² Friedman, J.³ Franklin, J.⁴

107
- 0035696433
- A Genetic Algorithm and an Indifference-Zone Ranking and Selection Framework for Simulation Optimization
- B. A. Peters, J. S. Smith, D. J. Medeiros & M. W. Rohrer, eds
- Hedlund, H. E. & Mollaghasemi, M. (2001), A Genetic Algorithm and an Indifference-Zone Ranking and Selection Framework for Simulation Optimization, in B. A. Peters, J. S. Smith, D. J. Medeiros & M. W. Rohrer, eds,'Proceedings of the 2001 Winter Simulation Conference', pp. 417-421.
- (2001) Proceedings of the 2001 Winter Simulation Conference , pp. 417-421
- Hedlund, H.E.¹ Mollaghasemi, M.²

108
- 33644525898
- Discrete Optimization via Simulation Using COMPASS
- Hong, L. J. & Nelson, B. L. (2006), 'Discrete Optimization via Simulation Using COMPASS', Operations Research 54(1), 115-129.
- (2006) Operations Research , vol.54 , Issue.1 , pp. 115-129
- Hong, L.J.¹ Nelson, B.L.²

109
- 77951527013
- A Brief Introduction To Optimization Via Simulation
- M. Rosetti, R. Hill, B. Johansson, A. Dunkin & R. Ingalls, eds
- Hong, L. J. & Nelson, B. L. (2009), A Brief Introduction To Optimization Via Simulation, in M. Rosetti, R. Hill, B. Johansson, A. Dunkin & R. Ingalls, eds,'Proceedings of the 2009 Winter Simulation Conference', pp. 75-85.
- (2009) Proceedings of the 2009 Winter Simulation Conference , pp. 75-85
- Hong, L.J.¹ Nelson, B.L.²

110
- 39549108095
- Ranking inequality: Applications of multivariate subset selection
- Horrace, W., Marchand, J. & Smeeding, T. (2008), 'Ranking inequality: Applications of multivariate subset selection', Journal of Economic Inequality 6(1), 5-32.
- (2008) Journal of Economic Inequality , vol.6 , Issue.1 , pp. 5-32
- Horrace, W.¹ Marchand, J.² Smeeding, T.³

111
- 84939051589
- Sequential transmission using noiseless feedback
- Horstein, M. (1963), 'Sequential transmission using noiseless feedback', IEEE Trans-actions on Information Theory 9(3), 136-143.
- (1963) IEEE Trans-actions on Information Theory , vol.9 , Issue.3 , pp. 136-143
- Horstein, M.¹

112
- 84939003870
- Information value theory
- Howard, R. A. (1966), 'Information value theory', IEEE Transactions on systems science and cybernetics 2(1), 22-26.
- (1966) IEEE Transactions on systems science and cybernetics , vol.2 , Issue.1 , pp. 22-26
- Howard, R.A.¹

113
- 79951643127
- An Approximate Annealing Search Algorithm to Global Optimization and its Connection to Stochastic Approximation
- B. Johansson, S. Jain, J. Montoya-Torres, J. Hugan & E. Yücesan, eds
- Hu, J. & Hu, P. (2010), An Approximate Annealing Search Algorithm to Global Optimization and its Connection to Stochastic Approximation, in B. Johansson, S. Jain, J. Montoya-Torres, J. Hugan & E. Yücesan, eds,'Proceedings of the 2010 Winter Simulation Conference', pp. 1223-1234.
- (2010) Proceedings of the 2010 Winter Simulation Conference , pp. 1223-1234
- Hu, J.¹ Hu, P.²

114
- 79960273055
- Annealing Adaptive Search, Cross-Entropy, and Stochastic Approximation in Global Optimization
- to appear
- Hu, J. & Hu, P. (2011), 'Annealing Adaptive Search, Cross-Entropy, and Stochastic Approximation in Global Optimization', Naval Research Logistics (to appear).
- (2011) Naval Research Logistics
- Hu, J.¹ Hu, P.²

115
- 84863275874
- Discrete Optimization Via Approximate Annealing Adaptive Search With Stochastic Averaging
- S. Jain, R. R. Creasey, J. Himmelspach, K. P. White & M. C. Fu, eds
- Hu, J. & Wang, C. (2011), Discrete Optimization Via Approximate Annealing Adaptive Search With Stochastic Averaging, in S. Jain, R. R. Creasey, J. Himmelspach, K. P. White & M. C. Fu, eds,'Proceedings of the 2011 Winter Simulation Conference'.
- (2011) Proceedings of the 2011 Winter Simulation Conference
- Hu, J.¹ Wang, C.²

116
- 37249005626
- A model reference adaptive search method for global optimization
- Hu, J., Fu, M. & Marcus, S. (2007), 'A model reference adaptive search method for global optimization', Operations Research 55(3), 549-568.
- (2007) Operations Research , vol.55 , Issue.3 , pp. 549-568
- Hu, J.¹ Fu, M.² Marcus, S.³

117
- 33644791173
- Global Optimization of Stochastic Black-Box Systems via Sequential Kriging Meta-Models
- Huang, D., Allen, T. T., Notz, W. I. & Zeng, N. (2006), 'Global Optimization of Stochastic Black-Box Systems via Sequential Kriging Meta-Models', Journal of Global Optimization 34(3), 441-466.
- (2006) Journal of Global Optimization , vol.34 , Issue.3 , pp. 441-466
- Huang, D.¹ Allen, T.T.² Notz, W.I.³ Zeng, N.⁴

118
- 0012353958
- An empirical evaluation of several methods to select the best system
- 381-4-07
- Inoue, K., Chick, S. E. & Chen, C.-H. (1999), 'An empirical evaluation of several methods to select the best system', ACM Transactions on Modeling and Computer Simulation 9, 381-4-07.
- (1999) ACM Transactions on Modeling and Computer Simulation 9
- Inoue, K.¹ Chick, S.E.² Chen, C.-H.³

119
- 0042685161
- Bayesian parameter estimation via variational methods
- Jaakkola, T. & Jordan, M. (2000), 'Bayesian parameter estimation via variational methods', Statistics and Computing 10(1), 25-37.
- (2000) Statistics and Computing , vol.10 , Issue.1 , pp. 25-37
- Jaakkola, T.¹ Jordan, M.²

120
- 84858041260
- Questions with noise: Bayes optimal policies for entropy loss
- to appear
- Jedynak, B., Frazier, P. I. & Sznitman, R. (2011), 'Questions with noise: Bayes optimal policies for entropy loss', Journal of Applied Probability (to appear).
- (2011) Journal of Applied Probability
- Jedynak, B.¹ Frazier, P.I.² Sznitman, R.³

121
- 0000561424
- Efficient global optimization of expensive black-box functions
- Jones, D., Schonlau, M. & Welch, W. (1998), 'Efficient global optimization of expensive black-box functions', Journal of Global Optimization 13(4), 455-492.
- (1998) Journal of Global Optimization , vol.13 , Issue.4 , pp. 455-492
- Jones, D.¹ Schonlau, M.² Welch, W.³

122
- 0004280606
- MIT Press, Cambridge, MA
- Kaelbling, L. P. (1993), Learning in embedded systems, MIT Press, Cambridge, MA.
- (1993) Learning in embedded systems
- Kaelbling, L.P.¹

123
- 0030305926
- Pricing decisions under demand uncertainty: A Bayesian mixture model approach
- Kalyanam, K. (1996), 'Pricing decisions under demand uncertainty: A Bayesian mixture model approach', Marketing Science 15(3), 207-221.
- (1996) Marketing Science , vol.15 , Issue.3 , pp. 207-221
- Kalyanam, K.¹

124
- 0015207267
- Discrete square root filtering: A survey of current techniques
- Kaminski, P., Bryson, A. & Schmidt, S. (1971), 'Discrete square root filtering: A survey of current techniques', IEEE Transactions on Automatic Control 16(6), 727-736.
- (1971) IEEE Transactions on Automatic Control , vol.16 , Issue.6 , pp. 727-736
- Kaminski, P.¹ Bryson, A.² Schmidt, S.³

125
- 0023345261
- The Multi-Armed Bandit Problem: Decomposition and Computation
- Katehakis, M. & Veinott, A. (1987), 'The Multi-Armed Bandit Problem: Decomposition and Computation', Mathematics of Operations Research 12(2), 262-268.
- (1987) Mathematics of Operations Research , vol.12 , Issue.2 , pp. 262-268
- Katehakis, M.¹ Veinott, A.²

126
- 0017707946
- Application of the Free-Wilson Technique to Structurally Related Series of Homologues
- Katz, R. & Ionescu, F. (1977), 'Application of the Free-Wilson Technique to Structurally Related Series of Homologues. Quantitative S tract ure-Activity Relations hip Studies of Narcotic Analgetics', 20(11), 1413-1419.
- (1977) Quantitative S tract ure-Activity Relations hip Studies of Narcotic Analgetics , vol.20 , Issue.11 , pp. 1413-1419
- Katz, R.¹ Ionescu, F.²

127
- 0036832954
- Near-optimal reinforcement learning in polynomial time.
- Kearns, M. & Singh, S. (2002), 'Near-optimal reinforcement learning in polynomial time.', Machine Learning 49, 209-232.
- (2002) Machine Learning , vol.49 , pp. 209-232
- Kearns, M.¹ Singh, S.²

128
- 0346963698
- A fully sequential procedure for indifference-zone selection in simulation
- Kim, S.-H. & Nelson, B. L. (2001), 'A fully sequential procedure for indifference-zone selection in simulation', ACM Trans. Model. Comput. Simul. 11, 251-273.
- (2001) ACM Trans. Model. Comput. Simul. , vol.11 , pp. 251-273
- Kim, S.-H.¹ Nelson, B.L.²

129
- 33744788406
- On the asymptotic validity of fully sequential selection procedures for steady-state simulation
- Kim, S.-H. & Nelson, B. L. (2006), 'On the asymptotic validity of fully sequential selection procedures for steady-state simulation', Operations Research 54, 475-488.
- (2006) Operations Research , vol.54 , pp. 475-488
- Kim, S.-H.¹ Nelson, B.L.²

130
- 38049011420
- Nearly Tight Bounds for the Continuum-Armed Bandit Problem
- L. Saul, Y. Weiss & L. Bottou, eds, MIT Press, Cambridge, MA
- Kleinberg, R. (2004), Nearly Tight Bounds for the Continuum-Armed Bandit Problem, in L. Saul, Y. Weiss & L. Bottou, eds,'Advances in Neural Information Processing Systems', MIT Press, Cambridge, MA, pp. 697-704.
- (2004) Advances in Neural Information Processing Systems , pp. 697-704
- Kleinberg, R.¹

131
- 77955660815
- Regret bounds for sleeping experts and bandits
- Kleinberg, R., Niculescu-Mizil, A. & Sharma, Y. (2010), 'Regret bounds for sleeping experts and bandits', Machine Learning 80(2), 245-272.
- (2010) Machine Learning , vol.80 , Issue.2 , pp. 245-272
- Kleinberg, R.¹ Niculescu-Mizil, A.² Sharma, Y.³

132
- 0022781718
- Shortest paths in networks with exponentially distributed arc lengths
- Kulkarni, V. (1986), 'Shortest paths in networks with exponentially distributed arc lengths', Networks 16, 255-274.
- (1986) Networks , vol.16 , pp. 255-274
- Kulkarni, V.¹

133
- 9944258743
- Springer
- Kushner, H. J. & Yin, G. G. (2003), Stochastic Approximation and Recursive Algorithms and Applications, Springer.
- (2003) Stochastic Approximation and Recursive Algorithms and Applications
- Kushner, H.J.¹ Yin, G.G.²

134
- 0000854435
- Adaptive treatment allocation and the multi-armed bandit problem
- Lai, T. L. (1987), 'Adaptive treatment allocation and the multi-armed bandit problem', The Annals of Statistics 15(3), 1091-1114.
- (1987) The Annals of Statistics , vol.15 , Issue.3 , pp. 1091-1114
- Lai, T.L.¹

135
- 0002899547
- Asymptotically Efficient Adaptive Allocation Rules
- Lai, T. L. & Robbins, H. (1985), 'Asymptotically Efficient Adaptive Allocation Rules', Advances in Applied Mathematics 6, 4-22.
- (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
- Lai, T.L.¹ Robbins, H.²

136
- 0001388964
- Continuous multi-armed bandits and multiparameter processes
- Mandelbaum, A. (1987), 'Continuous multi-armed bandits and multiparameter processes', The Annals of Probability 15(4), 1527-1556.
- (1987) The Annals of Probability , vol.15 , Issue.4 , pp. 1527-1556
- Mandelbaum, A.¹

137
- 0040157510
- John Wiley and Sons
- Martin, J. (1967), Bayesian Decision Problems and Markov Chains, John Wiley and Sons.
- (1967) Bayesian Decision Problems and Markov Chains
- Martin, J.¹

138
- 0001036667
- Two-stage multiple comparisons with the best for computer simulation
- Matejcik, F. & Nelson, B. (1995), 'Two-stage multiple comparisons with the best for computer simulation', Operations Research 43(4), 633-640.
- (1995) Operations Research , vol.43 , Issue.4 , pp. 633-640
- Matejcik, F.¹ Nelson, B.²

139
- 84949574365
- U.S. Greenhouse Gas Abatement Mapping Initiative, Executive Report
- McKinsey & Company (2007), 'Reducing U.S. Greenhouse Gas Emissions: How Much at What Cost?', U.S. Greenhouse Gas Abatement Mapping Initiative, Executive Report.
- (2007)
- McKinsey¹ Company²

140
- 0003971926
- CRC Press
- Miller, A. (2002), Subset selection in regression, CRC Press.
- (2002) Subset selection in regression
- Miller, A.¹

141
- 0038387331
- PhD thesis, Massachusetts Institute of Technology
- Minka, T. (2001), A family of algorithms for approximate Bayesian inference, PhD thesis, Massachusetts Institute of Technology.
- (2001) A family of algorithms for approximate Bayesian inference
- Minka, T.¹

142
- 0003486756
- John Wiley and Sons
- Montgomery, D. C. (2008), Design and analysis of experiments (7th ed.), John Wiley and Sons.
- (2008) Design and analysis of experiments (7th ed.)
- Montgomery, D.C.¹

143
- 79961092747
- The Knowledge-Gradient Algorithm for Sequencing Experiments in Drug Discovery
- Negoescu, D. M., Frazier, P. I. & Powell, W. B. (2011), 'The Knowledge-Gradient Algorithm for Sequencing Experiments in Drug Discovery', INFORMS Journal on Computing 23(3), 346-363.
- (2011) INFORMS Journal on Computing , vol.23 , Issue.3 , pp. 346-363
- Negoescu, D.M.¹ Frazier, P.I.² Powell, W.B.³

144
- 0001338090
- Using common random numbers for indifference-zone selection and multiple comparisons in simulation
- Nelson, B. L. & Matejcik, F. J. (1995), 'Using common random numbers for indifference-zone selection and multiple comparisons in simulation', Management Science 41(12), 1935-1945.
- (1995) Management Science , vol.41 , Issue.12 , pp. 1935-1945
- Nelson, B.L.¹ Matejcik, F.J.²

145
- 79955755016
- Computing a Classic Index for Finite-Horizon Bandits
- Nino-Mora, J. (2010), 'Computing a Classic Index for Finite-Horizon Bandits', INFORMS Journal on Computing 23(2), 254-267.
- (2010) INFORMS Journal on Computing , vol.23 , Issue.2 , pp. 254-267
- Nino-Mora, J.¹

146
- 79952563147
- The stochastic root-finding problem: Overview, solutions, and open questions
- Pasupathy, R. & Kim, S. (2011), 'The stochastic root-finding problem: Overview, solutions, and open questions', ACM Transactions on Modeling and Computer Simulation 21(3), 19:1-19:23.
- (2011) ACM Transactions on Modeling and Computer Simulation , vol.21 , Issue.3 , pp. 191-1923
- Pasupathy, R.¹ Kim, S.²

147
- 0000145493
- A Sequential Procedure for Selecting the Population with the Largest Mean from k Normal Populations
- Paulson, E. (1964), 'A Sequential Procedure for Selecting the Population with the Largest Mean from k Normal Populations', The Annals of Mathematical Statistics 35, 174-180.
- (1964) The Annals of Mathematical Statistics , vol.35 , pp. 174-180
- Paulson, E.¹

148
- 34248185608
- Finding the shortest path in stochastic networks
- Peer, S. & Sharma, D. (2007), 'Finding the shortest path in stochastic networks', Computers and Mathematics with Applications 53, 729-740.
- (2007) Computers and Mathematics with Applications , vol.53 , pp. 729-740
- Peer, S.¹ Sharma, D.²

149
- 47349092417
- John Wiley & Sons, Hoboken, NJ
- Powell, W. B. (2007), Approximate Dynamic Programming: Solving the curses of dimensionality, John Wiley & Sons, Hoboken, NJ.
- (2007) Approximate Dynamic Programming: Solving the curses of dimensionality
- Powell, W.B.¹

150
- 84949764394
- 2nd. edn, John Wiley & Sons, Hoboken, NJ
- Powell, W. B. (2011), Approximate Dynamic Programming: Solving the curses of dimensionality, 2nd. edn, John Wiley & Sons, Hoboken, NJ.
- (2011) Approximate Dynamic Programming: Solving the curses of dimensionality
- Powell, W.B.¹

151
- 0003998452
- 1st edn, John Wiley and Sons, Hoboken
- Puterman, M. L. (1994), Markov Decision Processes, 1st edn, John Wiley and Sons, Hoboken.
- (1994) Markov Decision Processes
- Puterman, M.L.¹

152
- 0003029778
- A Nonconcavity in the Value of Information
- M. Boyer & R. Kihlstrom, eds, North-Holland, Amsterdam, chapter 3
- Radner, R. & Stiglitz, J. (1984), A Nonconcavity in the Value of Information, in M. Boyer & R. Kihlstrom, eds,'Bayesian models in economic theory', Vol. 5, North-Holland, Amsterdam, chapter 3, pp. 33-52.
- (1984) Bayesian models in economic theory , vol.5 , pp. 33-52
- Radner, R.¹ Stiglitz, J.²

153
- 84856651677
- Gaussian Processes for Machine Learning
- Rasmussen, C. E. & Williams, C. K. I. (2006), 'Gaussian Processes for Machine Learning', The MIT Press.
- (2006) The MIT Press
- Rasmussen, C.E.¹ Williams, C.K.I.²

154
- 0001552833
- On two-stage selection procedures and related probability inequalities
- Rinott, Y. (1978), 'On two-stage selection procedures and related probability inequalities', Communications in Statistics A7 pp. 799-811.
- (1978) Communications in Statistics A7 , pp. 799-811
- Rinott, Y.¹

155
- 0000016172
- A stochastic approximation method
- Robbins, H. & Monro, S. (1951), 'A stochastic approximation method', The Annals of Mathematical Statistics 22(3), 400-407.
- (1951) The Annals of Mathematical Statistics , vol.22 , Issue.3 , pp. 400-407
- Robbins, H.¹ Monro, S.²

156
- 0003919677
- Springer-Verlag, New York
- Roberts, C. P. & Casella, G. (2004), Monte Carlo Statistical Methods, Springer-Verlag, New York.
- (2004) Monte Carlo Statistical Methods
- Roberts, C.P.¹ Casella, G.²

157
- 84974012804
- Simulated annealing and adaptive search in global optimization
- Romeijn, H. & Smith, R. (1994), 'Simulated annealing and adaptive search in global optimization', Probability in the Engineering and Informational Sciences 8(4), 571-590.
- (1994) Probability in the Engineering and Informational Sciences , vol.8 , Issue.4 , pp. 571-590
- Romeijn, H.¹ Smith, R.²

158
- 0001058483
- A two-armed bandit theory of market pricing
- Rothschild, M. (1974), 'A two-armed bandit theory of market pricing', Journal of Economic Theory 9(2), 185-202.
- (1974) Journal of Economic Theory , vol.9 , Issue.2 , pp. 185-202
- Rothschild, M.¹

159
- 0004080531
- Wiley-Interscience, New York
- Rubinstein, R. Y. & Kroese, D. R (2008), Simulation and the Monte Carlo Method, Wiley-Interscience, New York.
- (2008) Simulation and the Monte Carlo Method
- Rubinstein, R.Y.¹ Kroese, D.R.²

160
- 77953111834
- Linearly Parameterized Bandits
- Rusmevichientong, R & Tsitsiklis, J. N. (2010), 'Linearly Parameterized Bandits', Mathematics of Operations Research 35(2), 395-411.
- (2010) Mathematics of Operations Research , vol.35 , Issue.2 , pp. 395-411
- Rusmevichientong, R.¹ Tsitsiklis, J.N.²

161
- 77950479202
- Elsevier, Amsterdam
- Ruszczynski, A. & Shapiro, A. (2003), Handbooks in Operations Research and Man-agement Science: Stochastic Programming, Vol. 10, Elsevier, Amsterdam.
- (2003) Handbooks in Operations Research and Man-agement Science: Stochastic Programming , vol.10
- Ruszczynski, A.¹ Shapiro, A.²

162
- 77951568757
- A Monte-Carlo Knowledge Gradient Method For Learning Abatement Potential Of Emissions Reduction Technologies
- M. D. Rossetti, R. R. Hill, B. Johansson, A. Dunkin & R. G. Ingalls, eds
- Ryzhov, I. O. & Powell, W. B. (2009a), A Monte-Carlo Knowledge Gradient Method For Learning Abatement Potential Of Emissions Reduction Technologies, in M. D. Rossetti, R. R. Hill, B. Johansson, A. Dunkin & R. G. Ingalls, eds,'Proceedings of the 2009 Winter Simulation Conference'.
- (2009) Proceedings of the 2009 Winter Simulation Conference
- Ryzhov, I.O.¹ Powell, W.B.²

163
- 67650505320
- The knowledge gradient algorithm for online subset selection
- Ryzhov, I. O. & Powell, W. B. (2009o), The knowledge gradient algorithm for online subset selection, in'Proceedings of the 2009 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning', pp. 137-144.
- (2009) Proceedings of the 2009 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning , pp. 137-144
- Ryzhov, I.O.¹ Powell, W.B.²

164
- 84949574366
- Information collection for linear programs with unknown objective coefficients
- Submitted for publication
- Ryzhov, I. O. & Powell, W. B. (201 la),'Information collection for linear programs with unknown objective coefficients', Submitted for publication.
- (2011)
- Ryzhov, I.O.¹ Powell, W.B.²

165
- 79952942276
- Information Collection on a Graph
- Ryzhov, I. O. & Powell, W. B. (2011b),'Information Collection on a Graph', Operations Research 59(1), 188-201.
- (2011) Operations Research , vol.59 , Issue.1 , pp. 188-201
- Ryzhov, I.O.¹ Powell, W.B.²

166
- 79958262308
- The value of information in multi-armed bandits with exponentially distributed rewards
- Ryzhov, I. O. & Powell, W. B. (2011c), The value of information in multi-armed bandits with exponentially distributed rewards, in'Proceedings of the 2011 International Conference on Computational Science', pp. 1363-1372.
- (2011) Proceedings of the 2011 International Conference on Computational Science , pp. 1363-1372
- Ryzhov, I.O.¹ Powell, W.B.²

167
- 84877933547
- The knowledge gradient algorithm for a general class of online learning problems
- to appear
- Ryzhov, I. O., Powell, W. B. & Frazier, P. I. (2011), 'The knowledge gradient algorithm for a general class of online learning problems', Operations Research (to appear).
- (2011) Operations Research
- Ryzhov, I.O.¹ Powell, W.B.² Frazier, P.I.³

168
- 79951586758
- Optimal Learning of Transition Probabilities in the Two-Agent Newsvendor Problem
- B. Johansson, S. Jain, J. Montoya-Torres, J. Hugan & E. Yucesan, eds
- Ryzhov, I. O., Valdez-Vivas, M. R. & Powell, W B. (2010), Optimal Learning of Transition Probabilities in the Two-Agent Newsvendor Problem, in B. Johansson, S. Jain, J. Montoya-Torres, J. Hugan & E. Yucesan, eds,'Proceedings of the 2010 Winter Simulation Conference', pp. 1088-1098.
- (2010) Proceedings of the 2010 Winter Simulation Conference , pp. 1088-1098
- Ryzhov, I.O.¹ Valdez-Vivas, M.R.² Powell, W.B.³

169
- 84972517827
- Design and analysis of computer experiments
- Sacks, J., Welch, W, Mitchell, T. J. & Wynn, H. P. (1989), 'Design and analysis of computer experiments', Statistical Science 4(4), 409-423.
- (1989) Statistical Science , vol.4 , Issue.4 , pp. 409-423
- Sacks, J.¹ Welch, W.² Mitchell, T.J.³ Wynn, H.P.⁴

170
- 77249140907
- Markov decision processes with imprecise transition probabilities
- Satia, J. & Lave, R. (1973), 'Markov decision processes with imprecise transition probabilities', Operations Research 21(3), 755-763.
- (1973) Operations Research , vol.21 , Issue.3 , pp. 755-763
- Satia, J.¹ Lave, R.²

171
- 80054748080
- The Correlated Knowledge Gradient for Simulation Optimization of Continuous Parameters using Gaussian Process Regression
- Scott, W. R., Frazier, P. I. & Powell, W. B. (2011), 'The Correlated Knowledge Gradient for Simulation Optimization of Continuous Parameters using Gaussian Process Regression', SI AM Journal on Optimization 21(3), 996-1026.
- (2011) SI AM Journal on Optimization , vol.21 , Issue.3 , pp. 996-1026
- Scott, W.R.¹ Frazier, P.I.² Powell, W.B.³

172
- 79951666282
- Calibrating simulation models using the knowledge gradient with continuous parameters
- B. Johansson, S. Jain, J. Montoya-Torres, J. Hugan & E. Yiicesan, eds
- Scott, W. R., Powell, W. B. & Simão, H. P. (2010), Calibrating simulation models using the knowledge gradient with continuous parameters, in B. Johansson, S. Jain, J. Montoya-Torres, J. Hugan & E. Yiicesan, eds,'Proceedings of the 2010 Winter Simulation Conference', pp. 1099-1109.
- (2010) Proceedings of the 2010 Winter Simulation Conference , pp. 1099-1109
- Scott, W.R.¹ Powell, W.B.² Simão, H.P.³

173
- 68949137209
- Computer Sciences Technical Report 1648, University of Wisconsin-Madison
- Settles, B. (2009), Active learning literature survey, Computer Sciences Technical Report 1648, University of Wisconsin-Madison.
- (2009) Active learning literature survey
- Settles, B.¹

174
- 18544370594
- Si, J., Barto, A. G., Powell, W. B. & Wunsch, D. (2005), Learning and Approximate Dynamic Programming.
- (2005) Learning and Approximate Dynamic Programming
- Si, J.¹ Barto, A.G.² Powell, W.B.³ Wunsch, D.⁴

175
- 34547984629
- Technical Report 1, Operations Research Center, MIT
- Silver, E. (1963), Markovian decision processes with uncertain transition probabilities or rewards, Technical Report 1, Operations Research Center, MIT.
- (1963) Markovian decision processes with uncertain transition probabilities or rewards
- Silver, E.¹

176
- 0033901602
- Convergence results for single-step on-policy reinforcement-learning algorithms
- Singh, S. P., Jaakkola, T., Szepesvari, C. & Littman, M. (2000), 'Convergence results for single-step on-policy reinforcement-learning algorithms', Machine Learning 38(3), 287-308.
- (2000) Machine Learning , vol.38 , Issue.3 , pp. 287-308
- Singh, S.P.¹ Jaakkola, T.² Szepesvari, C.³ Littman, M.⁴

177
- 34248179010
- Probabilistic networks and network algorithms
- M. Ball, T. Magnanti & C. Monma, eds, Networks', North-Holland Publishing, Amsterdam
- Snyder, T. & Steele, J. (1995), Probabilistic networks and network algorithms, in M. Ball, T. Magnanti & C. Monma, eds,'Handbooks of Operations Research and Management Science, vol. 7: Networks', North-Holland Publishing, Amsterdam, pp. 401-424.
- (1995) Handbooks of Operations Research and Management Science , vol.7 , pp. 401-424
- Snyder, T.¹ Steele, J.²

178
- 0013025914
- John Wiley & Sons, Hoboken, NJ
- Spall, J. C. (2003), Introduction to Stochastic Search and Optimization: Estimation, Simulation and Control, John Wiley & Sons, Hoboken, NJ.
- (2003) Introduction to Stochastic Search and Optimization: Estimation, Simulation and Control
- Spall, J.C.¹

179
- 77956501313
- Gaussian process optimization in the bandit setting: No regret and experimental design
- Srinivas, N., Krause, A., Kakade, S. M. & Seeger, M. (2010), Gaussian process optimization in the bandit setting: No regret and experimental design, in'Proceedings of the 27th International Conference on Machine Learning', pp. 1015-1022.
- (2010) Proceedings of the 27th International Conference on Machine Learning , pp. 1015-1022
- Srinivas, N.¹ Krause, A.² Kakade, S.M.³ Seeger, M.⁴

180
- 0004307223
- Springer, New York
- Steele, J. M. (2000), Stochastic Calculus and Financial Applications, Springer, New York.
- (2000) Stochastic Calculus and Financial Applications
- Steele, J.M.¹

181
- 0003693218
- Springer Verlag
- Stein, M. (1999), Interpolation of Spatial Data: Some theory for kriging, Springer Verlag.
- (1999) Interpolation of Spatial Data: Some theory for kriging
- Stein, M.¹

182
- 33750375100
- A simple distribution-free approach to the max k-armed bandit problem
- Lecture Notes in Computer Science
- Streeter, M. & Smith, S. (2006), A simple distribution-free approach to the max k-armed bandit problem, in'Principles and Practice of Constraint Programming', Vol. 4204 of Lecture Notes in Computer Science, pp. 560-574.
- (2006) Principles and Practice of Constraint Programming , vol.4204 , pp. 560-574
- Streeter, M.¹ Smith, S.²

183
- 0004007508
- MIT Press, Cambridge, MA
- Sutton, R. S. & Barto, A. G. (1998), Reinforcement Learning, Vol. 35, MIT Press, Cambridge, MA.
- (1998) Reinforcement Learning , vol.35
- Sutton, R.S.¹ Barto, A.G.²

184
- 71149099079
- Fast gradient-descent methods for temporal-difference learning with linear function approximation
- Sutton, R. S., Maei, H. R., Precup, D., Bhatnagar, S., Silver, D., Szepesvari, C. & Wiewiora, E. (2009), Fast gradient-descent methods for temporal-difference learning with linear function approximation, in'Proceedings of the 26th International Conference on Machine Learning', pp. 993-1000.
- (2009) Proceedings of the 26th International Conference on Machine Learning , pp. 993-1000
- Sutton, R.S.¹ Maei, H.R.² Precup, D.³ Bhatnagar, S.⁴ Silver, D.⁵ Szepesvari, C.⁶ Wiewiora, E.⁷

185
- 84949574367
- of Synthesis Lectures on Artificial Intelligence and Machine Learning, Morgan & Claypool Publishers
- Szepesvári, C. (2010), Algorithms for reinforcement learning, Vol. 4 of Synthesis Lectures on Artificial Intelligence and Machine Learning, Morgan & Claypool Publishers.
- (2010) Algorithms for reinforcement learning , vol.4
- Szepesvári, C.¹

186
- 3142657664
- Path kernels and multiplicative updates
- Takimoto, E. & Warmuth, M. K. (2003), 'Path kernels and multiplicative updates', Journal of Machine Learning Research 4, 773-818.
- (2003) Journal of Machine Learning Research , vol.4 , pp. 773-818
- Takimoto, E.¹ Warmuth, M.K.²

187
- 0001046225
- Practical Issues in Temporal Difference Learning
- Tesauro, G. (1992), 'Practical Issues in Temporal Difference Learning', Machine Learning 8(3-4), 257-277.
- (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 257-277
- Tesauro, G.¹

188
- 2242471756
- The optimal choice of a subset of a population
- Vanderbei, R. J. (1980), 'The optimal choice of a subset of a population', Mathematics of Operations Research 5(4), 481-186.
- (1980) Mathematics of Operations Research , vol.5 , Issue.4 , pp. 481-186
- Vanderbei, R.J.¹

189
- 0003782186
- (3rd ed.), Springer
- Vanderbei, R. J. (2008), Linear programming: foundations and extensions (3rd ed.), Springer.
- (2008) Linear programming: foundations and extensions
- Vanderbei, R.J.¹

190
- 33646406807
- Multi-armed bandit algorithms and empirical evaluation
- Vermorel, J. & Mohri, M. (2005), 'Multi-armed bandit algorithms and empirical evaluation', Proceedings of the 16th European Conference on Machine Learning pp. 437-448.
- (2005) Proceedings of the 16th European Conference on Machine Learning , pp. 437-448
- Vermorel, J.¹ Mohri, M.²

191
- 0000193326
- Optimum Character of the Sequential Probability Ratio Test
- Wald, A. & Wolfowitz, J. (1948), 'Optimum Character of the Sequential Probability Ratio Test', The Annals of Mathematical Statistics 19, 326-339.
- (1948) The Annals of Mathematical Statistics , vol.19 , pp. 326-339
- Wald, A.¹ Wolfowitz, J.²

192
- 0002327722
- On an index policy for restless bandits
- Weber, R. R. & Weiss, G. (1990), 'On an index policy for restless bandits', J. Appl. Prob. 27(3), 637-648.
- (1990) J. Appl. Prob. , vol.27 , Issue.3 , pp. 637-648
- Weber, R.R.¹ Weiss, G.²

193
- 34547236489
- Better May be Worse: Some Monotonicity Results and Paradoxes in Discrete Choice Under Uncertainty
- Weibull, J. W., Mattsson, L.-G. & Voorneveld, M. (2007), 'Better May be Worse: Some Monotonicity Results and Paradoxes in Discrete Choice Under Uncertainty', Theory and Decision 63(2), 121-151.
- (2007) Theory and Decision , vol.63 , Issue.2 , pp. 121-151
- Weibull, J.W.¹ Mattsson, L.-G.² Voorneveld, M.³

194
- 0004227773
- Chap-man & Hall
- Wetherill, G. B. & Glazebrook, K. D. (1986), Sequential methods in statistics, Chap-man & Hall.
- (1986) Sequential methods in statistics
- Wetherill, G.B.¹ Glazebrook, K.D.²

195
- 0000248624
- Multi-armed bandits and the Gittins index
- Whittle, P. (1980), 'Multi-armed bandits and the Gittins index', Journal of the Royal Statistical Society B42(2), 143-149.
- (1980) Journal of the Royal Statistical Society , vol.B42 , Issue.2 , pp. 143-149
- Whittle, P.¹

196
- 0001043843
- Restless bandits: Activity Allocation in a Changing World
- Whittle, P. (1988), 'Restless bandits: Activity Allocation in a Changing World', J. Appl. Prob. 25(1988), 287-298.
- (1988) J. Appl. Prob. , vol.25 , Issue.1988 , pp. 287-298
- Whittle, P.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.