메뉴 건너뛰기




Volumn , Issue , 2012, Pages 1-384

Optimal Learning

Author keywords

[No Author keywords available]

Indexed keywords

MATHEMATICAL PROGRAMMING; MATLAB; TABLE LOOKUP;

EID: 84871543700     PISSN: None     EISSN: None     Source Type: Book    
DOI: 10.1002/9781118309858     Document Type: Book
Times cited : (245)

References (196)
  • 3
    • 0000616723 scopus 로고
    • Sample mean based index policies with O (log n) regret for the multi-armed bandit problem
    • Agrawal, R. (1995), 'Sample mean based index policies with O (log n) regret for the multi-armed bandit problem', Advance s in Applied Probability 27(4), 1054-1078.
    • (1995) Advance s in Applied Probability , vol.27 , Issue.4 , pp. 1054-1078
    • Agrawal, R.1
  • 4
    • 0344182998 scopus 로고    scopus 로고
    • A simulated annealing algorithm with constant temperature for discrete stochastic optimization
    • Alrefaei, M. H. & Andradottir, S. (1999), 'A simulated annealing algorithm with constant temperature for discrete stochastic optimization', Management Science 45,748-764.
    • (1999) Management Science , vol.45 , pp. 748-764
    • Alrefaei, M.H.1    Andradottir, S.2
  • 5
    • 77956926259 scopus 로고    scopus 로고
    • Adaptive random search for continuous simulation optimization
    • Andradottir, S. & Prudius, A. A. (2010), 'Adaptive random search for continuous simulation optimization', Naval Research Logistics 57(6), 583-604.
    • (2010) Naval Research Logistics , vol.57 , Issue.6 , pp. 583-604
    • Andradottir, S.1    Prudius, A.A.2
  • 6
    • 77951184800 scopus 로고    scopus 로고
    • Stochastic kriging for simulation metamodeling
    • Ankenman, B., Nelson, B. L. & Staum, J. (2010), 'Stochastic kriging for simulation metamodeling', Operations Research 58(2), 371-382.
    • (2010) Operations Research , vol.58 , Issue.2 , pp. 371-382
    • Ankenman, B.1    Nelson, B.L.2    Staum, J.3
  • 7
    • 70350228783 scopus 로고    scopus 로고
    • Dynamic pricing for non-perishable products with demand learning
    • Araman, V. & Caldentey, R. (2009), 'Dynamic pricing for non-perishable products with demand learning', Operations Research 57(5), 1169-1188.
    • (2009) Operations Research , vol.57 , Issue.5 , pp. 1169-1188
    • Araman, V.1    Caldentey, R.2
  • 9
    • 0036568025 scopus 로고    scopus 로고
    • Finite-time analysis of the multi-armed bandit problem
    • Auer, P., Cesa-Bianchi, N. & Fischer, P. (2002), 'Finite-time analysis of the multi-armed bandit problem', Machine Learning 47(2), 235-256.
    • (2002) Machine Learning , vol.47 , Issue.2 , pp. 235-256
    • Auer, P.1    Cesa-Bianchi, N.2    Fischer, P.3
  • 10
    • 73549103329 scopus 로고    scopus 로고
    • Near-optimal regret bounds for reinforcement learning
    • D. Koller, Y. Bengio, D. Schuurmans, L. Bottou & A. Culotta, eds
    • Auer, P., Jaksch, T. & Ortner, R. (2008), Near-optimal regret bounds for reinforcement learning, in D. Koller, Y. Bengio, D. Schuurmans, L. Bottou & A. Culotta, eds,'Advances in Neural Information Processing Systems', Vol. 21, pp. 89-96.
    • (2008) Advances in Neural Information Processing Systems , vol.21 , pp. 89-96
    • Auer, P.1    Jaksch, T.2    Ortner, R.3
  • 11
    • 25844499294 scopus 로고    scopus 로고
    • A partially observed Markov decision process for dynamic pricing
    • Aviv, Y. & Pazgal, A. (2005), 'A partially observed Markov decision process for dynamic pricing', Management Science 51(9), 1400-1416.
    • (2005) Management Science , vol.51 , Issue.9 , pp. 1400-1416
    • Aviv, Y.1    Pazgal, A.2
  • 12
    • 0011155531 scopus 로고
    • Optimality proof for the symmetric Fibonacci search technique
    • Avriel, M. & Wilde, D. (1966), 'Optimality proof for the symmetric Fibonacci search technique', Fibonacci Quarterly 4, 265-269.
    • (1966) Fibonacci Quarterly , vol.4 , pp. 265-269
    • Avriel, M.1    Wilde, D.2
  • 13
    • 0033078286 scopus 로고    scopus 로고
    • Simulation optimization with qualitative variables and structural model changes: A genetic algorithm approach
    • Azadivar, F. & Tompkins, G. (1999), 'Simulation optimization with qualitative variables and structural model changes: A genetic algorithm approach', European Journal of Operational Research 113, 169-182.
    • (1999) European Journal of Operational Research , vol.113 , pp. 169-182
    • Azadivar, F.1    Tompkins, G.2
  • 14
    • 0034694502 scopus 로고    scopus 로고
    • Facility Layout Optimization Using Simulation and Genetic Algorithms
    • Azadivar, F. & Wang, J. (2000), 'Facility Layout Optimization Using Simulation and Genetic Algorithms', International Journal of Production Research 38( 17), 43694383.
    • (2000) International Journal of Production Research , vol.38 , Issue.17 , pp. 43694383
    • Azadivar, F.1    Wang, J.2
  • 16
    • 0000541772 scopus 로고
    • An introduction to ranking and selection procedures
    • Barr, D. R. & Rizvi, M. H. (1966), 'An introduction to ranking and selection procedures', J. Amer. Statist. Assoc. 61(315), 640-646.
    • (1966) J. Amer. Statist. Assoc. , vol.61 , Issue.315 , pp. 640-646
    • Barr, D.R.1    Rizvi, M.H.2
  • 18
    • 0002426110 scopus 로고
    • A Single-Sample Multiple Decision Procedure for Ranking Means of Normal Populations with known Variances
    • Bechhofer, R. E. (1954), 'A Single-Sample Multiple Decision Procedure for Ranking Means of Normal Populations with known Variances', The Annals of Mathematical Statistics 25, 16-39.
    • (1954) The Annals of Mathematical Statistics , vol.25 , pp. 16-39
    • Bechhofer, R.E.1
  • 21
    • 84938011869 scopus 로고
    • On adaptive control processes
    • Bellman, R. & Kalaba, R. (1959), 'On adaptive control processes', IRE Trans. 4,1-9.
    • (1959) IRE Trans. , vol.4 , pp. 1-9
    • Bellman, R.1    Kalaba, R.2
  • 24
    • 0030352286 scopus 로고    scopus 로고
    • Learning and strategic pricing
    • Bergemann, D. & Välimäki, J. (1996), 'Learning and strategic pricing', Econometrica 64(5), 1125-1149.
    • (1996) Econometrica , vol.64 , Issue.5 , pp. 1125-1149
    • Bergemann, D.1    Välimäki, J.2
  • 27
    • 0343441515 scopus 로고    scopus 로고
    • Restless bandits, linear programming relaxations, and a primal-dual index heuristic
    • Bertsimas, D. J. & Nino-Mora, J. (2000), 'Restless bandits, linear programming relaxations, and a primal-dual index heuristic', Operations Research 48(1), 8090.
    • (2000) Operations Research , vol.48 , Issue.1 , pp. 8090
    • Bertsimas, D.J.1    Nino-Mora, J.2
  • 32
    • 0041965975 scopus 로고    scopus 로고
    • R-max - a general polynomial time algorithm for near-optimal reinforcement learning
    • Brafman, R. I. & Tennenholtz, M. (2003), 'R-max - a general polynomial time algorithm for near-optimal reinforcement learning', Journal of Machine Learning Research 3,213-231.
    • (2003) Journal of Machine Learning Research , vol.3 , pp. 213-231
    • Brafman, R.I.1    Tennenholtz, M.2
  • 33
    • 33745765950 scopus 로고    scopus 로고
    • New developments in ranking and selection: an empirical comparison of the three main approaches
    • M. Kuhl, N. Steiger, F. Argstrong & J. Joines, eds, IEEE, Inc., Piscataway, NJ
    • Branke, J., Chick, S. E. & Schmidt, C. (2005), New developments in ranking and selection: an empirical comparison of the three main approaches, in M. Kuhl, N. Steiger, F. Argstrong & J. Joines, eds,'Proc. 2005 Winter Simulation Conference', IEEE, Inc., Piscataway, NJ, pp. 708-717.
    • (2005) Proc. 2005 Winter Simulation Conference , pp. 708-717
    • Branke, J.1    Chick, S.E.2    Schmidt, C.3
  • 35
    • 0036334330 scopus 로고    scopus 로고
    • Optimal learning and experimentation in bandit problems
    • Brezzi, M. & Lai, T. L. (2002), 'Optimal learning and experimentation in bandit problems', Journal of Economic Dynamics and Control 27(1), 87-108.
    • (2002) Journal of Economic Dynamics and Control , vol.27 , Issue.1 , pp. 87-108
    • Brezzi, M.1    Lai, T.L.2
  • 38
    • 0001596834 scopus 로고
    • A unified approach to a class of best choice problems with an unknown number of options
    • Bruss, F. (1984), 'A unified approach to a class of best choice problems with an unknown number of options', The Annals of Probability 12(3), 882-889.
    • (1984) The Annals of Probability , vol.12 , Issue.3 , pp. 882-889
    • Bruss, F.1
  • 41
    • 84949574363 scopus 로고    scopus 로고
    • Convergence rates of efficient global optimization algorithms
    • Bull, A. D. (2011), 'Convergence rates of efficient global optimization algorithms', Submitted for publication.
    • (2011) Submitted for publication
    • Bull, A.D.1
  • 42
    • 60749134420 scopus 로고    scopus 로고
    • A comparative study of genetic algorithm components in simulation-based optimization
    • S. Mason, R. Hill, L. Mönch, O. Rose, T. Jefferson & J. Fowler, eds
    • Can, B., Beham, A. & Heavey, C. (2008), A comparative study of genetic algorithm components in simulation-based optimization, in S. Mason, R. Hill, L. Mönch, O. Rose, T. Jefferson & J. Fowler, eds,'Proceedings of the 2008 Winter Simulation Conference', pp. 1829-1837.
    • (2008) Proceedings of the 2008 Winter Simulation Conference , pp. 1829-1837
    • Can, B.1    Beham, A.2    Heavey, C.3
  • 44
    • 34547314820 scopus 로고    scopus 로고
    • Learning and pricing in an Internet environment with binomial demands
    • Carvalho, A. & Puterman, M. (2005), 'Learning and pricing in an Internet environment with binomial demands', Journal of Revenue and Pricing Management 3(4), 320-336.
    • (2005) Journal of Revenue and Pricing Management , vol.3 , Issue.4 , pp. 320-336
    • Carvalho, A.1    Puterman, M.2
  • 45
    • 1842266137 scopus 로고
    • Mathematical questions with their solutions, No. 4528
    • Cayley, A. (1875), 'Mathematical questions with their solutions, No. 4528', Educational Times.
    • (1875) Educational Times
    • Cayley, A.1
  • 46
    • 0036921026 scopus 로고    scopus 로고
    • Another Look at the Radner-Stiglitz Nonconcavity in the Value of Information
    • Chade, H. & Schlee, E. E. (2002), 'Another Look at the Radner-Stiglitz Nonconcavity in the Value of Information', Journal of Economic Theory 107(2), 421-452.
    • (2002) Journal of Economic Theory , vol.107 , Issue.2 , pp. 421-452
    • Chade, H.1    Schlee, E.E.2
  • 48
    • 27844503922 scopus 로고    scopus 로고
    • Application of genetic algorithms in production and operations management: a review
    • Chaudhry, S. & Luo, W. (2005), 'Application of genetic algorithms in production and operations management: a review', International Journal of Production Research 43(19), 4083-4101.
    • (2005) International Journal of Production Research , vol.43 , Issue.19 , pp. 4083-4101
    • Chaudhry, S.1    Luo, W.2
  • 50
    • 60749111637 scopus 로고    scopus 로고
    • Simulation and optimization
    • P. Gray, Z.-L. Chen & S. Raghavan, eds
    • Chen, C.-H., Fu, M. C. & Shi, L. (2008), Simulation and optimization, in P. Gray, Z.-L. Chen & S. Raghavan, eds,'2008 TutORials in Operations Research', pp. 247-260.
    • (2008) 2008 TutORials in Operations Research , pp. 247-260
    • Chen, C.-H.1    Fu, M.C.2    Shi, L.3
  • 51
    • 33845807891 scopus 로고    scopus 로고
    • Efficient Dynamic Simulation Allocation in Ordinal Optimization
    • Chen, C.-H., He, D. & Fu, M. C. (2006), 'Efficient Dynamic Simulation Allocation in Ordinal Optimization', IEEE Transactions Automatic Control 51, 2005-2009.
    • (2006) IEEE Transactions Automatic Control , vol.51 , pp. 2005-2009
    • Chen, C.-H.1    He, D.2    Fu, M.C.3
  • 52
    • 0034225544 scopus 로고    scopus 로고
    • Simulation budget allocation for further enhancing the efficiency of ordinal optimization
    • Chen, C.-H., Lin, J., Yücesan, E. & Chick, S. E. (2000), 'Simulation budget allocation for further enhancing the efficiency of ordinal optimization', Discrete Event Dynamic Systems 10(3), 251-270.
    • (2000) Discrete Event Dynamic Systems , vol.10 , Issue.3 , pp. 251-270
    • Chen, C.-H.1    Lin, J.2    Yücesan, E.3    Chick, S.E.4
  • 54
    • 84949574364 scopus 로고    scopus 로고
    • Dynamic pricing with non-conjugate Pareto priors, Technical report
    • Rensselaer Polytechnic Institute
    • Chhabra, M. & Das, S. (2012), Dynamic pricing with non-conjugate Pareto priors, Technical report, Rensselaer Polytechnic Institute.
    • (2012)
    • Chhabra, M.1    Das, S.2
  • 55
    • 1642437901 scopus 로고    scopus 로고
    • Expected opportunity cost guarantees and indifference zone selection procedures
    • S. E. Chick, P. J. Sánchez, D. Ferrin & D. J. Morrice, eds
    • Chick, S. E. (2003), Expected opportunity cost guarantees and indifference zone selection procedures, in S. E. Chick, P. J. Sánchez, D. Ferrin & D. J. Morrice, eds,'Proceedings of the 2003 Winter Simulation Conference', pp. 465-473.
    • (2003) Proceedings of the 2003 Winter Simulation Conference , pp. 465-473
    • Chick, S.E.1
  • 56
    • 77950462837 scopus 로고    scopus 로고
    • Subjective Probability and Bayesian Methodology
    • S. Hen-derson & B. Nelson, eds, Simulation', North-Holland Publishing, Amsterdam
    • Chick, S. E. (2006), Subjective Probability and Bayesian Methodology, in S. Hen-derson & B. Nelson, eds,'Handbooks of Operations Research and Management Science, vol. 13: Simulation', North-Holland Publishing, Amsterdam, pp. 225-258.
    • (2006) Handbooks of Operations Research and Management Science , vol.13 , pp. 225-258
    • Chick, S.E.1
  • 57
    • 67649990621 scopus 로고    scopus 로고
    • Economic analysis of simulation selection problems
    • Chick, S. E. & Gans, N. (2009), 'Economic analysis of simulation selection problems', Management Science 55(3), 421-37.
    • (2009) Management Science , vol.55 , Issue.3 , pp. 421-437
    • Chick, S.E.1    Gans, N.2
  • 58
    • 0035460965 scopus 로고    scopus 로고
    • New two-stage and sequential procedures for selecting the best simulated system
    • Chick, S. E. & Inoue, K. (2001), 'New two-stage and sequential procedures for selecting the best simulated system', Operations Research 49(5), 732-743.
    • (2001) Operations Research , vol.49 , Issue.5 , pp. 732-743
    • Chick, S.E.1    Inoue, K.2
  • 59
    • 27344432350 scopus 로고    scopus 로고
    • Selection procedures with frequentist expected opportunity cost bounds
    • Chick, S. E. & Wu, Y. (2005), 'Selection procedures with frequentist expected opportunity cost bounds', Operations Research 53(5), 867-878.
    • (2005) Operations Research , vol.53 , Issue.5 , pp. 867-878
    • Chick, S.E.1    Wu, Y.2
  • 60
    • 77949359798 scopus 로고    scopus 로고
    • Sequential Sampling to Myopically Maximize the Expected Value of Information
    • Chick, S. E., Branke, J. & Schmidt, C. (2010), 'Sequential Sampling to Myopically Maximize the Expected Value of Information', INFORMS Journal on Computing 22(1), 71-80.
    • (2010) INFORMS Journal on Computing , vol.22 , Issue.1 , pp. 71-80
    • Chick, S.E.1    Branke, J.2    Schmidt, C.3
  • 61
    • 34548237750 scopus 로고    scopus 로고
    • Opportunity cost and OCBA selection procedures in ordinal optimization for a fixed number of alternative systems
    • Chick, S. E., He, D. H. & Chen, C.-H. (2007), 'Opportunity cost and OCBA selection procedures in ordinal optimization for a fixed number of alternative systems', IEEE Transactions on Systems Man and Cybernetics C37, 951-961.
    • (2007) IEEE Transactions on Systems Man and Cybernetics , vol.C37 , pp. 951-961
    • Chick, S.E.1    He, D.H.2    Chen, C.-H.3
  • 64
    • 34250348767 scopus 로고    scopus 로고
    • Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration.
    • Cohen, J. D., McClure, S. M. & Yu, A. J. (2007), 'Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration.', Philosophical transactions of the Royal Society of London B362(1481), 933-42.
    • (2007) Philosophical transactions of the Royal Society of London , vol.B362 , Issue.1481 , pp. 933-942
    • Cohen, J.D.1    McClure, S.M.2    Yu, A.J.3
  • 65
    • 0028424239 scopus 로고
    • Improving generalization with active learning
    • Cohn, D., Atlas, L. & Ladner, R. (1994), 'Improving generalization with active learning', Machine Learning 5(2201), 221.
    • (1994) Machine Learning , vol.5 , Issue.2201 , pp. 221
    • Cohn, D.1    Atlas, L.2    Ladner, R.3
  • 67
    • 84900550689 scopus 로고
    • Markov decision processes with uncertain transition probabilities
    • Technical Report 11, Operations Research Center, MIT
    • Cozzolino, J., Gonzalez-Zubieta, R. & Miller, R. (1965), Markov decision processes with uncertain transition probabilities, Technical Report 11, Operations Research Center, MIT.
    • (1965)
    • Cozzolino, J.1    Gonzalez-Zubieta, R.2    Miller, R.3
  • 70
    • 49349084239 scopus 로고    scopus 로고
    • Index policies for discounted bandit problems with availability constraints
    • Dayanik, S., Powell, W. B. & Yamazaki, K. (2008), 'Index policies for discounted bandit problems with availability constraints', Adv. in Appl. Probab 40, 377-100.
    • (2008) Adv. in Appl. Probab , vol.40 , pp. 377-100
    • Dayanik, S.1    Powell, W.B.2    Yamazaki, K.3
  • 74
    • 36048940444 scopus 로고    scopus 로고
    • A tight sufficient condition for Radner-Stiglitz non-concavity in the value of information
    • Delara, M. & Gilotte, L. (2007), 'A tight sufficient condition for Radner-Stiglitz non-concavity in the value of information', Journal of Economic Theory 137(1), 696-708.
    • (2007) Journal of Economic Theory , vol.137 , Issue.1 , pp. 696-708
    • Delara, M.1    Gilotte, L.2
  • 76
    • 16244388049 scopus 로고    scopus 로고
    • Local bandit approximation for optimal learning problems
    • M. Mozer, M. Jordan & T. Pesche, eds, Cambridge, MA: MIT Press
    • Duff, M. & Barto, A. (1996), Local bandit approximation for optimal learning problems, in M. Mozer, M. Jordan & T. Pesche, eds,'Advances in Neural Information Processing Systems', Vol. 9, Cambridge, MA: MIT Press, pp. 1019-1025.
    • (1996) Advances in Neural Information Processing Systems , vol.9 , pp. 1019-1025
    • Duff, M.1    Barto, A.2
  • 77
    • 0000145537 scopus 로고
    • Multistage Stochastic Programs - The State of the Art and Selected Bibliography
    • Dupacova, J. (1995), 'Multistage Stochastic Programs - The State of the Art and Selected Bibliography', Kybernetica 31, 151-174.
    • (1995) Kybernetica , vol.31 , pp. 151-174
    • Dupacova, J.1
  • 78
    • 77249163740 scopus 로고    scopus 로고
    • Dynamic pricing with a prior on market response
    • Farias, V. & Van Roy, B. (2010), 'Dynamic pricing with a prior on market response', Operations Research 58(1), 16-29.
    • (2010) Operations Research , vol.58 , Issue.1 , pp. 16-29
    • Farias, V.1    Van Roy, B.2
  • 80
    • 78651309095 scopus 로고    scopus 로고
    • Paradoxes in Learning and the Marginal Value of Information
    • Frazier, P. I. & Powell, W. B. (2010), 'Paradoxes in Learning and the Marginal Value of Information', Decision Analysis 7(4), 378-403.
    • (2010) Decision Analysis , vol.7 , Issue.4 , pp. 378-403
    • Frazier, P.I.1    Powell, W.B.2
  • 83
    • 70449498873 scopus 로고    scopus 로고
    • The Knowledge-Gradient Policy for Correlated Normal Beliefs
    • Frazier, P. I., Powell, W. B. & Dayanik, S. (2009), 'The Knowledge-Gradient Policy for Correlated Normal Beliefs', INFORMS Journal on Computing 21(4), 599-613.
    • (2009) INFORMS Journal on Computing , vol.21 , Issue.4 , pp. 599-613
    • Frazier, P.I.1    Powell, W.B.2    Dayanik, S.3
  • 84
    • 0012260296 scopus 로고    scopus 로고
    • Optimization for simulation: Theory vs. practice
    • Fu, M. C. (2002), 'Optimization for simulation: Theory vs. practice', INFORMS Journal on Computing 14(3), 192-215.
    • (2002) INFORMS Journal on Computing , vol.14 , Issue.3 , pp. 192-215
    • Fu, M.C.1
  • 85
    • 33846679442 scopus 로고    scopus 로고
    • Simulation optimization: a review, new developments, and applications
    • M. E. Kuhl, N. M. Steiger, F. B. Armstrong & J. A. Joines, eds
    • Fu, M. C, Glover, F. & April, J. (2005), Simulation optimization: a review, new developments, and applications, in M. E. Kuhl, N. M. Steiger, F. B. Armstrong & J. A. Joines, eds,'Proceedings of the 2005 Winter Simulation Conference', pp. 83-95.
    • (2005) Proceedings of the 2005 Winter Simulation Conference , pp. 83-95
    • Fu, M.C.1    Glover, F.2    April, J.3
  • 86
    • 17744377782 scopus 로고    scopus 로고
    • Optimal computing budget allocation under correlated sampling
    • R. G. Ingalls, M. D. Rossetti, J. S. Smith & B. A. Peters, eds
    • Fu, M. C, Hu, J. Q., Chen, C.-H. & Xiong, X. (2004), Optimal computing budget allocation under correlated sampling, in R. G. Ingalls, M. D. Rossetti, J. S. Smith & B. A. Peters, eds,'Proceedings of the 2004 Winter Simulation Conference', pp. 595-603.
    • (2004) Proceedings of the 2004 Winter Simulation Conference , pp. 595-603
    • Fu, M.C.1    Hu, J.Q.2    Chen, C.-H.3    Xiong, X.4
  • 87
    • 0028480132 scopus 로고
    • Optimal dynamic pricing of inventories with stochastic demand over finite horizons
    • Gallego, G. & Van Ryzin, G. (1994), 'Optimal dynamic pricing of inventories with stochastic demand over finite horizons', Management Science 40(8), 999-1020.
    • (1994) Management Science , vol.40 , Issue.8 , pp. 999-1020
    • Gallego, G.1    Van Ryzin, G.2
  • 88
    • 0004012196 scopus 로고    scopus 로고
    • Bayesian Data Analysis
    • 2nd ed', Chapman & Hall, New York
    • Gelman, A., Carlin, J. B., Stern, H. S. & Rubin, D. B. (2004), 'Bayesian Data Analysis, 2nd ed', Chapman & Hall, New York p. 63.
    • (2004) , pp. 63
    • Gelman, A.1    Carlin, J.B.2    Stern, H.S.3    Rubin, D.B.4
  • 89
    • 0001942829 scopus 로고
    • Neural networks and the bias/variance dilemma
    • Geman, S., Bienenstock, E. & Doursat, R. (1992), 'Neural networks and the bias/variance dilemma', Neural computation 4(1), 1-58.
    • (1992) Neural computation , vol.4 , Issue.1 , pp. 1-58
    • Geman, S.1    Bienenstock, E.2    Doursat, R.3
  • 90
    • 27144549876 scopus 로고    scopus 로고
    • Sensitivity analysis in linear optimization: Invariant support set intervals
    • Ghaffari-Hadigheh, A. & Terlaky, T. (2006), 'Sensitivity analysis in linear optimization: Invariant support set intervals', European Journal of Operational Research 169(3), 1158-1175.
    • (2006) European Journal of Operational Research , vol.169 , Issue.3 , pp. 1158-1175
    • Ghaffari-Hadigheh, A.1    Terlaky, T.2
  • 92
    • 0000169010 scopus 로고
    • Bandit processes and dynamic allocation indices
    • Gittins, J. C. (1979), 'Bandit processes and dynamic allocation indices', Journal of the Royal Statistical Society B41(2), 148-177.
    • (1979) Journal of the Royal Statistical Society , vol.B41 , Issue.2 , pp. 148-177
    • Gittins, J.C.1
  • 93
    • 84891584370 scopus 로고    scopus 로고
    • Multi-armed Bandit Allocation Indices
    • Wiley and Sons: New York
    • Gittins, J. C. (1989), 'Multi-armed Bandit Allocation Indices', Wiley and Sons: New York.
    • (1989)
    • Gittins, J.C.1
  • 94
    • 0002955623 scopus 로고
    • A dynamic allocation index for the sequential design of experiments
    • J. Gani, ed., North Holland, Amsterdam
    • Gittins, J. C. & Jones, D. M. (1974), A dynamic allocation index for the sequential design of experiments, in J. Gani, ed.,'Progress in statistics', North Holland, Amsterdam, pp. 241-266.
    • (1974) Progress in statistics , pp. 241-266
    • Gittins, J.C.1    Jones, D.M.2
  • 97
    • 0002232604 scopus 로고
    • On the evaluation of suboptimal strategies for families of alternative bandit processes
    • Glazebrook, K. D. (1982), 'On the evaluation of suboptimal strategies for families of alternative bandit processes', Journal of Applied Probability 19(3), 716-722.
    • (1982) Journal of Applied Probability , vol.19 , Issue.3 , pp. 716-722
    • Glazebrook, K.D.1
  • 98
    • 67649922844 scopus 로고    scopus 로고
    • A Generalized Gittins Index for a Class of Multiarmed Bandits with General Resource Requirements
    • Glazebrook, K. D. & Minty, R. (2009), 'A Generalized Gittins Index for a Class of Multiarmed Bandits with General Resource Requirements', Mathematics of Operations Research.
    • (2009) Mathematics of Operations Research
    • Glazebrook, K.D.1    Minty, R.2
  • 99
    • 0024561585 scopus 로고
    • A comparative analysis of selection schemes used in genetic algorithms
    • G. Rawlings, ed., Morgan Kaufmann Publishers, San Mateo, CA
    • Goldberg, D. E. & Deb, K. (1991), A comparative analysis of selection schemes used in genetic algorithms, in G. Rawlings, ed.,'Foundations of genetic algorithms', Morgan Kaufmann Publishers, San Mateo, CA, pp. 69-93.
    • (1991) Foundations of genetic algorithms , pp. 69-93
    • Goldberg, D.E.1    Deb, K.2
  • 101
    • 84859489788 scopus 로고    scopus 로고
    • Optimization under unknown constraints
    • Arxiv preprint arXiv:1004.4027
    • Gramacy, R. B. & Lee, H. K. H. (2011), 'Optimization under unknown constraints', Arxiv preprint arXiv:1004.4027.
    • (2011)
    • Gramacy, R.B.1    Lee, H.K.H.2
  • 102
    • 0000511415 scopus 로고
    • Bayesian look ahead one stage sampling allocations for selecting the largest normal mean
    • Gupta, S. S. & Miescke, K. J. (1994), 'Bayesian look ahead one stage sampling allocations for selecting the largest normal mean', Statistical Papers 35,169-177.
    • (1994) Statistical Papers , vol.35 , pp. 169-177
    • Gupta, S.S.1    Miescke, K.J.2
  • 103
    • 0030590294 scopus 로고    scopus 로고
    • Bayesian look ahead one-stage sampling allocations for selection of the best population
    • Gupta, S. S. & Miescke, K. J. (1996), 'Bayesian look ahead one-stage sampling allocations for selection of the best population', Journal of Statistical Planning and Inference 54, 229-244.
    • (1996) Journal of Statistical Planning and Inference , vol.54 , pp. 229-244
    • Gupta, S.S.1    Miescke, K.J.2
  • 104
    • 84866344056 scopus 로고    scopus 로고
    • Bayesian dynamic pricing policies: Learning and earning under a binary prior distribution
    • Technical report, Working paper, Columbia and Stanford University
    • Harrison, J., Keskin, N. & Zeevi, A. (2010), Bayesian dynamic pricing policies: Learning and earning under a binary prior distribution, Technical report, Working paper, Columbia and Stanford University.
    • (2010)
    • Harrison, J.1    Keskin, N.2    Zeevi, A.3
  • 107
    • 0035696433 scopus 로고    scopus 로고
    • A Genetic Algorithm and an Indifference-Zone Ranking and Selection Framework for Simulation Optimization
    • B. A. Peters, J. S. Smith, D. J. Medeiros & M. W. Rohrer, eds
    • Hedlund, H. E. & Mollaghasemi, M. (2001), A Genetic Algorithm and an Indifference-Zone Ranking and Selection Framework for Simulation Optimization, in B. A. Peters, J. S. Smith, D. J. Medeiros & M. W. Rohrer, eds,'Proceedings of the 2001 Winter Simulation Conference', pp. 417-421.
    • (2001) Proceedings of the 2001 Winter Simulation Conference , pp. 417-421
    • Hedlund, H.E.1    Mollaghasemi, M.2
  • 108
    • 33644525898 scopus 로고    scopus 로고
    • Discrete Optimization via Simulation Using COMPASS
    • Hong, L. J. & Nelson, B. L. (2006), 'Discrete Optimization via Simulation Using COMPASS', Operations Research 54(1), 115-129.
    • (2006) Operations Research , vol.54 , Issue.1 , pp. 115-129
    • Hong, L.J.1    Nelson, B.L.2
  • 109
    • 77951527013 scopus 로고    scopus 로고
    • A Brief Introduction To Optimization Via Simulation
    • M. Rosetti, R. Hill, B. Johansson, A. Dunkin & R. Ingalls, eds
    • Hong, L. J. & Nelson, B. L. (2009), A Brief Introduction To Optimization Via Simulation, in M. Rosetti, R. Hill, B. Johansson, A. Dunkin & R. Ingalls, eds,'Proceedings of the 2009 Winter Simulation Conference', pp. 75-85.
    • (2009) Proceedings of the 2009 Winter Simulation Conference , pp. 75-85
    • Hong, L.J.1    Nelson, B.L.2
  • 110
    • 39549108095 scopus 로고    scopus 로고
    • Ranking inequality: Applications of multivariate subset selection
    • Horrace, W., Marchand, J. & Smeeding, T. (2008), 'Ranking inequality: Applications of multivariate subset selection', Journal of Economic Inequality 6(1), 5-32.
    • (2008) Journal of Economic Inequality , vol.6 , Issue.1 , pp. 5-32
    • Horrace, W.1    Marchand, J.2    Smeeding, T.3
  • 111
    • 84939051589 scopus 로고
    • Sequential transmission using noiseless feedback
    • Horstein, M. (1963), 'Sequential transmission using noiseless feedback', IEEE Trans-actions on Information Theory 9(3), 136-143.
    • (1963) IEEE Trans-actions on Information Theory , vol.9 , Issue.3 , pp. 136-143
    • Horstein, M.1
  • 113
    • 79951643127 scopus 로고    scopus 로고
    • An Approximate Annealing Search Algorithm to Global Optimization and its Connection to Stochastic Approximation
    • B. Johansson, S. Jain, J. Montoya-Torres, J. Hugan & E. Yücesan, eds
    • Hu, J. & Hu, P. (2010), An Approximate Annealing Search Algorithm to Global Optimization and its Connection to Stochastic Approximation, in B. Johansson, S. Jain, J. Montoya-Torres, J. Hugan & E. Yücesan, eds,'Proceedings of the 2010 Winter Simulation Conference', pp. 1223-1234.
    • (2010) Proceedings of the 2010 Winter Simulation Conference , pp. 1223-1234
    • Hu, J.1    Hu, P.2
  • 114
    • 79960273055 scopus 로고    scopus 로고
    • Annealing Adaptive Search, Cross-Entropy, and Stochastic Approximation in Global Optimization
    • to appear
    • Hu, J. & Hu, P. (2011), 'Annealing Adaptive Search, Cross-Entropy, and Stochastic Approximation in Global Optimization', Naval Research Logistics (to appear).
    • (2011) Naval Research Logistics
    • Hu, J.1    Hu, P.2
  • 115
    • 84863275874 scopus 로고    scopus 로고
    • Discrete Optimization Via Approximate Annealing Adaptive Search With Stochastic Averaging
    • S. Jain, R. R. Creasey, J. Himmelspach, K. P. White & M. C. Fu, eds
    • Hu, J. & Wang, C. (2011), Discrete Optimization Via Approximate Annealing Adaptive Search With Stochastic Averaging, in S. Jain, R. R. Creasey, J. Himmelspach, K. P. White & M. C. Fu, eds,'Proceedings of the 2011 Winter Simulation Conference'.
    • (2011) Proceedings of the 2011 Winter Simulation Conference
    • Hu, J.1    Wang, C.2
  • 116
    • 37249005626 scopus 로고    scopus 로고
    • A model reference adaptive search method for global optimization
    • Hu, J., Fu, M. & Marcus, S. (2007), 'A model reference adaptive search method for global optimization', Operations Research 55(3), 549-568.
    • (2007) Operations Research , vol.55 , Issue.3 , pp. 549-568
    • Hu, J.1    Fu, M.2    Marcus, S.3
  • 117
    • 33644791173 scopus 로고    scopus 로고
    • Global Optimization of Stochastic Black-Box Systems via Sequential Kriging Meta-Models
    • Huang, D., Allen, T. T., Notz, W. I. & Zeng, N. (2006), 'Global Optimization of Stochastic Black-Box Systems via Sequential Kriging Meta-Models', Journal of Global Optimization 34(3), 441-466.
    • (2006) Journal of Global Optimization , vol.34 , Issue.3 , pp. 441-466
    • Huang, D.1    Allen, T.T.2    Notz, W.I.3    Zeng, N.4
  • 119
    • 0042685161 scopus 로고    scopus 로고
    • Bayesian parameter estimation via variational methods
    • Jaakkola, T. & Jordan, M. (2000), 'Bayesian parameter estimation via variational methods', Statistics and Computing 10(1), 25-37.
    • (2000) Statistics and Computing , vol.10 , Issue.1 , pp. 25-37
    • Jaakkola, T.1    Jordan, M.2
  • 121
    • 0000561424 scopus 로고    scopus 로고
    • Efficient global optimization of expensive black-box functions
    • Jones, D., Schonlau, M. & Welch, W. (1998), 'Efficient global optimization of expensive black-box functions', Journal of Global Optimization 13(4), 455-492.
    • (1998) Journal of Global Optimization , vol.13 , Issue.4 , pp. 455-492
    • Jones, D.1    Schonlau, M.2    Welch, W.3
  • 123
    • 0030305926 scopus 로고    scopus 로고
    • Pricing decisions under demand uncertainty: A Bayesian mixture model approach
    • Kalyanam, K. (1996), 'Pricing decisions under demand uncertainty: A Bayesian mixture model approach', Marketing Science 15(3), 207-221.
    • (1996) Marketing Science , vol.15 , Issue.3 , pp. 207-221
    • Kalyanam, K.1
  • 124
  • 125
    • 0023345261 scopus 로고
    • The Multi-Armed Bandit Problem: Decomposition and Computation
    • Katehakis, M. & Veinott, A. (1987), 'The Multi-Armed Bandit Problem: Decomposition and Computation', Mathematics of Operations Research 12(2), 262-268.
    • (1987) Mathematics of Operations Research , vol.12 , Issue.2 , pp. 262-268
    • Katehakis, M.1    Veinott, A.2
  • 127
    • 0036832954 scopus 로고    scopus 로고
    • Near-optimal reinforcement learning in polynomial time.
    • Kearns, M. & Singh, S. (2002), 'Near-optimal reinforcement learning in polynomial time.', Machine Learning 49, 209-232.
    • (2002) Machine Learning , vol.49 , pp. 209-232
    • Kearns, M.1    Singh, S.2
  • 128
    • 0346963698 scopus 로고    scopus 로고
    • A fully sequential procedure for indifference-zone selection in simulation
    • Kim, S.-H. & Nelson, B. L. (2001), 'A fully sequential procedure for indifference-zone selection in simulation', ACM Trans. Model. Comput. Simul. 11, 251-273.
    • (2001) ACM Trans. Model. Comput. Simul. , vol.11 , pp. 251-273
    • Kim, S.-H.1    Nelson, B.L.2
  • 129
    • 33744788406 scopus 로고    scopus 로고
    • On the asymptotic validity of fully sequential selection procedures for steady-state simulation
    • Kim, S.-H. & Nelson, B. L. (2006), 'On the asymptotic validity of fully sequential selection procedures for steady-state simulation', Operations Research 54, 475-488.
    • (2006) Operations Research , vol.54 , pp. 475-488
    • Kim, S.-H.1    Nelson, B.L.2
  • 130
    • 38049011420 scopus 로고    scopus 로고
    • Nearly Tight Bounds for the Continuum-Armed Bandit Problem
    • L. Saul, Y. Weiss & L. Bottou, eds, MIT Press, Cambridge, MA
    • Kleinberg, R. (2004), Nearly Tight Bounds for the Continuum-Armed Bandit Problem, in L. Saul, Y. Weiss & L. Bottou, eds,'Advances in Neural Information Processing Systems', MIT Press, Cambridge, MA, pp. 697-704.
    • (2004) Advances in Neural Information Processing Systems , pp. 697-704
    • Kleinberg, R.1
  • 131
    • 77955660815 scopus 로고    scopus 로고
    • Regret bounds for sleeping experts and bandits
    • Kleinberg, R., Niculescu-Mizil, A. & Sharma, Y. (2010), 'Regret bounds for sleeping experts and bandits', Machine Learning 80(2), 245-272.
    • (2010) Machine Learning , vol.80 , Issue.2 , pp. 245-272
    • Kleinberg, R.1    Niculescu-Mizil, A.2    Sharma, Y.3
  • 132
    • 0022781718 scopus 로고
    • Shortest paths in networks with exponentially distributed arc lengths
    • Kulkarni, V. (1986), 'Shortest paths in networks with exponentially distributed arc lengths', Networks 16, 255-274.
    • (1986) Networks , vol.16 , pp. 255-274
    • Kulkarni, V.1
  • 134
    • 0000854435 scopus 로고
    • Adaptive treatment allocation and the multi-armed bandit problem
    • Lai, T. L. (1987), 'Adaptive treatment allocation and the multi-armed bandit problem', The Annals of Statistics 15(3), 1091-1114.
    • (1987) The Annals of Statistics , vol.15 , Issue.3 , pp. 1091-1114
    • Lai, T.L.1
  • 135
    • 0002899547 scopus 로고
    • Asymptotically Efficient Adaptive Allocation Rules
    • Lai, T. L. & Robbins, H. (1985), 'Asymptotically Efficient Adaptive Allocation Rules', Advances in Applied Mathematics 6, 4-22.
    • (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
    • Lai, T.L.1    Robbins, H.2
  • 136
    • 0001388964 scopus 로고
    • Continuous multi-armed bandits and multiparameter processes
    • Mandelbaum, A. (1987), 'Continuous multi-armed bandits and multiparameter processes', The Annals of Probability 15(4), 1527-1556.
    • (1987) The Annals of Probability , vol.15 , Issue.4 , pp. 1527-1556
    • Mandelbaum, A.1
  • 138
    • 0001036667 scopus 로고
    • Two-stage multiple comparisons with the best for computer simulation
    • Matejcik, F. & Nelson, B. (1995), 'Two-stage multiple comparisons with the best for computer simulation', Operations Research 43(4), 633-640.
    • (1995) Operations Research , vol.43 , Issue.4 , pp. 633-640
    • Matejcik, F.1    Nelson, B.2
  • 139
    • 84949574365 scopus 로고    scopus 로고
    • U.S. Greenhouse Gas Abatement Mapping Initiative, Executive Report
    • McKinsey & Company (2007), 'Reducing U.S. Greenhouse Gas Emissions: How Much at What Cost?', U.S. Greenhouse Gas Abatement Mapping Initiative, Executive Report.
    • (2007)
    • McKinsey1    Company2
  • 143
    • 79961092747 scopus 로고    scopus 로고
    • The Knowledge-Gradient Algorithm for Sequencing Experiments in Drug Discovery
    • Negoescu, D. M., Frazier, P. I. & Powell, W. B. (2011), 'The Knowledge-Gradient Algorithm for Sequencing Experiments in Drug Discovery', INFORMS Journal on Computing 23(3), 346-363.
    • (2011) INFORMS Journal on Computing , vol.23 , Issue.3 , pp. 346-363
    • Negoescu, D.M.1    Frazier, P.I.2    Powell, W.B.3
  • 144
    • 0001338090 scopus 로고
    • Using common random numbers for indifference-zone selection and multiple comparisons in simulation
    • Nelson, B. L. & Matejcik, F. J. (1995), 'Using common random numbers for indifference-zone selection and multiple comparisons in simulation', Management Science 41(12), 1935-1945.
    • (1995) Management Science , vol.41 , Issue.12 , pp. 1935-1945
    • Nelson, B.L.1    Matejcik, F.J.2
  • 145
    • 79955755016 scopus 로고    scopus 로고
    • Computing a Classic Index for Finite-Horizon Bandits
    • Nino-Mora, J. (2010), 'Computing a Classic Index for Finite-Horizon Bandits', INFORMS Journal on Computing 23(2), 254-267.
    • (2010) INFORMS Journal on Computing , vol.23 , Issue.2 , pp. 254-267
    • Nino-Mora, J.1
  • 146
    • 79952563147 scopus 로고    scopus 로고
    • The stochastic root-finding problem: Overview, solutions, and open questions
    • Pasupathy, R. & Kim, S. (2011), 'The stochastic root-finding problem: Overview, solutions, and open questions', ACM Transactions on Modeling and Computer Simulation 21(3), 19:1-19:23.
    • (2011) ACM Transactions on Modeling and Computer Simulation , vol.21 , Issue.3 , pp. 191-1923
    • Pasupathy, R.1    Kim, S.2
  • 147
    • 0000145493 scopus 로고
    • A Sequential Procedure for Selecting the Population with the Largest Mean from k Normal Populations
    • Paulson, E. (1964), 'A Sequential Procedure for Selecting the Population with the Largest Mean from k Normal Populations', The Annals of Mathematical Statistics 35, 174-180.
    • (1964) The Annals of Mathematical Statistics , vol.35 , pp. 174-180
    • Paulson, E.1
  • 152
    • 0003029778 scopus 로고
    • A Nonconcavity in the Value of Information
    • M. Boyer & R. Kihlstrom, eds, North-Holland, Amsterdam, chapter 3
    • Radner, R. & Stiglitz, J. (1984), A Nonconcavity in the Value of Information, in M. Boyer & R. Kihlstrom, eds,'Bayesian models in economic theory', Vol. 5, North-Holland, Amsterdam, chapter 3, pp. 33-52.
    • (1984) Bayesian models in economic theory , vol.5 , pp. 33-52
    • Radner, R.1    Stiglitz, J.2
  • 154
    • 0001552833 scopus 로고
    • On two-stage selection procedures and related probability inequalities
    • Rinott, Y. (1978), 'On two-stage selection procedures and related probability inequalities', Communications in Statistics A7 pp. 799-811.
    • (1978) Communications in Statistics A7 , pp. 799-811
    • Rinott, Y.1
  • 158
    • 0001058483 scopus 로고
    • A two-armed bandit theory of market pricing
    • Rothschild, M. (1974), 'A two-armed bandit theory of market pricing', Journal of Economic Theory 9(2), 185-202.
    • (1974) Journal of Economic Theory , vol.9 , Issue.2 , pp. 185-202
    • Rothschild, M.1
  • 162
    • 77951568757 scopus 로고    scopus 로고
    • A Monte-Carlo Knowledge Gradient Method For Learning Abatement Potential Of Emissions Reduction Technologies
    • M. D. Rossetti, R. R. Hill, B. Johansson, A. Dunkin & R. G. Ingalls, eds
    • Ryzhov, I. O. & Powell, W. B. (2009a), A Monte-Carlo Knowledge Gradient Method For Learning Abatement Potential Of Emissions Reduction Technologies, in M. D. Rossetti, R. R. Hill, B. Johansson, A. Dunkin & R. G. Ingalls, eds,'Proceedings of the 2009 Winter Simulation Conference'.
    • (2009) Proceedings of the 2009 Winter Simulation Conference
    • Ryzhov, I.O.1    Powell, W.B.2
  • 164
    • 84949574366 scopus 로고    scopus 로고
    • Information collection for linear programs with unknown objective coefficients
    • Submitted for publication
    • Ryzhov, I. O. & Powell, W. B. (201 la),'Information collection for linear programs with unknown objective coefficients', Submitted for publication.
    • (2011)
    • Ryzhov, I.O.1    Powell, W.B.2
  • 165
    • 79952942276 scopus 로고    scopus 로고
    • Information Collection on a Graph
    • Ryzhov, I. O. & Powell, W. B. (2011b),'Information Collection on a Graph', Operations Research 59(1), 188-201.
    • (2011) Operations Research , vol.59 , Issue.1 , pp. 188-201
    • Ryzhov, I.O.1    Powell, W.B.2
  • 167
    • 84877933547 scopus 로고    scopus 로고
    • The knowledge gradient algorithm for a general class of online learning problems
    • to appear
    • Ryzhov, I. O., Powell, W. B. & Frazier, P. I. (2011), 'The knowledge gradient algorithm for a general class of online learning problems', Operations Research (to appear).
    • (2011) Operations Research
    • Ryzhov, I.O.1    Powell, W.B.2    Frazier, P.I.3
  • 168
    • 79951586758 scopus 로고    scopus 로고
    • Optimal Learning of Transition Probabilities in the Two-Agent Newsvendor Problem
    • B. Johansson, S. Jain, J. Montoya-Torres, J. Hugan & E. Yucesan, eds
    • Ryzhov, I. O., Valdez-Vivas, M. R. & Powell, W B. (2010), Optimal Learning of Transition Probabilities in the Two-Agent Newsvendor Problem, in B. Johansson, S. Jain, J. Montoya-Torres, J. Hugan & E. Yucesan, eds,'Proceedings of the 2010 Winter Simulation Conference', pp. 1088-1098.
    • (2010) Proceedings of the 2010 Winter Simulation Conference , pp. 1088-1098
    • Ryzhov, I.O.1    Valdez-Vivas, M.R.2    Powell, W.B.3
  • 170
    • 77249140907 scopus 로고
    • Markov decision processes with imprecise transition probabilities
    • Satia, J. & Lave, R. (1973), 'Markov decision processes with imprecise transition probabilities', Operations Research 21(3), 755-763.
    • (1973) Operations Research , vol.21 , Issue.3 , pp. 755-763
    • Satia, J.1    Lave, R.2
  • 171
    • 80054748080 scopus 로고    scopus 로고
    • The Correlated Knowledge Gradient for Simulation Optimization of Continuous Parameters using Gaussian Process Regression
    • Scott, W. R., Frazier, P. I. & Powell, W. B. (2011), 'The Correlated Knowledge Gradient for Simulation Optimization of Continuous Parameters using Gaussian Process Regression', SI AM Journal on Optimization 21(3), 996-1026.
    • (2011) SI AM Journal on Optimization , vol.21 , Issue.3 , pp. 996-1026
    • Scott, W.R.1    Frazier, P.I.2    Powell, W.B.3
  • 172
    • 79951666282 scopus 로고    scopus 로고
    • Calibrating simulation models using the knowledge gradient with continuous parameters
    • B. Johansson, S. Jain, J. Montoya-Torres, J. Hugan & E. Yiicesan, eds
    • Scott, W. R., Powell, W. B. & Simão, H. P. (2010), Calibrating simulation models using the knowledge gradient with continuous parameters, in B. Johansson, S. Jain, J. Montoya-Torres, J. Hugan & E. Yiicesan, eds,'Proceedings of the 2010 Winter Simulation Conference', pp. 1099-1109.
    • (2010) Proceedings of the 2010 Winter Simulation Conference , pp. 1099-1109
    • Scott, W.R.1    Powell, W.B.2    Simão, H.P.3
  • 173
    • 68949137209 scopus 로고    scopus 로고
    • Computer Sciences Technical Report 1648, University of Wisconsin-Madison
    • Settles, B. (2009), Active learning literature survey, Computer Sciences Technical Report 1648, University of Wisconsin-Madison.
    • (2009) Active learning literature survey
    • Settles, B.1
  • 176
    • 0033901602 scopus 로고    scopus 로고
    • Convergence results for single-step on-policy reinforcement-learning algorithms
    • Singh, S. P., Jaakkola, T., Szepesvari, C. & Littman, M. (2000), 'Convergence results for single-step on-policy reinforcement-learning algorithms', Machine Learning 38(3), 287-308.
    • (2000) Machine Learning , vol.38 , Issue.3 , pp. 287-308
    • Singh, S.P.1    Jaakkola, T.2    Szepesvari, C.3    Littman, M.4
  • 177
    • 34248179010 scopus 로고
    • Probabilistic networks and network algorithms
    • M. Ball, T. Magnanti & C. Monma, eds, Networks', North-Holland Publishing, Amsterdam
    • Snyder, T. & Steele, J. (1995), Probabilistic networks and network algorithms, in M. Ball, T. Magnanti & C. Monma, eds,'Handbooks of Operations Research and Management Science, vol. 7: Networks', North-Holland Publishing, Amsterdam, pp. 401-424.
    • (1995) Handbooks of Operations Research and Management Science , vol.7 , pp. 401-424
    • Snyder, T.1    Steele, J.2
  • 182
    • 33750375100 scopus 로고    scopus 로고
    • A simple distribution-free approach to the max k-armed bandit problem
    • Lecture Notes in Computer Science
    • Streeter, M. & Smith, S. (2006), A simple distribution-free approach to the max k-armed bandit problem, in'Principles and Practice of Constraint Programming', Vol. 4204 of Lecture Notes in Computer Science, pp. 560-574.
    • (2006) Principles and Practice of Constraint Programming , vol.4204 , pp. 560-574
    • Streeter, M.1    Smith, S.2
  • 185
    • 84949574367 scopus 로고    scopus 로고
    • of Synthesis Lectures on Artificial Intelligence and Machine Learning, Morgan & Claypool Publishers
    • Szepesvári, C. (2010), Algorithms for reinforcement learning, Vol. 4 of Synthesis Lectures on Artificial Intelligence and Machine Learning, Morgan & Claypool Publishers.
    • (2010) Algorithms for reinforcement learning , vol.4
    • Szepesvári, C.1
  • 187
    • 0001046225 scopus 로고
    • Practical Issues in Temporal Difference Learning
    • Tesauro, G. (1992), 'Practical Issues in Temporal Difference Learning', Machine Learning 8(3-4), 257-277.
    • (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 257-277
    • Tesauro, G.1
  • 188
    • 2242471756 scopus 로고
    • The optimal choice of a subset of a population
    • Vanderbei, R. J. (1980), 'The optimal choice of a subset of a population', Mathematics of Operations Research 5(4), 481-186.
    • (1980) Mathematics of Operations Research , vol.5 , Issue.4 , pp. 481-186
    • Vanderbei, R.J.1
  • 191
    • 0000193326 scopus 로고
    • Optimum Character of the Sequential Probability Ratio Test
    • Wald, A. & Wolfowitz, J. (1948), 'Optimum Character of the Sequential Probability Ratio Test', The Annals of Mathematical Statistics 19, 326-339.
    • (1948) The Annals of Mathematical Statistics , vol.19 , pp. 326-339
    • Wald, A.1    Wolfowitz, J.2
  • 192
    • 0002327722 scopus 로고
    • On an index policy for restless bandits
    • Weber, R. R. & Weiss, G. (1990), 'On an index policy for restless bandits', J. Appl. Prob. 27(3), 637-648.
    • (1990) J. Appl. Prob. , vol.27 , Issue.3 , pp. 637-648
    • Weber, R.R.1    Weiss, G.2
  • 193
    • 34547236489 scopus 로고    scopus 로고
    • Better May be Worse: Some Monotonicity Results and Paradoxes in Discrete Choice Under Uncertainty
    • Weibull, J. W., Mattsson, L.-G. & Voorneveld, M. (2007), 'Better May be Worse: Some Monotonicity Results and Paradoxes in Discrete Choice Under Uncertainty', Theory and Decision 63(2), 121-151.
    • (2007) Theory and Decision , vol.63 , Issue.2 , pp. 121-151
    • Weibull, J.W.1    Mattsson, L.-G.2    Voorneveld, M.3
  • 195
    • 0000248624 scopus 로고
    • Multi-armed bandits and the Gittins index
    • Whittle, P. (1980), 'Multi-armed bandits and the Gittins index', Journal of the Royal Statistical Society B42(2), 143-149.
    • (1980) Journal of the Royal Statistical Society , vol.B42 , Issue.2 , pp. 143-149
    • Whittle, P.1
  • 196
    • 0001043843 scopus 로고
    • Restless bandits: Activity Allocation in a Changing World
    • Whittle, P. (1988), 'Restless bandits: Activity Allocation in a Changing World', J. Appl. Prob. 25(1988), 287-298.
    • (1988) J. Appl. Prob. , vol.25 , Issue.1988 , pp. 287-298
    • Whittle, P.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.