-
1
-
-
0031287072
-
An experimental analysis of the bandit problem
-
Banks, J., Olson, M., & Porter, D. (2013). An experimental analysis of the bandit problem. Economic Theory, 10, 55-77.
-
(2013)
Economic Theory
, vol.10
, pp. 55-77
-
-
Banks, J.1
Olson, M.2
Porter, D.3
-
2
-
-
34250348767
-
Should I stay or should I go? Exploration versus exploitation
-
Cohen, J. D., McClure, S. M., & Yu, A. J. (2007). Should I stay or should I go? Exploration versus exploitation. Philosophical Transactions of the Royal Society B: Biological Sciences, 362, 933-942.
-
(2007)
Philosophical Transactions of the Royal Society B: Biological Sciences
, vol.362
, pp. 933-942
-
-
Cohen, J. D.1
McClure, S. M.2
Yu, A. J.3
-
3
-
-
33745223257
-
Cortical substrates for exploratory decisions in humans
-
Daw, N. D., O'Doherty, J. P., Dayan, P., Seymour, B., & Dolan, R. J. (2006). Cortical substrates for exploratory decisions in humans. Nature, 441, 876-879.
-
(2006)
Nature
, vol.441
, pp. 876-879
-
-
Daw, N. D.1
O'Doherty, J. P.2
Dayan, P.3
Seymour, B.4
Dolan, R. J.5
-
4
-
-
55549135706
-
A knowledge-gradient policy for sequential information collection
-
Frazier, P., Powell, W., & Dayanik, S. (2008). A knowledge-gradient policy for sequential information collection. SIAM Journal on Control and Optimization, 47, 2410-2439.
-
(2008)
SIAM Journal on Control and Optimization
, vol.47
, pp. 2410-2439
-
-
Frazier, P.1
Powell, W.2
Dayanik, S.3
-
5
-
-
0004012196
-
-
(2 ed). Boca Raton, FL: Chapman & Hall/CRC
-
Gelman, A., Carlin, J. B., Stern, H. S., & Rubin, D. B. (2004). Bayesian data analysis (2 ed.). Boca Raton, FL: Chapman & Hall/CRC.
-
(2004)
Bayesian data analysis
-
-
Gelman, A.1
Carlin, J. B.2
Stern, H. S.3
Rubin, D. B.4
-
7
-
-
0029679044
-
Reinforcement learning: A survey
-
Kaebling, L. P., Littman, M. L., & Moore, A. W. (1996). Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4, 237-285.
-
(1996)
Journal of Artificial Intelligence Research
, vol.4
, pp. 237-285
-
-
Kaebling, L. P.1
Littman, M. L.2
Moore, A. W.3
-
8
-
-
79952189388
-
Psychological models of human and optimal performance in bandit problems
-
Lee, M. D., Zhang, S., Munro, M., & Steyvers, M. (2011). Psychological models of human and optimal performance in bandit problems. Cognitive Systems Research, 12, 164-174.
-
(2011)
Cognitive Systems Research
, vol.12
, pp. 164-174
-
-
Lee, M. D.1
Zhang, S.2
Munro, M.3
Steyvers, M.4
-
10
-
-
84966203785
-
Some aspects of the sequential design of experiments
-
Robbins, H. (1952). Some aspects of the sequential design of experiments. Bulletin of the American Mathematical Society, 58, 527-535.
-
(1952)
Bulletin of the American Mathematical Society
, vol.58
, pp. 527-535
-
-
Robbins, H.1
-
11
-
-
84859621831
-
The knowledge gradient algorithm for a general class of online learning problems
-
Ryzhov, I., Powell, W., & Frazier, P. (2012). The knowledge gradient algorithm for a general class of online learning problems. Operations Research, 60, 180-195.
-
(2012)
Operations Research
, vol.60
, pp. 180-195
-
-
Ryzhov, I.1
Powell, W.2
Frazier, P.3
-
12
-
-
67349268975
-
A bayesian analysis of human decision-making on bandit problems
-
Steyvers, M., Lee, M. D., & Wagenmakers, E.-J. (2009). A bayesian analysis of human decision-making on bandit problems. Journal of Mathematical Psychology, 53, 168-179.
-
(2009)
Journal of Mathematical Psychology
, vol.53
, pp. 168-179
-
-
Steyvers, M.1
Lee, M. D.2
Wagenmakers, E.-J.3
-
14
-
-
84858789760
-
Sequential effects: Superstition or rational behavior?
-
Cambridge, MA.: MIT Press
-
Yu, A. J., & Cohen, J. D. (2009). Sequential effects: Superstition or rational behavior? In Advances in neural information processing systems (Vol. 21, p. 1873-1880). Cambridge, MA.: MIT Press.
-
(2009)
Advances in neural information processing systems
, vol.21
, pp. 1873-1880
-
-
Yu, A. J.1
Cohen, J. D.2
|