SCOPUS 정보 검색 플랫폼

Cooperative Minds: Social Interaction and Group Dynamics - Proceedings of the 35th Annual Meeting of the Cognitive Science Society, CogSci 2013

Volumn , Issue , 2013, Pages 1647-1652

Cheap but Clever: Human Active Learning in a Bandit Setting

(2) Zhang, Shunan a Yu, Angela J a

a UNIVERSITY OF CALIFORNIA (United States)

Author keywords

Bandit problems; human active learning; human decision making; knowledge gradient

Indexed keywords

APPROXIMATION ALGORITHMS; ARTIFICIAL INTELLIGENCE; BEHAVIORAL RESEARCH; OPTIMIZATION;

ACTIVE LEARNING; BANDIT PROBLEMS; DECISION POLICY; HEURISTIC POLICIES; HUMAN ACTIVE LEARNING; HUMAN DECISION-MAKING; KNOWLEDGE GRADIENT; KNOWN ENVIRONMENTS; LONG-TERM GOALS; OPTIMAL POLICIES;

DECISION MAKING;

EID: 84898947296 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (10)

References (15)

1
- 0031287072
- An experimental analysis of the bandit problem
- Banks, J., Olson, M., & Porter, D. (2013). An experimental analysis of the bandit problem. Economic Theory, 10, 55-77.
- (2013) Economic Theory , vol.10 , pp. 55-77
- Banks, J.¹ Olson, M.² Porter, D.³

2
- 34250348767
- Should I stay or should I go? Exploration versus exploitation
- Cohen, J. D., McClure, S. M., & Yu, A. J. (2007). Should I stay or should I go? Exploration versus exploitation. Philosophical Transactions of the Royal Society B: Biological Sciences, 362, 933-942.
- (2007) Philosophical Transactions of the Royal Society B: Biological Sciences , vol.362 , pp. 933-942
- Cohen, J. D.¹ McClure, S. M.² Yu, A. J.³

3
- 33745223257
- Cortical substrates for exploratory decisions in humans
- Daw, N. D., O'Doherty, J. P., Dayan, P., Seymour, B., & Dolan, R. J. (2006). Cortical substrates for exploratory decisions in humans. Nature, 441, 876-879.
- (2006) Nature , vol.441 , pp. 876-879
- Daw, N. D.¹ O'Doherty, J. P.² Dayan, P.³ Seymour, B.⁴ Dolan, R. J.⁵

4
- 55549135706
- A knowledge-gradient policy for sequential information collection
- Frazier, P., Powell, W., & Dayanik, S. (2008). A knowledge-gradient policy for sequential information collection. SIAM Journal on Control and Optimization, 47, 2410-2439.
- (2008) SIAM Journal on Control and Optimization , vol.47 , pp. 2410-2439
- Frazier, P.¹ Powell, W.² Dayanik, S.³

5
- 0004012196
- (2 ed). Boca Raton, FL: Chapman & Hall/CRC
- Gelman, A., Carlin, J. B., Stern, H. S., & Rubin, D. B. (2004). Bayesian data analysis (2 ed.). Boca Raton, FL: Chapman & Hall/CRC.
- (2004) Bayesian data analysis
- Gelman, A.¹ Carlin, J. B.² Stern, H. S.³ Rubin, D. B.⁴

6
- 0000169010
- Bandit processes and dynamic allocation indices
- Gittins, J. C. (1979). Bandit processes and dynamic allocation indices. Journal of the Royal Statistical Society, 41, 148-177.
- (1979) Journal of the Royal Statistical Society , vol.41 , pp. 148-177
- Gittins, J. C.¹

7
- 0029679044
- Reinforcement learning: A survey
- Kaebling, L. P., Littman, M. L., & Moore, A. W. (1996). Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4, 237-285.
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 237-285
- Kaebling, L. P.¹ Littman, M. L.² Moore, A. W.³

8
- 79952189388
- Psychological models of human and optimal performance in bandit problems
- Lee, M. D., Zhang, S., Munro, M., & Steyvers, M. (2011). Psychological models of human and optimal performance in bandit problems. Cognitive Systems Research, 12, 164-174.
- (2011) Cognitive Systems Research , vol.12 , pp. 164-174
- Lee, M. D.¹ Zhang, S.² Munro, M.³ Steyvers, M.⁴

9
- 84871543700
- (1 ed). Wiley
- Powell, W., & Ryzhov, I. (2012). Optimal learning (1 ed.). Wiley.
- (2012) Optimal learning
- Powell, W.¹ Ryzhov, I.²

10
- 84966203785
- Some aspects of the sequential design of experiments
- Robbins, H. (1952). Some aspects of the sequential design of experiments. Bulletin of the American Mathematical Society, 58, 527-535.
- (1952) Bulletin of the American Mathematical Society , vol.58 , pp. 527-535
- Robbins, H.¹

11
- 84859621831
- The knowledge gradient algorithm for a general class of online learning problems
- Ryzhov, I., Powell, W., & Frazier, P. (2012). The knowledge gradient algorithm for a general class of online learning problems. Operations Research, 60, 180-195.
- (2012) Operations Research , vol.60 , pp. 180-195
- Ryzhov, I.¹ Powell, W.² Frazier, P.³

12
- 67349268975
- A bayesian analysis of human decision-making on bandit problems
- Steyvers, M., Lee, M. D., & Wagenmakers, E.-J. (2009). A bayesian analysis of human decision-making on bandit problems. Journal of Mathematical Psychology, 53, 168-179.
- (2009) Journal of Mathematical Psychology , vol.53 , pp. 168-179
- Steyvers, M.¹ Lee, M. D.² Wagenmakers, E.-J.³

13
- 0004102479
- Cambridge, MA: MIT Press
- Sutton, R. S., & Barto, A. G. (1998). Reinforcement learning: An introduction. Cambridge, MA: MIT Press.
- (1998) Reinforcement learning: An introduction
- Sutton, R. S.¹ Barto, A. G.²

14
- 84858789760
- Sequential effects: Superstition or rational behavior?
- Cambridge, MA.: MIT Press
- Yu, A. J., & Cohen, J. D. (2009). Sequential effects: Superstition or rational behavior? In Advances in neural information processing systems (Vol. 21, p. 1873-1880). Cambridge, MA.: MIT Press.
- (2009) Advances in neural information processing systems , vol.21 , pp. 1873-1880
- Yu, A. J.¹ Cohen, J. D.²

15
- 85139491190
- Cognitive models and the wisdom of crowds
- N. Taatgen & H. van Rijn (Eds), Austin, TX
- Zhang, S., & Lee, M. D. (2010). Cognitive models and the wisdom of crowds. In N. Taatgen & H. van Rijn (Eds.), Proceedings of the 32th annual conference of the cognitive science society. Austin, TX.
- (2010) Proceedings of the 32th annual conference of the cognitive science society
- Zhang, S.¹ Lee, M. D.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.