-
4
-
-
0037709910
-
The nonstochastic multiarmed bandit problem
-
Auer, P, Cesa-Bianchi, N, Freund, Y, and Schapire, R. The nonstochastic multiarmed bandit problem. SIAM J. on Comput, 32(1):48-77, 2002a.
-
(2002)
SIAM J. on Comput
, vol.32
, Issue.1
, pp. 48-77
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.4
-
5
-
-
0036568025
-
Finite-time analysis of the multiarmed bandit problem
-
Auer, P, Cesa-Bianchi, N, and Fischer, P. Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47:235-256, 2002b.
-
(2002)
Machine Learning
, vol.47
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
6
-
-
80053144086
-
Contextual bandit algorithms with supervised learning guarantees
-
Beygelzimer, A, Langford, J, Li, L, Reyzin, L, and Schapire, R. E. Contextual bandit algorithms with supervised learning guarantees. In Proceedings of International Conference on Artificial Intelligence and Statistics, 2011.
-
(2011)
Proceedings of International Conference on Artificial Intelligence and Statistics
-
-
Beygelzimer, A.1
Langford, J.2
Li, L.3
Reyzin, L.4
Schapire, R.E.5
-
7
-
-
84874045238
-
Regret analysis of stochastic and nonstochastic multi-armed bandit problems
-
Bubeck, S and Cesa-Bianchi, N. Regret analysis of stochastic and nonstochastic multi-armed bandit problems. Foundations and Trends in Machine Learning, 5(1): 1-122, 2012.
-
(2012)
Foundations and Trends in Machine Learning
, vol.5
, Issue.1
, pp. 1-122
-
-
Bubeck, S.1
Cesa-Bianchi, N.2
-
9
-
-
84897515317
-
Combinatorial multi-armed bandit: General framework, results and applications
-
Chen, W, Wang, Y, and Yuan, Y. Combinatorial multi-armed bandit: General framework, results and applications. In Proceedings of International Conference on Machine Learning, 2013a.
-
(2013)
Proceedings of International Conference on Machine Learning
-
-
Chen, W.1
Wang, Y.2
Yuan, Y.3
-
12
-
-
33745295134
-
Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems
-
Even-Dar, E, Mannor, S, and Mansour, Y. Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems. Journal of machine learning research, 7:1079-1105, 2006.
-
(2006)
Journal of Machine Learning Research
, vol.7
, pp. 1079-1105
-
-
Even-Dar, E.1
Mannor, S.2
Mansour, Y.3
-
13
-
-
84897504552
-
Adaptive task assignment for crowdsourced classification
-
Ho, C.-I, Jabbari, S, and Vaughan, J. W. Adaptive task assignment for crowdsourced classification. In ICML, 2013.
-
(2013)
ICML
-
-
Ho, C.-I.1
Jabbari, S.2
Vaughan, J.W.3
-
15
-
-
84867131498
-
PAC subset selection in stochastic multi-armed bandits
-
Kalyanakrishnan, S, Tewari, A, Auer, P, and Stone, P. PAC subset selection in stochastic multi-armed bandits. In Proceedings of International Conference on Machine Learning, 2012.
-
(2012)
Proceedings of International Conference on Machine Learning
-
-
Kalyanakrishnan, S.1
Tewari, A.2
Auer, P.3
Stone, P.4
-
16
-
-
84897539605
-
-
arXiv:1110.3564v3, 11
-
Karger, D. R, Oh, S, and Shah, D. Budget-optimal task allocation for reliable crowdsourcing systems. arXiv:1110.3564v3, 11 2012.
-
(2012)
Budget-optimal Task Allocation for Reliable Crowdsourcing Systems
-
-
Karger, D.R.1
Oh, S.2
Shah, D.3
-
18
-
-
0001640560
-
A procedure for selecting a subset of size m containing the I best of k independent normal populations, with applications to simulation
-
Koenig, L. W and Law, A. M. A procedure for selecting a subset of size m containing the I best of k independent normal populations, with applications to simulation. Communications in statistics. Simulation and computation, 14:719-734, 1985.
-
(1985)
Communications in Statistics. Simulation and Computation
, vol.14
, pp. 719-734
-
-
Koenig, L.W.1
Law, A.M.2
-
19
-
-
30044441333
-
The sample complexity of exploration in the multi-armed bandit problem
-
Mannor, S and Tsitsiklis, J. N. The sample complexity of exploration in the multi-armed bandit problem. Journal of Machine Learning Research, 5:623-648, 2004.
-
(2004)
Journal of Machine Learning Research
, vol.5
, pp. 623-648
-
-
Mannor, S.1
Tsitsiklis, J.N.2
-
20
-
-
77951954464
-
Learning from crowds
-
Raykar, V. C, Yu, S, Zhao, L. H, Valadez, G. H, Florin, C, Bogoni, L, and Moy, L. Learning from crowds. Journal of Machine Learning Research, 11:1297-1322, 2010.
-
(2010)
Journal of Machine Learning Research
, vol.11
, pp. 1297-1322
-
-
Raykar, V.C.1
Yu, S.2
Zhao, L.H.3
Valadez, G.H.4
Florin, C.5
Bogoni, L.6
Moy, L.7
-
22
-
-
80053360508
-
Cheap and fast - But is it good? Evaluating non-expert annotations for natural language tasks
-
Snow, R, Connor, B. O, Jurafsky, D, and Ng., A. Y. Cheap and fast - but is it good? evaluating non-expert annotations for natural language tasks. In EMNLP, 2008.
-
(2008)
EMNLP
-
-
Snow, R.1
Connor, B.O.2
Jurafsky, D.3
Ng., A.Y.4
-
23
-
-
0001395850
-
On the likelihood that one unknown probability exceeds another in view of the evidence of two samples
-
Thompson, W. On the likelihood that one unknown probability exceeds another in view of the evidence of two samples. Biometrika, 25:285-294, 1933.
-
(1933)
Biometrika
, vol.25
, pp. 285-294
-
-
Thompson, W.1
-
24
-
-
84881239793
-
Learning from the wisdom of crowds by minimax conditional entropy
-
Zhou, D, Basu, S, Mao, Y, and Piatt, J. Learning from the wisdom of crowds by minimax conditional entropy. In NIPS. 2012.
-
(2012)
NIPS
-
-
Zhou, D.1
Basu, S.2
Mao, Y.3
Piatt, J.4
|