SCOPUS 정보 검색 플랫폼

Journal of Artificial Intelligence Research

Volumn 55, Issue , 2016, Pages 317-359

Adaptive contract design for crowdsourcing markets: Bandit algorithms for repeated principal-agent problems

(3) Ho, Chien Ju a Slivkins, Aleksandrs b Vaughan, Jennifer Wortman b

a Cornell University ^* (United States)

b MICROSOFT RESEARCH (United States)

Author keywords

[No Author keywords available]

Indexed keywords

BUDGET CONTROL; COMMERCE; COSTS;

CONTINGENT PAYMENTS; MULTI ARMED BANDIT; MULTI-ARMED BANDIT PROBLEM; POPULAR PLATFORM; PRICING PROBLEMS; PRINCIPAL AGENT MODELS; PRINCIPAL-AGENT PROBLEMS; STRATEGIC CHOICE;

CROWDSOURCING;

EID: 84958047672 PISSN: 10769757 EISSN: None Source Type: Journal
DOI: 10.1613/jair.4940 Document Type: Article

Times cited : (103)

References (60)

1
- 0345224411
- The continuum-armed bandit problem
- Agrawal, R. (1995). The continuum-armed bandit problem. SIAM J. Control and Opti-mization, 33 (6), 1926-1951.
- (1995) SIAM J. Control and Opti-mization , vol.33 , Issue.6 , pp. 1926-1951
- Agrawal, R.¹

2
- 84903162190
- Bandits with concave rewards and convex knapsacks
- Agrawal, S., & Devanur, N. R. (2014). Bandits with concave rewards and convex knapsacks. In 15th ACM Conf. on Economics and Computation (EC).
- (2014) 15th ACM Conf. on Economics and Computation EC
- Agrawal, S.¹ Devanur, N.R.²

3
- 84958071868
- Contextual bandits with global constraints and objective
- arXiv:1506.03374
- Agrawal, S., Devanur, N. R., & Li, L. (2015). Contextual bandits with global constraints and objective.. Technical report, arXiv:1506.03374.
- (2015) Technical Report
- Agrawal, S.¹ Devanur, N.R.² Li, L.³

4
- 84873311235
- Toward a classification of finite partial-monitoring games
- Antos, A., Bartók, G., Pál, D., & Szepesvári, C. (2013). Toward a classification of finite partial-monitoring games. Theor. Comput. Sci., 473, 77-99.
- (2013) Theor. Comput. Sci. , vol.473 , pp. 77-99
- Antos, A.¹ Bartók, G.² Pál, D.³ Szepesvári, C.⁴

5
- 78649420293
- Regret bounds and minimax policies under partial monitoring
- Audibert, J., & Bubeck, S. (2010). Regret Bounds and Minimax Policies under Partial Monitoring. J. of Machine Learning Research (JMLR), 11, 2785-2836.
- (2010) J. of Machine Learning Research (JMLR) , vol.11 , pp. 2785-2836
- Audibert, J.¹ Bubeck, S.²

6
- 0036568025
- Finite-time analysis of the multiarmed bandit problem
- Auer, P., Cesa-Bianchi, N., & Fischer, P. (2002). Finite-time analysis of the multiarmed bandit problem.. Machine Learning, 47 (2-3), 235-256.
- (2002) Machine Learning , vol.47 , Issue.2-3 , pp. 235-256
- Auer, P.¹ Cesa-Bianchi, N.² Fischer, P.³

7
- 38049040954
- Improved rates for the stochastic continuum-armed bandit problem
- Auer, P., Ortner, R., & Szepesvári, C. (2007). Improved Rates for the Stochastic Continuum-Armed Bandit Problem. In 20th Conf. on Learning Theory (COLT), pp. 454-468.
- (2007) 20th Conf. on Learning Theory (COLT) , pp. 454-468
- Auer, P.¹ Ortner, R.² Szepesvári, C.³

8
- 85037633842
- Dynamic pricing with limited supply
- Babaioff, M., Dughmi, S., Kleinberg, R. D., & Slivkins, A. (2015). Dynamic pricing with limited supply. ACM Trans. on Economics and Computation, 3 (1), 4.
- (2015) ACM Trans. on Economics and Computation , vol.3 , Issue.1 , pp. 4
- Babaioff, M.¹ Dughmi, S.² Kleinberg, R.D.³ Slivkins, A.⁴

9
- 33748714319
- Combinatorial agency
- Babaioff, M., Feldman, M., & Nisan, N. (2006). Combinatorial agency. In 7th ACM Conf. on Electronic Commerce (EC).
- (2006) 7th ACM Conf. on Electronic Commerce EC
- Babaioff, M.¹ Feldman, M.² Nisan, N.³

10
- 84863507274
- Learning on a budget: Posted price mechanisms for online procurement
- Badanidiyuru, A., Kleinberg, R., & Singer, Y. (2012). Learning on a budget: posted price mechanisms for online procurement. In 13th ACM Conf. on Electronic Commerce (EC), pp. 128-145.
- (2012) 13th ACM Conf. on Electronic Commerce (EC) , pp. 128-145
- Badanidiyuru, A.¹ Kleinberg, R.² Singer, Y.³

11
- 84893451322
- Bandits with knapsacks
- Badanidiyuru, A., Kleinberg, R., & Slivkins, A. (2013). Bandits with knapsacks. In 54th IEEE Symp. on Foundations of Computer Science (FOCS).
- (2013) 54th IEEE Symp. on Foundations of Computer Science FOCS
- Badanidiyuru, A.¹ Kleinberg, R.² Slivkins, A.³

12
- 84939636813
- Resourceful contextual bandits
- Badanidiyuru, A., Langford, J., & Slivkins, A. (2014). Resourceful contextual bandits. In 27th Conf. on Learning Theory (COLT).
- (2014) 27th Conf. on Learning Theory COLT
- Badanidiyuru, A.¹ Langford, J.² Slivkins, A.³

13
- 84908695406
- Partial monitoring-classification, regret bounds, and algorithms
- Bartók, G., Foster, D. P., Pál, D., Rakhlin, A., & Szepesvári, C. (2014). Partial monitoring-classification, regret bounds, and algorithms. Math. Oper. Res., 39 (4), 967-997.
- (2014) Math. Oper. Res. , vol.39 , Issue.4 , pp. 967-997
- Bartók, G.¹ Foster, D.P.² Pál, D.³ Rakhlin, A.⁴ Szepesvári, C.⁵

14
- 70350251174
- Dynamic pricing without knowing the demand function: Risk bounds and near-optimal algorithms
- Besbes, O., & Zeevi, A. (2009). Dynamic pricing without knowing the demand function: Risk bounds and near-optimal algorithms. Operations Research, 57, 1407-1420.
- (2009) Operations Research , vol.57 , pp. 1407-1420
- Besbes, O.¹ Zeevi, A.²

15
- 84871887590
- Blind network revenue management
- Besbes, O., & Zeevi, A. J. (2012). Blind network revenue management. Operations Research, 60 (6), 1537-1550.
- (2012) Operations Research , vol.60 , Issue.6 , pp. 1537-1550
- Besbes, O.¹ Zeevi, A.J.²

16
- 0037740018
- Online learning in online auctions
- Blum, A., Kumar, V., Rudra, A., & Wu, F. (2003). Online learning in online auctions. In 14th ACM-SIAM Symp. on Discrete Algorithms (SODA), pp. 202-204.
- (2003) 14th ACM-SIAM Symp. on Discrete Algorithms (SODA) , pp. 202-204
- Blum, A.¹ Kumar, V.² Rudra, A.³ Wu, F.⁴

17
- 84958071870
- Working paper
- Bohren, J. A., & Kravitz, T. (2013). Incentives for spot market labor when output is unverifiable. Working paper.
- (2013) Incentives for Spot Market Labor When Output Is Unverifiable
- Bohren, J.A.¹ Kravitz, T.²

18
- 84874045238
- Regret analysis of stochastic and nonstochastic multi-armed bandit problems
- Bubeck, S., & Cesa-Bianchi, N. (2012). Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems. Foundations and Trends in Machine Learning, 5 (1), 1-122.
- (2012) Foundations and Trends in Machine Learning , vol.5 , Issue.1 , pp. 1-122
- Bubeck, S.¹ Cesa-Bianchi, N.²

19
- 84860634388
- Online optimization in XArmed bandits
- Bubeck, S., Munos, R., Stoltz, G., & Szepesvari, C. (2011a). Online Optimization in XArmed Bandits. J. of Machine Learning Research (JMLR), 12, 1587-1627.
- (2011) J. of Machine Learning Research (JMLR) , vol.12 , pp. 1587-1627
- Bubeck, S.¹ Munos, R.² Stoltz, G.³ Szepesvari, C.⁴

20
- 80054092590
- Lipschitz bandits without the lipschitz constant
- Bubeck, S., Stoltz, G., & Yu, J. Y. (2011b). Lipschitz bandits without the lipschitz constant. In 22nd Intl. Conf. on Algorithmic Learning Theory (ALT), pp. 144-158.
- (2011) 22nd Intl. Conf. on Algorithmic Learning Theory (ALT) , pp. 144-158
- Bubeck, S.¹ Stoltz, G.² Yu, J.Y.³

21
- 84958071871
- Adaptive-treed bandits
- 1302.2489, arxiv.org
- Bull, A. D. (2013). Adaptive-treed bandits. Tech. rep. 1302.2489, arxiv.org.
- (2013) Tech. Rep.
- Bull, A.D.¹

22
- 84926078662
- Cambridge Univ. Press
- Cesa-Bianchi, N., & Lugosi, G. (2006). Prediction, learning, and games. Cambridge Univ. Press.
- (2006) Prediction, Learning, and Games
- Cesa-Bianchi, N.¹ Lugosi, G.²

23
- 34250720060
- Online learning algorithms for online principal-agent problems (and selling goods online)
- Conitzer, V., & Garera, N. (2006). Online learning algorithms for online principal-agent problems (and selling goods online). In International Conference on Machine Learning (ICML).
- (2006) International Conference on Machine Learning ICML
- Conitzer, V.¹ Garera, N.²

24
- 84927660607
- Dynamic pricing and learning: Historical origins, current research, and new directions
- Forthcoming
- den Boer, A. V. (2015). Dynamic pricing and learning: Historical origins, current research, and new directions. Surveys in Operations Research and Management Science. Forthcoming.
- (2015) Surveys in Operations Research and Management Science
- Den Boer, A.V.¹

25
- 80053154335
- Efficient optimal leanring for contextual bandits
- Dudik, M., Hsu, D., Kale, S., Karampatziakis, N., Langford, J., Reyzin, L., & Zhang, T. (2011). Efficient optimal leanring for contextual bandits. In 27th Conf. on Uncertainty in Artificial Intelligence (UAI).
- (2011) 27th Conf. on Uncertainty in Artificial Intelligence UAI
- Dudik, M.¹ Hsu, D.² Kale, S.³ Karampatziakis, N.⁴ Langford, J.⁵ Reyzin, L.⁶ Zhang, T.⁷

26
- 84898437076
- The KL-UCB algorithm for bounded stochastic bandits and beyond
- Garivier, A., & Cappé, O. (2011). The KL-UCB Algorithm for Bounded Stochastic Bandits and Beyond. In 24th Conf. on Learning Theory (COLT).
- (2011) 24th Conf. on Learning Theory COLT
- Garivier, A.¹ Cappé, O.²

27
- 79959648882
- A game-theoretic analysis of rank-order mechanisms for user-generated content
- Ghosh, A., & Hummel, P. (2011). A game-theoretic analysis of rank-order mechanisms for user-generated content. In 12th ACM Conf. on Electronic Commerce (EC).
- (2011) 12th ACM Conf. on Electronic Commerce EC
- Ghosh, A.¹ Hummel, P.²

28
- 84873376815
- Learning and incentives in user-generated content: Multiarmed bandits with endogenous arms
- Ghosh, A., & Hummel, P. (2013). Learning and incentives in user-generated content: Multiarmed bandits with endogenous arms. In Proc. 4th Conference on Innovations in Theoretical Computer Science (ITCS).
- (2013) Proc. 4th Conference on Innovations in Theoretical Computer Science ITCS
- Ghosh, A.¹ Hummel, P.²

29
- 79960249300
- Incentivizing high-quality user-generated content
- Ghosh, A., & McAfee, P. (2011). Incentivizing high-quality user-generated content. In 20th Intl. World Wide Web Conf. (WWW).
- (2011) 20th Intl. World Wide Web Conf. WWW
- Ghosh, A.¹ McAfee, P.²

30
- 84891584370
- John Wiley & Sons
- Gittins, J., Glazebrook, K., & Weber, R. (2011). Multi-Armed Bandit Allocation Indices. John Wiley & Sons.
- (2011) Multi-Armed Bandit Allocation Indices
- Gittins, J.¹ Glazebrook, K.² Weber, R.³

31
- 84856146589
- You're hired! An examination of crowdsourcing incentive models in human resource tasks
- Harris, C. G. (2011). You're hired! an examination of crowdsourcing incentive models in human resource tasks. In CSDM.
- (2011) CSDM
- Harris, C.G.¹

32
- 84968835134
- Incentivizing high quality crowdwork
- Ho, C., Slivkins, A., Suri, S., & Vaughan, J. W. (2015). Incentivizing high quality crowdwork. In 24th Intl. World Wide Web Conf. (WWW).
- (2015) 24th Intl. World Wide Web Conf. WWW
- Ho, C.¹ Slivkins, A.² Suri, S.³ Vaughan, J.W.⁴

33
- 84923881006
- Towards social norm design for crowdsourcing markets
- Ho, C.-J., Zhang, Y., Vaughan, J. W., & van der Schaar, M. (2012). Towards social norm design for crowdsourcing markets. In HCOMP.
- (2012) HCOMP
- Ho, C.-J.¹ Zhang, Y.² Vaughan, J.W.³ Van Der Schaar, M.⁴

34
- 77954742475
- The labor economics of paid crowdsourcing
- Horton, J. J., & Chilton, L. B. (2010). The labor economics of paid crowdsourcing. In 11th ACM Conf. on Electronic Commerce (EC).
- (2010) 11th ACM Conf. on Electronic Commerce EC
- Horton, J.J.¹ Chilton, L.B.²

35
- 84903198050
- Designing incentives for online question-and-answer forums
- Jain, S., Chen, Y., & Parkes, D. (2012). Designing incentives for online question-and-answer forums. Games and Economic Behavior.
- (2012) Games and Economic Behavior
- Jain, S.¹ Chen, Y.² Parkes, D.³

36
- 84898981061
- Nearly tight bounds for the continuum-armed bandit problem
- Kleinberg, R. (2004). Nearly tight bounds for the continuum-armed bandit problem. In 18th Advances in Neural Information Processing Systems (NIPS).
- (2004) 18th Advances in Neural Information Processing Systems NIPS
- Kleinberg, R.¹

37
- 0345412655
- The value of knowing a demand curve: Bounds on regret for online posted-price auctions
- Kleinberg, R., & Leighton, T. (2003). The value of knowing a demand curve: Bounds on regret for online posted-price auctions.. In 44th IEEE Symp. on Foundations of Computer Science (FOCS), pp. 594-605.
- (2003) 44th IEEE Symp. on Foundations of Computer Science (FOCS) , pp. 594-605
- Kleinberg, R.¹ Leighton, T.²

38
- 57049185311
- Multi-armed bandits in metric spaces
- Kleinberg, R., Slivkins, A., & Upfal, E. (2008). Multi-armed bandits in metric spaces. In 40th ACM Symp. on Theory of Computing (STOC), pp. 681-690.
- (2008) 40th ACM Symp. on Theory of Computing (STOC) , pp. 681-690
- Kleinberg, R.¹ Slivkins, A.² Upfal, E.³

39
- 0345412655
- The value of knowing a demand curve: Bounds on regret for online posted-price auctions
- Kleinberg, R. D., & Leighton, F. T. (2003). The value of knowing a demand curve: Bounds on regret for online posted-price auctions. In IEEE Symp. on Foundations of Computer Science (FOCS).
- (2003) IEEE Symp. on Foundations of Computer Science FOCS
- Kleinberg, R.D.¹ Leighton, F.T.²

40
- 33750293964
- Bandit based monte-carlo planning
- Kocsis, L., & Szepesvari, C. (2006). Bandit Based Monte-Carlo Planning. In 17th European Conf. on Machine Learning (ECML), pp. 282-293.
- (2006) 17th European Conf. on Machine Learning (ECML) , pp. 282-293
- Kocsis, L.¹ Szepesvari, C.²

41
- 84923993122
- Princeton University Press
- Laffont, J.-J., & Martimort, D. (2002). The Theory of Incentives: The Principal-Agent Model. Princeton University Press.
- (2002) The Theory of Incentives: The Principal-Agent Model
- Laffont, J.-J.¹ Martimort, D.²

42
- 0002899547
- Asymptotically efficient adaptive allocation rules
- Lai, T. L., & Robbins, H. (1985). Asymptotically efficient Adaptive Allocation Rules. Advances in Applied Mathematics, 6, 4-22.
- (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
- Lai, T.L.¹ Robbins, H.²

43
- 0036303754
- Optimal linear contracts with heterogeneous agents
- Levy, A., & Vukina, T. (2002). Optimal linear contracts with heterogeneous agents. In European Review of Agricultural Economics.
- (2002) European Review of Agricultural Economics
- Levy, A.¹ Vukina, T.²

44
- 70449657723
- Financial incentives and the "performance of crowds"
- Mason, W., & Watts, D. (2009). Financial incentives and the "performance of crowds". In HCOMP.
- (2009) HCOMP
- Mason, W.¹ Watts, D.²

45
- 84892372751
- Working Paper
- Misra, S., Nair, H. S., & Daljord, O. (2012). Homogenous contracts for heterogeneous agents: Aligning salesforce composition and compensation. Working Paper.
- (2012) Homogenous Contracts for Heterogeneous Agents: Aligning Salesforce Composition and Compensation
- Misra, S.¹ Nair, H.S.² Daljord, O.³

46
- 70349275222
- Bandit algorithms for tree search
- Munos, R., & Coquelin, P.-A. (2007). Bandit algorithms for tree search. In 23rd Conf. on Uncertainty in Artificial Intelligence (UAI).
- (2007) 23rd Conf. on Uncertainty in Artificial Intelligence UAI
- Munos, R.¹ Coquelin, P.-A.²

47
- 70049106076
- Bandits for taxonomies: A model-based approach
- Pandey, S., Agarwal, D., Chakrabarti, D., & Josifovski, V. (2007). Bandits for Taxonomies: A Model-based Approach. In SIAM Intl. Conf. on Data Mining (SDM).
- (2007) SIAM Intl. Conf. on Data Mining SDM
- Pandey, S.¹ Agarwal, D.² Chakrabarti, D.³ Josifovski, V.⁴

48
- 56449088596
- Learning diverse rankings with multiarmed bandits
- Radlinski, F., Kleinberg, R., & Joachims, T. (2008). Learning diverse rankings with multiarmed bandits. In 25th Intl. Conf. on Machine Learning (ICML), pp. 784-791.
- (2008) 25th Intl. Conf. on Machine Learning (ICML) , pp. 784-791
- Radlinski, F.¹ Kleinberg, R.² Joachims, T.³

49
- 45249101176
- A continuous-time version of the principal-agent problem
- Sannikov, Y. (2008). A continuous-time version of the principal-agent problem. In The Review of Economics Studies.
- (2008) The Review of Economics Studies
- Sannikov, Y.¹

50
- 84958071880
- Contracts: The theory of dynamic principal-agent relationships and the continuous-time approach
- Sannikov, Y. (2012). Contracts: The theory of dynamic principal-agent relationships and the continuous-time approach. In 10th World Congress of the Econometric Society.
- (2012) 10th World Congress of the Econometric Society
- Sannikov, Y.¹

51
- 84893063797
- Pricing mechanisms in crowdsourcing markets
- Singer, Y., & Mittal, M. (2013). Pricing mechanisms in crowdsourcing markets. In 22nd Intl. World Wide Web Conf. (WWW).
- (2013) 22nd Intl. World Wide Web Conf. WWW
- Singer, Y.¹ Mittal, M.²

52
- 84893043989
- Truthful incentives in crowdsourcing tasks using regret minimization mechanisms
- Singla, A., & Krause, A. (2013). Truthful incentives in crowdsourcing tasks using regret minimization mechanisms. In 22nd Intl. World Wide Web Conf. (WWW).
- (2013) 22nd Intl. World Wide Web Conf. WWW
- Singla, A.¹ Krause, A.²

53
- 85162320142
- Multi-armed bandits on implicit metric spaces
- Slivkins, A. (2011). Multi-armed bandits on implicit metric spaces. In 25th Advances in Neural Information Processing Systems (NIPS).
- (2011) 25th Advances in Neural Information Processing Systems NIPS
- Slivkins, A.¹

54
- 84907350147
- Contextual bandits with similarity information
- Preliminary version in COLT
- Slivkins, A. (2014). Contextual bandits with similarity information. J. of Machine Learning Research (JMLR), 15 (1), 2533-2568. Preliminary version in COLT 2011.
- (2011) J. of Machine Learning Research (JMLR) , vol.15 , Issue.1 , pp. 2533-2568
- Slivkins, A.¹

55
- 84875138796
- Ranked bandits in metric spaces: Learning optimally diverse rankings over large document collections
- Preliminary version in 27th ICML
- Slivkins, A., Radlinski, F., & Gollapudi, S. (2013). Ranked bandits in metric spaces: Learning optimally diverse rankings over large document collections. J. of Machine Learning Research (JMLR), 14 (Feb), 399-436. Preliminary version in 27th ICML, 2010.
- (2010) J. of Machine Learning Research (JMLR) , vol.14 , Issue.FEB , pp. 399-436
- Slivkins, A.¹ Radlinski, F.² Gollapudi, S.³

56
- 0001395850
- On the likelihood that one unknown probability exceeds another in view of the evidence of two samples
- Thompson, W. R. (1933). On the likelihood that one unknown probability exceeds another in view of the evidence of two samples.. Biometrika, 25 (3-4), 285-294.
- (1933) Biometrika , vol.25 , Issue.3-4 , pp. 285-294
- Thompson, W.R.¹

57
- 84899561628
- Close the gaps: A learning-while-doing algorithm for single-product revenue management problems
- Wang, Z., Deng, S., & Ye, Y. (2014). Close the gaps: A learning-while-doing algorithm for single-product revenue management problems. Operations Research, 62 (2), 318-331.
- (2014) Operations Research , vol.62 , Issue.2 , pp. 318-331
- Wang, Z.¹ Deng, S.² Ye, Y.³

58
- 24544431704
- Working Paper
- Williams, N. (2009). On dynamic principal-agent problems in continuous time. Working Paper.
- (2009) On Dynamic Principal-agent Problems in Continuous Time
- Williams, N.¹

59
- 84893402805
- The effects of performance-contingent financial incentives in online labor markets
- Yin, M., Chen, Y., & Sun, Y.-A. (2013). The effects of performance-contingent financial incentives in online labor markets. In AAAI.
- (2013) AAAI
- Yin, M.¹ Chen, Y.² Sun, Y.-A.³

60
- 84861603528
- Reputation-based incentive protocols in crowdsourcing applications
- Zhang, Y., & van der Schaar, M. (2012). Reputation-based incentive protocols in crowdsourcing applications. In Infocom.
- (2012) Infocom
- Zhang, Y.¹ Van Der Schaar, M.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.