메뉴 건너뛰기




Volumn 55, Issue , 2016, Pages 317-359

Adaptive contract design for crowdsourcing markets: Bandit algorithms for repeated principal-agent problems

Author keywords

[No Author keywords available]

Indexed keywords

BUDGET CONTROL; COMMERCE; COSTS;

EID: 84958047672     PISSN: 10769757     EISSN: None     Source Type: Journal    
DOI: 10.1613/jair.4940     Document Type: Article
Times cited : (103)

References (60)
  • 1
    • 0345224411 scopus 로고
    • The continuum-armed bandit problem
    • Agrawal, R. (1995). The continuum-armed bandit problem. SIAM J. Control and Opti-mization, 33 (6), 1926-1951.
    • (1995) SIAM J. Control and Opti-mization , vol.33 , Issue.6 , pp. 1926-1951
    • Agrawal, R.1
  • 3
    • 84958071868 scopus 로고    scopus 로고
    • Contextual bandits with global constraints and objective
    • arXiv:1506.03374
    • Agrawal, S., Devanur, N. R., & Li, L. (2015). Contextual bandits with global constraints and objective.. Technical report, arXiv:1506.03374.
    • (2015) Technical Report
    • Agrawal, S.1    Devanur, N.R.2    Li, L.3
  • 4
    • 84873311235 scopus 로고    scopus 로고
    • Toward a classification of finite partial-monitoring games
    • Antos, A., Bartók, G., Pál, D., & Szepesvári, C. (2013). Toward a classification of finite partial-monitoring games. Theor. Comput. Sci., 473, 77-99.
    • (2013) Theor. Comput. Sci. , vol.473 , pp. 77-99
    • Antos, A.1    Bartók, G.2    Pál, D.3    Szepesvári, C.4
  • 5
    • 78649420293 scopus 로고    scopus 로고
    • Regret bounds and minimax policies under partial monitoring
    • Audibert, J., & Bubeck, S. (2010). Regret Bounds and Minimax Policies under Partial Monitoring. J. of Machine Learning Research (JMLR), 11, 2785-2836.
    • (2010) J. of Machine Learning Research (JMLR) , vol.11 , pp. 2785-2836
    • Audibert, J.1    Bubeck, S.2
  • 6
    • 0036568025 scopus 로고    scopus 로고
    • Finite-time analysis of the multiarmed bandit problem
    • Auer, P., Cesa-Bianchi, N., & Fischer, P. (2002). Finite-time analysis of the multiarmed bandit problem.. Machine Learning, 47 (2-3), 235-256.
    • (2002) Machine Learning , vol.47 , Issue.2-3 , pp. 235-256
    • Auer, P.1    Cesa-Bianchi, N.2    Fischer, P.3
  • 13
    • 84908695406 scopus 로고    scopus 로고
    • Partial monitoring-classification, regret bounds, and algorithms
    • Bartók, G., Foster, D. P., Pál, D., Rakhlin, A., & Szepesvári, C. (2014). Partial monitoring-classification, regret bounds, and algorithms. Math. Oper. Res., 39 (4), 967-997.
    • (2014) Math. Oper. Res. , vol.39 , Issue.4 , pp. 967-997
    • Bartók, G.1    Foster, D.P.2    Pál, D.3    Rakhlin, A.4    Szepesvári, C.5
  • 14
    • 70350251174 scopus 로고    scopus 로고
    • Dynamic pricing without knowing the demand function: Risk bounds and near-optimal algorithms
    • Besbes, O., & Zeevi, A. (2009). Dynamic pricing without knowing the demand function: Risk bounds and near-optimal algorithms. Operations Research, 57, 1407-1420.
    • (2009) Operations Research , vol.57 , pp. 1407-1420
    • Besbes, O.1    Zeevi, A.2
  • 15
    • 84871887590 scopus 로고    scopus 로고
    • Blind network revenue management
    • Besbes, O., & Zeevi, A. J. (2012). Blind network revenue management. Operations Research, 60 (6), 1537-1550.
    • (2012) Operations Research , vol.60 , Issue.6 , pp. 1537-1550
    • Besbes, O.1    Zeevi, A.J.2
  • 18
    • 84874045238 scopus 로고    scopus 로고
    • Regret analysis of stochastic and nonstochastic multi-armed bandit problems
    • Bubeck, S., & Cesa-Bianchi, N. (2012). Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems. Foundations and Trends in Machine Learning, 5 (1), 1-122.
    • (2012) Foundations and Trends in Machine Learning , vol.5 , Issue.1 , pp. 1-122
    • Bubeck, S.1    Cesa-Bianchi, N.2
  • 21
    • 84958071871 scopus 로고    scopus 로고
    • Adaptive-treed bandits
    • 1302.2489, arxiv.org
    • Bull, A. D. (2013). Adaptive-treed bandits. Tech. rep. 1302.2489, arxiv.org.
    • (2013) Tech. Rep.
    • Bull, A.D.1
  • 23
    • 34250720060 scopus 로고    scopus 로고
    • Online learning algorithms for online principal-agent problems (and selling goods online)
    • Conitzer, V., & Garera, N. (2006). Online learning algorithms for online principal-agent problems (and selling goods online). In International Conference on Machine Learning (ICML).
    • (2006) International Conference on Machine Learning ICML
    • Conitzer, V.1    Garera, N.2
  • 24
    • 84927660607 scopus 로고    scopus 로고
    • Dynamic pricing and learning: Historical origins, current research, and new directions
    • Forthcoming
    • den Boer, A. V. (2015). Dynamic pricing and learning: Historical origins, current research, and new directions. Surveys in Operations Research and Management Science. Forthcoming.
    • (2015) Surveys in Operations Research and Management Science
    • Den Boer, A.V.1
  • 27
    • 79959648882 scopus 로고    scopus 로고
    • A game-theoretic analysis of rank-order mechanisms for user-generated content
    • Ghosh, A., & Hummel, P. (2011). A game-theoretic analysis of rank-order mechanisms for user-generated content. In 12th ACM Conf. on Electronic Commerce (EC).
    • (2011) 12th ACM Conf. on Electronic Commerce EC
    • Ghosh, A.1    Hummel, P.2
  • 31
    • 84856146589 scopus 로고    scopus 로고
    • You're hired! An examination of crowdsourcing incentive models in human resource tasks
    • Harris, C. G. (2011). You're hired! an examination of crowdsourcing incentive models in human resource tasks. In CSDM.
    • (2011) CSDM
    • Harris, C.G.1
  • 42
    • 0002899547 scopus 로고
    • Asymptotically efficient adaptive allocation rules
    • Lai, T. L., & Robbins, H. (1985). Asymptotically efficient Adaptive Allocation Rules. Advances in Applied Mathematics, 6, 4-22.
    • (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
    • Lai, T.L.1    Robbins, H.2
  • 44
    • 70449657723 scopus 로고    scopus 로고
    • Financial incentives and the "performance of crowds"
    • Mason, W., & Watts, D. (2009). Financial incentives and the "performance of crowds". In HCOMP.
    • (2009) HCOMP
    • Mason, W.1    Watts, D.2
  • 49
    • 45249101176 scopus 로고    scopus 로고
    • A continuous-time version of the principal-agent problem
    • Sannikov, Y. (2008). A continuous-time version of the principal-agent problem. In The Review of Economics Studies.
    • (2008) The Review of Economics Studies
    • Sannikov, Y.1
  • 50
    • 84958071880 scopus 로고    scopus 로고
    • Contracts: The theory of dynamic principal-agent relationships and the continuous-time approach
    • Sannikov, Y. (2012). Contracts: The theory of dynamic principal-agent relationships and the continuous-time approach. In 10th World Congress of the Econometric Society.
    • (2012) 10th World Congress of the Econometric Society
    • Sannikov, Y.1
  • 52
    • 84893043989 scopus 로고    scopus 로고
    • Truthful incentives in crowdsourcing tasks using regret minimization mechanisms
    • Singla, A., & Krause, A. (2013). Truthful incentives in crowdsourcing tasks using regret minimization mechanisms. In 22nd Intl. World Wide Web Conf. (WWW).
    • (2013) 22nd Intl. World Wide Web Conf. WWW
    • Singla, A.1    Krause, A.2
  • 54
    • 84907350147 scopus 로고    scopus 로고
    • Contextual bandits with similarity information
    • Preliminary version in COLT
    • Slivkins, A. (2014). Contextual bandits with similarity information. J. of Machine Learning Research (JMLR), 15 (1), 2533-2568. Preliminary version in COLT 2011.
    • (2011) J. of Machine Learning Research (JMLR) , vol.15 , Issue.1 , pp. 2533-2568
    • Slivkins, A.1
  • 55
    • 84875138796 scopus 로고    scopus 로고
    • Ranked bandits in metric spaces: Learning optimally diverse rankings over large document collections
    • Preliminary version in 27th ICML
    • Slivkins, A., Radlinski, F., & Gollapudi, S. (2013). Ranked bandits in metric spaces: Learning optimally diverse rankings over large document collections. J. of Machine Learning Research (JMLR), 14 (Feb), 399-436. Preliminary version in 27th ICML, 2010.
    • (2010) J. of Machine Learning Research (JMLR) , vol.14 , Issue.FEB , pp. 399-436
    • Slivkins, A.1    Radlinski, F.2    Gollapudi, S.3
  • 56
    • 0001395850 scopus 로고
    • On the likelihood that one unknown probability exceeds another in view of the evidence of two samples
    • Thompson, W. R. (1933). On the likelihood that one unknown probability exceeds another in view of the evidence of two samples.. Biometrika, 25 (3-4), 285-294.
    • (1933) Biometrika , vol.25 , Issue.3-4 , pp. 285-294
    • Thompson, W.R.1
  • 57
    • 84899561628 scopus 로고    scopus 로고
    • Close the gaps: A learning-while-doing algorithm for single-product revenue management problems
    • Wang, Z., Deng, S., & Ye, Y. (2014). Close the gaps: A learning-while-doing algorithm for single-product revenue management problems. Operations Research, 62 (2), 318-331.
    • (2014) Operations Research , vol.62 , Issue.2 , pp. 318-331
    • Wang, Z.1    Deng, S.2    Ye, Y.3
  • 59
    • 84893402805 scopus 로고    scopus 로고
    • The effects of performance-contingent financial incentives in online labor markets
    • Yin, M., Chen, Y., & Sun, Y.-A. (2013). The effects of performance-contingent financial incentives in online labor markets. In AAAI.
    • (2013) AAAI
    • Yin, M.1    Chen, Y.2    Sun, Y.-A.3
  • 60
    • 84861603528 scopus 로고    scopus 로고
    • Reputation-based incentive protocols in crowdsourcing applications
    • Zhang, Y., & van der Schaar, M. (2012). Reputation-based incentive protocols in crowdsourcing applications. In Infocom.
    • (2012) Infocom
    • Zhang, Y.1    Van Der Schaar, M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.