메뉴 건너뛰기




Volumn 35, Issue , 2014, Pages 1109-1134

Resourceful contextual bandits

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE;

EID: 84939636813     PISSN: 15324435     EISSN: 15337928     Source Type: Journal    
DOI: None     Document Type: Conference Paper
Times cited : (134)

References (36)
  • 3
    • 84939620711 scopus 로고    scopus 로고
    • Bandits with concave rewards and convex knapsacks
    • Shipra Agrawal and Nikhil R. Devanur. Bandits with concave rewards and convex knapsacks. In 15th, 2014.
    • (2014) 15th
    • Agrawal, S.1    Devanur, N.R.2
  • 4
    • 0041966002 scopus 로고    scopus 로고
    • Using confidence bounds for exploitation-exploration trade-offs
    • Preliminary version in 41st IEEE FOCS
    • Peter Auer. Using confidence bounds for exploitation-exploration trade-offs. J. of Machine Learning Research (JMLR), 3:397-422, 2002. Preliminary version in 41st IEEE FOCS, 2000.
    • (2000) J. of Machine Learning Research (JMLR) , vol.3 , pp. 397-422
    • Auer, P.1
  • 5
    • 0037709910 scopus 로고
    • The nonstochastic multiarmed bandit problem
    • Preliminary version in 36th IEEE FOCS
    • Peter Auer, Nicolò Cesa-Bianchi, Yoav Freund, and Robert E. Schapire. The nonstochastic multiarmed bandit problem. SIAM J. Comput., 32(1):48-77, 2002. Preliminary version in 36th IEEE FOCS, 1995.
    • (1995) SIAM J. Comput. , vol.32 , Issue.1 , pp. 48-77
    • Auer, P.1    Cesa-Bianchi, N.2    Freund, Y.3    Schapire, R.E.4
  • 10
    • 70350251174 scopus 로고    scopus 로고
    • Dynamic pricing without knowing the demand function: Risk bounds and near-optimal algorithms
    • Omar Besbes and Assaf Zeevi. Dynamic pricing without knowing the demand function: Risk bounds and near-optimal algorithms. Operations Research, 57:1407-1420, 2009.
    • (2009) Operations Research , vol.57 , pp. 1407-1420
    • Besbes, O.1    Zeevi, A.2
  • 11
    • 84871887590 scopus 로고    scopus 로고
    • Blind network revenue management
    • Omar Besbes and Assaf J. Zeevi. Blind network revenue management. Operations Research, 60(6):1537-1550, 2012.
    • (2012) Operations Research , vol.60 , Issue.6 , pp. 1537-1550
    • Besbes, O.1    Zeevi, A.J.2
  • 13
    • 84874045238 scopus 로고    scopus 로고
    • Regret analysis of stochastic and nonstochastic multiarmed bandit problems
    • Sébastien Bubeck and Nicolo Cesa-Bianchi. Regret Analysis of Stochastic and Nonstochastic Multiarmed Bandit Problems. Foundations and Trends in Machine Learning, 5(1):1-122, 2012.
    • (2012) Foundations and Trends in Machine Learning , vol.5 , Issue.1 , pp. 1-122
    • Bubeck, S.1    Cesa-Bianchi, N.2
  • 15
    • 4544235463 scopus 로고    scopus 로고
    • The spending constraint model for market equilibrium: Algorithmic, existence and uniqueness results
    • Nikhil Devanur and Vijay Vazirani. The spending constraint model for market equilibrium: Algorithmic, existence and uniqueness results. In 36th ACM Symp. on Theory of Computing (STOC), 2004.
    • (2004) 36th ACM Symp. on Theory of Computing (STOC)
    • Devanur, N.1    Vazirani, V.2
  • 16
    • 76749148680 scopus 로고    scopus 로고
    • The AdWords problem: Online keyword matching with budgeted bidders under random permutations
    • Nikhil R. Devanur and Thomas P. Hayes. The AdWords problem: Online keyword matching with budgeted bidders under random permutations. In 10th ACM Conf. on Electronic Commerce (EC), pages 71-78, 2009.
    • (2009) 10th ACM Conf. on Electronic Commerce (EC) , pp. 71-78
    • Devanur, N.R.1    Hayes, T.P.2
  • 17
    • 79959593176 scopus 로고    scopus 로고
    • Near optimal online algorithms and fast approximation algorithms for resource allocation problems
    • Nikhil R. Devanur, Kamal Jain, Balasubramanian Sivan, and Christopher A. Wilkens. Near optimal online algorithms and fast approximation algorithms for resource allocation problems. In 12th ACM Conf. on Electronic Commerce (EC), pages 29-38, 2011.
    • (2011) 12th ACM Conf. on Electronic Commerce (EC) , pp. 29-38
    • Devanur, N.R.1    Jain, K.2    Sivan, B.3    Wilkens, C.A.4
  • 19
    • 0002384441 scopus 로고
    • On tail probabilities for martingales
    • D. A. Freedman. On tail probabilities for martingales. The Annals of Probability, 3:100-118, 1975.
    • (1975) The Annals of Probability , vol.3 , pp. 100-118
    • Freedman, D.A.1
  • 22
    • 69449097218 scopus 로고    scopus 로고
    • Approximation algorithms for restless bandit problems.
    • Combined final version of papers in
    • Sudipta Guha, Kamesh Munagala, and Peng Shi. Approximation algorithms for restless bandit problems., 2010. Combined final version of papers in IEEE FOCS 2007 and ACM-SIAM SODA 2009.
    • (2009) IEEE FOCS 2007 and ACM-SIAM SODA
    • Guha, S.1    Munagala, K.2    Shi, P.3
  • 29
    • 84893043989 scopus 로고    scopus 로고
    • Truthful incentives in crowdsourcing tasks using regret minimization mechanisms
    • Adish Singla and Andreas Krause. Truthful incentives in crowdsourcing tasks using regret minimization mechanisms. In 22nd Intl. World Wide Web Conf. (WWW), pages 1167-1178, 2013.
    • (2013) 22nd Intl. World Wide Web Conf. (WWW) , pp. 1167-1178
    • Singla, A.1    Krause, A.2
  • 30
    • 84972513554 scopus 로고
    • On general minimax theorems
    • Maurice Sion. On general minimax theorems. Pac. J. Math., 8:171176, 1958.
    • (1958) Pac. J. Math. , vol.8 , pp. 171176
    • Sion, M.1
  • 31
    • 84874058621 scopus 로고    scopus 로고
    • Contextual bandits with similarity information
    • To appear in J. of Machine Learning Research JMLR
    • Aleksandrs Slivkins. Contextual Bandits with Similarity Information. In 24th Conf. on Learning Theory (COLT), 2011. To appear in J. of Machine Learning Research (JMLR), 2014.
    • (2011) 24th Conf. on Learning Theory (COLT)
    • Slivkins, A.1
  • 33
    • 84963496106 scopus 로고    scopus 로고
    • Online decision making in crowdsourcing markets: Theoretical challenges
    • December, Position Paper and survey
    • Aleksandrs Slivkins and Jennifer Wortman Vaughan. Online decision making in crowdsourcing markets: Theoretical challenges. SIGecom Exchanges, 12(2), December 2013. Position Paper and survey.
    • (2013) SIGecom Exchanges , vol.12 , Issue.2
    • Slivkins, A.1    Vaughan, J.W.2
  • 34
    • 0001395850 scopus 로고
    • On the likelihood that one unknown probability exceeds another in view of the evidence of two samples
    • William R. Thompson. On the likelihood that one unknown probability exceeds another in view of the evidence of two samples. Biometrika, 25(3-4):285294, 1933.
    • (1933) Biometrika , vol.25 , Issue.3-4 , pp. 285294
    • Thompson, W.R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.