SCOPUS 정보 검색 플랫폼

Volumn 8139 LNAI, Issue , 2013, Pages 234-248

An efficient algorithm for learning with semi-bandit feedback

Author keywords

bandit problems; combinatorial optimization; Follow the perturbed leader; online learning

Indexed keywords

BANDIT PROBLEMS; BINARY VECTORS; FOLLOW-THE-PERTURBED-LEADER; FULL INFORMATIONS; LOSS ESTIMATION; ONLINE LEARNING; PREDICTION METHODS; REGRET BOUNDS;

COMBINATORIAL OPTIMIZATION;

ALGORITHMS;

EID: 84887500930 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-642-40935-6_17 Document Type: Conference Paper

Times cited : (71)

References (17)

2
- 78649420293
- Regret bounds and minimax policies under partial monitoring
- Audibert, J.-Y., Bubeck, S.: Regret bounds and minimax policies under partial monitoring. Journal of Machine Learning Research 11, 2635-2686 (2010)
- (2010) Journal of Machine Learning Research , vol.11 , pp. 2635-2686
- Audibert, J.-Y.¹ Bubeck, S.²

4
- 0037709910
- The nonstochastic multiarmed bandit problem
- Auer, P., Cesa-Bianchi, N., Freund, Y., Schapire, R.E.: The nonstochastic multiarmed bandit problem. SIAM J. Comput. 32(1), 48-77 (2002)
- (2002) SIAM J. Comput. , vol.32 , Issue.1 , pp. 48-77
- Auer, P.¹ Cesa-Bianchi, N.² Freund, Y.³ Schapire, R.E.⁴

5
- 4544345025
- Adaptive routing with end-to-end feedback: Distributed learning and geometric approaches
- Awerbuch, B., Kleinberg, R.D.: Adaptive routing with end-to-end feedback: distributed learning and geometric approaches. In: Proceedings of the 36th ACM Symposium on Theory of Computing, pp. 45-53 (2004)
- (2004) Proceedings of the 36th ACM Symposium on Theory of Computing , pp. 45-53
- Awerbuch, B.¹ Kleinberg, R.D.²

6
- 84898039203
- Towards minimax policies for online linear optimization with bandit feedback
- Bubeck, S., Cesa-Bianchi, N., Kakade, S.M.: Towards minimax policies for online linear optimization with bandit feedback. In: Proceedings of the 25th Annual Conference on Learning Theory (COLT), pp. 1-14 (2012)
- (2012) Proceedings of the 25th Annual Conference on Learning Theory (COLT) , pp. 1-14
- Bubeck, S.¹ Cesa-Bianchi, N.² Kakade, S.M.³

7
- 84926078662
- Cambridge University Press, New York
- Cesa-Bianchi, N., Lugosi, G.: Prediction, Learning, and Games. Cambridge University Press, New York (2006)
- (2006) Prediction, Learning, and Games
- Cesa-Bianchi, N.¹ Lugosi, G.²

8
- 84861620768
- Combinatorial bandits
- Cesa-Bianchi, N., Lugosi, G.: Combinatorial bandits. Journal of Computer and System Sciences 78, 1404-1422 (2012)
- (2012) Journal of Computer and System Sciences , vol.78 , pp. 1404-1422
- Cesa-Bianchi, N.¹ Lugosi, G.²

9
- 85162050055
- The price of bandit information for online optimization
- Dani, V., Hayes, T., Kakade, S.: The price of bandit information for online optimization. In: Advances in Neural Information Processing Systems (NIPS), vol. 20, pp. 345-352 (2008)
- (2008) Advances in Neural Information Processing Systems (NIPS) , vol.20 , pp. 345-352
- Dani, V.¹ Hayes, T.² Kakade, S.³

10
- 35948943542
- The on-line shortest path problem under partial monitoring
- György, A., Linder, T., Lugosi, G., Ottucsák, G.: The on-line shortest path problem under partial monitoring. Journal of Machine Learning Research 8, 2369-2403 (2007)
- (2007) Journal of Machine Learning Research , vol.8 , pp. 2369-2403
- György, A.¹ Linder, T.² Lugosi, G.³ Ottucsák, G.⁴

11
- 0001976283
- Approximation to Bayes risk in repeated play
- Hannan, J.: Approximation to Bayes risk in repeated play. Contributions to the Theory of Games 3, 97-139 (1957)
- (1957) Contributions to the Theory of Games , vol.3 , pp. 97-139
- Hannan, J.¹

12
- 24644463787
- Efficient algorithms for online decision problems
- Kalai, A., Vempala, S.: Efficient algorithms for online decision problems. Journal of Computer and System Sciences 71, 291-307 (2005)
- (2005) Journal of Computer and System Sciences , vol.71 , pp. 291-307
- Kalai, A.¹ Vempala, S.²

13
- 84860647444
- Hedging structured concepts
- Koolen, W., Warmuth, M., Kivinen, J.: Hedging structured concepts. In: Proceedings of the 23rd Annual Conference on Learning Theory (COLT), pp. 93-105 (2010)
- (2010) Proceedings of the 23rd Annual Conference on Learning Theory (COLT) , pp. 93-105
- Koolen, W.¹ Warmuth, M.² Kivinen, J.³

17
- 3142657664
- Paths kernels and multiplicative updates
- Takimoto, E., Warmuth, M.: Paths kernels and multiplicative updates. Journal of Machine Learning Research 4, 773-818 (2003)
- (2003) Journal of Machine Learning Research , vol.4 , pp. 773-818
- Takimoto, E.¹ Warmuth, M.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.