SCOPUS 정보 검색 플랫폼

31st International Conference on Machine Learning, ICML 2014

Volumn 3, Issue , 2014, Pages 2512-2537

Combinatorial partial monitoring game with linear feedback and its applications

(5) Lin, Tian a Abrahao, Bruno b Kleinberg, Robert b Lui, John C S c Chen, Wei d

a TSINGHUA UNIVERSITY (China)

b Cornell University ^* (United States)

c CHINESE UNIVERSITY OF HONG KONG (Hong Kong)

d MICROSOFT RESEARCH (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; E-LEARNING; LEARNING ALGORITHMS; LEARNING SYSTEMS; SOCIAL NETWORKING (ONLINE);

CONFIDENCE BOUNDS; EFFICIENT LEARNING; ITS APPLICATIONS; LIMITED FEEDBACK; MODEL AND ALGORITHMS; MULTI ARMED BANDIT; OFF-LINE OPTIMIZATION; ONLINE LEARNING;

COMBINATORIAL MATHEMATICS;

EID: 84919902752 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (17)

References (17)

1
- 84919945245
- Toward a classification of finite partial- monitoring games
- Antos, Andras, Bartok, Gabor, Pal, David, and Szepesvari, Csaba. Toward a classification of finite partial- monitoring games. Theoretical Computer Science, 2012.
- (2012) Theoretical Computer Science
- Antos, A.¹ Bartok, G.² Pal, D.³ Szepesvari, C.⁴

2
- 84898079018
- Minimax policies for adversarial and stochastic bandits
- Audibert, Jean-Yves and Bubeck, Sebastien. Minimax policies for adversarial and stochastic bandits. In COLT, 2009.
- (2009) COLT
- Audibert, J.-Y.¹ Bubeck, S.²

3
- 0036568025
- Finite-time analysis of the multi armed bandit problem
- Auer, Peter, Cesa-Bianchi, Nicolo, and Fischer, Paul. Finite-time analysis of the multi armed bandit problem. Machine learning, 47(2-3):235-256, 2002.
- (2002) Machine Learning , vol.47 , Issue.2-3 , pp. 235-256
- Auer, P.¹ Cesa-Bianchi, N.² Fischer, P.³

4
- 84919930512
- An adaptive algorithm for finite stochastic partial monitoring (extended version)
- June
- Bartok, G., Zolghadr, N., and Szepesvari, Cs. An adaptive algorithm for finite stochastic partial monitoring (extended version). In ICML, pp. 1-20, June 2012.
- (2012) ICML , pp. 1-20
- Bartok, G.¹ Zolghadr, N.² Szepesvari, C.³

5
- 84867849262
- Minimax regret of finite partial-monitoring games in stochastic environments
- Bartok, Gabor, Pal, David, and Szepesvari, Csaba. Minimax regret of finite partial-monitoring games in stochastic environments. Journal of Machine Learning Research-Proceedings Track, 19:133-154, 2011.
- (2011) Journal of Machine Learning Research-Proceedings Track , vol.19 , pp. 133-154
- Bartok, G.¹ Pal, D.² Szepesvari, C.³

6
- 84874045238
- Regret analysis of stochastic and non stochastic multi-armed bandit problems
- Bubeck, Sebastien and Cesa-Bianchi, Nicolo. Regret analysis of stochastic and non stochastic multi-armed bandit problems. Foundations and Trends in Machine Learning, 5(1):1-122, 2012.
- (2012) Foundations and Trends in Machine Learning , vol.5 , Issue.1 , pp. 1-122
- Bubeck, S.¹ Cesa-Bianchi, N.²

7
- 84926078662
- Cambridge University Press
- Cesa-Bianchi, Nicolo and Lugosi, Gabor. Prediction, learning, and games. Cambridge University Press, 2006.
- (2006) Prediction, Learning, and Games
- Cesa-Bianchi, N.¹ Lugosi, G.²

8
- 84861620768
- Combinatorial bandits
- Cesa-Bianchi, Nicolo and Lugosi, Gabor. Combinatorial bandits. Journal of Computer and System Sciences, 78 (5): 1404-1422, 2012.
- (2012) Journal of Computer and System Sciences , vol.78 , Issue.5 , pp. 1404-1422
- Cesa-Bianchi, N.¹ Lugosi, G.²

9
- 33748442333
- Regret minimization under partial monitoring
- Cesa-Bianchi, Nicolo, Lugosi, Gabor, and Stoltz, Gilles. Regret minimization under partial monitoring. Mathematics of Operations Research, 31(3):562-580, 2006.
- (2006) Mathematics of Operations Research , vol.31 , Issue.3 , pp. 562-580
- Cesa-Bianchi, N.¹ Lugosi, G.² Stoltz, G.³

10
- 84897515317
- Combinatorial multi-armed bandit: General framework and applications
- Chen, Wei, Wang, Yajun, and Yuan, Yang. Combinatorial multi-armed bandit: General framework and applications. In Proceedings of the 30th International Conference on Machine Learning (ICML-13), pp. 151-159, 2013.
- (2013) Proceedings of the 30th International Conference on Machine Learning (ICML-13) , pp. 151-159
- Chen, W.¹ Wang, Y.² Yuan, Y.³

11
- 84867858040
- Combinatorial network optimization with unknown variables: Multi-armed bandits with linear rewards and individual observations
- October
- Gai, Yi, Krishnamachari, Bhaskar, and Jain, Rahul. Combinatorial network optimization with unknown variables: Multi-armed bandits with linear rewards and individual observations. IEEE/ACM Trans. Netw., 20(5):1466- 1478, October 2012. ISSN 1063-6692.
- (2012) IEEE/ACM Trans. Netw. , vol.20 , Issue.5 , pp. 1466-1478
- Gai, Y.¹ Krishnamachari, B.² Jain, R.³

12
- 0003953052
- Springer
- Kozen, Dexter. The design and analysis of algorithms. Springer, 1992.
- (1992) The Design and Analysis of Algorithms
- Kozen, D.¹

13
- 0002899547
- Asymptotically efficient adaptive allocation rules
- Lai, Tze Leung and Robbins, Herbert. Asymptotically efficient adaptive allocation rules. Advances in applied mathematics, 6(1):4-22, 1985.
- (1985) Advances in Applied Mathematics , vol.6 , Issue.1 , pp. 4-22
- Lai, T.L.¹ Robbins, H.²

14
- 0024766543
- The weighted majority algorithm
- IEEE
- Littlestone, Nick and Warmuth, Manfred K. The weighted majority algorithm. In Foundations of Computer Science, 1989., 30th Annual Symposium on, pp. 256-261. IEEE, 1989.
- (1989) Foundations of Computer Science, 1989., 30th Annual Symposium on , pp. 256-261
- Littlestone, N.¹ Warmuth, M.K.²

15
- 84898041886
- Discrete prediction games with arbitrary feedback and loss
- Springer
- Piccolboni, Antonio and Schindelhauer, Christian. Discrete prediction games with arbitrary feedback and loss. In Computational Learning Theory, pp. 208-223. Springer, 2001.
- (2001) Computational Learning Theory , pp. 208-223
- Piccolboni, A.¹ Schindelhauer, C.²

16
- 84893549814
- Some aspects of the sequential design of experiments
- Springer
- Robbins, Herbert. Some aspects of the sequential design of experiments. In Herbert Robbins Selected Papers, pp. 169-177. Springer, 1985.
- (1985) Herbert Robbins Selected Papers , pp. 169-177
- Robbins, H.¹

17
- 85048665932
- Aggregating strategies
- Morgan Kaufmann
- Vovk, Volodimir G. Aggregating strategies. In Proc. Third Workshop on Computational Learning Theory, pp. 371- 383. Morgan Kaufmann, 1990.
- (1990) Proc. Third Workshop on Computational Learning Theory , pp. 371-383
- Vovk, V.G.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.