SCOPUS 정보 검색 플랫폼

Machine Learning

Volumn 90, Issue 3, 2013, Pages 347-383

Multiclass classification with bandit feedback using adaptive regularization

(2) Crammer, Koby a Gentile, Claudio b

a TECHNION ISRAEL INSTITUTE OF TECHNOLOGY (Israel)

b UNIVERSITY OF INSUBRIA (Italy)

Author keywords

Online learning; Regret; Upper confidence bound

Indexed keywords

ADAPTIVE REGULARIZATION; BANDIT FEEDBACKS; EXPLORATION AND EXPLOITATION; MULTI-CLASS; MULTI-CLASS CLASSIFICATION; ON-LINE ALGORITHMS; ONLINE LEARNING; PARTIAL FEEDBACK; PERCEPTRON; PROBABILISTIC MODELS; RANDOM SAMPLING; REAL-WORLD; REGRET; SECOND ORDERS; SINGLE-BIT; TEXT CLASSIFICATION; UPPER CONFIDENCE BOUND; VOWEL RECOGNITION;

LEARNING ALGORITHMS;

CLASSIFICATION (OF INFORMATION);

EID: 84874710652 PISSN: 08856125 EISSN: 15730565 Source Type: Journal
DOI: 10.1007/s10994-012-5321-8 Document Type: Article

Times cited : (59)

References (31)

1
- 0041966002
- Using confidence bounds for exploitation-exploration trade-offs
- 1984023 1084.68543
- Auer, P. (2003). Using confidence bounds for exploitation-exploration trade-offs. Journal of Machine Learning Research, 3, 397-422.
- (2003) Journal of Machine Learning Research , vol.3 , pp. 397-422
- Auer, P.¹

2
- 0035370926
- Relative loss bounds for online density estimation with the exponential family of distributions
- 0988.68173 10.1023/A:1010896012157
- Azoury, K. S., & Warmuth, M. K. (2001). Relative loss bounds for online density estimation with the exponential family of distributions. Machine Learning, 43, 211-246.
- (2001) Machine Learning , vol.43 , pp. 211-246
- Azoury, K.S.¹ Warmuth, M.K.²

3
- 84860524227
- Biographies, Bollywood, boom-boxes and blenders: Domain adaptation for sentiment classification
- Blitzer, J., Dredze, M., & Pereira, F. (2007). Biographies, Bollywood, boom-boxes and blenders: domain adaptation for sentiment classification. In Association of computational linguistics (ACL).
- (2007) Association of Computational Linguistics (ACL)
- Blitzer, J.¹ Dredze, M.² Pereira, F.³

4
- 20544470498
- A second-order perceptron algorithm
- 2137076 10.1137/S0097539703432542
- Cesa-Bianchi, N., Conconi, A., & Gentile, C. (2005). A second-order perceptron algorithm. SIAM Journal on Computing, 43, 640-668.
- (2005) SIAM Journal on Computing , vol.43 , pp. 640-668
- Cesa-Bianchi, N.¹ Conconi, A.² Gentile, C.³

5
- 71149102767
- Robust bounds for classification via selective sampling
- Cesa-Bianchi, N., Gentile, C., & Orabona, F. (2009). Robust bounds for classification via selective sampling. In Proc. 26th ICML.
- (2009) Proc. 26th ICML
- Cesa-Bianchi, N.¹ Gentile, C.² Orabona, F.³

6
- 0036568032
- On the learnability and design of output codes for multiclass problems
- 1012.68155 10.1023/A:1013637720281
- Crammer, K., & Singer, Y. (2002). On the learnability and design of output codes for multiclass problems. Machine Learning, 47, 201-233.
- (2002) Machine Learning , vol.47 , pp. 201-233
- Crammer, K.¹ Singer, Y.²

7
- 0141496132
- Ultraconservative online algorithms for multiclass problems
- 1983939 1112.68497
- Crammer, K., & Singer, Y. (2003). Ultraconservative online algorithms for multiclass problems. Journal of Machine Learning Research, 3, 951-991.
- (2003) Journal of Machine Learning Research , vol.3 , pp. 951-991
- Crammer, K.¹ Singer, Y.²

8
- 79953651835
- Multi-class confidence weighted algorithms
- Crammer, K., Dredze, M., & Kulesza, A. (2009a). Multi-class confidence weighted algorithms. In EMNLP 2009.
- (2009) EMNLP 2009
- Crammer, K.¹ Dredze, M.² Kulesza, A.³

9
- 84874117035
- Adaptive regularization of weighted vectors
- Crammer, K., Kulesza, A., & Dredze, M. (2009b). Adaptive regularization of weighted vectors. In Nips 2009.
- (2009) Nips 2009
- Crammer, K.¹ Kulesza, A.² Dredze, M.³

10
- 84898072179
- Stochastic linear optimization under bandit feedback
- Dani, V., Hayes, T., & Kakade, S. (2008). Stochastic linear optimization under bandit feedback. In Colt 2008.
- (2008) Colt 2008
- Dani, V.¹ Hayes, T.² Kakade, S.³

11
- 84875634609
- Robust selective sampling from single and multiple teachers
- Dekel, O., Gentile, C., & Sridharan, K. (2010). Robust selective sampling from single and multiple teachers. In Colt 2010.
- (2010) Colt 2010
- Dekel, O.¹ Gentile, C.² Sridharan, K.³

12
- 56449101965
- Confidence-weighted linear classification
- Dredze, M., Crammer, K., & Pereira, F. (2008). Confidence-weighted linear classification. In ICML 2008.
- (2008) ICML 2008
- Dredze, M.¹ Crammer, K.² Pereira, F.³

13
- 56449092085
- 1-ball for learning in high dimensions
- 1-ball for learning in high dimensions. In ICML 2008 (pp. 272-279).
- (2008) ICML 2008 , pp. 272-279
- Duchi, J.¹ Shalev-Shwartz, S.² Singer, Y.³ Chandra, T.⁴

14
- 85162453290
- Newtron: An efficient bandit algorithm for online multiclass prediction
- Hazan, E., & Kale, S. (2011). Newtron: an efficient bandit algorithm for online multiclass prediction. In NIPS 2011.
- (2011) NIPS 2011
- Hazan, E.¹ Kale, S.²

15
- 84942484786
- Ridge regression: Biased estimation for nonorthogonal problems
- 0202.17205 10.1080/00401706.1970.10488634
- Hoerl, A., & Kennard, R. (1970). Ridge regression: biased estimation for nonorthogonal problems. Technometrics, 12, 55-67.
- (1970) Technometrics , vol.12 , pp. 55-67
- Hoerl, A.¹ Kennard, R.²

16
- 84874692021
- On the generalization ability of online strongly convex programming algorithm
- Kakade, S., & Tewari, A. (2008). On the generalization ability of online strongly convex programming algorithm. In Nips 2008.
- (2008) Nips 2008
- Kakade, S.¹ Tewari, A.²

17
- 56449104477
- Efficient bandit algorithms for online multiclass prediction
- Kakade, S., Shalev-Shwartz, S., & Tewari, A. (2008). Efficient bandit algorithms for online multiclass prediction. In ICML 2008.
- (2008) ICML 2008
- Kakade, S.¹ Shalev-Shwartz, S.² Tewari, A.³

18
- 44949085753
- The Vocal Joystick data collection effort and vowel corpus
- Pittsburgh, PA
- Kilanski, K., Malkin, J., Li, X., Wright, R., & Bilmes, J. (2006). The Vocal Joystick data collection effort and vowel corpus. In Interspeech, Pittsburgh, PA.
- (2006) Interspeech
- Kilanski, K.¹ Malkin, J.² Li, X.³ Wright, R.⁴ Bilmes, J.⁵

19
- 77956144722
- The epoch-greedy algorithm for contextual multi-armed bandits
- Langford, J., & Zhang, T. (2007). The epoch-greedy algorithm for contextual multi-armed bandits. In Nips 2007.
- (2007) Nips 2007
- Langford, J.¹ Zhang, T.²

20
- 84876811202
- Rcv1: A new benchmark collection for text categorization research
- Lewis, D. D., Yang, Y., Rose, T. G., & Li, F. (2004). Rcv1: a new benchmark collection for text categorization research. Journal of Machine Learning Research, 5, 361-397.
- (2004) Journal of Machine Learning Research , vol.5 , pp. 361-397
- Lewis, D.D.¹ Yang, Y.² Rose, T.G.³ Li, F.⁴

21
- 70450205580
- How to loose confidence: Probabilistic linear machines for multiclass classification
- Lin, H., Bilmes, J., & Crammer, K. (2009). How to loose confidence: probabilistic linear machines for multiclass classification. In INTERSPEECH (pp. 2559-2562).
- (2009) INTERSPEECH , pp. 2559-2562
- Lin, H.¹ Bilmes, J.² Crammer, K.³

22
- 77953179734
- Efficient Euclidean projections in linear time
- Liu, J., & Ye, J. (2009). Efficient Euclidean projections in linear time. In ICML 2009 (p. 83).
- (2009) ICML 2009 , pp. 83
- Liu, J.¹ Ye, J.²

23
- 84898452145
- Showing relevant ads via Lipschitz context multi-armed bandits
- Lu, T., Pal, D., & Pal, M. (2010). Showing relevant ads via Lipschitz context multi-armed bandits. In Aistat 2010.
- (2010) Aistat 2010
- Lu, T.¹ Pal, D.² Pal, M.³

24
- 80053440857
- Nonparametric bandits with covariates
- Rigollet, P., & Zeevi, A. (2010). Nonparametric bandits with covariates. In Colt 2010.
- (2010) Colt 2010
- Rigollet, P.¹ Zeevi, A.²

25
- 11144273669
- The perceptron: A probabilistic model for information storage and organization in the brain
- 1529895 10.1037/h0042519
- Rosenblatt, F. (1958). The perceptron: A probabilistic model for information storage and organization in the brain. Psychological Review, 65, 386-407.
- (1958) Psychological Review , vol.65 , pp. 386-407
- Rosenblatt, F.¹

26
- 77958006662
- Linguistic Data Consortium Philadelphia
- Sandhaus, E. (2008). The New York Times annotated corpus. Philadelphia: Linguistic Data Consortium.
- (2008) The New York Times Annotated Corpus
- Sandhaus, E.¹

27
- 85162058047
- Online linear regression and its application to model-based reinforcement learning
- Strehl, A., & Littman, M. (2008). Online linear regression and its application to model-based reinforcement learning. In NIPS 2008.
- (2008) NIPS 2008
- Strehl, A.¹ Littman, M.²

28
- 80052674910
- Learning to trade off between exploration and exploitation in multiclass bandit prediction
- Valizadegan, H., Jin, R., & Wang, S. (2011). Learning to trade off between exploration and exploitation in multiclass bandit prediction. In KDD 2011.
- (2011) KDD 2011
- Valizadegan, H.¹ Jin, R.² Wang, S.³

29
- 79958846996
- Exploring compact reinforcement-learning representations with linear regression
- Walsh, T. J., Szita, I., Diuk, C., & Littman, M. L. (2009). Exploring compact reinforcement-learning representations with linear regression. In UAI 2008 & Rutgers Univ. Tech. Rep.
- (2009) UAI 2008 & Rutgers Univ. Tech. Rep
- Walsh, T.J.¹ Szita, I.² Diuk, C.³ Littman, M.L.⁴

30
- 15844389867
- Bandit problems with side observation
- 2123095 10.1109/TAC.2005.844079
- Wang, C., Kulkarni, S., & Poor, V. (2005). Bandit problems with side observation. IEEE Transactions on Automatic Control, 50, 338-355.
- (2005) IEEE Transactions on Automatic Control , vol.50 , pp. 338-355
- Wang, C.¹ Kulkarni, S.² Poor, V.³

31
- 80052680363
- A potential-based framework for online multi-class learning with partial feedback
- Wang, S., Jin, R., & Valizadegan, H. (2010). A potential-based framework for online multi-class learning with partial feedback. In Aistat 2010.
- (2010) Aistat 2010
- Wang, S.¹ Jin, R.² Valizadegan, H.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.