-
1
-
-
0242647875
-
A learner-independent evaluation of the usefulness of statistical phrases for automated text categorization
-
A. G. Chin, editor, Idea Group Publishing, Hershey, US
-
M. F. Caropreso, S. Matwin, and F. Sebastiani. A learner-independent evaluation of the usefulness of statistical phrases for automated text categorization. In A. G. Chin, editor, Text Databases and Document Management: Theory and Practice, pages 78-102. Idea Group Publishing, Hershey, US, 2001.
-
(2001)
Text Databases and Document Management: Theory and Practice
, pp. 78-102
-
-
Caropreso, M.F.1
Matwin, S.2
Sebastiani, F.3
-
2
-
-
84989525001
-
Indexing by latent semantic analysis
-
S. C. Deerwester, S. T. Dumais, T. K. Landauer, G. W. Furnas, and R. A. Harshman. Indexing by latent semantic analysis. Journal of the American Society of Information Science, 41(6):391-407, 1990.
-
(1990)
Journal of the American Society of Information Science
, vol.41
, Issue.6
, pp. 391-407
-
-
Deerwester, S.C.1
Dumais, S.T.2
Landauer, T.K.3
Furnas, G.W.4
Harshman, R.A.5
-
3
-
-
34547439207
-
The Wikipedia XML corpus
-
L. Denoyer and P. Gallinari. The Wikipedia XML corpus. SIGIR Forum, 40(1):64-69, 2006.
-
(2006)
SIGIR Forum
, vol.40
, Issue.1
, pp. 64-69
-
-
Denoyer, L.1
Gallinari, P.2
-
5
-
-
85105809948
-
Inductive learning algorithms and representations for text categorization
-
New York, NY, USA, ACM
-
S. Dumais, J. Platt, D. Heckerman, and M. Sahami. Inductive learning algorithms and representations for text categorization. In CIKM'98: Proceedings of the 7th international conference on Information and knowledge management, pages 148-155, New York, NY, USA, 1998. ACM.
-
(1998)
CIKM'98: Proceedings of the 7th International Conference on Information and Knowledge Management
, pp. 148-155
-
-
Dumais, S.1
Platt, J.2
Heckerman, D.3
Sahami, M.4
-
6
-
-
50949133669
-
Liblinear: A library for large linear classification
-
R.-E. Fan, K.-W. Chang, C.-J. Hsieh, X.-R. Wang, and C.-J. Lin. Liblinear: A library for large linear classification. Journal of Machine Learning Research, 9:1871-1874, 2008.
-
(2008)
Journal of Machine Learning Research
, vol.9
, pp. 1871-1874
-
-
Fan, R.-E.1
Chang, K.-W.2
Hsieh, C.-J.3
Wang, X.-R.4
Lin, C.-J.5
-
7
-
-
2942731012
-
An extensive empirical study of feature selection metrics for text classi cation
-
G. Forman. An extensive empirical study of feature selection metrics for text classi cation. Journal of Machine Learning Research, 3:1289-1305, 2003.
-
(2003)
Journal of Machine Learning Research
, vol.3
, pp. 1289-1305
-
-
Forman, G.1
-
10
-
-
33646161449
-
An examination of feature selection frameworks in text categorization
-
AIRS'05: Proceedings of 2nd Asia information retrieval symposium
-
B. C. How and W. T. Kiong. An examination of feature selection frameworks in text categorization. In AIRS'05: Proceedings of 2nd Asia information retrieval symposium, pages 558-564. Lecture notes in computer science, 2005.
-
(2005)
Lecture Notes in Computer Science
, pp. 558-564
-
-
How, B.C.1
Kiong, W.T.2
-
11
-
-
84957069814
-
Text categorization with support vector machines: Learning with many relevant features
-
C. Nédellec and C. Rouveirol, editors, Springer-Verlag, Heidelberg, DE
-
T. Joachims. Text categorization with support vector machines: learning with many relevant features. In C. Nédellec and C. Rouveirol, editors, ECML'98: Proceedings of the 10th European Conference on Machine Learning, pages 137-142. Springer-Verlag, Heidelberg, DE, 1998.
-
(1998)
ECML'98: Proceedings of the 10th European Conference on Machine Learning
, pp. 137-142
-
-
Joachims, T.1
-
12
-
-
0002312061
-
Feature selection and feature extraction for text categorization
-
Defense Advanced Research Projects Agency, Morgan Kaufmann
-
D. D. Lewis. Feature selection and feature extraction for text categorization. In Proceedings of the Speech and Natural Language Workshop, pages 212-217. Defense Advanced Research Projects Agency, Morgan Kaufmann, 1992.
-
(1992)
Proceedings of the Speech and Natural Language Workshop
, pp. 212-217
-
-
Lewis, D.D.1
-
14
-
-
84876811202
-
RCV1: A new benchmark collection for text categorization research
-
D. D. Lewis, Y. Yang, T. G. Rose, and F. Li. RCV1: A new benchmark collection for text categorization research. Journal of Machine Learning Research, 5:361-397, 2004.
-
(2004)
Journal of Machine Learning Research
, vol.5
, pp. 361-397
-
-
Lewis, D.D.1
Yang, Y.2
Rose, T.G.3
Li, F.4
-
15
-
-
0032274556
-
Classification of text documents
-
Y. H. Li and A. K. Jain. Classification of text documents. The Computer Journal, 41:537-546, 1998.
-
(1998)
The Computer Journal
, vol.41
, pp. 537-546
-
-
Li, Y.H.1
Jain, A.K.2
-
17
-
-
0030651099
-
Feature selection, perceptron learning, and a usability case study for text categorization
-
H. T. Ng, W. B. Goh, and K. L. Low. Feature selection, perceptron learning, and a usability case study for text categorization. In SIGIR '97: Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval, pages 67-73, 1997.
-
(1997)
SIGIR '97: Proceedings of the 20th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
, pp. 67-73
-
-
Ng, H.T.1
Goh, W.B.2
Low, K.L.3
-
18
-
-
84948481845
-
An algorithm for suffix stripping
-
M. F. Porter. An algorithm for suffix stripping. Program, 14(3):130-137, 1980.
-
(1980)
Program
, vol.14
, Issue.3
, pp. 130-137
-
-
Porter, M.F.1
-
20
-
-
0016572913
-
A vector space model for automatic indexing
-
G. Salton, A. Wong, and C. S. Yang. A vector space model for automatic indexing. Communations of the ACM, 18(11):613-620, 1975.
-
(1975)
Communations of the ACM
, vol.18
, Issue.11
, pp. 613-620
-
-
Salton, G.1
Wong, A.2
Yang, C.S.3
-
21
-
-
0002442796
-
Machine learning in automated text categorization
-
F. Sebastiani. Machine learning in automated text categorization. ACM Computing Surveys, 34:1-47, 2002.
-
(2002)
ACM Computing Surveys
, vol.34
, pp. 1-47
-
-
Sebastiani, F.1
-
22
-
-
84856043672
-
A mathematical theory of communication
-
C. E. Shannon. A mathematical theory of communication. Bell System Technical Journal, 27:379-423 and 623-656, 1948.
-
(1948)
Bell System Technical Journal
, vol.27
-
-
Shannon, C.E.1
-
23
-
-
0003450542
-
-
Springer-Verlag New York, Inc., New York, NY, USA
-
V. N. Vapnik. The nature of statistical learning theory. Springer-Verlag New York, Inc., New York, NY, USA, 1995.
-
(1995)
The Nature of Statistical Learning Theory
-
-
Vapnik, V.N.1
-
26
-
-
0003141935
-
A comparative study on feature selection in text categorization
-
D. H. Fisher, editor, Morgan Kaufmann Publishers, San Francisco, US
-
Y. Yang and J. O. Pedersen. A comparative study on feature selection in text categorization. In D. H. Fisher, editor, ICML'97: Proceedings of the 14th International Conference on Machine Learning, pages 412-420. Morgan Kaufmann Publishers, San Francisco, US, 1997.
-
(1997)
ICML'97: Proceedings of the 14th International Conference on Machine Learning
, pp. 412-420
-
-
Yang, Y.1
Pedersen, J.O.2
|