-
2
-
-
84861249510
-
-
July
-
Benedetto, B., Caglioti, E., Loreto, V.: On J. Goodman's comment, to "Language trees and zipping", http://arxiv.org/abs/cond-mat/0203275 July 2004.
-
(2004)
On J. Goodman's Comment, to "Language Trees and Zipping
-
-
Benedetto, B.1
Caglioti, E.2
Loreto, V.3
-
3
-
-
85038284286
-
Benedetto, Caglioti, and Loreto reply
-
089804
-
Benedetto, D. and Caglioti, E.: Benedetto, Caglioti, and Loreto reply. Physical Review Letters, 90(089804) (2003)
-
(2003)
Physical Review Letters
, vol.90
-
-
Benedetto, D.1
Caglioti, E.2
-
6
-
-
0028911698
-
Gauging similarity with n-grams: Language-independent categorization of text
-
Damashek, M.: Gauging similarity with n-grams: Language-independent categorization of text. Science 267(5199) (1995):843-848
-
(1995)
Science
, vol.267
, Issue.5199
, pp. 843-848
-
-
Damashek, M.1
-
8
-
-
0033894701
-
Text categorization using compression models
-
Frank, E., Chui, C., Witten, I.H.: Text Categorization Using Compression Models. Proc. of DCC-00, IEEE Data Compression Conference (2000) 200-209.
-
(2000)
Proc. of DCC-00, IEEE Data Compression Conference
, pp. 200-209
-
-
Frank, E.1
Chui, C.2
Witten, I.H.3
-
9
-
-
24644482320
-
Using error correcting codes for efficient text classification with a large number of categories
-
Masters Thesis. Center for Automated Learning and Discovery, Carnegie Mellon University
-
Ghani, Rayid: Using Error Correcting Codes for Efficient Text Classification with a Large Number of Categories. KDD project report. Masters Thesis. Center for Automated Learning and Discovery, Carnegie Mellon University (2001)
-
(2001)
KDD Project Report
-
-
Ghani, R.1
-
10
-
-
0035497388
-
A bit of progress in language modeling, extended version
-
October
-
Goodman, Joshua T.: A Bit of Progress in Language Modeling, Extended Version. Computer Speech and Language, October 2001, pages 403-434.
-
(2001)
Computer Speech and Language
, pp. 403-434
-
-
Goodman, J.T.1
-
15
-
-
33646138699
-
Using Markov chains for identification of writers
-
Khmelev D., Tweedie F.: Using Markov Chains for Identification of Writers. Literary and Linguistic Computing 16(4) (2001):299-307
-
(2001)
Literary and Linguistic Computing
, vol.16
, Issue.4
, pp. 299-307
-
-
Khmelev, D.1
Tweedie, F.2
-
21
-
-
84861237266
-
LZW source code
-
October
-
Nelson, Mark R.: LZW source code. Dr. Dobb's Journal, October, 1989 (Also available at http://www.dogma.net/markn/articles/lzw/lzw.htm).
-
(1989)
Dr. Dobb's Journal
-
-
Nelson, M.R.1
-
22
-
-
35248883872
-
Combining naive bayes and n-gram language models for text classification
-
Proc. of The 25th European Conference on Information Retrieval Research (ECIR03)
-
Peng, F., Schuurmans, D.: Combining Naive Bayes and n-gram language models for text classification. Proc. of The 25th European Conference on Information Retrieval Research (ECIR03)LNCS 2633 (2003):335-350
-
(2003)
LNCS
, vol.2633
, pp. 335-350
-
-
Peng, F.1
Schuurmans, D.2
-
23
-
-
3843083955
-
Augmenting Naive Bayes classifiers with statistical language models
-
Peng, F., Schuurmans, D., Wang, S.: Augmenting Naive Bayes classifiers with statistical language models. Information Retrieval 7 (2004):317-345.
-
(2004)
Information Retrieval
, vol.7
, pp. 317-345
-
-
Peng, F.1
Schuurmans, D.2
Wang, S.3
-
25
-
-
84861251797
-
-
Version 3.30 (22 Jan). Copyright (c) 1993-2004 Eugene Roshal
-
RAR compression tool by RAR Labs, Inc. (www.rarlab.com). Version 3.30 (22 Jan 2004). Copyright (c) 1993-2004 Eugene Roshal.
-
(2004)
-
-
-
26
-
-
1942484786
-
Tackling the poor assumptions of Naive Bayes text classifiers
-
Rennie, J. D. M., Shih, L., Teevan, J., Karger, D. R. Tackling the poor assumptions of Naive Bayes text classifiers. Proc. of the Twentieth International Conference on Machine Learning (2003)
-
(2003)
Proc. of the Twentieth International Conference on Machine Learning
-
-
Rennie, J.D.M.1
Shih, L.2
Teevan, J.3
Karger, D.R.4
-
27
-
-
24644432371
-
-
Personal communication
-
Rorshal, Eugene (RAR Labs Inc.): Personal communication (2004)
-
(2004)
-
-
Rorshal, E.1
-
28
-
-
24644474482
-
Fun with your zip program: Sort through texts, and more
-
April 30
-
Schechter, B.: Fun with your zip program: Sort through texts, and more. New York Times, April 30, 2002.
-
(2002)
New York Times
-
-
Schechter, B.1
-
29
-
-
0035769083
-
Improving the efficiency of PPM algorithm
-
Shkarin, D.: Improving the efficiency of PPM algorithm. Problems of information transmission 34(3) (2001):44-54 (In Russian. English description available at http://www.dogma.net/DataCompression/Miscellaneous/PPMII_DCC02.pdf) .
-
(2001)
Problems of Information Transmission
, vol.34
, Issue.3
, pp. 44-54
-
-
Shkarin, D.1
-
30
-
-
0002442796
-
Machine learning in automated text categorization
-
Sebastiani, F.: Machine learning in automated text categorization, ACM Computing Surveys 34(1) (2002):1-47
-
(2002)
ACM Computing Surveys
, vol.34
, Issue.1
, pp. 1-47
-
-
Sebastiani, F.1
-
35
-
-
27144441097
-
An evaluation of statistical approaches to text categorization
-
Yang, Yiming: An Evaluation of Statistical Approaches to Text Categorization. Information Retrieval, 1(1/2):67-88. (1999).
-
(1999)
Information Retrieval
, vol.1
, Issue.1-2
, pp. 67-88
-
-
Yang, Y.1
-
37
-
-
24644434262
-
-
Personal communication
-
Zhang, Tong: Personal communication (2004).
-
(2004)
-
-
Zhang, T.1
-
38
-
-
0001868572
-
Text categorization based on regularized linear classification methods
-
Zhang, Tong, Oles, J. Frank.: Text Categorization Based on Regularized Linear Classification Methods. Information retrieval 4 (2001):5-31.
-
(2001)
Information Retrieval
, vol.4
, pp. 5-31
-
-
Zhang, T.1
Frank, O.J.2
-
39
-
-
1942420344
-
Modified logistic regression: An approximation to SVM and its applications in large-scale text categorization
-
Zhang, J., Jin, R., Yang, Y., Hauptmann, A.G.: Modified Logistic Regression: An Approximation to SVM and Its Applications in Large-Scale Text Categorization. Proc. of the 20th International Conference on Machine Learning (2003):888-895
-
(2003)
Proc. of the 20th International Conference on Machine Learning
, pp. 888-895
-
-
Zhang, J.1
Jin, R.2
Yang, Y.3
Hauptmann, A.G.4
|