메뉴 건너뛰기




Volumn 3408, Issue , 2005, Pages 300-314

On compression-based text classification

Author keywords

[No Author keywords available]

Indexed keywords

DATA ACQUISITION; DATA COMPRESSION; INFORMATION RETRIEVAL; PROGRAM DOCUMENTATION; TEXT PROCESSING; WORD PROCESSING;

EID: 24644522810     PISSN: 03029743     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1007/978-3-540-31865-1_22     Document Type: Conference Paper
Times cited : (58)

References (39)
  • 6
    • 0028911698 scopus 로고
    • Gauging similarity with n-grams: Language-independent categorization of text
    • Damashek, M.: Gauging similarity with n-grams: Language-independent categorization of text. Science 267(5199) (1995):843-848
    • (1995) Science , vol.267 , Issue.5199 , pp. 843-848
    • Damashek, M.1
  • 9
    • 24644482320 scopus 로고    scopus 로고
    • Using error correcting codes for efficient text classification with a large number of categories
    • Masters Thesis. Center for Automated Learning and Discovery, Carnegie Mellon University
    • Ghani, Rayid: Using Error Correcting Codes for Efficient Text Classification with a Large Number of Categories. KDD project report. Masters Thesis. Center for Automated Learning and Discovery, Carnegie Mellon University (2001)
    • (2001) KDD Project Report
    • Ghani, R.1
  • 10
    • 0035497388 scopus 로고    scopus 로고
    • A bit of progress in language modeling, extended version
    • October
    • Goodman, Joshua T.: A Bit of Progress in Language Modeling, Extended Version. Computer Speech and Language, October 2001, pages 403-434.
    • (2001) Computer Speech and Language , pp. 403-434
    • Goodman, J.T.1
  • 15
    • 33646138699 scopus 로고    scopus 로고
    • Using Markov chains for identification of writers
    • Khmelev D., Tweedie F.: Using Markov Chains for Identification of Writers. Literary and Linguistic Computing 16(4) (2001):299-307
    • (2001) Literary and Linguistic Computing , vol.16 , Issue.4 , pp. 299-307
    • Khmelev, D.1    Tweedie, F.2
  • 21
    • 84861237266 scopus 로고
    • LZW source code
    • October
    • Nelson, Mark R.: LZW source code. Dr. Dobb's Journal, October, 1989 (Also available at http://www.dogma.net/markn/articles/lzw/lzw.htm).
    • (1989) Dr. Dobb's Journal
    • Nelson, M.R.1
  • 22
    • 35248883872 scopus 로고    scopus 로고
    • Combining naive bayes and n-gram language models for text classification
    • Proc. of The 25th European Conference on Information Retrieval Research (ECIR03)
    • Peng, F., Schuurmans, D.: Combining Naive Bayes and n-gram language models for text classification. Proc. of The 25th European Conference on Information Retrieval Research (ECIR03)LNCS 2633 (2003):335-350
    • (2003) LNCS , vol.2633 , pp. 335-350
    • Peng, F.1    Schuurmans, D.2
  • 23
    • 3843083955 scopus 로고    scopus 로고
    • Augmenting Naive Bayes classifiers with statistical language models
    • Peng, F., Schuurmans, D., Wang, S.: Augmenting Naive Bayes classifiers with statistical language models. Information Retrieval 7 (2004):317-345.
    • (2004) Information Retrieval , vol.7 , pp. 317-345
    • Peng, F.1    Schuurmans, D.2    Wang, S.3
  • 25
    • 84861251797 scopus 로고    scopus 로고
    • Version 3.30 (22 Jan). Copyright (c) 1993-2004 Eugene Roshal
    • RAR compression tool by RAR Labs, Inc. (www.rarlab.com). Version 3.30 (22 Jan 2004). Copyright (c) 1993-2004 Eugene Roshal.
    • (2004)
  • 27
    • 24644432371 scopus 로고    scopus 로고
    • Personal communication
    • Rorshal, Eugene (RAR Labs Inc.): Personal communication (2004)
    • (2004)
    • Rorshal, E.1
  • 28
    • 24644474482 scopus 로고    scopus 로고
    • Fun with your zip program: Sort through texts, and more
    • April 30
    • Schechter, B.: Fun with your zip program: Sort through texts, and more. New York Times, April 30, 2002.
    • (2002) New York Times
    • Schechter, B.1
  • 29
    • 0035769083 scopus 로고    scopus 로고
    • Improving the efficiency of PPM algorithm
    • Shkarin, D.: Improving the efficiency of PPM algorithm. Problems of information transmission 34(3) (2001):44-54 (In Russian. English description available at http://www.dogma.net/DataCompression/Miscellaneous/PPMII_DCC02.pdf) .
    • (2001) Problems of Information Transmission , vol.34 , Issue.3 , pp. 44-54
    • Shkarin, D.1
  • 30
    • 0002442796 scopus 로고    scopus 로고
    • Machine learning in automated text categorization
    • Sebastiani, F.: Machine learning in automated text categorization, ACM Computing Surveys 34(1) (2002):1-47
    • (2002) ACM Computing Surveys , vol.34 , Issue.1 , pp. 1-47
    • Sebastiani, F.1
  • 35
    • 27144441097 scopus 로고    scopus 로고
    • An evaluation of statistical approaches to text categorization
    • Yang, Yiming: An Evaluation of Statistical Approaches to Text Categorization. Information Retrieval, 1(1/2):67-88. (1999).
    • (1999) Information Retrieval , vol.1 , Issue.1-2 , pp. 67-88
    • Yang, Y.1
  • 37
    • 24644434262 scopus 로고    scopus 로고
    • Personal communication
    • Zhang, Tong: Personal communication (2004).
    • (2004)
    • Zhang, T.1
  • 38
    • 0001868572 scopus 로고    scopus 로고
    • Text categorization based on regularized linear classification methods
    • Zhang, Tong, Oles, J. Frank.: Text Categorization Based on Regularized Linear Classification Methods. Information retrieval 4 (2001):5-31.
    • (2001) Information Retrieval , vol.4 , pp. 5-31
    • Zhang, T.1    Frank, O.J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.