메뉴 건너뛰기




Volumn 20, Issue 5, 2008, Pages 641-652

Text clustering with feature selection by using statistical data

Author keywords

Chi2 statistic; Feature selection; Performance analysis; Text clustering; Text mining

Indexed keywords

CHI2 STATISTIC; FEATURE SELECTION; PERFORMANCE ANALYSIS; TEXT CLUSTERING; TEXT MINING;

EID: 60249094938     PISSN: 10414347     EISSN: None     Source Type: Journal    
DOI: 10.1109/TKDE.2007.190740     Document Type: Article
Times cited : (176)

References (24)
  • 4
    • 70350286850 scopus 로고    scopus 로고
    • available at
    • Classic data set, available at ftp://ftp.cs.cornell.edu/pub/smart/.
    • Classic Data Set
  • 5
    • 0013326060 scopus 로고    scopus 로고
    • Feature Selection for Classification
    • M. Dash and H. Liu, "Feature Selection for Classification," Intelligent Data Analysis, vol.1, no.3, pp. 131-156, 1997.
    • (1997) Intelligent Data Analysis , vol.1 , Issue.3 , pp. 131-156
    • Dash, M.1    Liu, H.2
  • 8
    • 30344466927 scopus 로고    scopus 로고
    • Feature selection: We've barely scratched the surface
    • November
    • G. Forman, "Feature Selection: We've Barely Scratched the Surface," IEEE Intelligent Systems, November, 2005.
    • (2005) IEEE Intelligent Systems
    • Forman, G.1
  • 16
    • 33744584654 scopus 로고
    • Induction of Decision Trees
    • J. R. Quinlan, "Induction of Decision Trees," Machine Learning, vol.1, pp. 81-106, 1986.
    • (1986) Machine Learning , vol.1 , pp. 81-106
    • Quinlan, J.R.1
  • 17
    • 70350295596 scopus 로고    scopus 로고
    • Reuters-21578 Distribution 1.0 available at
    • Reuters-21578 Distribution 1.0, available at http://www.daviddlewis.com/ resources/testcollections/reuters21578.
  • 19
    • 0002442796 scopus 로고    scopus 로고
    • Machine Learning in Automated Text Categorization
    • F. Sebastiani, "Machine Learning in Automated Text Categorization," ACM Computing Surveys, vol.34, no.1, pp. 1-47, 2002.
    • (2002) ACM Computing Surveys , vol.34 , Issue.1 , pp. 1-47
    • Sebastiani, F.1
  • 21
    • 0011214473 scopus 로고
    • The Automatic Identification of Stop Words
    • W. J. Wilbur and K. Sirotkin, "The Automatic Identification of Stop Words," Journal of Information Science, vol.18, no.1, pp. 45-55, 1992.
    • (1992) Journal of Information Science , vol.18 , Issue.1 , pp. 45-55
    • Wilbur, W.J.1    Sirotkin, K.2
  • 23
    • 0003141935 scopus 로고    scopus 로고
    • A Comparative study on feature selection in text categorization
    • Y. Yang and J. O. Pedersen, "A Comparative Study on Feature Selection in Text Categorization," Proc. of Int'l Conf. on Machine Learning, pp. 412-420, 1997.
    • (1997) Proc. of Int'l Conf. on Machine Learning , pp. 412-420
    • Yang, Y.1    Pedersen, J.O.2
  • 24
    • 3543085722 scopus 로고    scopus 로고
    • Empirical and theoretical comparisons of selected criterion functions for document clustering
    • Y. Zhao and G. Karypis, "Empirical and Theoretical Comparisons of Selected Criterion Functions for Document Clustering," Machine Learning, vol.55. no.3, pp. 311-331, 2004.
    • (2004) Machine Learning , vol.55 , Issue.3 , pp. 311-331
    • Zhao, Y.1    Karypis, G.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.