메뉴 건너뛰기




Volumn 45, Issue 1, 2014, Pages 1-10

T-Test feature selection approach based on term frequency for text categorization

Author keywords

Feature selection; Student t test; Term frequency; Text classification

Indexed keywords

CLASSIFICATION (OF INFORMATION); FEATURE EXTRACTION; INFORMATION FILTERING;

EID: 84896987950     PISSN: 01678655     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.patrec.2014.02.013     Document Type: Article
Times cited : (105)

References (40)
  • 1
    • 0002442796 scopus 로고    scopus 로고
    • Machine learning in automated text categorization
    • F. Sebastiani Machine learning in automated text categorization ACM Comput. Surv. 34 1 2002 1 47
    • (2002) ACM Comput. Surv. , vol.34 , Issue.1 , pp. 1-47
    • Sebastiani, F.1
  • 4
    • 84860362416 scopus 로고    scopus 로고
    • Predicting bugs components via mining bug reports
    • Deqing Wang, Hui Zhang, Rui Liu, Mengxiang Lin, and Wenjun Wu Predicting bugs components via mining bug reports J. Softw. 7 5 2012 1149 1154
    • (2012) J. Softw. , vol.7 , Issue.5 , pp. 1149-1154
    • Wang, D.1    Zhang, H.2    Liu, R.3    Lin, M.4    Wu, W.5
  • 5
    • 33744584654 scopus 로고
    • Induction of decision trees
    • R. Quinlan Induction of decision trees Mach. Learn. 1 1 1986 81 106
    • (1986) Mach. Learn. , vol.1 , Issue.1 , pp. 81-106
    • Quinlan, R.1
  • 6
    • 0003141935 scopus 로고    scopus 로고
    • A comparative study on feature selection in text categorization
    • YiMing Yang, Jan O. Pedersen, A comparative study on feature selection in text categorization, in: Proceedings of ICML, 1997, pp. 412-420.
    • (1997) Proceedings of ICML , pp. 412-420
    • Yang, Y.1    Pedersen, J.O.2
  • 7
    • 14344263547 scopus 로고    scopus 로고
    • Centroid-based document classification: Analysis & experimental results
    • E-H. Han, G. Karypis, Centroid-based document classification: analysis & experimental results, in: Proceedings of PKDD, 2000.
    • (2000) Proceedings of PKDD
    • Han, E.-H.1    Karypis, G.2
  • 8
    • 84868624925 scopus 로고    scopus 로고
    • Towards enhancing centroid classifier for text classification - A border-instance approach
    • Deqing Wang, Junjie Wu, Ke Xu, and Mengxiang Lin Towards enhancing centroid classifier for text classification - a border-instance approach Neurocomputing 101 2013 299 308
    • (2013) Neurocomputing , vol.101 , pp. 299-308
    • Wang, D.1    Wu, J.2    Xu, K.3    Lin, M.4
  • 9
    • 34249753618 scopus 로고
    • Support-vector networks
    • C. Cortes, and V. Vapnik Support-vector networks Mach. Learn. 20 1995 273 297
    • (1995) Mach. Learn. , vol.20 , pp. 273-297
    • Cortes, C.1    Vapnik, V.2
  • 10
    • 33745561205 scopus 로고    scopus 로고
    • An introduction to variable and feature selection
    • Isabelle Guyon, and Andre Elisseeff An introduction to variable and feature selection J. Mach. Learn. Res. 3 2003 1157 1182
    • (2003) J. Mach. Learn. Res. , vol.3 , pp. 1157-1182
    • Guyon, I.1    Elisseeff, A.2
  • 11
    • 57049180269 scopus 로고    scopus 로고
    • Feature selection based on the rough set theory and expectation- maximization clustering algorithm
    • F. Fazayeli, L.P. Wang, and J. Mandziuk Feature selection based on the rough set theory and expectation-maximization clustering algorithm Rough Sets Current Trends Comput. 5306 2005 272 282
    • (2005) Rough Sets Current Trends Comput. , vol.5306 , pp. 272-282
    • Fazayeli, F.1    Wang, L.P.2    Mandziuk, J.3
  • 12
    • 67349169465 scopus 로고    scopus 로고
    • Multiclass MTS for simultaneous feature selection and classification
    • Su. Chao-Ton, and Yu-Hsiang Hsiao Multiclass MTS for simultaneous feature selection and classification IEEE Trans. Knowl. Data Eng. 21 2009 192 205
    • (2009) IEEE Trans. Knowl. Data Eng. , vol.21 , pp. 192-205
    • Chao-Ton, Su.1    Hsiao, Y.-H.2
  • 14
    • 0002346866 scopus 로고    scopus 로고
    • Hierarchically classifying documents using very few words
    • D. Koller, M. Sahami, Hierarchically classifying documents using very few words, in: Proceedings of ICML, 1997, pp. 170-178.
    • (1997) Proceedings of ICML , pp. 170-178
    • Koller, D.1    Sahami, M.2
  • 15
    • 0002551285 scopus 로고    scopus 로고
    • Feature selection for unbalanced class distribution and Naive Bayes
    • D. Mladenic, M. Grobelnik, Feature selection for unbalanced class distribution and Naive Bayes, in: Proceedings of ICML, 1999.
    • (1999) Proceedings of ICML
    • Mladenic, D.1    Grobelnik, M.2
  • 16
    • 33646417914 scopus 로고    scopus 로고
    • Weighted average pointwise mutual information for feature selection in text categorization
    • Karl-Michael Schneider, Weighted average pointwise mutual information for feature selection in text categorization, in: Proceedings of The PKDD05, 2005, pp. 252-263.
    • (2005) Proceedings of the PKDD05 , pp. 252-263
    • Schneider, K.-M.1
  • 19
    • 38949156621 scopus 로고    scopus 로고
    • A Modified T-test Feature Selection Method and Its Application on the HapMap Genotype Data
    • DOI 10.1016/S1672-0229(08)60011-X, PII S167202290860011X
    • Nina Zhou, and Liping Wang A modified t-test feature selection method and its application on the hapmap genotype data Geno. Prot. Bioinfo. 5 3-4 2007 242 249 (Pubitemid 351215123)
    • (2007) Genomics, Proteomics and Bioinformatics , vol.5 , Issue.3-4 , pp. 242-249
    • Zhou, N.1    Wang, L.2
  • 20
    • 33746600352 scopus 로고    scopus 로고
    • An efficient semi-unsupervised gene selection method via spectral biclustering
    • DOI 10.1109/TNB.2006.875040, 1637452
    • Bing Liu, Chunru Wan, and L.P. Wang An efficient semi-unsupervised gene selection method via spectral biclustering IEEE Trans. Nano-Biosci. 5 2 2006 110 114 (Pubitemid 44144067)
    • (2006) IEEE Transactions on Nanobioscience , vol.5 , Issue.2 , pp. 110-114
    • Liu, B.1    Wan, C.2    Wang, L.3
  • 23
    • 0036930581 scopus 로고    scopus 로고
    • Relative term-frequency based feature selection for text categorization
    • Stewart M. Yang, Xiaobin Wu, Zhihong Deng, Ming Zhang, Dongqing Yang, Relative term-frequency based feature selection for text categorization, in: Proceedings of ICMLC, 2002.
    • (2002) Proceedings of ICMLC
    • Yang, S.M.1    Wu, X.2    Deng, Z.3    Zhang, M.4    Yang, D.5
  • 25
    • 2942731012 scopus 로고    scopus 로고
    • An extensive empirical study of feature selection metrics for text classification
    • G. Forman An extensive empirical study of feature selection metrics for text classification J. Mach. Learn. Res. 3 2003 1289 1305
    • (2003) J. Mach. Learn. Res. , vol.3 , pp. 1289-1305
    • Forman, G.1
  • 26
    • 85055298348 scopus 로고
    • Accurate methods for the statistics of surprise and coincidence
    • T. Dunning Accurate methods for the statistics of surprise and coincidence Comput. Linguist. 19 1 1993 61 74
    • (1993) Comput. Linguist. , vol.19 , Issue.1 , pp. 61-74
    • Dunning, T.1
  • 27
  • 29
    • 80955181170 scopus 로고    scopus 로고
    • A two-stage feature selection method for text categorization by using information gain, principal component analysis and genetic algorithm
    • Harun Uguz A two-stage feature selection method for text categorization by using information gain, principal component analysis and genetic algorithm Knowl.-Based Syst. 24 7 2011 1024 1032
    • (2011) Knowl.-Based Syst. , vol.24 , Issue.7 , pp. 1024-1032
    • Uguz, H.1
  • 30
    • 77956611003 scopus 로고    scopus 로고
    • Mr2pso: A maximum relevance minimum redundancy feature selection method based on swarm intelligence for support vector machine classification
    • Alper Unler, Alper Murat, and Ratna Babu Chinnam mr2pso: A maximum relevance minimum redundancy feature selection method based on swarm intelligence for support vector machine classification Inf. Sci. 181 20 2011 4625 4641
    • (2011) Inf. Sci. , vol.181 , Issue.20 , pp. 4625-4641
    • Unler, A.1    Murat, A.2    Chinnam, R.B.3
  • 32
    • 34250956889 scopus 로고
    • Eine neue herleitung des exponentialgesetzes in der wahrscheinlichkeitsrechnung
    • J.W. Lindeberg Eine neue herleitung des exponentialgesetzes in der wahrscheinlichkeitsrechnung Math. Z. 15 1922 211 225
    • (1922) Math. Z. , vol.15 , pp. 211-225
    • Lindeberg, J.W.1
  • 33
    • 34250943556 scopus 로고
    • Ueber den zentralen Grenzwertsatz der Wahrscheinlichkeitsrechnung
    • W. Feller Ueber den zentralen Grenzwertsatz der Wahrscheinlichkeitsrechnung Math. Z. 40 1935 521 559
    • (1935) Math. Z. , vol.40 , pp. 521-559
    • Feller, W.1
  • 35
    • 0345399126 scopus 로고
    • The probable error of a mean
    • S. William The probable error of a mean Biometrika 6 1 1908 1 25
    • (1908) Biometrika , vol.6 , Issue.1 , pp. 1-25
    • William, S.1
  • 36
    • 45549117987 scopus 로고
    • Term-weighting approaches in automatic text retrieval
    • G. Salton, and C. Buckley Term-weighting approaches in automatic text retrieval Inf. Process. Manage. 24 5 1988 513 523
    • (1988) Inf. Process. Manage. , vol.24 , Issue.5 , pp. 513-523
    • Salton, G.1    Buckley, C.2
  • 38
    • 37749027001 scopus 로고    scopus 로고
    • Support vector machine training for improved hidden markov modeling
    • A. Sloin, and D. Burshtein Support vector machine training for improved hidden markov modeling IEEE Trans. Signal Process. 56 2008 172 188
    • (2008) IEEE Trans. Signal Process. , vol.56 , pp. 172-188
    • Sloin, A.1    Burshtein, D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.