메뉴 건너뛰기




Volumn 36, Issue , 2012, Pages 226-235

A novel probabilistic feature selection method for text classification

Author keywords

Dimension reduction; Feature selection; Filter; Pattern recognition; Text classification

Indexed keywords

CLASSIFICATION ACCURACY; CLASSIFICATION ALGORITHM; DATA SETS; DIMENSION REDUCTION; FEATURE SELECTION METHODS; FEATURE SPACE; FILTER; FILTER APPROACH; FILTER-BASED; GINI INDEX; HIGH DIMENSIONALITY; INFORMATION GAIN; PROCESSING TIME; SUCCESS MEASURE; TEXT CLASSIFICATION;

EID: 84867846144     PISSN: 09507051     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.knosys.2012.06.005     Document Type: Article
Times cited : (282)

References (43)
  • 3
    • 36948999941 scopus 로고    scopus 로고
    • University of California, Department of Information and Computer Science Irvine, CA
    • A. Asuncion, and D.J. Newman UCI Machine Learning Repository 2007 University of California, Department of Information and Computer Science Irvine, CA
    • (2007) UCI Machine Learning Repository
    • Asuncion, A.1    Newman, D.J.2
  • 6
    • 58349094507 scopus 로고    scopus 로고
    • Feature selection for text classification with Naive Bayes
    • J. Chen, H. Huang, S. Tian, and Y. Qu Feature selection for text classification with Naive Bayes Expert Systems with Applications 36 3 2009 5432 5435
    • (2009) Expert Systems with Applications , vol.36 , Issue.3 , pp. 5432-5435
    • Chen, J.1    Huang, H.2    Tian, S.3    Qu, Y.4
  • 7
    • 33646156614 scopus 로고    scopus 로고
    • Web page classification based on a support vector machine using a weighted vote schema
    • R.-C. Chen, and C.-H. Hsieh Web page classification based on a support vector machine using a weighted vote schema Expert Systems with Applications 31 2 2006 427 435
    • (2006) Expert Systems with Applications , vol.31 , Issue.2 , pp. 427-435
    • Chen, R.-C.1    Hsieh, C.-H.2
  • 8
    • 78650694104 scopus 로고    scopus 로고
    • Using chi-square statistics to measure similarities for text categorization
    • Y.-T. Chen, and M.C. Chen Using chi-square statistics to measure similarities for text categorization Expert Systems with Applications 38 4 2011 3085 3090
    • (2011) Expert Systems with Applications , vol.38 , Issue.4 , pp. 3085-3090
    • Chen, Y.-T.1    Chen, M.C.2
  • 11
    • 39649105441 scopus 로고    scopus 로고
    • Author identification: Using text sampling to handle the class imbalance problem
    • S. Efstathios Author identification: using text sampling to handle the class imbalance problem Information Processing and Management 44 2 2008 790 799
    • (2008) Information Processing and Management , vol.44 , Issue.2 , pp. 790-799
    • Efstathios, S.1
  • 13
    • 2942731012 scopus 로고    scopus 로고
    • An extensive empirical study of feature selection metrics for text classification
    • G. Forman An extensive empirical study of feature selection metrics for text classification Journal of Machine Learning Research 3 2003 1289 1305
    • (2003) Journal of Machine Learning Research , vol.3 , pp. 1289-1305
    • Forman, G.1
  • 15
    • 48049093694 scopus 로고    scopus 로고
    • Subspace based feature selection for pattern recognition
    • S. Gunal, and R. Edizkan Subspace based feature selection for pattern recognition Information Sciences 178 19 2008 3716 3726
    • (2008) Information Sciences , vol.178 , Issue.19 , pp. 3716-3726
    • Gunal, S.1    Edizkan, R.2
  • 17
    • 67349107478 scopus 로고    scopus 로고
    • The search for optimal feature set in power quality event classification
    • S. Gunal, O.N. Gerek, D.G. Ece, and R. Edizkan The search for optimal feature set in power quality event classification Expert Systems with Applications 36 7 2009 10266 10273
    • (2009) Expert Systems with Applications , vol.36 , Issue.7 , pp. 10266-10273
    • Gunal, S.1    Gerek, O.N.2    Ece, D.G.3    Edizkan, R.4
  • 19
    • 67349246464 scopus 로고    scopus 로고
    • A review of machine learning approaches to spam filtering
    • T.S. Guzella, and W.M. Caminhas A review of machine learning approaches to spam filtering Expert Systems with Applications 36 7 2009 10206 10222
    • (2009) Expert Systems with Applications , vol.36 , Issue.7 , pp. 10206-10222
    • Guzella, T.S.1    Caminhas, W.M.2
  • 20
    • 0036505670 scopus 로고    scopus 로고
    • A comparison of methods for multiclass support vector machines
    • C.-W. Hsu, and C.-J. Lin A comparison of methods for multiclass support vector machines IEEE Transactions on Neural Networks 13 2 2002 415 425
    • (2002) IEEE Transactions on Neural Networks , vol.13 , Issue.2 , pp. 415-425
    • Hsu, C.-W.1    Lin, C.-J.2
  • 22
    • 84957069814 scopus 로고    scopus 로고
    • Text categorization with support vector machines: Learning with many relevant features
    • T. Joachims, Text categorization with support vector machines: learning with many relevant features, in: Proceedings of the 10th European Conference on Machine Learning, 1998, pp. 137-142.
    • (1998) Proceedings of the 10th European Conference on Machine Learning , pp. 137-142
    • Joachims, T.1
  • 23
    • 0036997082 scopus 로고    scopus 로고
    • A decision-tree-based symbolic rule induction system for text categorization
    • D.E. Johnson, F.J. Oles, T. Zhang, and T. Goetz A decision-tree-based symbolic rule induction system for text categorization IBM Systems Journal 41 3 2002 428 437
    • (2002) IBM Systems Journal , vol.41 , Issue.3 , pp. 428-437
    • Johnson, D.E.1    Oles, F.J.2    Zhang, T.3    Goetz, T.4
  • 24
    • 0031381525 scopus 로고    scopus 로고
    • Wrappers for feature subset selection
    • R. Kohavi, and G.H. John Wrappers for feature subset selection Artificial Intelligence 97 1997 273 324
    • (1997) Artificial Intelligence , vol.97 , pp. 273-324
    • Kohavi, R.1    John, G.H.2
  • 25
    • 77953129520 scopus 로고    scopus 로고
    • A comparison study on multiple binary-class SVM methods for unilabel text categorization
    • M.A. Kumar, and M. Gopal A comparison study on multiple binary-class SVM methods for unilabel text categorization Pattern Recognition Letters 31 11 2010 1437 1444
    • (2010) Pattern Recognition Letters , vol.31 , Issue.11 , pp. 1437-1444
    • Kumar, M.A.1    Gopal, M.2
  • 26
    • 23744432473 scopus 로고    scopus 로고
    • Information gain and divergence-based feature selection for machine learning-based text categorization
    • C. Lee, and G.G. Lee Information gain and divergence-based feature selection for machine learning-based text categorization Information Processing and Management 42 1 2006 155 165
    • (2006) Information Processing and Management , vol.42 , Issue.1 , pp. 155-165
    • Lee, C.1    Lee, G.G.2
  • 27
    • 62349118015 scopus 로고    scopus 로고
    • Feature selection with dynamic mutual information
    • H. Liu, J. Sun, L. Liu, and H. Zhang Feature selection with dynamic mutual information Pattern Recognition 42 7 2009 1330 1339
    • (2009) Pattern Recognition , vol.42 , Issue.7 , pp. 1330-1339
    • Liu, H.1    Sun, J.2    Liu, L.3    Zhang, H.4
  • 31
    • 0037375142 scopus 로고    scopus 로고
    • Feature selection on hierarchy of web documents
    • D. Mladenic, and M. Grobelnik Feature selection on hierarchy of web documents Decision Support Systems 35 1 2003 45 87
    • (2003) Decision Support Systems , vol.35 , Issue.1 , pp. 45-87
    • Mladenic, D.1    Grobelnik, M.2
  • 32
    • 58349094495 scopus 로고    scopus 로고
    • Feature selection with a measure of deviations from Poisson in text categorization
    • H. Ogura, H. Amano, and M. Kondo Feature selection with a measure of deviations from Poisson in text categorization Decision Support Systems 36 3 2009 6826 6832
    • (2009) Decision Support Systems , vol.36 , Issue.3 , pp. 6826-6832
    • Ogura, H.1    Amano, H.2    Kondo, M.3
  • 33
    • 78650716116 scopus 로고    scopus 로고
    • A web page classification system based on a genetic algorithm using tagged-terms as features
    • S.A. Ozel A web page classification system based on a genetic algorithm using tagged-terms as features Expert Systems with Applications 38 4 2011 3407 3415
    • (2011) Expert Systems with Applications , vol.38 , Issue.4 , pp. 3407-3415
    • Ozel, S.A.1
  • 34
    • 84948481845 scopus 로고
    • An algorithm for suffix stripping
    • M.F. Porter An algorithm for suffix stripping Program 14 3 1980 130 137
    • (1980) Program , vol.14 , Issue.3 , pp. 130-137
    • Porter, M.F.1
  • 35
    • 35748932917 scopus 로고    scopus 로고
    • A review of feature selection techniques in bioinformatics
    • Y. Saeys, I. Inza, and P. Larranaga A review of feature selection techniques in bioinformatics Bioinformatics 23 19 2007 2507 2517
    • (2007) Bioinformatics , vol.23 , Issue.19 , pp. 2507-2517
    • Saeys, Y.1    Inza, I.2    Larranaga, P.3
  • 38
    • 80955181170 scopus 로고    scopus 로고
    • A two-stage feature selection method for text categorization by using information gain, principal component analysis and genetic algorithm
    • H. Uguz A two-stage feature selection method for text categorization by using information gain, principal component analysis and genetic algorithm Knowledge-Based Systems 24 7 2011 1024 1032
    • (2011) Knowledge-Based Systems , vol.24 , Issue.7 , pp. 1024-1032
    • Uguz, H.1
  • 39
    • 37249061676 scopus 로고    scopus 로고
    • A probabilistic approach to feature selection for multi-class text categorization
    • K. Wu, B.L. Lu, M. Uchiyama, and H. Isahara A probabilistic approach to feature selection for multi-class text categorization Lecture Notes in Computer Science 4491 2007 1310 1317
    • (2007) Lecture Notes in Computer Science , vol.4491 , pp. 1310-1317
    • Wu, K.1    Lu, B.L.2    Uchiyama, M.3    Isahara, H.4
  • 40
    • 79957440082 scopus 로고    scopus 로고
    • A new feature selection algorithm based on binomial hypothesis testing for spam filtering
    • J. Yang, Y. Liu, Z. Liu, X. Zhu, and X. Zhang A new feature selection algorithm based on binomial hypothesis testing for spam filtering Knowledge-Based Systems 24 6 2011 904 914
    • (2011) Knowledge-Based Systems , vol.24 , Issue.6 , pp. 904-914
    • Yang, J.1    Liu, Y.2    Liu, Z.3    Zhu, X.4    Zhang, X.5
  • 43
    • 67349121244 scopus 로고    scopus 로고
    • Combining neural networks and semantic feature space for email classification
    • B. Yu, and D.-h. Zhu Combining neural networks and semantic feature space for email classification Knowledge-Based Systems 22 5 2009 376 381
    • (2009) Knowledge-Based Systems , vol.22 , Issue.5 , pp. 376-381
    • Yu, B.1    Zhu, D.-H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.