메뉴 건너뛰기




Volumn 48, Issue 1, 2009, Pages 191-201

On strategies for imbalanced text classification using SVM: A comparative study

Author keywords

Imbalanced text classification; Instance weighting; Resampling; Support Vector Machines; SVM

Indexed keywords

APPLICATION DEVELOPERS; BENCHMARK DATASETS; BEST DECISION; CLASSIFICATION ACCURACY; COMPARATIVE STUDIES; IMBALANCED CLASSIFICATION; IMBALANCED TEXT CLASSIFICATION; INSTANCE WEIGHTING; NEWSGROUPS; PERFORMANCE MEASURE; REAL-WORLD; RESAMPLING; REUTERS-21578; SVM; TEST PHASIS; TEXT CLASSIFICATION; THRESHOLDING; TRAINING EXAMPLE;

EID: 70350565063     PISSN: 01679236     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.dss.2009.07.011     Document Type: Article
Times cited : (245)

References (37)
  • 1
    • 22944452794 scopus 로고    scopus 로고
    • Applying support vector machines to imbalanced datasets
    • Pisa, Italy
    • Akbani R., Kwek S., and Japkowicz N. Applying support vector machines to imbalanced datasets. Proc. of ECML'04 (Sep. 2004) 39-50 Pisa, Italy
    • (2004) Proc. of ECML'04 , pp. 39-50
    • Akbani, R.1    Kwek, S.2    Japkowicz, N.3
  • 3
    • 36249013642 scopus 로고    scopus 로고
    • A machine learning approach to web page filtering using content and structure analysis
    • Chau M., and Chen H. A machine learning approach to web page filtering using content and structure analysis. Decision Support Systems 44 2 (2008) 482-494
    • (2008) Decision Support Systems , vol.44 , Issue.2 , pp. 482-494
    • Chau, M.1    Chen, H.2
  • 5
    • 10844295777 scopus 로고    scopus 로고
    • Multi-class svm with negative data selection for web page classification
    • Budapest, Hungary
    • Chen C.-M., Lee H.-M., and Kao M.-T. Multi-class svm with negative data selection for web page classification. Proc. of IEEE Int'l Joint Conf. on Neural Networks vol. 3 (July 2004) 2047-2052 Budapest, Hungary
    • (2004) Proc. of IEEE Int'l Joint Conf. on Neural Networks , vol.3 , pp. 2047-2052
    • Chen, C.-M.1    Lee, H.-M.2    Kao, M.-T.3
  • 7
    • 33749249600 scopus 로고    scopus 로고
    • The relationship between precision-recall and ROC curves
    • ACM Press, Pittsburgh, Pennsylvania
    • Davis J., and Goadrich M. The relationship between precision-recall and ROC curves. Proc. of ICML'06 (June 2006), ACM Press, Pittsburgh, Pennsylvania 233-240
    • (2006) Proc. of ICML'06 , pp. 233-240
    • Davis, J.1    Goadrich, M.2
  • 8
    • 85105809948 scopus 로고    scopus 로고
    • Inductive learning algorithms and representations for text categorization
    • Bethesda, Maryland
    • Dumais S.T., Platt J., Heckerman D., and Sahami M. Inductive learning algorithms and representations for text categorization. Proc. of ACM CIKM'98 (Nov. 1998) 148-155 Bethesda, Maryland
    • (1998) Proc. of ACM CIKM'98 , pp. 148-155
    • Dumais, S.T.1    Platt, J.2    Heckerman, D.3    Sahami, M.4
  • 9
    • 33747771325 scopus 로고    scopus 로고
    • An integrated two-stage model for intelligent information routing
    • Fan W., Gordon M.D., and Pathak P. An integrated two-stage model for intelligent information routing. Decision Support Systems 42 1 (2006) 362-374
    • (2006) Decision Support Systems , vol.42 , Issue.1 , pp. 362-374
    • Fan, W.1    Gordon, M.D.2    Pathak, P.3
  • 10
    • 0345438685 scopus 로고    scopus 로고
    • Roc graphs: Notes and practical considerations for data mining researchers
    • Technical Report HPL-2003-4, HP Laboratories, Jan
    • T. Fawcett. Roc graphs: Notes and practical considerations for data mining researchers. Technical Report HPL-2003-4, HP Laboratories, Jan. 2003. http://www.hpl.hp.com/techreports/2003/HPL-2003-4.html.
    • (2003)
    • Fawcett, T.1
  • 11
    • 0242625252 scopus 로고    scopus 로고
    • Integrating feature and instance selection for text classification
    • Edmonton, Alberta, Canada
    • Fragoudis D., Meretakis D., and Likothanassis S. Integrating feature and instance selection for text classification. Proc. of ACM SIGKDD'02 (July 2002), Edmonton, Alberta, Canada 501-506
    • (2002) Proc. of ACM SIGKDD'02 , pp. 501-506
    • Fragoudis, D.1    Meretakis, D.2    Likothanassis, S.3
  • 12
    • 33845536164 scopus 로고    scopus 로고
    • The class imbalance problem: a systematic study
    • Japkowicz N., and Stephen S. The class imbalance problem: a systematic study. Intelligent Data Analysis 6 5 (2002) 429-449
    • (2002) Intelligent Data Analysis , vol.6 , Issue.5 , pp. 429-449
    • Japkowicz, N.1    Stephen, S.2
  • 13
    • 84957069814 scopus 로고    scopus 로고
    • Text categorization with support vector machines: learning with many relevant features
    • Springer-Verlag
    • Joachims T. Text categorization with support vector machines: learning with many relevant features. Proc. of ECML'98 (Apr. 1998) 137-142 Springer-Verlag
    • (1998) Proc. of ECML'98 , pp. 137-142
    • Joachims, T.1
  • 14
    • 31844446804 scopus 로고    scopus 로고
    • A support vector method for multivariate performance measures
    • Bonn, Germany
    • Joachims T. A support vector method for multivariate performance measures. Proc. of ICML'05 (Aug. 2005) 377-384 Bonn, Germany
    • (2005) Proc. of ICML'05 , pp. 377-384
    • Joachims, T.1
  • 15
    • 0002229304 scopus 로고    scopus 로고
    • Pairwise classification and support vector machines
    • Schölkopf B., Burges C., and Smola A. (Eds), MIT Press
    • Krebel U.H.-G. Pairwise classification and support vector machines. In: Schölkopf B., Burges C., and Smola A. (Eds). Advances in kernel methods: support vector learning (1999), MIT Press 255-268
    • (1999) Advances in kernel methods: support vector learning , pp. 255-268
    • Krebel, U.H.-G.1
  • 16
    • 0001972236 scopus 로고    scopus 로고
    • Addressing the curse of imbalanced training sets: one-sided selection
    • Kubat M., and Matwin S. Addressing the curse of imbalanced training sets: one-sided selection. Proc. of ICML'97 (July 1997) 179-186
    • (1997) Proc. of ICML'97 , pp. 179-186
    • Kubat, M.1    Matwin, S.2
  • 17
    • 0036161242 scopus 로고    scopus 로고
    • Text categorization with support vector machines how to represent texts in input space?
    • Leopold E., and Kindermann J. Text categorization with support vector machines how to represent texts in input space?. Machine Learning 46 1-3 (2002) 423-444
    • (2002) Machine Learning , vol.46 , Issue.1-3 , pp. 423-444
    • Leopold, E.1    Kindermann, J.2
  • 19
    • 84878083672 scopus 로고    scopus 로고
    • Exploratory under-sampling for class-imbalance learning
    • Hong Kong, China
    • Liu X.-Y., Wu J., and Zhou Z.-H. Exploratory under-sampling for class-imbalance learning. Proc. of ICDM'06 (Dec. 2006) 965-969 Hong Kong, China
    • (2006) Proc. of ICDM'06 , pp. 965-969
    • Liu, X.-Y.1    Wu, J.2    Zhou, Z.-H.3
  • 20
    • 53849085839 scopus 로고    scopus 로고
    • Imbalanced text classification: a term weighting approach
    • Liu Y., Loh H.T., and Sun A. Imbalanced text classification: a term weighting approach. Expert System with Applications 36 1 (2009) 690-701
    • (2009) Expert System with Applications , vol.36 , Issue.1 , pp. 690-701
    • Liu, Y.1    Loh, H.T.2    Sun, A.3
  • 21
    • 0003260442 scopus 로고    scopus 로고
    • Combining statistical learning with a knowledge-based approach - a case study in intensive care monitoring
    • Bled, Slowenien
    • Morik K., Brockhausen P., and Joachims T. Combining statistical learning with a knowledge-based approach - a case study in intensive care monitoring. Proc. of ICML'99 (1999) 268-277 Bled, Slowenien
    • (1999) Proc. of ICML'99 , pp. 268-277
    • Morik, K.1    Brockhausen, P.2    Joachims, T.3
  • 24
    • 0002442796 scopus 로고    scopus 로고
    • Machine learning in automated text categorization
    • Sebastiani F. Machine learning in automated text categorization. ACM Computing Surveys 34 1 (2002) 1-47
    • (2002) ACM Computing Surveys , vol.34 , Issue.1 , pp. 1-47
    • Sebastiani, F.1
  • 25
    • 18744364452 scopus 로고    scopus 로고
    • Boosting support vector machines for text classification through parameter-free threshold relaxation
    • New Orleans, LA
    • Shanahan J.G., and Roma N. Boosting support vector machines for text classification through parameter-free threshold relaxation. Proc. of CIKM'03 (2003) 247-254 New Orleans, LA
    • (2003) Proc. of CIKM'03 , pp. 247-254
    • Shanahan, J.G.1    Roma, N.2
  • 26
    • 34548491642 scopus 로고    scopus 로고
    • An intelligent information agent for document title classification and filtering in document-intensive domains
    • Song D., Lau R.Y.K., Bruza P.D., Wong K.-F., and Chen D.-Y. An intelligent information agent for document title classification and filtering in document-intensive domains. Decision Support Systems 44 1 (2007) 251-265
    • (2007) Decision Support Systems , vol.44 , Issue.1 , pp. 251-265
    • Song, D.1    Lau, R.Y.K.2    Bruza, P.D.3    Wong, K.-F.4    Chen, D.-Y.5
  • 27
    • 33745767534 scopus 로고    scopus 로고
    • FISA: Feature-based instance selection for imbalanced text classification
    • Singapore
    • Sun A., Lim E.-P., Benatallah B., and Hassan M. FISA: Feature-based instance selection for imbalanced text classification. Proc. of PAKDD'06 (2006) 250-254 Singapore
    • (2006) Proc. of PAKDD'06 , pp. 250-254
    • Sun, A.1    Lim, E.-P.2    Benatallah, B.3    Hassan, M.4
  • 28
    • 1542270212 scopus 로고    scopus 로고
    • Web classification using support vector machine
    • ACM, McLean, Virginia, USA
    • Sun A., Lim E.-P., and Ng W.-K. Web classification using support vector machine. Proc. of WIDM'02 (2002), ACM, McLean, Virginia, USA 96-99
    • (2002) Proc. of WIDM'02 , pp. 96-99
    • Sun, A.1    Lim, E.-P.2    Ng, W.-K.3
  • 32
    • 20844441675 scopus 로고    scopus 로고
    • KBA: kernel boundary alignment considering imbalanced data distribution
    • Wu G., and Chang E.Y. KBA: kernel boundary alignment considering imbalanced data distribution. IEEE Transactions on Knowledge and Data Engineering (TKDE) 17 6 (June 2005) 786-795
    • (2005) IEEE Transactions on Knowledge and Data Engineering (TKDE) , vol.17 , Issue.6 , pp. 786-795
    • Wu, G.1    Chang, E.Y.2
  • 33
    • 0034785186 scopus 로고    scopus 로고
    • A study of thresholding strategies for text categorization
    • New Orleans, USA
    • Yang Y. A study of thresholding strategies for text categorization. Proc. of SIGIR'01 (2001) 137-145 New Orleans, USA
    • (2001) Proc. of SIGIR'01 , pp. 137-145
    • Yang, Y.1
  • 34
    • 85024373635 scopus 로고    scopus 로고
    • A re-examination of text categorization methods
    • Berkeley, USA
    • Yang Y., and Liu X. A re-examination of text categorization methods. Proc. of ACM SIGIR'99 (Aug. 1999) 42-49 Berkeley, USA
    • (1999) Proc. of ACM SIGIR'99 , pp. 42-49
    • Yang, Y.1    Liu, X.2
  • 35
    • 33846986488 scopus 로고    scopus 로고
    • An unsupervised learning approach to resolving the data imbalanced issue in supervised learning problems in functional genomics
    • Yoon K., and Kwek S. An unsupervised learning approach to resolving the data imbalanced issue in supervised learning problems in functional genomics. Proc. of International Conference on Hybrid Intelligent Systems (2005) 303-308
    • (2005) Proc. of International Conference on Hybrid Intelligent Systems , pp. 303-308
    • Yoon, K.1    Kwek, S.2
  • 36
    • 70350564445 scopus 로고    scopus 로고
    • Automatic online news monitoring and classification for syndromic surveillance
    • Zhang Y., Dang Y., Chen H., Thurmond M., and Larson C. Automatic online news monitoring and classification for syndromic surveillance. Decision Support Systems 47 4 (2009) 508-517
    • (2009) Decision Support Systems , vol.47 , Issue.4 , pp. 508-517
    • Zhang, Y.1    Dang, Y.2    Chen, H.3    Thurmond, M.4    Larson, C.5
  • 37
    • 16644402628 scopus 로고    scopus 로고
    • Feature selection for text categorization on imbalanced data
    • Zheng Z., Wu X., and Srihari R. Feature selection for text categorization on imbalanced data. SIGKDD Explorations Newsletter 6 1 (2004) 80-89
    • (2004) SIGKDD Explorations Newsletter , vol.6 , Issue.1 , pp. 80-89
    • Zheng, Z.1    Wu, X.2    Srihari, R.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.