메뉴 건너뛰기




Volumn 2, Issue 1, 2013, Pages 73-84

A novel framework for class imbalance learning using intelligent under-sampling

Author keywords

CILIUS; Class imbalance; Classification; Weighted sampling

Indexed keywords

BOTTLES; CLASSIFICATION (OF INFORMATION); LEARNING SYSTEMS; NEURAL NETWORKS; TREES (MATHEMATICS);

EID: 84900488591     PISSN: 21926352     EISSN: 21926360     Source Type: Journal    
DOI: 10.1007/s13748-012-0038-2     Document Type: Article
Times cited : (13)

References (51)
  • 4
    • 20844458491 scopus 로고    scopus 로고
    • Mining with rarity: a unifying framework
    • Weiss, G. M.: Mining with rarity: a unifying framework. ACM SIGKDD Explor. Newslett. 6(1), 7-19 (2004).
    • (2004) ACM SIGKDD Explor. Newslett. , vol.6 , Issue.1 , pp. 7-19
    • Weiss, G.M.1
  • 6
    • 41549103711 scopus 로고    scopus 로고
    • Ground-level ozone prediction by support vector machine approach with a cost-sensitive classification scheme
    • Lu, W.-Z., Wang, D.: Ground-level ozone prediction by support vector machine approach with a cost-sensitive classification scheme. Sci. Total. Environ. 395(2-3), 109-116 (2008).
    • (2008) Sci. Total. Environ. , vol.395 , Issue.2-3 , pp. 109-116
    • Lu, W.-Z.1    Wang, D.2
  • 7
    • 33646142788 scopus 로고    scopus 로고
    • Evaluation of neural networks and data mining methods on a credit assessment task for class imbalance problem
    • Huang, Y.-M., Hung, C.-M., Jiau, H. C.: Evaluation of neural networks and data mining methods on a credit assessment task for class imbalance problem. Nonlinear Anal. R. World Appl. 7(4), 720-747 (2006).
    • (2006) Nonlinear Anal. R. World Appl. , vol.7 , Issue.4 , pp. 720-747
    • Huang, Y.-M.1    Hung, C.-M.2    Jiau, H.C.3
  • 9
    • 40649126091 scopus 로고    scopus 로고
    • Training neural network classifiers for medical decision making: the effects of imbalanced datasets on classification performance
    • Mazurowski, M. A., Habas, P. A., Zurada, J. M., Lo, J. Y., Baker, J. A., Tourassi, G. D.: Training neural network classifiers for medical decision making: the effects of imbalanced datasets on classification performance. Neural Netw. 21(2-3), 427-436 (2008).
    • (2008) Neural Netw. , vol.21 , Issue.2-3 , pp. 427-436
    • Mazurowski, M.A.1    Habas, P.A.2    Zurada, J.M.3    Lo, J.Y.4    Baker, J.A.5    Tourassi, G.D.6
  • 11
    • 34548604303 scopus 로고    scopus 로고
    • Comparison of different strategies of utilizing fuzzy clustering in structure identification
    • Kiliç, K., Türksen, I. B.: Comparison of different strategies of utilizing fuzzy clustering in structure identification. Inf. Sci. 177(23), 5153-5162 (2007).
    • (2007) Inf. Sci. , vol.177 , Issue.23 , pp. 5153-5162
    • Kiliç, K.1    Türksen, I.B.2
  • 13
    • 40649114452 scopus 로고    scopus 로고
    • Robust BMPM training based on second-order cone programming and its application in medical diagnosis
    • 2008. Berlin/Heidelberg, Germany: Springer, vol. 4654, pp. 303-312
    • X. Peng and I. King, "Robust BMPM training based on second-order cone programming and its application in medical diagnosis", Neural Netw., vol. 21, no. 2-3, pp. 450-457, 2008. Berlin/Heidelberg, Germany: Springer, vol. 4654, pp. 303-312, (2007).
    • (2007) Neural Netw. , vol.21 , Issue.2-3 , pp. 450-457
    • Peng, X.1    King, I.2
  • 14
    • 77953089698 scopus 로고    scopus 로고
    • FSVM-CIL: fuzzy support vector machines for class imbalance learning
    • Batuwita, R., Palade, V.: FSVM-CIL: fuzzy support vector machines for class imbalance learning. IEEE Trans. Fuzzy Syst. 18(3), 558-571 (2010).
    • (2010) IEEE Trans. Fuzzy Syst. , vol.18 , Issue.3 , pp. 558-571
    • Batuwita, R.1    Palade, V.2
  • 15
    • 33845536164 scopus 로고    scopus 로고
    • The class imbalance problem: a systematic study
    • Japkowicz, N., Stephen, S.: The class imbalance problem: a systematic study. Intell. Data Anal. 6, 429-450 (2002).
    • (2002) Intell. Data Anal. , vol.6 , pp. 429-450
    • Japkowicz, N.1    Stephen, S.2
  • 19
    • 20844458491 scopus 로고    scopus 로고
    • Mining with rarity: a unifying framework
    • Weiss, G.: Mining with rarity: a unifying framework. SIGKDD Explor. Newslett. 6(1), 7-19 (2004).
    • (2004) SIGKDD Explor. Newslett. , vol.6 , Issue.1 , pp. 7-19
    • Weiss, G.1
  • 20
    • 0346586663 scopus 로고    scopus 로고
    • SMOTE: synthetic minority over-sampling technique
    • Chawla, N., Bowyer, K., Kegelmeyer, P.: SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321-357 (2002).
    • (2002) J. Artif. Intell. Res. , vol.16 , pp. 321-357
    • Chawla, N.1    Bowyer, K.2    Kegelmeyer, P.3
  • 22
    • 27144540575 scopus 로고    scopus 로고
    • Class imbalances versus small disjuncts
    • Jo, T., Japkowicz, N.: Class imbalances versus small disjuncts. ACM SIGKDD Explor. Newslett. 6(1), 40-49 (2004).
    • (2004) ACM SIGKDD Explor. Newslett. , vol.6 , Issue.1 , pp. 40-49
    • Jo, T.1    Japkowicz, N.2
  • 24
    • 80055010583 scopus 로고    scopus 로고
    • Extract minimum positive and maximum negative features for imbalanced binary classification
    • Wang, J., You, J., Li, Q., Xu, Y.: Extract minimum positive and maximum negative features for imbalanced binary classification. Pattern Recognit. 45, 1136-1145 (2012).
    • (2012) Pattern Recognit , vol.45 , pp. 1136-1145
    • Wang, J.1    You, J.2    Li, Q.3    Xu, Y.4
  • 25
    • 80255133264 scopus 로고    scopus 로고
    • An experimental comparison of classification algorithms for imbalanced credit scoring data sets
    • Brown, I., Mues, C.: An experimental comparison of classification algorithms for imbalanced credit scoring data sets. Expert Syst. Appl. 39, 3446-3453 (2012).
    • (2012) Expert Syst. Appl. , vol.39 , pp. 3446-3453
    • Brown, I.1    Mues, C.2
  • 26
    • 80052414830 scopus 로고    scopus 로고
    • Evolutionary-based selection of generalized instances for imbalanced classification
    • Garcì, S., Derrac, J., Triguero, I., Carmona, C. J., Herrera, F.: Evolutionary-based selection of generalized instances for imbalanced classification. Knowl. Based Syst. 25, 3-12 (2012).
    • (2012) Knowl. Based Syst. , vol.25 , pp. 3-12
    • Garcì, S.1    Derrac, J.2    Triguero, I.3    Carmona, C.J.4    Herrera, F.5
  • 27
    • 80255137251 scopus 로고    scopus 로고
    • Dynamic classifier ensemble model for customer classification with imbalanced class distribution
    • Xiao, J., Xie, L., He, C., Jiang, X.: Dynamic classifier ensemble model for customer classification with imbalanced class distribution. Expert Syst. Appl. 39, 3668-3675 (2012).
    • (2012) Expert Syst. Appl. , vol.39 , pp. 3668-3675
    • Xiao, J.1    Xie, L.2    He, C.3    Jiang, X.4
  • 28
    • 84856964446 scopus 로고    scopus 로고
    • Analysis of preprocessing vs. cost-sensitive learning for imbalanced classification. Open problems on intrinsic data characteristics
    • López, V., Fernández, A., Moreno-Torres, J. G., Herrera, F.: Analysis of preprocessing vs. cost-sensitive learning for imbalanced classification. Open problems on intrinsic data characteristics. Expert Syst. Appl. 39, 6585-6608 (2012).
    • (2012) Expert Syst. Appl. , vol.39 , pp. 6585-6608
    • López, V.1    Fernández, A.2    Moreno-Torres, J.G.3    Herrera, F.4
  • 29
    • 84929403541 scopus 로고    scopus 로고
    • The research of imbalanced data set of sample sampling method based on K-means cluster and genetic algorithm
    • Yong, Y.: The research of imbalanced data set of sample sampling method based on K-means cluster and genetic algorithm. Energy Procedia 17, 164-170 (2012).
    • (2012) Energy Procedia , vol.17 , pp. 164-170
    • Yong, Y.1
  • 31
    • 80052394779 scopus 로고    scopus 로고
    • On the effectiveness of preprocessing methods when dealing with different levels of class imbalance
    • Garcia, V., Sanchez, J. S., Mollineda, R. A.: On the effectiveness of preprocessing methods when dealing with different levels of class imbalance. Knowl. Based Syst. 25, 13-21 (2012).
    • (2012) Knowl. Based Syst. , vol.25 , pp. 13-21
    • Garcia, V.1    Sanchez, J.S.2    Mollineda, R.A.3
  • 32
    • 77956395103 scopus 로고    scopus 로고
    • Analysis of an evolutionary RBFN design algorithm, CO2RBFN, for imbalanced data sets
    • Pérez-Godoy, M. D., Fernández, A., Rivera, A. J., del Jesus, M. J.: Analysis of an evolutionary RBFN design algorithm, CO2RBFN, for imbalanced data sets. Pattern Recognit. Lett. 31, 2375-2388 (2010).
    • (2010) Pattern Recognit. Lett. , vol.31 , pp. 2375-2388
    • Pérez-Godoy, M.D.1    Fernández, A.2    Rivera, A.J.3    del Jesus, M.J.4
  • 33
    • 77952554315 scopus 로고    scopus 로고
    • A learning method for the class imbalance problem with medical data sets
    • Li, D. C., Liu, C. W., Hu, S. C.: A learning method for the class imbalance problem with medical data sets. Comput. Biol. Med. 40, 509-518 (2010).
    • (2010) Comput. Biol. Med. , vol.40 , pp. 509-518
    • Li, D.C.1    Liu, C.W.2    Hu, S.C.3
  • 34
    • 79951944700 scopus 로고    scopus 로고
    • Exploiting probabilistic topic models to improve text categorization under class imbalance
    • Che, E., Lin, Y., Xiong, H., Luo, Q., Ma, H.: Exploiting probabilistic topic models to improve text categorization under class imbalance. Inf. Process. Manag. 47, 202-214 (2011).
    • (2011) Inf. Process. Manag. , vol.47 , pp. 202-214
    • Che, E.1    Lin, Y.2    Xiong, H.3    Luo, Q.4    Ma, H.5
  • 35
    • 75149159107 scopus 로고    scopus 로고
    • On the 2-tuples based genetic tuning performance for fuzzy rule based classification systems in imbalanced data-sets
    • Fernández, A., del Jesus, M. J., Herrera, F.: On the 2-tuples based genetic tuning performance for fuzzy rule based classification systems in imbalanced data-sets. Inf. Sci. 180, 1268-1291 (2010).
    • (2010) Inf. Sci. , vol.180 , pp. 1268-1291
    • Fernández, A.1    del Jesus, M.J.2    Herrera, F.3
  • 36
    • 0005256939 scopus 로고    scopus 로고
    • Fuzzy Algorithms with Applications to Image Processing and Pattern Recognition
    • Chi, Z., Yan, H., Pham, T.: Fuzzy Algorithms with Applications to Image Processing and Pattern Recognition. World Scientific (1996).
    • (1996) World Scientific
    • Chi, Z.1    Yan, H.2    Pham, T.3
  • 37
    • 17444427096 scopus 로고    scopus 로고
    • Hybridization of fuzzy GBML approaches for pattern classification problems
    • Ishibuchi, H., Yamamoto, T., Nakashima, T.: Hybridization of fuzzy GBML approaches for pattern classification problems. IEEE Trans. Syst. Man Cybern. B 35(2), 359-365 (2005).
    • (2005) IEEE Trans. Syst. Man Cybern. B , vol.35 , Issue.2 , pp. 359-365
    • Ishibuchi, H.1    Yamamoto, T.2    Nakashima, T.3
  • 38
    • 58349098976 scopus 로고    scopus 로고
    • Handling class imbalance in customer churn prediction
    • Burez, J., van den Poel, D.: Handling class imbalance in customer churn prediction. Expert Syst. Appl. 36, 4626-4636 (2009).
    • (2009) Expert Syst. Appl. , vol.36 , pp. 4626-4636
    • Burez, J.1    van den Poel, D.2
  • 39
    • 79151476746 scopus 로고    scopus 로고
    • Bayesian decision theory for support vector machines: imbalance measurement and feature optimization
    • Hsu, C. C., Wang, K. S., Chang, S. H.: Bayesian decision theory for support vector machines: imbalance measurement and feature optimization. Expert Syst. Appl. 38, 4698-4704 (2011).
    • (2011) Expert Syst. Appl. , vol.38 , pp. 4698-4704
    • Hsu, C.C.1    Wang, K.S.2    Chang, S.H.3
  • 40
    • 64049115860 scopus 로고    scopus 로고
    • On the influence of an adaptive inference system in fuzzy rule based classification systems for imbalanced data-sets
    • Fernández, A., del Jesus, M. J., Herrera, F.: On the influence of an adaptive inference system in fuzzy rule based classification systems for imbalanced data-sets. Expert Syst. Appl. 36, 9805-9812 (2009).
    • (2009) Expert Syst. Appl. , vol.36 , pp. 9805-9812
    • Fernández, A.1    del Jesus, M.J.2    Herrera, F.3
  • 41
    • 82355169734 scopus 로고    scopus 로고
    • The effect of class imbalance on case selection for case-based classifiers: an empirical study in the context of medical decision support
    • Malof, J. M., Mazurowski, M. A., Tourassi, G. D.: The effect of class imbalance on case selection for case-based classifiers: an empirical study in the context of medical decision support. Neural Netw. 25, 141-145 (2012).
    • (2012) Neural Netw , vol.25 , pp. 141-145
    • Malof, J.M.1    Mazurowski, M.A.2    Tourassi, G.D.3
  • 44
    • 0022471098 scopus 로고
    • Learning representations by back-propagating errors
    • doi: 10. 1038/323533a0
    • Rumelhart, D. E., Hinton, Geoffrey E., Williams, Ronald J., Learning representations by back-propagating errors. Nature 323(6088), 533-536 (1986). doi: 10. 1038/323533a0.
    • (1986) Nature , vol.323 , Issue.6088 , pp. 533-536
    • Rumelhart, D.E.1    Hinton, G.E.2    Williams Ronald, J.3
  • 45
    • 84863387880 scopus 로고    scopus 로고
    • (School of Information and Computer Science) University of California, Irvine (2007, online)
    • Asuncion, A., Newmann, D.: UCI Repository of Machine Learning Database (School of Information and Computer Science) University of California, Irvine (2007, online). http://www. ics. uci. edu/~mlearn/MLRepository. html.
    • UCI Repository of Machine Learning Database
    • Asuncion, A.1    Newmann, D.2
  • 47
    • 46849096083 scopus 로고    scopus 로고
    • A study of the behaviour of linguistic fuzzy rule based classification systems in the framework of imbalanced data-sets
    • Fernández, A., García, S., del Jesus, M. J., Herrera, F.: A study of the behaviour of linguistic fuzzy rule based classification systems in the framework of imbalanced data-sets. Fuzzy Sets Syst. 159(18), 2378-2398 (2008).
    • (2008) Fuzzy Sets Syst. , vol.159 , Issue.18 , pp. 2378-2398
    • Fernández, A.1    García, S.2    del Jesus, M.J.3    Herrera, F.4
  • 48
    • 64549120231 scopus 로고    scopus 로고
    • A study of statistical techniques and performance measures for genetics-based machine learning: accuracy and interpretability
    • doi: 10. 1007/s00500-008-0392-y
    • García, S., Fernández, A., Luengo, J., Herrera, F.: A study of statistical techniques and performance measures for genetics-based machine learning: accuracy and interpretability. Soft Comput. (2009). doi: 10. 1007/s00500-008-0392-y.
    • (2009) Soft Comput
    • García, S.1    Fernández, A.2    Luengo, J.3    Herrera, F.4
  • 49
    • 55549116330 scopus 로고    scopus 로고
    • Evolutionary rule-based systems for imbalanced datasets
    • Orriols-Puig, A., Bernadó-Mansilla, E.: Evolutionary rule-based systems for imbalanced datasets. Soft Comput. 13(3), 213-225 (2009).
    • (2009) Soft Comput. , vol.13 , Issue.3 , pp. 213-225
    • Orriols-Puig, A.1    Bernadó-Mansilla, E.2
  • 50
    • 70349270458 scopus 로고    scopus 로고
    • A study on the use of non-parametric tests for analyzing the evolutionary algorithms' behaviour: a case study on the CEC'2005 special session on real parameter optimization
    • doi: 10. 1007/s10732-008-9080-4
    • García, S., Molina, D., Lozano, M., Herrera, F.: A study on the use of non-parametric tests for analyzing the evolutionary algorithms' behaviour: a case study on the CEC'2005 special session on real parameter optimization. J Heurist. 15, 617-644 (2009). doi: 10. 1007/s10732-008-9080-4.
    • (2009) J Heurist. , vol.15 , pp. 617-644
    • García, S.1    Molina, D.2    Lozano, M.3    Herrera, F.4
  • 51
    • 60249094201 scopus 로고    scopus 로고
    • A study on the use of statistical tests for experimentation with neural networks: analysis of parametric test conditions and non-parametric tests
    • Luengo, J., García, S., Herrera, F.: A study on the use of statistical tests for experimentation with neural networks: analysis of parametric test conditions and non-parametric tests. Expert Syst. Appl. 36(2009), 7798-7808 (2009).
    • (2009) Expert Syst. Appl. , vol.36 , Issue.2009 , pp. 7798-7808
    • Luengo, J.1    García, S.2    Herrera, F.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.