메뉴 건너뛰기




Volumn 28, Issue 1, 2016, Pages 238-251

To Combat Multi-Class Imbalanced Problems by Means of Over-Sampling Techniques

Author keywords

Mahalanobis distance; Multi class imbalance problems; over sampling techniques

Indexed keywords

ARTIFICIAL INTELLIGENCE; LEARNING SYSTEMS; SAMPLING;

EID: 84961631662     PISSN: 10414347     EISSN: None     Source Type: Journal    
DOI: 10.1109/TKDE.2015.2458858     Document Type: Conference Paper
Times cited : (319)

References (57)
  • 3
    • 27144531570 scopus 로고    scopus 로고
    • A study of the behavior of several methods for balancing machine learning training data
    • G. E. A. P. A. Batista, R. C. Prati, and W. C. Monard, "A study of the behavior of several methods for balancing machine learning training data," ACM Sigkdd Explorations Newslett., vol. 6, no. 1, pp. 20-29, 2004.
    • (2004) ACM Sigkdd Explorations Newslett. , vol.6 , Issue.1 , pp. 20-29
    • Batista, G.E.A.P.A.1    Prati, R.C.2    Monard, W.C.3
  • 4
    • 0031191630 scopus 로고    scopus 로고
    • The use of the area under the roc curve in the evaluation of machine learning algorithms
    • A. P. Bradley, "The use of the area under the roc curve in the evaluation of machine learning algorithms," Pattern Recognit., vol. 30, no. 7, pp. 1145-1159, 1997.
    • (1997) Pattern Recognit. , vol.30 , Issue.7 , pp. 1145-1159
    • Bradley, A.P.1
  • 5
    • 0030211964 scopus 로고    scopus 로고
    • Bagging predictors
    • L. Breiman, "Bagging predictors," Mach. Learn., vol. 24, no. 2, pp. 123-140, 1996.
    • (1996) Mach. Learn. , vol.24 , Issue.2 , pp. 123-140
    • Breiman, L.1
  • 8
    • 27144549260 scopus 로고    scopus 로고
    • Editorial: Special issue on learning from imbalanced data sets
    • N. V. Chawla, N. Japkowicz, and A. Kotcz, "Editorial: Special issue on learning from imbalanced data sets," ACM Sigkdd Explorations Newslett., vol. 6, no. 1, pp. 1-6, 2004.
    • (2004) ACM Sigkdd Explorations Newslett. , vol.6 , Issue.1 , pp. 1-6
    • Chawla, N.V.1    Japkowicz, N.2    Kotcz, A.3
  • 10
    • 70449457525 scopus 로고    scopus 로고
    • Sera: Selectively recursive approach towards nonstationary imbalanced stream data mining
    • S. Chen and H. He, "Sera: Selectively recursive approach towards nonstationary imbalanced stream data mining," in Proc. Int. Joint Conf. Neural Netw., 2009, pp. 522-529.
    • (2009) Proc. Int. Joint Conf. Neural Netw. , pp. 522-529
    • Chen, S.1    He, H.2
  • 11
    • 79952737601 scopus 로고    scopus 로고
    • Towards incremental learning of nonstationary imbalanced data stream: A multiple selectively recursive approach
    • S. Chen and H. He, "Towards incremental learning of nonstationary imbalanced data stream: A multiple selectively recursive approach," Evolving Syst., vol. 2, no. 1, pp. 35-50, 2011.
    • (2011) Evolving Syst. , vol.2 , Issue.1 , pp. 35-50
    • Chen, S.1    He, H.2
  • 12
    • 79959429637 scopus 로고    scopus 로고
    • Musera: Multiple selectively recursive approach towards imbalanced stream data mining
    • S. Chen, H. He, K. Li, and S. Desai, "Musera: Multiple selectively recursive approach towards imbalanced stream data mining," in Proc. Int. Joint Conf. Neural Netw., 2010, pp. 1-8.
    • (2010) Proc. Int. Joint Conf. Neural Netw. , pp. 1-8
    • Chen, S.1    He, H.2    Li, K.3    Desai, S.4
  • 14
    • 85149612939 scopus 로고
    • Fast effective rule induction in machine learning
    • W. W. Cohen, "Fast effective rule induction in machine learning," in Proc. 12th Int. Conf. Mach. Learn., 1995, pp. 115-123.
    • (1995) Proc. 12th Int. Conf. Mach. Learn. , pp. 115-123
    • Cohen, W.W.1
  • 15
    • 35548978022 scopus 로고    scopus 로고
    • Fast linear algebra is stable
    • J. Demmel, I. Dumitriu, and O. Holtz, "Fast linear algebra is stable," Numerische Mathematik, vol. 108, no. 1, pp. 59-91, 2007.
    • (2007) Numerische Mathematik , vol.108 , Issue.1 , pp. 59-91
    • Demmel, J.1    Dumitriu, I.2    Holtz, O.3
  • 16
    • 29644438050 scopus 로고    scopus 로고
    • Statistical comparisons of classifiers over multiple data sets
    • J. Demšar, "Statistical comparisons of classifiers over multiple data sets," The J. Mach. Learn. Res., vol. 7, pp. 1-30, 2006.
    • (2006) The J. Mach. Learn. Res. , vol.7 , pp. 1-30
    • Demšar, J.1
  • 17
    • 84943987463 scopus 로고
    • Multiple comparisons among means
    • O. J. Dunn, "Multiple comparisons among means," J. Am. Statist. Assoc., vol. 56, no. 293, pp. 52-64, 1961.
    • (1961) J. Am. Statist. Assoc. , vol.56 , Issue.293 , pp. 52-64
    • Dunn, O.J.1
  • 19
    • 84874667219 scopus 로고    scopus 로고
    • Analysing the classification of imbalanced data-sets with multiple classes: Binarization techniques and ad-hoc approaches
    • A. Fernández, V. López, M. Galar, M. D. Jesus, and F. Herrera, "Analysing the classification of imbalanced data-sets with multiple classes: Binarization techniques and ad-hoc approaches," Knowl.-Based Syst., vol. 42, pp. 97-110, 2013.
    • (2013) Knowl.-Based Syst. , vol.42 , pp. 97-110
    • Fernández, A.1    López, V.2    Galar, M.3    Jesus, M.D.4    Herrera, F.5
  • 20
    • 79953050208 scopus 로고    scopus 로고
    • A dynamic over-sampling procedure based on sensitivity for multi-class problems
    • F. Fernández-Navarro, C. Hervás-Martínez, and P. Antonio Gutiérrez, "A dynamic over-sampling procedure based on sensitivity for multi-class problems," Pattern Recognit., vol. 44, no. 8, pp. 1821-1833, 2011.
    • (2011) Pattern Recognit. , vol.44 , Issue.8 , pp. 1821-1833
    • Fernández-Navarro, F.1    Hervás-Martínez, C.2    Antonio Gutiérrez, P.3
  • 22
    • 0002978642 scopus 로고    scopus 로고
    • Experiments with a new boosting algorithm
    • Y. Freund and R. E. Schapire, "Experiments with a new boosting algorithm," in Proc. Int. Conf.Mach. Learn., 1996, vol. 96, pp. 148-156.
    • (1996) Proc. Int. Conf.Mach. Learn. , vol.96 , pp. 148-156
    • Freund, Y.1    Schapire, R.E.2
  • 23
    • 84944811700 scopus 로고
    • The use of ranks to avoid the assumption of normality implicit in the analysis of variance
    • M. Friedman, "The use of ranks to avoid the assumption of normality implicit in the analysis of variance," J. Am. Statist. Assoc., vol. 32, no. 200, pp. 675-701, 1937.
    • (1937) J. Am. Statist. Assoc. , vol.32 , Issue.200 , pp. 675-701
    • Friedman, M.1
  • 24
    • 80052394779 scopus 로고    scopus 로고
    • On the effectiveness of preprocessing methods when dealing with different levels of class imbalance
    • V. García, J. S. Sánchez, and R. A. Mollineda, "On the effectiveness of preprocessing methods when dealing with different levels of class imbalance," Knowl.-Based Syst., vol. 25, no. 1, pp. 13-21, 2012.
    • (2012) Knowl.-Based Syst. , vol.25 , Issue.1 , pp. 13-21
    • García, V.1    Sánchez, J.S.2    Mollineda, R.A.3
  • 25
    • 27144501672 scopus 로고    scopus 로고
    • Borderline-SMOTE: A new over-sampling method in imbalanced data sets learning
    • H. Han, W.-Y. Wang, and B.-H. Mao, "Borderline-SMOTE: A new over-sampling method in imbalanced data sets learning," in Proc. Int. Conf. Adv. Intell. Comput., 2005, pp. 878-887.
    • (2005) Proc. Int. Conf. Adv. Intell. Comput. , pp. 878-887
    • Han, H.1    Wang, W.-Y.2    Mao, B.-H.3
  • 26
    • 0003562954 scopus 로고    scopus 로고
    • A simple generalisation of the area under the roc curve for multiple class classification problems
    • D. J. Hand and R. J. Till, "A simple generalisation of the area under the roc curve for multiple class classification problems," Mach. Learn., vol. 45, no. 2, pp. 171-186, 2001.
    • (2001) Mach. Learn. , vol.45 , Issue.2 , pp. 171-186
    • Hand, D.J.1    Till, R.J.2
  • 27
    • 0020083498 scopus 로고
    • The meaning and use of the area under a receiver operating characteristic (roc) curve
    • J. A. Hanley and B. J. McNeil, "The meaning and use of the area under a receiver operating characteristic (roc) curve," Radiology, vol. 143, no. 1, pp. 29-36, 1982.
    • (1982) Radiology , vol.143 , Issue.1 , pp. 29-36
    • Hanley, J.A.1    McNeil, B.J.2
  • 28
    • 84931162639 scopus 로고
    • The condensed nearest neighbor rule (corresp.)
    • May
    • P. E. Hart, "The condensed nearest neighbor rule (corresp.)," IEEE Trans. Inf. Theory, vol. 14, no. 3, pp. 515-516, May 1968.
    • (1968) IEEE Trans. Inf. Theory , vol.14 , Issue.3 , pp. 515-516
    • Hart, P.E.1
  • 29
    • 0032355984 scopus 로고    scopus 로고
    • Classification by pairwise coupling
    • T. Hastie, R. Tibshirani, et al., "Classification by pairwise coupling," The Ann. Statist., vol. 26, no. 2, pp. 451-471, 1998.
    • (1998) The Ann. Statist. , vol.26 , Issue.2 , pp. 451-471
    • Hastie, T.1    Tibshirani, R.2
  • 31
    • 68549133155 scopus 로고    scopus 로고
    • Learning from imbalanced data
    • Sep.
    • H. He and E. A. Garcia, "Learning from imbalanced data," IEEE Trans. Knowl. Data Eng., vol. 21, no. 9, pp. 1263-1284, Sep. 2009.
    • (2009) IEEE Trans. Knowl. Data Eng. , vol.21 , Issue.9 , pp. 1263-1284
    • He, H.1    Garcia, E.A.2
  • 32
    • 0001750957 scopus 로고
    • Approximations of the critical region of the fbietkan statistic
    • R. L. Iman and J. M. Davenport, "Approximations of the critical region of the fbietkan statistic," Commun. Statist.-Theory Methods, vol. 9, no. 6, pp. 571-595, 1980.
    • (1980) Commun. Statist.-Theory Methods , vol.9 , Issue.6 , pp. 571-595
    • Iman, R.L.1    Davenport, J.M.2
  • 33
    • 51649085353 scopus 로고    scopus 로고
    • Evaluating boosting algorithms to classify rare classes: Comparison and improvements
    • M. V. Joshi, V. Kumar, and R. C. Agarwal, "Evaluating boosting algorithms to classify rare classes: Comparison and improvements," in Proc. IEEE Int. Conf. DataMining, 2001, pp. 257-264.
    • (2001) Proc. IEEE Int. Conf. DataMining , pp. 257-264
    • Joshi, M.V.1    Kumar, V.2    Agarwal, R.C.3
  • 36
    • 0001972236 scopus 로고    scopus 로고
    • Addressing the curse of imbalanced training sets: One-sided selection
    • M. Kubat, S. Matwin, et al., "Addressing the curse of imbalanced training sets: One-sided selection," in Proc. 14th Int. Conf. Mach. Learn., 1997, vol. 97, pp. 179-186.
    • (1997) Proc. 14th Int. Conf. Mach. Learn. , vol.97 , pp. 179-186
    • Kubat, M.1    Matwin, S.2
  • 37
    • 44949084506 scopus 로고    scopus 로고
    • Classification of weld flaws with imbalanced class data
    • T. W. Liao, "Classification of weld flaws with imbalanced class data," Expert Syst. Appl., vol. 35, no. 3, pp. 1041-1052, 2008.
    • (2008) Expert Syst. Appl. , vol.35 , Issue.3 , pp. 1041-1052
    • Liao, T.W.1
  • 38
    • 84875898112 scopus 로고    scopus 로고
    • Dynamic sampling approach to training neural networks for multiclass imbalance classification
    • Apr.
    • M. Lin, K. Tang, and X. Yao, "Dynamic sampling approach to training neural networks for multiclass imbalance classification," IEEE Trans. Neural Netw. Learn. Syst., vol. 24, no. 4, pp. 647-660, Apr. 2013.
    • (2013) IEEE Trans. Neural Netw. Learn. Syst. , vol.24 , Issue.4 , pp. 647-660
    • Lin, M.1    Tang, K.2    Yao, X.3
  • 39
    • 0002975203 scopus 로고
    • On the generalized distance in statistics
    • P. C. Mahalanobis, "On the generalized distance in statistics," in Proc. Nat. Instit. Sci., 1936, vol. 2, pp. 49-55.
    • (1936) Proc. Nat. Instit. Sci. , vol.2 , pp. 49-55
    • Mahalanobis, P.C.1
  • 40
    • 55549116330 scopus 로고    scopus 로고
    • Evolutionary rulebased systems for imbalanced data sets
    • A. Orriols-Puig and E. Bernadó-Mansilla, "Evolutionary rulebased systems for imbalanced data sets," Soft Comput., vol. 13, no. 3, pp. 213-225, 2009.
    • (2009) Soft Comput. , vol.13 , Issue.3 , pp. 213-225
    • Orriols-Puig, A.1    Bernadó-Mansilla, E.2
  • 41
    • 9444270977 scopus 로고    scopus 로고
    • Class imbalances versus class overlapping: An analysis of a learning system behavior
    • R. C. Prati, G. E. A. P. A. Batista, and M. C. Monard, "Class imbalances versus class overlapping: An analysis of a learning system behavior," in Proc. Adv. Artif. Intell., 2004, pp. 312-321.
    • (2004) Proc. Adv. Artif. Intell. , pp. 312-321
    • Prati, R.C.1    Batista, G.E.A.P.A.2    Monard, M.C.3
  • 42
    • 84861433287 scopus 로고    scopus 로고
    • A pruning-based approach for searching precise and generalized region for synthetic minority over-sampling
    • K. Puntumapon and K. Waiyamai, "A pruning-based approach for searching precise and generalized region for synthetic minority over-sampling," in Proc. 16th Pacific-Asia Conf. Adv. Knowl. Discovery Data Mining, 2012, pp. 371-382.
    • (2012) Proc. 16th Pacific-Asia Conf. Adv. Knowl. Discovery Data Mining , pp. 371-382
    • Puntumapon, K.1    Waiyamai, K.2
  • 44
    • 56749117943 scopus 로고    scopus 로고
    • In defense of one-vs-all classification
    • R. Rifkin and A. Klautau, "In defense of one-vs-all classification," The J. Mach. Learn. Res., no. 5, pp. 101-141, 2004.
    • (2004) The J. Mach. Learn. Res. , Issue.5 , pp. 101-141
    • Rifkin, R.1    Klautau, A.2
  • 47
    • 67049152595 scopus 로고    scopus 로고
    • Boosting for learning multiple classes with imbalanced class distribution
    • Y. Sun, M. S. Kamel, and Y. Wang, "Boosting for learning multiple classes with imbalanced class distribution," in Proc. 6th Int. Conf. Data Mining, 2006, pp. 592-602.
    • (2006) Proc. 6th Int. Conf. Data Mining , pp. 592-602
    • Sun, Y.1    Kamel, M.S.2    Wang, Y.3
  • 48
    • 80055040243 scopus 로고    scopus 로고
    • Towards maximizing the area under the ROC Curve for multi-class classification problems
    • K. Tang, R. Wang, and T. Chen, "Towards maximizing the area under the ROC Curve for multi-class classification problems," in Proc. 25th AAAI Conf. Artif. Intell., 2011, pp. 483-488.
    • (2011) Proc. 25th AAAI Conf. Artif. Intell. , pp. 483-488
    • Tang, K.1    Wang, R.2    Chen, T.3
  • 49
    • 79951771270 scopus 로고    scopus 로고
    • Imbalanced classification using support vector machine ensemble
    • J. Tian, H. Gu, and W. Liu, "Imbalanced classification using support vector machine ensemble," Neural Comput. Appl., vol. 20, no. 2, pp. 203-209, 2011.
    • (2011) Neural Comput. Appl. , vol.20 , Issue.2 , pp. 203-209
    • Tian, J.1    Gu, H.2    Liu, W.3
  • 50
    • 0017024036 scopus 로고
    • Two modifications of CNN
    • Nov.
    • I. Tomek, "Two modifications of CNN," IEEE Trans. Syst., Man Cybern., vol. 6, no. 11, pp. 769-772, Nov. 1976.
    • (1976) IEEE Trans. Syst., Man Cybern. , vol.6 , Issue.11 , pp. 769-772
    • Tomek, I.1
  • 51
    • 33746812429 scopus 로고    scopus 로고
    • Imbalanced data set learning with synthetic samples
    • Workshop, Ottawa, Canada, Jun.
    • B. X. Wang and N. Japkowicz, "Imbalanced data set learning with synthetic samples," in Proc. IRIS Mach. Learn. Workshop, Ottawa, Canada, Jun. 2004.
    • (2004) Proc. IRIS Mach. Learn.
    • Wang, B.X.1    Japkowicz, N.2
  • 52
    • 84864153221 scopus 로고    scopus 로고
    • Multiclass imbalance problems: Analysis and potential solutions
    • Aug.
    • S. Wang and X. Yao, "Multiclass imbalance problems: Analysis and potential solutions," IEEE Trans. Syst., Man Cybern., vol. 42, no. 4, pp. 1119-1130, Aug. 2012.
    • (2012) IEEE Trans. Syst., Man Cybern. , vol.42 , Issue.4 , pp. 1119-1130
    • Wang, S.1    Yao, X.2
  • 57
    • 31344442851 scopus 로고    scopus 로고
    • Training cost-sensitive neural networks with methods addressing the class imbalance problem
    • Jan.
    • Z.-H. Zhou and X.-Y. Liu, "Training cost-sensitive neural networks with methods addressing the class imbalance problem," IEEE Trans. Knowl. Data Eng., vol. 18, no. 1, pp. 63-77, Jan. 2006.
    • (2006) IEEE Trans. Knowl. Data Eng. , vol.18 , Issue.1 , pp. 63-77
    • Zhou, Z.-H.1    Liu, X.-Y.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.