메뉴 건너뛰기




Volumn , Issue , 2005, Pages 24-33

Wrapper-based computation and evaluation of sampling methods for imbalanced datasets

Author keywords

cost sensitive learning and evaluation; imbalanced datasets; SMOTE; under sampling; wrapper

Indexed keywords

AVERAGE COST; CLASSIFICATION ACCURACY; COST CURVES; COST-SENSITIVE LEARNING; DATA SETS; EVALUATION FUNCTION; LARGE SCALE SIMULATIONS; REGIONS OF INTEREST; SAMPLING METHOD; SYNTHETIC GENERATION; TEST EXAMPLES; TRUE POSITIVE RATES; UNDER-SAMPLING; WRAPPER APPROACH;

EID: 51149119276     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1089827.1089830     Document Type: Conference Paper
Times cited : (22)

References (34)
  • 7
    • 27144531570 scopus 로고    scopus 로고
    • A study of the behavior of several methods for balancing machine learning training data
    • G. E. A. P. A. Batista, R. C. Prati, and M. C. Monard. A study of the behavior of several methods for balancing machine learning training data. SIGKDD Explorations, 6(1), 2004.
    • (2004) SIGKDD Explorations , vol.6 , Issue.1
    • Batista, G.E.A.P.A.1    Prati, R.C.2    Monard, M.C.3
  • 8
    • 0003408496 scopus 로고    scopus 로고
    • Department of Information and Computer Sciences, University of California, Irvine
    • C. Blake and C. Merz. UCI Repository of Machine Learning Databases. Department of Information and Computer Sciences, University of California, Irvine, 1998.
    • (1998) UCI Repository of Machine Learning Databases
    • Blake, C.1    Merz, C.2
  • 11
  • 12
    • 68549121111 scopus 로고    scopus 로고
    • C4.5 and imbalanced datasets: Investigating the effect of ampling method, probabilistic estimate, and decision tree structure
    • N. V. Chawla. C4.5 and imbalanced datasets: Investigating the effect of ampling method, probabilistic estimate, and decision tree structure. In Proceedings of the ICML'03 Workshop on Class Imbalances, 2003.
    • Proceedings of the ICML'03 Workshop on Class Imbalances, 2003
    • Chawla, N.V.1
  • 13
    • 27144549260 scopus 로고    scopus 로고
    • Editorial: Learning from Imbalanced Datasets
    • N. V. Chawla. Editorial: Learning from Imbalanced Datasets. SIGKDD Explorations, 6(1), 2004.
    • (2004) SIGKDD Explorations , vol.6 , Issue.1
    • Chawla, N.V.1
  • 19
  • 20
    • 0003704318 scopus 로고    scopus 로고
    • Department of Information and Computer Sciences, University of California, Irvine
    • S. Hettich and S. D. Bay. The UCI KDD Archive [http://kdd.ics.uci.edu]. Department of Information and Computer Sciences, University of California, Irvine, 1998.
    • (1998) The UCI KDD Archive
    • Hettich, S.1    Bay, S.D.2
  • 22
    • 33845536164 scopus 로고    scopus 로고
    • The Class Imbalance Problem: A Systematic Study
    • N. Japkowicz and S. Stephen. The Class Imbalance Problem: A Systematic Study. Intelligent Data Analysis, 6(5):203-231, 2002.
    • (2002) Intelligent Data Analysis , vol.6 , Issue.5 , pp. 203-231
    • Japkowicz, N.1    Stephen, S.2
  • 23
    • 0031381525 scopus 로고    scopus 로고
    • Wrappers for feature subset selection
    • R. Kohavi and G. H. John. Wrappers for feature subset selection. Artificial Intelligence, 97(1-2):273-324, 1997.
    • (1997) Artificial Intelligence , vol.97 , Issue.1-2 , pp. 273-324
    • Kohavi, R.1    John, G.H.2
  • 24
    • 0031998121 scopus 로고    scopus 로고
    • Machine Learning for the Detection of Oil Spills in Satellite Radar Images
    • M. Kubat, R. Holte, and S. Matwin. Machine Learning for the Detection of Oil Spills in Satellite Radar Images. Machine Learning, 30:195-215, 1998.
    • (1998) Machine Learning , vol.30 , pp. 195-215
    • Kubat, M.1    Holte, R.2    Matwin, S.3
  • 25
  • 29
    • 85101511266 scopus 로고    scopus 로고
    • Analysis and Visualization of Classifier Performance: Comparison under Imprecise Class and Cost Distributions
    • New Port Beach, CA, AAAI Press
    • F. Provost and T. Fawcett. Analysis and Visualization of Classifier Performance: Comparison under Imprecise Class and Cost Distributions. In Proceedings of the Third International Conference on Knowledge Discovery and Data Mining, pages 43-48, New Port Beach, CA, 1997. AAAI Press.
    • (1997) Proceedings of the Third International Conference on Knowledge Discovery and Data Mining , pp. 43-48
    • Provost, F.1    Fawcett, T.2
  • 32
    • 1442275185 scopus 로고    scopus 로고
    • Learning when Training Data are Costly: The Effect of Class Distribution on Tree Induction
    • G. Weiss and F. Provost. Learning when Training Data are Costly: The Effect of Class Distribution on Tree Induction. Journal of Artificial Intelligence Research, 19:315-354, 2003.
    • (2003) Journal of Artificial Intelligence Research , vol.19 , pp. 315-354
    • Weiss, G.1    Provost, F.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.