메뉴 건너뛰기




Volumn , Issue , 2012, Pages 695-704

Class probability estimates are unreliable for imbalanced data (and how to fix them)

Author keywords

Class imbalance; Probability estimates

Indexed keywords

ASYMMETRIC COSTS; BAYESIAN APPROACHES; CLASS IMBALANCE; CLASS PROBABILITIES; CLASSIFICATION SYSTEM; IMBALANCED DATA; POSTERIOR DISTRIBUTIONS; PROBABILITY ESTIMATE; RARE EVENT;

EID: 84874066190     PISSN: 15504786     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICDM.2012.115     Document Type: Conference Paper
Times cited : (69)

References (29)
  • 1
    • 0030211964 scopus 로고    scopus 로고
    • Bagging predictors
    • Leo Breiman. Bagging predictors. Machine Learning, 24(2):123-140, 1996.
    • (1996) Machine Learning , vol.24 , Issue.2 , pp. 123-140
    • Breiman, L.1
  • 3
    • 0003010182 scopus 로고
    • Verification of forecasts expressed in terms of probability
    • G.W. Brier. Verification of forecasts expressed in terms of probability. Monthly weather review, 78(1):1-3, 1950.
    • (1950) Monthly Weather Review , vol.78 , Issue.1 , pp. 1-3
    • Brier, G.W.1
  • 6
    • 44649133282 scopus 로고    scopus 로고
    • Analyzing pets on imbalanced datasets when training and testing class distributions differ
    • D. Cieslak and N. Chawla. Analyzing pets on imbalanced datasets when training and testing class distributions differ. Advances in Knowledge Discovery and Data Mining, pages 519-526, 2008.
    • (2008) Advances in Knowledge Discovery and Data Mining , pp. 519-526
    • Cieslak, D.1    Chawla, N.2
  • 10
    • 0002178053 scopus 로고
    • Bias reduction of maximum likelihood estimates
    • D. Firth. Bias reduction of maximum likelihood estimates. Biometrika, 80(1):27-38, 1993.
    • (1993) Biometrika , vol.80 , Issue.1 , pp. 27-38
    • Firth, D.1
  • 11
    • 2942598504 scopus 로고    scopus 로고
    • Variable selection in data mining: Building a predictive model for bankruptcy
    • D.P. Foster and R.A. Stine. Variable selection in data mining: Building a predictive model for bankruptcy. Journal of the American Statistical Association, 99(466):303-313, 2004.
    • (2004) Journal of the American Statistical Association , vol.99 , Issue.466 , pp. 303-313
    • Foster, D.P.1    Stine, R.A.2
  • 14
    • 0032355984 scopus 로고    scopus 로고
    • Classification by pairwise coupling
    • T. Hastie and R. Tibshirani. Classification by pairwise coupling. The annals of statistics, 26(2):451-471, 1998.
    • (1998) The Annals of Statistics , vol.26 , Issue.2 , pp. 451-471
    • Hastie, T.1    Tibshirani, R.2
  • 16
    • 33845536164 scopus 로고    scopus 로고
    • The class imbalance problem: A systematic study
    • Nathalie Japkowicz and Shaju Stephen. The class imbalance problem: A systematic study. Intelligent Data Analysis, 6(5):429-449, 2002.
    • (2002) Intelligent Data Analysis , vol.6 , Issue.5 , pp. 429-449
    • Japkowicz, N.1    Stephen, S.2
  • 17
    • 4544259831 scopus 로고    scopus 로고
    • Logistic regression in rare events data
    • G. King and L. Zeng. Logistic regression in rare events data. Political analysis, 9(2):137-163, 2001.
    • (2001) Political Analysis , vol.9 , Issue.2 , pp. 137-163
    • King, G.1    Zeng, L.2
  • 18
    • 34548160247 scopus 로고    scopus 로고
    • A note on platts probabilistic outputs for support vector machines
    • H.T. Lin, C.J. Lin, and R.C. Weng. A note on platts probabilistic outputs for support vector machines. Machine learning, 68(3):267-276, 2007.
    • (2007) Machine Learning , vol.68 , Issue.3 , pp. 267-276
    • Lin, H.T.1    Lin, C.J.2    Weng, R.C.3
  • 22
    • 0003243224 scopus 로고    scopus 로고
    • Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods
    • J. Platt et al. Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. Advances in large margin classifiers, 10(3):61-74, 1999.
    • (1999) Advances in Large Margin Classifiers , vol.10 , Issue.3 , pp. 61-74
    • Platt, J.1
  • 26
    • 0022411849 scopus 로고
    • Small sample estimation of log odds ratios from logistic regression and fourfold tables
    • SD Walter. Small sample estimation of log odds ratios from logistic regression and fourfold tables. Statistics in medicine, 4(4):437-444, 1985.
    • (1985) Statistics in Medicine , vol.4 , Issue.4 , pp. 437-444
    • Walter, S.D.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.