메뉴 건너뛰기




Volumn 42, Issue 5, 2005, Pages 695-707

An experimental investigation of the impact of aggregation on the performance of data mining with logistic regression

Author keywords

Aggregation; Area under the ROC curve; Data mining; Data warehouse; DJIA; Logistic regression; Model assessment; Model performance; Prediction; Predictive modeling; ROC

Indexed keywords

DATA PROCESSING; DATA WAREHOUSES; DECISION MAKING; FINANCE; MARKETING; MATHEMATICAL MODELS; REGRESSION ANALYSIS;

EID: 18544363237     PISSN: 03787206     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.im.2004.04.005     Document Type: Article
Times cited : (12)

References (49)
  • 1
    • 0022500965 scopus 로고
    • The use of relative operating characteristic (ROC) curves in test performance evaluation
    • J.R. Beck, and E.K. Shultz The use of relative operating characteristic (ROC) curves in test performance evaluation Archives of Pathology and Laboratory Medicine 110 1986 13 20
    • (1986) Archives of Pathology and Laboratory Medicine , vol.110 , pp. 13-20
    • Beck, J.R.1    Shultz, E.K.2
  • 2
    • 0042632458 scopus 로고    scopus 로고
    • Theory and support for process frameworks of knowledge discovery and data mining from ERP systems
    • E. Bendoly Theory and support for process frameworks of knowledge discovery and data mining from ERP systems Information & Management 40 2003 639 647
    • (2003) Information & Management , vol.40 , pp. 639-647
    • Bendoly, E.1
  • 4
    • 0031191630 scopus 로고    scopus 로고
    • The use of the area under the ROC curve in the evaluation of machine learning algorithms
    • A.P. Bradley The use of the area under the ROC curve in the evaluation of machine learning algorithms Pattern Recognition 30 7 1997 1145 1159
    • (1997) Pattern Recognition , vol.30 , Issue.7 , pp. 1145-1159
    • Bradley, A.P.1
  • 7
    • 1942482069 scopus 로고    scopus 로고
    • Learning with non-uniform class and cost distributions: Effects and a distributed multi-classifier approach
    • P.K. Chan, S.J. Stolfo, Learning with non-uniform class and cost distributions: effects and a distributed multi-classifier approach, Work Notes KDD-98 Workshop on Distributed Data Mining (1998) 1-9.
    • (1998) Work Notes KDD-98 Workshop on Distributed Data Mining , pp. 1-9
    • Chan, P.K.1    Stolfo, S.J.2
  • 8
    • 0344811122 scopus 로고    scopus 로고
    • An overview of data warehousing and OLAP technology
    • S. Chaudhuri, and U. Dayal An overview of data warehousing and OLAP technology SIGMOD Record 26 1 1997 1 10
    • (1997) SIGMOD Record , vol.26 , Issue.1 , pp. 1-10
    • Chaudhuri, S.1    Dayal, U.2
  • 10
    • 0002658844 scopus 로고
    • Maintenance of materialized views: Problems, techniques, and applications
    • A. Gupta, and I.S. Mumick Maintenance of materialized views: problems, techniques, and applications IEEE Data Engineering Bulletin 18 2 1995
    • (1995) IEEE Data Engineering Bulletin , vol.18 , Issue.2
    • Gupta, A.1    Mumick, I.S.2
  • 11
    • 0020083498 scopus 로고
    • The meaning and use of the area under a receiver operating characteristic (ROC) curve
    • J.A. Hanley, and B.J. McNeil The meaning and use of the area under a receiver operating characteristic (ROC) curve Radiology 143 1 April 1982 29 36
    • (1982) Radiology , vol.143 , Issue.1 , pp. 29-36
    • Hanley, J.A.1    McNeil, B.J.2
  • 12
    • 0024443297 scopus 로고
    • Receiver operating characteristics (ROC) methodology: The state of the art
    • J.A. Hanley Receiver operating characteristics (ROC) methodology: the state of the art Critical Reviews in Diagnostic Imaging 29 3 1989 307 335
    • (1989) Critical Reviews in Diagnostic Imaging , vol.29 , Issue.3 , pp. 307-335
    • Hanley, J.A.1
  • 13
    • 0020524559 scopus 로고
    • A method of comparing the areas under receiver operating characteristic curves derived from the same cases
    • J.A. Hanley, and B.J. McNeil A method of comparing the areas under receiver operating characteristic curves derived from the same cases Radiology 148 1 1983 839 843
    • (1983) Radiology , vol.148 , Issue.1 , pp. 839-843
    • Hanley, J.A.1    McNeil, B.J.2
  • 14
    • 0013331361 scopus 로고    scopus 로고
    • Real-world data is dirty: Data cleansing and the merge/purge problem
    • M. Hernandez, and S. Stolfo Real-world data is dirty: data cleansing and the merge/purge problem Data Mining and Knowledge Discovery 2 1 1998 9 37
    • (1998) Data Mining and Knowledge Discovery , vol.2 , Issue.1 , pp. 9-37
    • Hernandez, M.1    Stolfo, S.2
  • 15
    • 0002940527 scopus 로고    scopus 로고
    • Data mining for customer service support
    • S.C. Hui, and G. Jha Data mining for customer service support Information & Management 38 2000 1 13
    • (2000) Information & Management , vol.38 , pp. 1-13
    • Hui, S.C.1    Jha, G.2
  • 16
    • 0030286485 scopus 로고    scopus 로고
    • The data warehouse and data mining
    • W.H. Inmon The data warehouse and data mining Communications of the ACM 39 11 Nov 1996 49 50
    • (1996) Communications of the ACM , vol.39 , Issue.11 , pp. 49-50
    • Inmon, W.H.1
  • 18
    • 2242481419 scopus 로고    scopus 로고
    • Learning from imbalanced data sets: A comparison of various strategies
    • Technical Report WS-00-05, July
    • N. Japkowicz, Learning from imbalanced data sets: a comparison of various strategies, AAAI 2000 Workshop on Learning from Imbalanced Data Sets, Technical Report WS-00-05, July 2000.
    • (2000) AAAI 2000 Workshop on Learning from Imbalanced Data Sets
    • Japkowicz, N.1
  • 22
    • 70449977947 scopus 로고
    • Individual differences and decision-making using various levels of aggregation of information
    • A.L. Lederer, and J.R. Smith Individual differences and decision-making using various levels of aggregation of information Journal of Management Information Systems 5 3 1988-1989 53 69
    • (1988) Journal of Management Information Systems , vol.5 , Issue.3 , pp. 53-69
    • Lederer, A.L.1    Smith, J.R.2
  • 23
    • 0003253665 scopus 로고
    • Special issue on materialized views and data warehousing
    • D. Lomet, J. Widom (Eds.), Special issue on materialized views and data warehousing, IEEE Data Engineering Bulletin 18 (2) 1995.
    • (1995) IEEE Data Engineering Bulletin , vol.18 , Issue.2
    • Lomet, D.1    Widom, J.2
  • 26
    • 0021203016 scopus 로고
    • Statistical approaches to the analysis of receiver operating characteristic (ROC) curves
    • B.J. McNeil, and J.A. Hanley Statistical approaches to the analysis of receiver operating characteristic (ROC) curves Medical Decision Making 4 2 1984 137 150
    • (1984) Medical Decision Making , vol.4 , Issue.2 , pp. 137-150
    • McNeil, B.J.1    Hanley, J.A.2
  • 28
    • 0018079655 scopus 로고
    • Basic principles of ROC analysis
    • C.E. Metz Basic principles of ROC analysis Seminars in Nuclear Medicine 8 4 1978 283 298
    • (1978) Seminars in Nuclear Medicine , vol.8 , Issue.4 , pp. 283-298
    • Metz, C.E.1
  • 29
    • 0031962305 scopus 로고    scopus 로고
    • Statistical comparison of two ROC-curve estimates obtained from partially-paired datasets
    • C.E. Metz, B.A. Herman, and C.A. Roe Statistical comparison of two ROC-curve estimates obtained from partially-paired datasets Medical Decision Making 18 1 January-March 1998 110 121
    • (1998) Medical Decision Making , vol.18 , Issue.1 , pp. 110-121
    • Metz, C.E.1    Herman, B.A.2    Roe, C.A.3
  • 30
    • 0024511244 scopus 로고
    • Some practical issues of experimental design and data analysis in radiological ROC studies
    • C.E. Metz Some practical issues of experimental design and data analysis in radiological ROC studies Investigative Radiology 24 1989 234 245
    • (1989) Investigative Radiology , vol.24 , pp. 234-245
    • Metz, C.E.1
  • 31
    • 0022470978 scopus 로고
    • ROC methodology in radiological imaging
    • C.E. Metz ROC methodology in radiological imaging Investigative Radiology 21 9 1986 720 733
    • (1986) Investigative Radiology , vol.21 , Issue.9 , pp. 720-733
    • Metz, C.E.1
  • 32
    • 0000962918 scopus 로고
    • The effects of models of information presentation on decision-making
    • A.R. Montazemi, and S. Wang The effects of models of information presentation on decision-making Journal of Management Information Systems 5 3 1988-89 101 127
    • (1988) Journal of Management Information Systems , vol.5 , Issue.3 , pp. 101-127
    • Montazemi, A.R.1    Wang, S.2
  • 35
    • 0002534234 scopus 로고    scopus 로고
    • On comparing classifiers: A critique of current research and methods
    • S. Salzberg On comparing classifiers: a critique of current research and methods Data Mining and Knowledge Discovery 1 1999 1 12
    • (1999) Data Mining and Knowledge Discovery , vol.1 , pp. 1-12
    • Salzberg, S.1
  • 36
    • 0037411136 scopus 로고    scopus 로고
    • Technology and knowledge: Bridging a generating gap
    • I. Spiegler Technology and knowledge: bridging a generating gap Information & Management 40 2003 533 539
    • (2003) Information & Management , vol.40 , pp. 533-539
    • Spiegler, I.1
  • 40
    • 0024385165 scopus 로고
    • On the statistical analysis of ROC curves
    • M.L. Thompson, and W. Zucchini On the statistical analysis of ROC curves Statistics in Medicine 8 1989 1277 1290
    • (1989) Statistics in Medicine , vol.8 , pp. 1277-1290
    • Thompson, M.L.1    Zucchini, W.2
  • 43
    • 0034894378 scopus 로고    scopus 로고
    • An empirical analysis of data requirements for financial forecasting with neural networks
    • S. Walczak An empirical analysis of data requirements for financial forecasting with neural networks Journal of Management Information Systems 17 4 Spring 2001 203 222
    • (2001) Journal of Management Information Systems , vol.17 , Issue.4 , pp. 203-222
    • Walczak, S.1
  • 45
    • 0003790115 scopus 로고    scopus 로고
    • The effect of class distribution on classifier learning: An empirical study
    • Department of Computer Science, Rutgers University, 2 August
    • G.M. Weiss, F. Provost, The effect of class distribution on classifier learning: an empirical study, Technical Report ML-TR-44, Department of Computer Science, Rutgers University, 2 August 2001, pp. 1-26.
    • (2001) Technical Report , vol.ML-TR-44 , pp. 1-26
    • Weiss, G.M.1    Provost, F.2
  • 46
    • 0003790115 scopus 로고    scopus 로고
    • The effect of class distribution on classifier learning
    • Department of Computer Science, Rutgers University, 11 January
    • G.M. Weiss, F. Provost, The effect of class distribution on classifier learning, Technical Report ML-TR-43, Department of Computer Science, Rutgers University, 11 January 2001, pp. 1-6.
    • (2001) Technical Report , vol.ML-TR-43 , pp. 1-6
    • Weiss, G.M.1    Provost, F.2
  • 48
    • 0141572640 scopus 로고    scopus 로고
    • Research issues in data warehousing
    • Ulm, Germany
    • M.C. Wu, and A.P. Buchmann Research issues in data warehousing Proceedings of the BTW'97 Ulm, Germany 1997 61 82
    • (1997) Proceedings of the BTW'97 , pp. 61-82
    • Wu, M.C.1    Buchmann, A.P.2
  • 49
    • 0027457620 scopus 로고
    • Receiver-operating characteristic (ROC) plots: A fundamental evaluation tool in clinical medicine
    • M.H. Zweig, and G. Campbell Receiver-operating characteristic (ROC) plots: a fundamental evaluation tool in clinical medicine Clinical Chemistry 39 4 1993 561 577
    • (1993) Clinical Chemistry , vol.39 , Issue.4 , pp. 561-577
    • Zweig, M.H.1    Campbell, G.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.