메뉴 건너뛰기




Volumn 103, Issue 8-9, 2003, Pages 611-621

Data mining and the impact of missing data

Author keywords

Data handling; Database management systems; Information gathering; Information retrieval

Indexed keywords

ALGORITHMS; DATA ACQUISITION; DATA HANDLING; DATABASE SYSTEMS; INFORMATION RETRIEVAL; MATHEMATICAL MODELS; MATRIX ALGEBRA; PATTERN RECOGNITION; STATISTICAL METHODS;

EID: 0344687410     PISSN: 02635577     EISSN: None     Source Type: Journal    
DOI: 10.1108/02635570310497657     Document Type: Article
Times cited : (137)

References (59)
  • 2
    • 0000828942 scopus 로고
    • Missing observations in multivariate statistics I: Review of the literature
    • Afifi, A. and Elashoff, R. (1966), "Missing observations in multivariate statistics I: review of the literature", Journal of the American Statistical Association, Vol. 61, pp. 595-604.
    • (1966) Journal of the American Statistical Association , vol.61 , pp. 595-604
    • Afifi, A.1    Elashoff, R.2
  • 3
    • 0032954507 scopus 로고    scopus 로고
    • Applications of multiple imputation in medical studies: From AIDS to NHANES
    • Barnard, J. and Meng, X. (1999), "Applications of multiple imputation in medical studies: from AIDS to NHANES", Statistical Methods in Medical Research, Vol. 8, pp. 17-36.
    • (1999) Statistical Methods in Medical Research , vol.8 , pp. 17-36
    • Barnard, J.1    Meng, X.2
  • 5
    • 0345075738 scopus 로고    scopus 로고
    • The art and science of customer relationship
    • Berry, M. and Linoff, G. (2000), "The art and science of customer relationship", Industrial Management & Data Systems, Vol. 100 No. 5, pp. 245-6.
    • (2000) Industrial Management & Data Systems , vol.100 , Issue.5 , pp. 245-246
    • Berry, M.1    Linoff, G.2
  • 8
    • 0001913059 scopus 로고
    • Multiple imputation of industry and occupation codes in census public-use samples using Bayesian logistic regression
    • Clogg, C., Rubin, D., Schenker, N., Schultz, B. and Weidman, L. (1991), "Multiple imputation of industry and occupation codes in census public-use samples using Bayesian logistic regression", Journal of the American Statistical Association, Vol. 86 No. 413, pp. 68-78.
    • (1991) Journal of the American Statistical Association , vol.86 , Issue.413 , pp. 68-78
    • Clogg, C.1    Rubin, D.2    Schenker, N.3    Schultz, B.4    Weidman, L.5
  • 9
    • 0345507244 scopus 로고    scopus 로고
    • Datamining for the masses
    • Darling, C.B. (1997), "Datamining for the masses", Datamation, Vol. 52, pp. 5.
    • (1997) Datamation , vol.52 , pp. 5
    • Darling, C.B.1
  • 11
    • 0003100723 scopus 로고
    • Incomplete data in sample surveys
    • in Madow, W.G., Olkin, I. and Rubin, D. (Eds); Academic Press, New York, NY
    • Dempster, A. and Rubin, D. (1983), "Incomplete data in sample surveys", in Madow, W.G., Olkin, I. and Rubin, D. (Eds), Sample Surveys Vol. II: Theory and Annotated Bibliography, Academic Press, New York, NY, pp. 3-10.
    • (1983) Sample Surveys Vol. II: Theory and Annotated Bibliography , pp. 3-10
    • Dempster, A.1    Rubin, D.2
  • 12
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the EM algorithm (with discussion)
    • Dempster, A., Laird, N. and Rubin, D. (1977), "Maximum likelihood from incomplete data via the EM algorithm (with discussion)", Journal of the Royal Statistical Society, Vol. B39, pp. 1-38.
    • (1977) Journal of the Royal Statistical Society , vol.B39 , pp. 1-38
    • Dempster, A.1    Laird, N.2    Rubin, D.3
  • 13
    • 4243828610 scopus 로고
    • Informative dropout in longitudinal data analysis (with discussion)
    • Diggle, P. and Kenward, M. (1994), "Informative dropout in longitudinal data analysis (with discussion)", Applied Statistics, Vol. 43, pp. 49-94.
    • (1994) Applied Statistics , vol.43 , pp. 49-94
    • Diggle, P.1    Kenward, M.2
  • 22
    • 84910339393 scopus 로고
    • An evaluation of model-dependent and probability-sampling inferences in sample surveys
    • Hansen, M., Madow, W. and Tepping, J. (1983), "An evaluation of model-dependent and probability-sampling inferences in sample surveys", Journal of the American Statistical Association, Vol. 78, pp. 776-807.
    • (1983) Journal of the American Statistical Association , vol.78 , pp. 776-807
    • Hansen, M.1    Madow, W.2    Tepping, J.3
  • 23
    • 0000996385 scopus 로고
    • The analysis of incomplete data
    • Hartley, H. and Hocking, R. (1971), "The analysis of incomplete data", Biometrics, Vol. 27, pp. 783-808.
    • (1971) Biometrics , vol.27 , pp. 783-808
    • Hartley, H.1    Hocking, R.2
  • 24
    • 0003217726 scopus 로고
    • Using multiple imputations to handle nonresponse in sample surveys
    • in Madow, W. G., Olkin, I. and Rubin, D. (Eds); Academic Press, New York, NY
    • Herzog, T. and Rubin, D. (1983), "Using multiple imputations to handle nonresponse in sample surveys", in Madow, W. G., Olkin, I. and Rubin, D. (Eds), Incomplete Data in Sample Surveys, Volume 2: Theory and Bibliographies, Academic Press, New York, NY, pp. 209-45.
    • (1983) Incomplete Data in Sample Surveys, Volume 2: Theory and Bibliographies , pp. 209-245
    • Herzog, T.1    Rubin, D.2
  • 25
    • 77955249236 scopus 로고    scopus 로고
    • Treatment of missing data
    • D.C. Howell personal Web site
    • Howell, D.C. (1998), "Treatment of missing data", D.C. Howell personal Web site, available at: www.uvm.edu/~dhowell/StatPages/More_Stuff/Missing_Data/ Missing.html/
    • (1998)
    • Howell, D.C.1
  • 27
    • 0001621722 scopus 로고    scopus 로고
    • Towards a theory of self-administered questionnaire design
    • in Lyberg, L. et al. (Eds); John Wiley Company, New York, NY
    • Jenkins, C.R. and Dillman, D.A. (1997), "Towards a theory of self-administered questionnaire design", in Lyberg, L. et al. (Eds), Survey Measurement and Process Quality, John Wiley Company, New York, NY.
    • (1997) Survey Measurement and Process Quality
    • Jenkins, C.R.1    Dillman, D.A.2
  • 31
    • 33644684594 scopus 로고    scopus 로고
    • The curse of the missing data
    • Y. Kim personal Web site
    • Kim, Y. (2001), "The curse of the missing data", Y. Kim personal Web site, available at: http://209.68.11:8080/2ndMoment/978476655/addPostingForm/
    • (2001)
    • Kim, Y.1
  • 32
    • 0035780739 scopus 로고    scopus 로고
    • A review of data mining techniques
    • Lee, S. and Siau, K. (2001), "A review of data mining techniques", Industrial Management & Data Systems, Vol. 101 No. 1, pp. 41-6.
    • (2001) Industrial Management & Data Systems , vol.101 , Issue.1 , pp. 41-46
    • Lee, S.1    Siau, K.2
  • 33
    • 0345075736 scopus 로고
    • Hypothesis testing in multiple imputation - With emphasis on mixed-up frequencies in contingency tables
    • PhD thesis, The University of Chicago, Chicago, IL
    • Li, K. (1985), "Hypothesis testing in multiple imputation - with emphasis on mixed-up frequencies in contingency tables", PhD thesis, The University of Chicago, Chicago, IL.
    • (1985)
    • Li, K.1
  • 36
    • 84950452119 scopus 로고
    • Modeling the drop-out mechanism in repeated-measures studies
    • Little, R. (1995), "Modeling the drop-out mechanism in repeated-measures studies", Journal of the American Statistical Association, Vol. 90, pp. 1112-21.
    • (1995) Journal of the American Statistical Association , vol.90 , pp. 1112-1121
    • Little, R.1
  • 38
    • 84965572693 scopus 로고
    • The analysis of social science data with missing values
    • Little, R. and Rubin, D. (1989) "The analysis of social science data with missing values", Sociological Methods and Research, Vol. 18, pp. 292-26.
    • (1989) Sociological Methods and Research , vol.18 , pp. 292-326
    • Little, R.1    Rubin, D.2
  • 39
    • 0042486047 scopus 로고    scopus 로고
    • Data warehousing, technology assessment and management
    • Ma, C., Chou, D. and Yen, D. (2000), "Data warehousing, technology assessment and management", Industrial Management & Data Systems, Vol. 100, No. 3, pp. 125-35.
    • (2000) Industrial Management & Data Systems , vol.100 , Issue.3 , pp. 125-135
    • Ma, C.1    Chou, D.2    Yen, D.3
  • 42
    • 24444442352 scopus 로고
    • Cross-sectional imputation and longitudinal editing procedures in the survey of income and program participation
    • Technical report, Institute of Social Research, University of Michigan, Ann Arbor, MI
    • Pennell, S. (1993) "Cross-sectional imputation and longitudinal editing procedures in the survey of income and program participation", Technical report, Institute of Social Research, University of Michigan, Ann Arbor, MI.
    • (1993)
    • Pennell, S.1
  • 44
    • 0002608867 scopus 로고
    • Multiple imputations in sample surveys - A phenomenological Bayesian approach to nonresponse
    • Data, US Department of Commerce, Washington, DC
    • Rubin, D. (1978) "Multiple imputations in sample surveys - a phenomenological Bayesian approach to nonresponse", Imputation and Editing of Faulty or Missing Survey Data, US Department of Commerce, Washington, DC, pp. 1-23.
    • (1978) Imputation and Editing of Faulty or Missing Survey , pp. 1-23
    • Rubin, D.1
  • 45
    • 84940952808 scopus 로고
    • Statistical matching using file concatenation with adjusted weights and multiple imputations
    • Rubin, D. (1986), "Statistical matching using file concatenation with adjusted weights and multiple imputations", Journal of Business and Economic Statistics, Vol. 4, pp. 87-94.
    • (1986) Journal of Business and Economic Statistics , vol.4 , pp. 87-94
    • Rubin, D.1
  • 47
    • 0030539070 scopus 로고    scopus 로고
    • Multiple imputation after 18+years (with discussion)
    • Rubin, D. (1996) "Multiple imputation after 18+years (with discussion)", Journal of the American Statistical Association, Vol. 91, pp. 473-89.
    • (1996) Journal of the American Statistical Association , vol.91 , pp. 473-489
    • Rubin, D.1
  • 48
    • 84950918760 scopus 로고
    • Multiple imputation for interval estimation from simple random sample with ignorable nonresponse
    • Rubin, D. and Schenker, N. (1986) "Multiple imputation for interval estimation from simple random sample with ignorable nonresponse", Journal of the American Statistical Association, Vol. 81, pp. 366-74.
    • (1986) Journal of the American Statistical Association , vol.81 , pp. 366-374
    • Rubin, D.1    Schenker, N.2
  • 49
    • 0000165453 scopus 로고
    • Imputation in surveys: Coping with reality
    • Sande, L. (1982) "Imputation in surveys: coping with reality", The American Statistician, Vol. 36, pp. 145-52.
    • (1982) The American Statistician , vol.36 , pp. 145-152
    • Sande, L.1
  • 50
    • 0003366296 scopus 로고
    • Hot-deck imputation procedures
    • in Madow, W.G. and Olkin, I. (Eds); Academic Press, New ork, NY
    • Sande, L. (1983) "Hot-deck imputation procedures", in Madow, W.G. and Olkin, I. (Eds), Incomplete Data in Sample Surveys, Vol. 3, Proceedings of the Symposium, Academic Press, New York, NY, pp. 339-49.
    • (1983) Incomplete Data in Sample Surveys, Vol. 3, Proceedings of the Symposium , pp. 339-349
    • Sande, L.1
  • 53
    • 0032219074 scopus 로고    scopus 로고
    • Multiple imputation for multivariate missing-data problems: A data analyst's perspective
    • Schafer, J. and Olsen, M. (1998), "Multiple imputation for multivariate missing-data problems: a data analyst's perspective", Multivariate Behavioral Research, Vol. 33, pp. 545-71.
    • (1998) Multivariate Behavioral Research , vol.33 , pp. 545-571
    • Schafer, J.1    Olsen, M.2
  • 54
    • 0345507242 scopus 로고    scopus 로고
    • General FAQ #25: Handling missing or incomplete data
    • Statistical Services of University of Texas
    • Statistical Services of University of Texas (2000) "General FAQ #25: handling missing or incomplete data", available at: www.utexas.edu/cc/faqs/stat/general/gen25.html
    • (2000)
  • 55
    • 0033616909 scopus 로고    scopus 로고
    • Multiple imputation of missing blood pressure covariates in survival analysis
    • van Buren, S., Boshuizen, H. and Knook, D. (1999), "Multiple imputation of missing blood pressure covariates in survival analysis", Statistics in Medicine, Vol. 18, pp. 681-94.
    • (1999) Statistics in Medicine , vol.18 , pp. 681-694
    • Van Buren, S.1    Boshuizen, H.2    Knook, D.3
  • 56
    • 0035780821 scopus 로고    scopus 로고
    • Managing dirty data in organizations using ERP: Lessons from a case study
    • Vosburg, J. and Kumar, A. (2001) "Managing dirty data in organizations using ERP: lessons from a case study", Industrial Management & Data Systems, Vol. 101 No. 1, pp. 21-31.
    • (2001) Industrial Management & Data Systems , vol.101 , Issue.1 , pp. 21-31
    • Vosburg, J.1    Kumar, A.2
  • 58
    • 0004204162 scopus 로고    scopus 로고
    • Academic Press, San Francisco, CA
    • Witten, I. and Frank, E. (2000), Data Mining, Academic Press, San Francisco, CA.
    • (2000) Data Mining
    • Witten, I.1    Frank, E.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.