메뉴 건너뛰기




Volumn 20, Issue 3, 2005, Pages 231-238

How to lie with bad data

Author keywords

Accuracy; Data consistency; Data mining; Data profiling; Data quality; Data rectification; Data ware housing; Distortion; Missing values; Record linkage

Indexed keywords


EID: 26444582963     PISSN: 08834237     EISSN: None     Source Type: Journal    
DOI: 10.1214/088342305000000269     Document Type: Article
Times cited : (51)

References (32)
  • 1
    • 1842559788 scopus 로고    scopus 로고
    • Reproducibility of SELDI-TOF protein patterns in serum: Comparing datasets from different experiments
    • BAGGERLY, K. A, MORRIS, J. S. and COOMBES, K. R. (2004). Reproducibility of SELDI-TOF protein patterns in serum: Comparing datasets from different experiments. Bioinformatics 20 777-785.
    • (2004) Bioinformatics , vol.20 , pp. 777-785
    • Baggerly, K.A.1    Morris, J.S.2    Coombes, K.R.3
  • 2
    • 0025090128 scopus 로고
    • Some sources of error in the coding of birth weight
    • BRUNSKILL, A. J. (1990). Some sources of error in the coding of birth weight. American J. Public Health 80 72-73.
    • (1990) American J. Public Health , vol.80 , pp. 72-73
    • Brunskill, A.J.1
  • 3
    • 2942535913 scopus 로고    scopus 로고
    • Proteomics and cancer: Running before we can walk?
    • CHECK, E. (2004). Proteomics and cancer: Running before we can walk? Nature 429 496-497.
    • (2004) Nature , vol.429 , pp. 496-497
    • Check, E.1
  • 4
    • 0013768045 scopus 로고
    • The case of the Indians and the teen-age widows
    • COALE, A. J. and STEPHAN, F. F. (1962). The case of the Indians and the teen-age widows. J. Amer. Statist. Assoc. 57 338-347.
    • (1962) J. Amer. Statist. Assoc. , vol.57 , pp. 338-347
    • Coale, A.J.1    Stephan, F.F.2
  • 5
    • 26444576680 scopus 로고    scopus 로고
    • Data mining: A view from down in the pit
    • DE VEAUX, R. D. (2002). Data mining: A view from down in the pit. Stats (34) 3-9.
    • (2002) Stats , Issue.34 , pp. 3-9
    • De Veaux, R.D.1
  • 6
    • 26444586966 scopus 로고    scopus 로고
    • Using data mining techniques to harvest information in clinical trials
    • New York
    • DE VEAUX, R. D., DONAHUE, R. and SMALL, R. D. (2002). Using data mining techniques to harvest information in clinical trials. Presentation at Joint Statistical Meetings, New York.
    • (2002) Joint Statistical Meetings
    • De Veaux, R.D.1    Donahue, R.2    Small, R.D.3
  • 7
    • 17744404763 scopus 로고
    • Modeling of topographic effects on Antarctic sea-ice using multivariate adaptive regression splines
    • DE VEAUX, R. D., GORDON, A., COMISO, J. and BACHERER, N. E. (1993). Modeling of topographic effects on Antarctic sea-ice using multivariate adaptive regression splines. J. Geophysical Research - Oceans 98 20, 307-20, 320.
    • (1993) J. Geophysical Research - Oceans , vol.98
    • De Veaux, R.D.1    Gordon, A.2    Comiso, J.3    Bacherer, N.E.4
  • 8
    • 17444364649 scopus 로고    scopus 로고
    • Reject inference in credit operations
    • (E. Mays, ed.). Glenlake Publishing, Chicago
    • HAND, D. J. (2001). Reject inference in credit operations. In Handbook of Credit Scoring (E. Mays, ed.) 225-240. Glenlake Publishing, Chicago.
    • (2001) Handbook of Credit Scoring , pp. 225-240
    • Hand, D.J.1
  • 9
    • 26444606587 scopus 로고    scopus 로고
    • Academic obsessions and classification realities: Ignoring practicalities in supervised classification
    • (D. Banks, L. House, F. R. McMorris, P. Arabie and W. Gaul, eds.). Springer, Berlin
    • HAND, D. J. (2004a). Academic obsessions and classification realities: Ignoring practicalities in supervised classification. In Classification, Clustering and Data Mining Applications (D. Banks, L. House, F. R. McMorris, P. Arabie and W. Gaul, eds.) 209-232. Springer, Berlin.
    • (2004) Classification, Clustering and Data Mining Applications , pp. 209-232
    • Hand, D.J.1
  • 16
    • 0006185571 scopus 로고    scopus 로고
    • Data quality in the practice of consumer product management: Evidence from the field
    • KLEIN, B. D. (1998). Data quality in the practice of consumer product management: Evidence from the field. Data Quality 4(1).
    • (1998) Data Quality , vol.4 , Issue.1
    • Klein, B.D.1
  • 17
    • 0000043085 scopus 로고
    • Statistics in society: Problems unsolved and unformulated
    • KRUSKAL, W. (1981). Statistics in society: Problems unsolved and unformulated. J. Amer. Statist. Assoc. 76 505-515.
    • (1981) J. Amer. Statist. Assoc. , vol.76 , pp. 505-515
    • Kruskal, W.1
  • 18
    • 0022544742 scopus 로고
    • Data quality and due process in large interorganizational record systems
    • LAUDON, K. C. (1986). Data quality and due process in large interorganizational record systems. Communications of the ACM 29 4-11.
    • (1986) Communications of the ACM , vol.29 , pp. 4-11
    • Laudon, K.C.1
  • 21
    • 0042277408 scopus 로고
    • Introduction to the TDQM research program
    • Total Data Quality Management Research Program
    • MADNICK, S. E. and WANG, R. Y. (1992). Introduction to the TDQM research program. Working Paper 92-01, Total Data Quality Management Research Program.
    • (1992) Working Paper , vol.92 , Issue.1
    • Madnick, S.E.1    Wang, R.Y.2
  • 22
    • 0020125408 scopus 로고
    • Estimating and improving the quality of information in a MIS
    • MOREY, R. C. (1982). Estimating and improving the quality of information in a MIS. Communications of the ACM 25 337-342.
    • (1982) Communications of the ACM , vol.25 , pp. 337-342
    • Morey, R.C.1
  • 23
    • 85001748501 scopus 로고
    • My data, right or wrong
    • PERCY, T. (1986). My data, right or wrong. Datamation 32(11) 123-124.
    • (1986) Datamation , vol.32 , Issue.11 , pp. 123-124
    • Percy, T.1
  • 25
    • 26444448232 scopus 로고    scopus 로고
    • Modeling database error rates
    • PIERCE, E. (1997). Modeling database error rates. Data Quality 3(1). Available at www.dataquality.com/dqsep97.htm.
    • (1997) Data Quality , vol.3 , Issue.1
    • Pierce, E.1
  • 26
    • 26444585814 scopus 로고    scopus 로고
    • PRICEWATERHOUSECOOPERS (2004). The Tech Spotlight 22. Available at www.pwc.com/extweb/manissue.nsf/docid/ 2D6E2F57E06E022F85256B8F006F389A.
    • (2004) The Tech Spotlight , vol.22
  • 28
    • 26444494753 scopus 로고
    • Estimating the errors remaining in a data set: Techniques for quality control
    • STRAYHORN, J. M. (1990). Estimating the errors remaining in a data set: Techniques for quality control. Amer. Statist. 44 14-18.
    • (1990) Amer. Statist. , vol.44 , pp. 14-18
    • Strayhorn, J.M.1
  • 29
    • 84989967016 scopus 로고    scopus 로고
    • Curbstoning IQ and the 2000 presidential election
    • WAINER, H. (2004). Curbstoning IQ and the 2000 presidential election. Chance 17(4) 43-46.
    • (2004) Chance , vol.17 , Issue.4 , pp. 43-46
    • Wainer, H.1
  • 30
    • 0010108621 scopus 로고
    • Data base error trapping and prediction
    • WEST, M. and WINKLER, R. L. (1991). Data base error trapping and prediction. J. Amer. Statist. Assoc. 86 987-996.
    • (1991) J. Amer. Statist. Assoc. , vol.86 , pp. 987-996
    • West, M.1    Winkler, R.L.2
  • 32
    • 0012343292 scopus 로고
    • Responsibility for raw data
    • WOLINS, L. (1962). Responsibility for raw data. American Psychologist 17 657-658.
    • (1962) American Psychologist , vol.17 , pp. 657-658
    • Wolins, L.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.