메뉴 건너뛰기




Volumn 2, Issue 1, 1998, Pages 9-37

Real-world data is dirty: Data cleansing and the merge/purge problem

Author keywords

Data cleaning; Data cleansing; Duplicate elimination; Semantic integration

Indexed keywords


EID: 0013331361     PISSN: 13845810     EISSN: None     Source Type: Journal    
DOI: 10.1023/A:1009761603038     Document Type: Article
Times cited : (662)

References (23)
  • 1
    • 27144457080 scopus 로고    scopus 로고
    • ACM. SIGMOD record, December 1991
    • ACM. SIGMOD record, December 1991.
  • 3
    • 0023023948 scopus 로고
    • A Comparative Analysis of Methodologies for Database Schema Integration
    • December
    • Batini, C., Lenzerini, M. and Navathe, S. A Comparative Analysis of Methodologies for Database Schema Integration. ACM Computing Surverys, 18(4):323-364, December 1986.
    • (1986) ACM Computing Surverys , vol.18 , Issue.4 , pp. 323-364
    • Batini, C.1    Lenzerini, M.2    Navathe, S.3
  • 4
    • 0020763652 scopus 로고
    • Duplicate Record Elimination in Large Data Files
    • June
    • Bitton, D. and DeWitt, D. J. Duplicate Record Elimination in Large Data Files. ACM Transactions on Database Systems, 8(2):255-265, June 1983.
    • (1983) ACM Transactions on Database Systems , vol.8 , Issue.2 , pp. 255-265
    • Bitton, D.1    DeWitt, D.J.2
  • 5
    • 0020129484 scopus 로고
    • A fuzzy representation of data for relational databases
    • Buckles, B.P. and Petry, F. E. A fuzzy representation of data for relational databases. Fuzzy Sets and Systems, 7:213-226, 1982. Generally regarded as the paper that originated Fuzzy Databases.
    • (1982) Fuzzy Sets and Systems , vol.7 , pp. 213-226
    • Buckles, B.P.1    Petry, F.E.2
  • 7
    • 0002589728 scopus 로고
    • Probability Scoring for Spelling Correction
    • Church, K. W. and Gale, W. A. Probability Scoring for Spelling Correction. Statistics and Computing, 1:93-103, 1991.
    • (1991) Statistics and Computing , vol.1 , pp. 93-103
    • Church, K.W.1    Gale, W.A.2
  • 8
    • 27144521522 scopus 로고
    • Analyzing Foster Childrens' Foster Home Payments Database
    • Piatetsky-Shapiro, ed.
    • Clark, T. K. Analyzing Foster Childrens' Foster Home Payments Database. In KDD Nuggets 95:7 (http://info.gte.com/kdd/nuggets/95/), Piatetsky-Shapiro, ed., 1995.
    • (1995) KDD Nuggets , vol.95 , pp. 7
    • Clark, T.K.1
  • 9
    • 0002032320 scopus 로고
    • A Comparative Review of Selected Methods for Learning from Examples
    • R. Michalski, J. Carbonell, and T. Mitchell, editors, Morgan Kaufmann Publishers, Inc.
    • Dietterich, T. and Michalski, R. A Comparative Review of Selected Methods for Learning from Examples. In R. Michalski, J. Carbonell, and T. Mitchell, editors, Machine Learning, volume 1, pages 41-81. Morgan Kaufmann Publishers, Inc., 1983.
    • (1983) Machine Learning , vol.1 , pp. 41-81
    • Dietterich, T.1    Michalski, R.2
  • 10
    • 0017010058 scopus 로고
    • Clustering Techniques: The User's Dilema
    • Dubes, R. and Jain, A. Clustering Techniques: The User's Dilema. Pattern Recognition, 8:247-260, 1976.
    • (1976) Pattern Recognition , vol.8 , pp. 247-260
    • Dubes, R.1    Jain, A.2
  • 11
    • 0002283033 scopus 로고    scopus 로고
    • From Data Mining to Knowledge Discovery in Databases
    • Fall
    • Fayyad, U., Piatetsky-Shapiro, G. and Smyth, P. From Data Mining to Knowledge Discovery in Databases. AI Magazine, 17(3), Fall 1996.
    • (1996) AI Magazine , vol.17 , Issue.3
    • Fayyad, U.1    Piatetsky-Shapiro, G.2    Smyth, P.3
  • 13
    • 0003932959 scopus 로고
    • OPS5 User's Manual
    • Carnegie Mellon University, July
    • Forgy, C. L. OPS5 User's Manual. Technical Report CMU-CS-81-135, Carnegie Mellon University, July 1981.
    • (1981) Technical Report , vol.CMU-CS-81-135
    • Forgy, C.L.1
  • 17
    • 0026979939 scopus 로고
    • Techniques for Automatically Correcting Words in Text
    • Kukich, K. Techniques for Automatically Correcting Words in Text. ACM Computing Surveys, 24(4):377-439, 1992.
    • (1992) ACM Computing Surveys , vol.24 , Issue.4 , pp. 377-439
    • Kukich, K.1
  • 21
    • 84976776121 scopus 로고
    • Automatic spelling correction in scientific and scholarly text
    • Pollock, J. J. and Zamora, A. Automatic spelling correction in scientific and scholarly text. ACM Computing Surveys, 27(4):358-368, 1987.
    • (1987) ACM Computing Surveys , vol.27 , Issue.4 , pp. 358-368
    • Pollock, J.J.1    Zamora, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.