메뉴 건너뛰기




Volumn 13, Issue 11, 2002, Pages 2076-2082

Research on data quality and data cleaning: A survey

Author keywords

Data cleaning; Data cleaning framework; Data integration; Data quality; Duplicate record

Indexed keywords

CLASSIFICATION (OF INFORMATION); DATA MINING; MANAGEMENT INFORMATION SYSTEMS;

EID: 0036879367     PISSN: 10009825     EISSN: None     Source Type: Journal    
DOI: None     Document Type: Article
Times cited : (85)

References (24)
  • 3
    • 0002490026 scopus 로고    scopus 로고
    • Data cleaning: Problems and current approaches
    • Rahm, E., Do, H.H. Data cleaning: problems and current approaches. IEEE Data Engineering Bulletin, 2000, 23(4): 3-13.
    • (2000) IEEE Data Engineering Bulletin , vol.23 , Issue.4 , pp. 3-13
    • Rahm, E.1    Do, H.H.2
  • 5
    • 0013331361 scopus 로고    scopus 로고
    • Real-world data is dirty: Data cleansing and the merge/purge problem
    • Hernandez, M.A., Stolfo, S.J. Real-World data is dirty: data cleansing and the merge/purge problem. Data Mining and Knowledge Discovery, 1998, 2(1): 9-37.
    • (1998) Data Mining and Knowledge Discovery , vol.2 , Issue.1 , pp. 9-37
    • Hernandez, M.A.1    Stolfo, S.T.2
  • 6
    • 84947925307 scopus 로고    scopus 로고
    • Cleansing data for mining and warehousing
    • Bench-Capon T., Soda G. and Tjoa A.M. (ed.), Florence: Springer
    • Lee, M.L., Ling, T.W., Lu H.J., et al. Cleansing data for mining and warehousing. In: Bench-Capon, T., Soda, G., Tjoa, A.M., eds. Database and Expert Systems Applications. Florence: Springer, 1999. 751-760.
    • (1999) Database and Expert Systems Applications , pp. 751-760
    • Lee, M.L.1    Ling, T.W.2    Lu, H.J.3
  • 7
    • 0002089617 scopus 로고    scopus 로고
    • Matching algorithm within a duplicate detection system
    • Monge A.E. Matching algorithm within a duplicate detection system. IEEE Data Engineering Bulletin, 2000, 23(4): 14-20.
    • (2000) IEEE Data Engineering Bulletin , vol.23 , Issue.4 , pp. 14-20
    • Monge, A.E.1
  • 12
    • 0003108406 scopus 로고    scopus 로고
    • Using schema matching to simplify heterogeneous data translation
    • Gupta A., Shmueli O. and Widom J. (ed.), New York; Morgan Kaufmann
    • Milo, T., Zohar, S. Using schema matching to simplify heterogeneous data translation. In: Gupta, A., Shmueli, O., Widom, J., eds. Proceedings of the 24th International Conference on Very Large Data Bases. New York; Morgan Kaufmann, 1998. 122-133.
    • (1998) Proceedings of the 24th International Conference on Very Large Data Bases , pp. 122-133
    • Milo, T.1    Zohar, S.2
  • 14
    • 0013117789 scopus 로고    scopus 로고
    • Telcordia's database reconciliation and data quality analysis tool
    • Abbadi A.E., Brodie M.L. and Chakravarthy S. (ed.), Cairo: Morgan Kaufmann
    • Caruso, F., Cochinwala, M., Ganapathy, U., et al. Telcordia's database reconciliation and data quality analysis tool. In; Abbadi, A.E., Brodie, M.L., Chakravarthy, S., et al., eds. Proceedings of the 26th International Conference on Very Large Data Bases. Cairo: Morgan Kaufmann, 2000. 615-618.
    • (2000) Proceedings of the 26th International Conference on Very Large Data Bases , pp. 615-618
    • Caruso, F.1    Cochinwala, M.2    Ganapathy, U.3
  • 15
    • 62449209904 scopus 로고    scopus 로고
    • Data cleaning and integration
    • Galhardas, H. Data cleaning and integration. 2001. http://aravel.inria.fr/-galharda/cleaning.html.
    • (2001)
    • Galhardas, H.1
  • 19
    • 0002356707 scopus 로고    scopus 로고
    • Automatically extracting structure from free text addresses
    • Borkar, V., Deshmukh, K., Sarawagi, S. Automatically extracting structure from free text addresses. IEEE Data Engineering Bulletin, 2000, 23(4): 27-32.
    • (2000) IEEE Data Engineering Bulletin , vol.23 , Issue.4 , pp. 27-32
    • Borkar, V.1    Deshmukh, K.2    Sarawagi, S.3
  • 21
    • 0032091575 scopus 로고    scopus 로고
    • Integration of heterogeneous databases without common domains using queries based on textual similarity
    • Haas L. and Tiwary A. (ed.), Seattle: ACM Press
    • Cohen, W. Integration of heterogeneous databases without common domains using queries based on textual similarity. In: Haas, L., Tiwary, A., eds. Proceedings of International Conference on Management of Data. Seattle: ACM Press, 1998. 201-212.
    • (1998) Proceedings of International Conference on Management of Data , pp. 201-212
    • Cohen, W.1
  • 22
    • 0034841126 scopus 로고    scopus 로고
    • An efficient approach for detecting approximately duplicate database records
    • Chinese source
    • Qiu, Yue-feng, Tian, Zeng-ping, Ji, Wen-yun, et al. An efficient approach for detecting approximately duplicate database records, Chinese Journal of Computers, 2001, 24(1): 69-77 (in Chinese).
    • (2001) Chinese Journal of Computers , vol.24 , Issue.1 , pp. 69-77
    • Qiu, Y.-F.1    Tian, Z.-P.2    Ji, W.-Y.3
  • 23
    • 0013117790 scopus 로고    scopus 로고
    • A synthetical approach for detecting approximately duplicate database records of multi-language data
    • Chinese source
    • Yu, Rong-hua, Tian, Zeng-ping, Zhou, Ao-ying. A synthetical approach for detecting approximately duplicate database records of multi-language data. Computer Science, 2002, 29(1): 118-121 (in Chinese).
    • (2002) Computer Science , vol.29 , Issue.1 , pp. 118-121
    • Yu, R.-H.1    Tian, Z.-P.2    Zhou, A.-Y.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.