SCOPUS 정보 검색 플랫폼

Ruan Jian Xue Bao/Journal of Software

Volumn 13, Issue 11, 2002, Pages 2076-2082

Research on data quality and data cleaning: A survey

(2) Guo, Zhi Mao a Zhou, Ao Ying a

a FUDAN UNIVERSITY (China)

Author keywords

Data cleaning; Data cleaning framework; Data integration; Data quality; Duplicate record

Indexed keywords

CLASSIFICATION (OF INFORMATION); DATA MINING; MANAGEMENT INFORMATION SYSTEMS;

DATA CLEARING; DATA INTEGRATION; DATA QUALITY; DUPLICATE RECORDS; MEASUREMENT METRICS;

DATA PROCESSING;

EID: 0036879367 PISSN: 10009825 EISSN: None Source Type: Journal
DOI: None Document Type: Article

Times cited : (83)

References (24)

1
- 0013073636
- Towards improving data quality
- Sarda N.L. (ed.), Delhi
- Aebi, D., Perrochon, L. Towards improving data quality. In; Sarda, N.L., ed. Proceedings of the International Conference on Information Systems and Management of Data. Delhi, 1993. 273-281.
- (1993) Proceedings of the International Conference on Information Systems and Management of Data , pp. 273-281
- Aebi, D.¹ Perrochon, L.²

2
- 0027228754
- Data quality requirements analysis and modeling
- Vienna: IEEE Computer Society
- Wang, R.Y., Kon, H.B., Madnick, S.E. Data quality requirements analysis and modeling. In: Proceedings of the 9th International Conference on Data Engineering. Vienna: IEEE Computer Society, 1993. 670-677.
- (1993) Proceedings of the 9th International Conference on Data Engineering , pp. 670-677
- Wang, R.Y.¹ Kion, H.B.² Madnick, S.E.³

3
- 0002490026
- Data cleaning: Problems and current approaches
- Rahm, E., Do, H.H. Data cleaning: problems and current approaches. IEEE Data Engineering Bulletin, 2000, 23(4): 3-13.
- (2000) IEEE Data Engineering Bulletin , vol.23 , Issue.4 , pp. 3-13
- Rahm, E.¹ Do, H.H.²

4
- 85012212427
- AJAX: An extensible data cleaning tool
- Chen W.D., Naughton J.F. and Bernstein P.A. (ed.), Texas: ACM
- Galhardas, H., Florescu, D., Shasha, D., et al. AJAX: an extensible data cleaning tool. In: Chen, W.D., Naughton, J.F., Bernstein, P.A., eds. Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data. Texas: ACM, 2000. 590.
- (2000) Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data , pp. 590
- Galhardas, H.¹ Florscu, D.² Shasha, D.³

5
- 0013331361
- Real-world data is dirty: Data cleansing and the merge/purge problem
- Hernandez, M.A., Stolfo, S.J. Real-World data is dirty: data cleansing and the merge/purge problem. Data Mining and Knowledge Discovery, 1998, 2(1): 9-37.
- (1998) Data Mining and Knowledge Discovery , vol.2 , Issue.1 , pp. 9-37
- Hernandez, M.A.¹ Stolfo, S.T.²

6
- 84947925307
- Cleansing data for mining and warehousing
- Bench-Capon T., Soda G. and Tjoa A.M. (ed.), Florence: Springer
- Lee, M.L., Ling, T.W., Lu H.J., et al. Cleansing data for mining and warehousing. In: Bench-Capon, T., Soda, G., Tjoa, A.M., eds. Database and Expert Systems Applications. Florence: Springer, 1999. 751-760.
- (1999) Database and Expert Systems Applications , pp. 751-760
- Lee, M.L.¹ Ling, T.W.² Lu, H.J.³

7
- 0002089617
- Matching algorithm within a duplicate detection system
- Monge A.E. Matching algorithm within a duplicate detection system. IEEE Data Engineering Bulletin, 2000, 23(4): 14-20.
- (2000) IEEE Data Engineering Bulletin , vol.23 , Issue.4 , pp. 14-20
- Monge, A.E.¹

8
- 85018108837
- The field matching problem: Algorithms and applications
- Simoudis E., Han J.W. and Fayyad U. (ed.), Oregon; AAAI Press
- Monge, A.E., Elkan, C. The field matching problem: algorithms and applications. In: Simoudis, E., Han, J.W., Fayyad, U., eds. Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining. Oregon; AAAI Press, 1996. 267-270.
- (1996) Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining , pp. 267-270
- Monge, A.E.¹ Elkan, C.²

9
- 0002082857
- An efficient algorithm for mining association rules in large databases
- Dayal U., Gray P. and Nishio S. (ed.), Zurich: Morgan Kaufmann
- Savasere, A., Omiecinski, E., Navathe, S.B. An efficient algorithm for mining association rules in large databases. In: Dayal, U., Gray, P., Nishio, S., eds. Proceedings of the 21st International Conference on Very Large Data Bases. Zurich: Morgan Kaufmann, 1995. 432-444.
- (1995) Proceedings of the 21st International Conference on Very Large Data Bases , pp. 432-444
- Savasere, A.¹ Omiecinski, E.² Navathe, S.B.³

10
- 0002880407
- Mining generalized association rules
- Dayal U., Gray P. and Nishio S. (ed.), Zurich: Morgan Kaufmann
- Srikant, R., Agrawal, R. Mining Generalized Association Rules. In: Dayal, U., Gray, P., Nishio, S., eds. Proceedings of the 21st International Conference on Very Large Data Bases. Zurich: Morgan Kaufmann, 1995. 407-419.
- (1995) Proceedings of the 21st International Conference on Very Large Data Bases , pp. 407-419
- Srikant, R.¹ Agrawal, R.²

11
- 0002296248
- Tools for data translation and integration
- Abiteboul, S., Cluet, S., Milo, T., et al. Tools for data translation and integration. IEEE Data Engineering Bulletin, 1999, 22(1): 3-8.
- (1999) IEEE Data Engineering Bulletin , vol.22 , Issue.1 , pp. 3-8
- Abiteboul, S.¹ Cluet, S.² Milo, T.³

12
- 0003108406
- Using schema matching to simplify heterogeneous data translation
- Gupta A., Shmueli O. and Widom J. (ed.), New York; Morgan Kaufmann
- Milo, T., Zohar, S. Using schema matching to simplify heterogeneous data translation. In: Gupta, A., Shmueli, O., Widom, J., eds. Proceedings of the 24th International Conference on Very Large Data Bases. New York; Morgan Kaufmann, 1998. 122-133.
- (1998) Proceedings of the 24th International Conference on Very Large Data Bases , pp. 122-133
- Milo, T.¹ Zohar, S.²

13
- 0034592786
- IntelliClean: A knowledge-based intelligent data cleaner
- Boston: ACM Press
- Lee, M.L., Ling, T.W., Low, W.L. IntelliClean: a knowledge-based intelligent data cleaner. In: Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Boston: ACM Press, 2000. 290-294.
- (2000) Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , pp. 290-294
- Lee, M.L.¹ Ling, T.W.² Low, W.L.³

14
- 0013117789
- Telcordia's database reconciliation and data quality analysis tool
- Abbadi A.E., Brodie M.L. and Chakravarthy S. (ed.), Cairo: Morgan Kaufmann
- Caruso, F., Cochinwala, M., Ganapathy, U., et al. Telcordia's database reconciliation and data quality analysis tool. In; Abbadi, A.E., Brodie, M.L., Chakravarthy, S., et al., eds. Proceedings of the 26th International Conference on Very Large Data Bases. Cairo: Morgan Kaufmann, 2000. 615-618.
- (2000) Proceedings of the 26th International Conference on Very Large Data Bases , pp. 615-618
- Caruso, F.¹ Cochinwala, M.² Ganapathy, U.³

15
- 62449209904
- Data cleaning and integration
- Galhardas, H. Data cleaning and integration. 2001. http://aravel.inria.fr/-galharda/cleaning.html.
- (2001)
- Galhardas, H.¹

16
- 0344756845
- Declarative data cleaning: Language, model and algorithms
- Apers P., Atzeni P. and Ceri S. (ed.), Roma: Morgan Kaufmann
- Galhardas, H., Floescu, D., Shasha, D., et al. Declarative data cleaning: language, model and algorithms. In: Apers, P., Atzeni, P., Ceri, S., et al. eds. Proceedings of the 27th International Conference on Very Large Data Bases. Roma: Morgan Kaufmann, 2001. 371-380.
- (2001) Proceedings of the 27th International Conference on Very Large Data Bases , pp. 371-380
- Galhardas, H.¹ Floescu, D.² Shasha, D.³

17
- 84944315993
- Potter's wheel: An interactive data cleaning system
- Apers P., Atzeni P. and Ceri S. (ed.), Roma: Morgan Kaufmann
- Raman, V., Hellersterin, J. Potter's wheel: an interactive data cleaning system. In: Apers, P., Atzeni, P., Ceri, S., et al. eds. Proceedings of the 27th International Conference on Very Large Data Bases. Roma: Morgan Kaufmann, 2001. 381-390.
- (2001) Proceedings of the 27th International Conference on Very Large Data Bases , pp. 381-390
- Raman, V.¹ Hellersterin, J.²

18
- 8444224881
- Data quality mining: Making a virtue of necessity
- Santa Barbara
- Hipp, J., Guntzer, U., Grimmer, U. Data quality mining: making a virtue of necessity. In: Workshop on Research Issues in Data Mining and Knowledge Discovery. Santa Barbara, 2001.
- (2001) Workshop on Research Issues in Data Mining and Knowledge Discovery
- Hipp, J.¹ Guntzer, U.² Grimmer, U.³

19
- 0002356707
- Automatically extracting structure from free text addresses
- Borkar, V., Deshmukh, K., Sarawagi, S. Automatically extracting structure from free text addresses. IEEE Data Engineering Bulletin, 2000, 23(4): 27-32.
- (2000) IEEE Data Engineering Bulletin , vol.23 , Issue.4 , pp. 27-32
- Borkar, V.¹ Deshmukh, K.² Sarawagi, S.³

20
- 0001927734
- Independent, open enterprise data integration
- Hellerstein, J., Stonebraker, M., Caccia, R. Independent, open enterprise data integration. IEEE Data Engineering Bulletin, 1999, 22(1): 43-49.
- (1999) IEEE Data Engineering Bulletin , vol.22 , Issue.1 , pp. 43-49
- Hellerstein, J.¹ Stonebraker, M.² Caccia, R.³

21
- 0032091575
- Integration of heterogeneous databases without common domains using queries based on textual similarity
- Haas L. and Tiwary A. (ed.), Seattle: ACM Press
- Cohen, W. Integration of heterogeneous databases without common domains using queries based on textual similarity. In: Haas, L., Tiwary, A., eds. Proceedings of International Conference on Management of Data. Seattle: ACM Press, 1998. 201-212.
- (1998) Proceedings of International Conference on Management of Data , pp. 201-212
- Cohen, W.¹

22
- 0034841126
- An efficient approach for detecting approximately duplicate database records
- Chinese source
- Qiu, Yue-feng, Tian, Zeng-ping, Ji, Wen-yun, et al. An efficient approach for detecting approximately duplicate database records, Chinese Journal of Computers, 2001, 24(1): 69-77 (in Chinese).
- (2001) Chinese Journal of Computers , vol.24 , Issue.1 , pp. 69-77
- Qiu, Y.-F.¹ Tian, Z.-P.² Ji, W.-Y.³

23
- 0013117790
- A synthetical approach for detecting approximately duplicate database records of multi-language data
- Chinese source
- Yu, Rong-hua, Tian, Zeng-ping, Zhou, Ao-ying. A synthetical approach for detecting approximately duplicate database records of multi-language data. Computer Science, 2002, 29(1): 118-121 (in Chinese).
- (2002) Computer Science , vol.29 , Issue.1 , pp. 118-121
- Yu, R.-H.¹ Tian, Z.-P.² Zhou, A.-Y.³

24
- 84970907213
- Keys for XML
- Hong Kong: ACM Press
- Buneman, P., Davidson, S., Fan, W., et al. Keys for XML. In: Proceedings of the 10th International World wide Web Conference. Hong Kong: ACM Press, 2001. 201-210.
- (2001) Proceedings of the 10th International World Wide Web Conference , pp. 201-210
- Buneman, P.¹ Davidson, S.² Fan, W.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.