메뉴 건너뛰기




Volumn 4244 LNCS, Issue , 2006, Pages 136-164

Unsupervised duplicate detection using sample non-duplicates

Author keywords

[No Author keywords available]

Indexed keywords

DATA REDUCTION; ERROR ANALYSIS; PROBLEM SOLVING; SET THEORY;

EID: 38549144127     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/11890591_5     Document Type: Conference Paper
Times cited : (9)

References (35)
  • 3
    • 0013331361 scopus 로고    scopus 로고
    • Real-world data is dirty: Data cleansing and the merge/purge problem
    • Hernandez, M.A., Stolfo, S.J.: Real-world data is dirty: Data cleansing and the merge/purge problem. Data Mining and Knowledge Discovery 2 (1998) 9-37
    • (1998) Data Mining and Knowledge Discovery , vol.2 , pp. 9-37
    • Hernandez, M.A.1    Stolfo, S.J.2
  • 5
    • 38549128513 scopus 로고    scopus 로고
    • Monge, A., Elkan, C.: An efficient domain independent algorithm for detecting approximately duplicate database records. In: In Proceedings of the SIGMOD Workshop on Data Mining and Knowledge Discovery. (1997)
    • Monge, A., Elkan, C.: An efficient domain independent algorithm for detecting approximately duplicate database records. In: In Proceedings of the SIGMOD Workshop on Data Mining and Knowledge Discovery. (1997)
  • 7
    • 0002940254 scopus 로고
    • Using the em algorithm for weight computation in the fellegisunter model of record linkage
    • American Statistical Association
    • Winkler, W.E.: Using the em algorithm for weight computation in the fellegisunter model of record linkage. In: Proceedings of the Section on Survey Research Methods, American Statistical Association. (1988) 667-671
    • (1988) Proceedings of the Section on Survey Research Methods , pp. 667-671
    • Winkler, W.E.1
  • 9
    • 2942702984 scopus 로고
    • Improved decision rules in the fellegi-sunter model of record linkage
    • American Statistical Association
    • Winkler, W.E.: Improved decision rules in the fellegi-sunter model of record linkage. In: Proceedings of the Section on Survey Research Methods, American Statistical Association. (1993) 274-279
    • (1993) Proceedings of the Section on Survey Research Methods , pp. 274-279
    • Winkler, W.E.1
  • 14
    • 2342566765 scopus 로고    scopus 로고
    • Learning to combine trained distance metrics for duplicate detection in databases
    • 02-296, Artificial Intelligence Laboratory, University of Texas at Austin, Austin, TX
    • Bilenko, M., Mooney, R.J.: Learning to combine trained distance metrics for duplicate detection in databases. Technical Report AI 02-296, Artificial Intelligence Laboratory, University of Texas at Austin, Austin, TX (2002)
    • (2002) Technical Report AI
    • Bilenko, M.1    Mooney, R.J.2
  • 15
    • 0035545848 scopus 로고    scopus 로고
    • Learning object identification rules for information integration
    • Tejada, S., Knoblock, C.A., Minton, S.: Learning object identification rules for information integration. Information Systems Journal 26 (2001) 635-656
    • (2001) Information Systems Journal , vol.26 , pp. 635-656
    • Tejada, S.1    Knoblock, C.A.2    Minton, S.3
  • 19
    • 29844452555 scopus 로고    scopus 로고
    • Reference reconciliation in complex information spaces
    • Dong, X., Halevy, A.Y., Madhavan, J.: Reference reconciliation in complex information spaces. In: SIGMOD Conference. (2005) 85-96
    • (2005) SIGMOD Conference , pp. 85-96
    • Dong, X.1    Halevy, A.Y.2    Madhavan, J.3
  • 20
    • 70449590442 scopus 로고    scopus 로고
    • Deduplication and group detection using links
    • Workshop on Link Analysis and Group Detection, 2004
    • Bhattacharya, I., Getoor, L.: Deduplication and group detection using links. In: Proceedings of the KDD-2004 Workshop on Link Analysis and Group Detection. (2004)
    • Proceedings of the KDD-2004
    • Bhattacharya, I.1    Getoor, L.2
  • 26
    • 84936824188 scopus 로고
    • Word association norms, mutual information and lexicography
    • Church, K.W., Hanks, P.: Word association norms, mutual information and lexicography. Computational Linguistics 16 (1990) 22-29
    • (1990) Computational Linguistics , vol.16 , pp. 22-29
    • Church, K.W.1    Hanks, P.2
  • 35
    • 0001116877 scopus 로고
    • Binary codes capable of correcting insertions and reversals
    • Levenshtein, V.I.: Binary codes capable of correcting insertions and reversals. Soviet Physics Doklady 10 (1966) 707-710
    • (1966) Soviet Physics Doklady , vol.10 , pp. 707-710
    • Levenshtein, V.I.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.