메뉴 건너뛰기




Volumn , Issue , 2003, Pages 39-48

Adaptive duplicate detection using learnable string similarity measures

Author keywords

Data cleaning; Distance metric learning; Record linkage; String edit distance; SVM applications; Trained similarity measures

Indexed keywords

DATA CLEANING; DISTANCE METRIC LEARNING; EDIT DISTANCE; SIMILARITY MEASURE; SVM APPLICATIONS;

EID: 77952372966     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/956750.956759     Document Type: Conference Paper
Times cited : (760)

References (27)
  • 2
    • 2342566765 scopus 로고    scopus 로고
    • Learning to combine trained distance metrics for duplicate detection in databases
    • Artificial Intelligence Laboratory, University of Texas at Austin, Austin, TX, Feb
    • M. Bilenko and R. J. Mooney. Learning to combine trained distance metrics for duplicate detection in databases. Technical Report AI 02-296, Artificial Intelligence Laboratory, University of Texas at Austin, Austin, TX, Feb. 2002.
    • (2002) Technical Report AI 02-296
    • Bilenko, M.1    Mooney, R.J.2
  • 5
    • 0027652468 scopus 로고
    • Substructure discovery using minimum description length and background knowledge
    • D. J. Cook and L. B. Holder. Substructure discovery using minimum description length and background knowledge. Journal of Artificial Intelligence Research, 1:231-255, 1994.
    • (1994) Journal of Artificial Intelligence Research , vol.1 , pp. 231-255
    • Cook, D.J.1    Holder, L.B.2
  • 11
    • 0002714543 scopus 로고    scopus 로고
    • Making large-scale SVM learning practical
    • In B. Schölkopf, C. J. C. Burges, and A. J. Smola, editors, MIT Press
    • T. Joachims. Making large-scale SVM learning practical. In B. Schölkopf, C. J. C. Burges, and A. J. Smola, editors, Advances in Kernel Methods - Support Vector Learning, pages 169-184. MIT Press, 1999.
    • (1999) Advances in Kernel Methods - Support Vector Learning , pp. 169-184
    • Joachims, T.1
  • 17
    • 0014757386 scopus 로고
    • A general method applicable to the search for similarities in the amino acid sequences of two proteins
    • S. B. Needleman and C. D. Wunsch. A general method applicable to the search for similarities in the amino acid sequences of two proteins. Journal of Molecular Biology, 48:443-453, 1970.
    • (1970) Journal of Molecular Biology , vol.48 , pp. 443-453
    • Needleman, S.B.1    Wunsch, C.D.2
  • 19
    • 0003243224 scopus 로고    scopus 로고
    • Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods
    • In A. J. Smola, P. Bartlett, B. Schölkopf, and D. Schuurmans, editors, MIT Press
    • J. C. Platt. Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. In A. J. Smola, P. Bartlett, B. Schölkopf, and D. Schuurmans, editors, Advances in Large Margin Classifiers, pages 185-208. MIT Press, 1999.
    • (1999) Advances in Large Margin Classifiers , pp. 185-208
    • Platt, J.C.1
  • 20
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • L. R. Rabiner. A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE, 77 (2):257-286, 1989.
    • (1989) Proceedings of the IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.R.1
  • 25
    • 0012866045 scopus 로고    scopus 로고
    • The state of record linkage and current research problems
    • Statistical Research Division, U. S. Bureau of the Census, Wachington, DC
    • W. E. Winkler. The state of record linkage and current research problems. Technical report, Statistical Research Division, U. S. Bureau of the Census, Wachington, DC, 1999.
    • (1999) Technical Report
    • Winkler, W.E.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.