메뉴 건너뛰기




Volumn , Issue , 2007, Pages 201-209

Canonicalization of database records using adaptive similarity measures

Author keywords

Data cleaning; Data mining; Information extraction

Indexed keywords

DATA CLEANING; INFORMATION EXTRACTION;

EID: 36849045251     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1281192.1281217     Document Type: Conference Paper
Times cited : (21)

References (15)
  • 1
    • 2342566765 scopus 로고    scopus 로고
    • Learning to combine trained distance metrics for duplicate detection in databases
    • Technical Report AI-02-296, University of Texas at Austin
    • M. Bilenko and R. J. Mooney. Learning to combine trained distance metrics for duplicate detection in databases. Technical Report AI-02-296, University of Texas at Austin, 2002.
    • (2002)
    • Bilenko, M.1    Mooney, R.J.2
  • 3
    • 85127836544 scopus 로고    scopus 로고
    • Discriminative training methods for hidden markov models: Theory and experiments with perceptron algorithms
    • M. Collins. Discriminative training methods for hidden markov models: Theory and experiments with perceptron algorithms. In EMNLP, 2002.
    • (2002) EMNLP
    • Collins, M.1
  • 5
    • 85027780074 scopus 로고    scopus 로고
    • Creating probabilistic databases from information extraction models
    • R. Gupta and S. Sarawagi. Creating probabilistic databases from information extraction models. In VLDB, 2006.
    • (2006) VLDB
    • Gupta, R.1    Sarawagi, S.2
  • 6
    • 0000390142 scopus 로고
    • Binary codes capable of correcting deletions, insertions and reversals
    • V. Levenshtein. Binary codes capable of correcting deletions, insertions and reversals. Doklady Akademii Nauk SSR, 163(4):845-848, 1965.
    • (1965) Doklady Akademii Nauk SSR , vol.163 , Issue.4 , pp. 845-848
    • Levenshtein, V.1
  • 7
    • 33646887390 scopus 로고    scopus 로고
    • D. C. Liu and J. Nocedal. On the limited memory BFGS method for large scale optimization. Math. Programming, 45(3, (Ser. B)):503-528, 1989.
    • D. C. Liu and J. Nocedal. On the limited memory BFGS method for large scale optimization. Math. Programming, 45(3, (Ser. B)):503-528, 1989.
  • 8
    • 84859899973 scopus 로고    scopus 로고
    • G. Mann and D. Yarowsky. Multi-field information extraction and cross-document fusion. In ACL, 2005.
    • G. Mann and D. Yarowsky. Multi-field information extraction and cross-document fusion. In ACL, 2005.
  • 9
    • 44849098451 scopus 로고    scopus 로고
    • A conditional random field for discriminatively-trained finite-state string edit distance
    • A. McCallum, K. Bellare, and F. Pereira. A conditional random field for discriminatively-trained finite-state string edit distance. In Conference on Uncertainty in AI, 2005.
    • (2005) Conference on Uncertainty in AI
    • McCallum, A.1    Bellare, K.2    Pereira, F.3
  • 10
    • 33646765912 scopus 로고    scopus 로고
    • Conditional models of identity uncertainty with application to noun coreference
    • L. K. Saul, Y. Weiss, and L. Bottou, editors, MIT Press, Cambridge, MA
    • A. McCallum and B. Wellner. Conditional models of identity uncertainty with application to noun coreference. In L. K. Saul, Y. Weiss, and L. Bottou, editors, Advances in Neural Information Processing Systems 17. MIT Press, Cambridge, MA, 2005.
    • (2005) Advances in Neural Information Processing Systems 17
    • McCallum, A.1    Wellner, B.2
  • 12
    • 4244074850 scopus 로고    scopus 로고
    • Learning string edit distance
    • Technical Report CS-TR-532-96, Princeton University
    • E. S. Ristad and P. N. Yianilos. Learning string edit distance. Technical Report CS-TR-532-96, Princeton University, 1997.
    • (1997)
    • Ristad, E.S.1    Yianilos, P.N.2
  • 13
    • 0035545848 scopus 로고    scopus 로고
    • Learning object identification rules for information integration
    • S. Tejada, C. A. Knoblock, and S. Minton. Learning object identification rules for information integration. Information Systems, 26(8):607-633, 2001.
    • (2001) Information Systems , vol.26 , Issue.8 , pp. 607-633
    • Tejada, S.1    Knoblock, C.A.2    Minton, S.3
  • 14
    • 67651206429 scopus 로고    scopus 로고
    • Learning field compatibilities to extract database records from unstructured text
    • M. Wick, A. Culotta, and A. McCallum. Learning field compatibilities to extract database records from unstructured text. In EMNLP, 2006.
    • (2006) EMNLP
    • Wick, M.1    Culotta, A.2    McCallum, A.3
  • 15
    • 36848998962 scopus 로고    scopus 로고
    • J. J. Zhu and L. H. Unger. String edit analysis for merging databases. In KDD, 2000.
    • J. J. Zhu and L. H. Unger. String edit analysis for merging databases. In KDD, 2000.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.