SCOPUS 정보 검색 플랫폼

Volumn 4, Issue 10, 2011, Pages 622-633

Entity matching: How similar is similar

Author keywords

[No Author keywords available]

Indexed keywords

REDUNDANCY;

DATA CLEANING; ENTITY MATCHING; HIGH-ACCURACY; INDEX STRUCTURE; OPTIMIZATION TECHNIQUES; REAL APPLICATIONS; SIMILARITY FUNCTIONS; SYNTHETIC DATASETS;

OPTIMIZATION;

EID: 84863541462 PISSN: None EISSN: 21508097 Source Type: Conference Proceeding
DOI: 10.14778/2021017.2021020 Document Type: Article

Times cited : (137)

References (22)

3
- 85104914015
- Efficient exact set-similarity joins
- A. Arasu, V. Ganti, and R. Kaushik. Efficient exact set-similarity joins. In VLDB, pages 918-929, 2006.
- (2006) VLDB , pp. 918-929
- Arasu, A.¹ Ganti, V.² Kaushik, R.³

4
- 5444258997
- A comparison of fast blocking methods for record linkage
- R. Baxter, P. Christen, and T. Churches. A comparison of fast blocking methods for record linkage. In Proceedings of the 2003 ACM SIGKDD Workshop on Data Cleaning, Record Linkage, and Object Consolidation, pages 25-27, 2003.
- (2003) Proceedings of the 2003 ACM SIGKDD Workshop on Data Cleaning, Record Linkage, and Object Consolidation , pp. 25-27
- Baxter, R.¹ Christen, P.² Churches, T.³

5
- 77952372966
- Adaptive duplicate detection using learnable string similarity measures
- M. Bilenko and R. J. Mooney. Adaptive duplicate detection using learnable string similarity measures. In KDD, pages 39-48, 2003.
- (2003) KDD , pp. 39-48
- Bilenko, M.¹ Mooney, R.J.²

6
- 85011029434
- Example-driven design of efficient record matching queries
- S. Chaudhuri, B.-C. Chen, V. Ganti, and R. Kaushik. Example-driven design of efficient record matching queries. In VLDB, pages 327-338, 2007.
- (2007) VLDB , pp. 327-338
- Chaudhuri, S.¹ Chen, B.-C.² Ganti, V.³ Kaushik, R.⁴

7
- 33749597967
- A primitive operator for similarity joins in data cleaning
- S. Chaudhuri, V. Ganti, and R. Kaushik. A primitive operator for similarity joins in data cleaning. In ICDE, pages 5-16, 2006.
- (2006) ICDE , pp. 5-16
- Chaudhuri, S.¹ Ganti, V.² Kaushik, R.³

8
- 11144240583
- A comparison of string distance metrics for name-matching tasks
- W. W. Cohen, P. Ravikumar, and S. E. Fienberg. A comparison of string distance metrics for name-matching tasks. In IIWEB, pages 73-78, 2003.
- (2003) IIWEB , pp. 73-78
- Cohen, W.W.¹ Ravikumar, P.² Fienberg, S.E.³

9
- 0242540438
- Learning to match and cluster large high-dimensional data sets for data integration
- W. W. Cohen and J. Richman. Learning to match and cluster large high-dimensional data sets for data integration. In KDD, pages 475-480, 2002.
- (2002) KDD , pp. 475-480
- Cohen, W.W.¹ Richman, J.²

10
- 33845667955
- Duplicate record detection: A survey
- A. K. Elmagarmid, P. G. Ipeirotis, and V. S. Verykios. Duplicate record detection: A survey. TKDE, 19(1):1-16, 2007.
- (2007) TKDE , vol.19 , Issue.1 , pp. 1-16
- Elmagarmid, A.K.¹ Ipeirotis, P.G.² Verykios, V.S.³

11
- 84865086832
- Reasoning about record matching rules
- W. Fan, X. Jia, J. Li, and S. Ma. Reasoning about record matching rules. PVLDB, 2(1):407-418, 2009.
- (2009) PVLDB , vol.2 , Issue.1 , pp. 407-418
- Fan, W.¹ Jia, X.² Li, J.³ Ma, S.⁴

12
- 84947399464
- A theory for record linkage. Journal of the American Statistical Association
- I. P. Fellegi and A. B. Sunter. A theory for record linkage. Journal of the American Statistical Association, 64(328):1183-1210, 1969.
- (1969) , vol.64 , Issue.328 , pp. 1183-1210
- Fellegi, I.P.¹ Sunter, A.B.²

13
- 0344756845
- Declarative data cleaning: Language, model, and algorithms
- H. Galhardas, D. Florescu, D. Shasha, E. Simon, and C.-A. Saita. Declarative data cleaning: Language, model, and algorithms. In VLDB, pages 371-380, 2001.
- (2001) VLDB , pp. 371-380
- Galhardas, H.¹ Florescu, D.² Shasha, D.³ Simon, E.⁴ Saita, C.-A.⁵

14
- 84976856849
- The merge/purge problem for large databases
- M. A. Herńandez and S. J. Stolfo. The merge/purge problem for large databases. In SIGMOD, pages 127-138, 1995.
- (1995) SIGMOD , pp. 127-138
- Herńandez, M.A.¹ Stolfo, S.J.²

15
- 0003777592
- editor. PWS Publishing Company
- D. S. Hochbaum, editor. Approximation algorithms for NP-hard problems. PWS Publishing Company, 1997.
- (1997) Approximation algorithms for NP-hard problems
- Hochbaum, D.S.¹

16
- 84950419860
- Advances in record-linkage methodology as applied to matching the 1985 census of tampa, florida
- M. A. Jaro. Advances in record-linkage methodology as applied to matching the 1985 census of tampa, florida. Journal of the American Statistical Association, 84(406):414-420, 1989.
- (1989) Journal of the American Statistical Association , vol.84 , Issue.406 , pp. 414-420
- Jaro, M.A.¹

17
- 34250670467
- Record linkage: similarity measures and algorithms
- N. Koudas, S. Sarawagi, and D. Srivastava. Record linkage: similarity measures and algorithms. In SIGMOD, pages 802-803, 2006.
- (2006) SIGMOD , pp. 802-803
- Koudas, N.¹ Sarawagi, S.² Srivastava, D.³

18
- 0027189241
- Entity identification in database integration
- E. Lim, J. Srivastava, S. Prabhakar, and J. Richardson. Entity identification in database integration. In ICDE, pages 294-301, 1993.
- (1993) ICDE , pp. 294-301
- Lim, E.¹ Srivastava, J.² Prabhakar, S.³ Richardson, J.⁴

19
- 0034592784
- Efficient clustering of high-dimensional data sets with application to reference matching
- A. McCallum, K. Nigam, and L. H. Ungar. Efficient clustering of high-dimensional data sets with application to reference matching. In KDD, pages 169-178, 2000.
- (2000) KDD , pp. 169-178
- McCallum, A.¹ Nigam, K.² Ungar, L.H.³

20
- 0242456811
- Interactive deduplication using active learning
- S. Sarawagi and A. Bhamidipaty. Interactive deduplication using active learning. In KDD, pages 269-278, 2002.
- (2002) KDD , pp. 269-278
- Sarawagi, S.¹ Bhamidipaty, A.²

21
- 0242456803
- Learning domain-independent string transformation weights for high accuracy object identification
- S. Tejada, C. A. Knoblock, and S. Minton. Learning domain-independent string transformation weights for high accuracy object identification. In KDD, pages 350-359, 2002.
- (2002) KDD , pp. 350-359
- Tejada, S.¹ Knoblock, C.A.² Minton, S.³

22
- 2942741943
- Technical report, Series RRS2002/05, U.S. Bureau of the Census
- W. E. Winkler. Methods for record linkage and bayesian networks. Technical report, Series RRS2002/05, U.S. Bureau of the Census, 2002.
- (2002) Methods for record linkage and bayesian networks
- Winkler, W.E.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.