-
1
-
-
0034592763
-
The IGrid index: Reversing the dimensionality curse for similarity in-dexing in high dimensional space
-
(SIGKDD00), Boston
-
Aggarwal, C.C. & Yu, P.S. (2000), The IGrid index: Reversing the dimensionality curse for similarity in-dexing in high dimensional space, in 'ACM Inter-national Conference on Knowledge Discovery and Data Mining' (SIGKDD'00), Boston, pp. 119-129.
-
(2000)
ACM Inter-national Conference on Knowledge Discovery and Data Mining
, pp. 119-129
-
-
Aggarwal, C.C.1
Yu, P.S.2
-
2
-
-
33845363891
-
A fast linkage de-tection scheme for multi-source information inte-gration
-
(WIRI05), Tokyo
-
Aizawa, A. & Oyama, K. (2005), A fast linkage de-tection scheme for multi-source information inte-gration, in 'Web Information Retrieval and Inte-gration' (WIRI'05), Tokyo, pp. 30-39.
-
(2005)
Web Information Retrieval and Inte-gration
, pp. 30-39
-
-
Aizawa, A.1
Oyama, K.2
-
3
-
-
5444258997
-
A comparison of fast blocking methods for record linkage
-
Washington DC
-
Baxter, R., Christen, P. & Churches, T. (2003), A comparison of fast blocking methods for record linkage, in 'ACM SIGKDD'03 Workshop on Data Cleaning, Record Linkage and Object Consolida-tion', Washington DC, pp. 25-27.
-
(2003)
ACM SIGKDD03 Workshop on Data Cleaning, Record Linkage and Object Consolida-tion
, pp. 25-27
-
-
Baxter, R.1
Christen, P.2
Churches, T.3
-
4
-
-
35348849154
-
Scaling up all pairs similarity search
-
(WWW07), Banff, Canada
-
Bayardo, R.J., Ma, Y. & Srikant, R. (2007), Scaling up all pairs similarity search, in 'International Con-ference on World Wide Web' (WWW'07), Banff, Canada, pp. 131-140.
-
(2007)
International Con-ference on World Wide Web
, pp. 131-140
-
-
Bayardo, R.J.1
Ma, Y.2
Srikant, R.3
-
5
-
-
38349149499
-
Query-time entity resolution
-
Bhattacharya, I. & Getoor, L. (2007), 'Query-time entity resolution', Journal of Artificial Intelligence Research, 30, 621-657.
-
(2007)
Journal of Artificial Intelligence Research
, vol.30
, pp. 621-657
-
-
Bhattacharya, I.1
Getoor, L.2
-
6
-
-
84878049861
-
Adaptive blocking: Learning to scale up record linkage
-
(ICDM06), Hong Kong
-
Bilenko, M., Kamath, B, & Mooney, R.J. (2006), Adaptive blocking: Learning to scale up record linkage, in 'IEEE International Conference on Data Mining' (ICDM'06), Hong Kong, pp. 87-96.
-
(2006)
IEEE International Conference on Data Mining
, pp. 87-96
-
-
Bilenko, M.1
Kamath, B.2
Mooney, R.J.3
-
7
-
-
78449293191
-
A comparison of personal name matching: Techniques and practical issues
-
(MCD06), held at IEEE ICDM06, Hong Kong
-
Christen, P. (2006), A comparison of personal name matching: Techniques and practical issues, in 'Workshop on Mining Complex Data' (MCD'06), held at IEEE ICDM'06, Hong Kong.
-
(2006)
Workshop on Mining Complex Data
-
-
Christen, P.1
-
8
-
-
65449178105
-
Febrl - An open source data cleaning, deduplication and record linkage system with a graphical user interface
-
(SIGKDD08), Las Vegas
-
Christen, P. (2008), Febrl - An open source data cleaning, deduplication and record linkage system with a graphical user interface, in 'ACM Inter-national Conference on Knowledge Discovery and Data Mining' (SIGKDD'08), Las Vegas, pp. 1065- 1068.
-
(2008)
ACM Inter-national Conference on Knowledge Discovery and Data Mining
, pp. 1065-1068
-
-
Christen, P.1
-
9
-
-
33846428121
-
Quality and com-plexity measures for data linkage and deduplica-tion
-
F. Guillet & H. Hamilton, eds
-
Christen, P. & Goiser, K. (2007), Quality and com-plexity measures for data linkage and deduplica-tion, in F. Guillet & H. Hamilton, eds, 'Qual-ity Measures in Data Mining', Springer Studies in Computational Intelligence, Vol. 43, pp. 127-151.
-
(2007)
Qual-ity Measures in Data Mining, Springer Studies in Computational Intelligence
, vol.43
, pp. 127-151
-
-
Christen, P.1
Goiser, K.2
-
10
-
-
11144240583
-
A comparison of string distance metrics for name-matching tasks
-
(IIWeb03), Aca-pulco
-
Cohen W.W., Ravikumar P. & Fienberg S.E. (2003), A comparison of string distance metrics for name-matching tasks, in 'IJCAI'03 Workshop on Infor-mation Integration on the Web' (IIWeb'03), Aca-pulco, pp. 73-78.
-
(2003)
IJCAI03 Workshop on Infor-mation Integration on the Web
, pp. 73-78
-
-
Cohen, W.W.1
Ravikumar, P.2
Fienberg, S.E.3
-
11
-
-
0242540438
-
Learning to match and cluster large high-dimensional data sets for data integration
-
(SIGKDD02), Edmonton
-
Cohen, W.W. & Richman, J. (2002), Learning to match and cluster large high-dimensional data sets for data integration, in 'ACM International Con-ference on Knowledge Discovery and Data Mining' (SIGKDD'02), Edmonton, pp. 475-480.
-
(2002)
ACM International Con-ference on Knowledge Discovery and Data Mining
, pp. 475-480
-
-
Cohen, W.W.1
Richman, J.2
-
12
-
-
12244271239
-
On-line duplicate detection: Signature reliability in a dynamic retrieval environment
-
(CIKM03), New Orleans
-
Conrad, J.G., Guo, X.S. & Schriber, C.P. (2003), On-line duplicate detection: Signature reliability in a dynamic retrieval environment, in 'ACM Confer-ence on Information and Knowledge Management' (CIKM'03), New Orleans, pp. 443-452.
-
(2003)
ACM Confer-ence on Information and Knowledge Management
, pp. 443-452
-
-
Conrad, J.G.1
Guo, X.S.2
Schriber, C.P.3
-
13
-
-
29844452555
-
Refer-ence reconciliation in complex information spaces
-
(SIGMOD05), Baltimore
-
Dong, X., Halevy, A., & Madhavan, J. (2005), Refer-ence reconciliation in complex information spaces, in 'ACM International Conference on Management of Data' (SIGMOD'05), Baltimore, pp. 85-96.
-
(2005)
ACM International Conference on Management of Data
, pp. 85-96
-
-
Dong, X.1
Halevy, A.2
Madhavan, J.3
-
14
-
-
0036203458
-
TAILOR: A record linkage toolbox
-
(ICDE02), San Jose
-
Elfeky, M.G., Verykios, V.S. & Elmagarmid, A.K. (2002), TAILOR: A record linkage toolbox, in 'International Conference on Data Engineering' (ICDE'02), San Jose, pp. 17-28.
-
(2002)
International Conference on Data Engineering
, pp. 17-28
-
-
Elfeky, M.G.1
Verykios, V.S.2
Elmagarmid, A.K.3
-
15
-
-
33845667955
-
Duplicate record detection: A survey
-
Elmagarmid, A.K., Ipeirotis, P.G. & Verykios, V.S. (2007), 'Duplicate record detection: A survey', IEEE Transactions on Knowledge and Data Engi-neering (TKDE), 19(1), 1-16.
-
(2007)
IEEE Transactions on Knowledge and Data Engi-neering (TKDE)
, vol.19
, Issue.1
, pp. 1-16
-
-
Elmagarmid, A.K.1
Ipeirotis, P.G.2
Verykios, V.S.3
-
16
-
-
84947399464
-
A theory for record linkage
-
Fellegi, I.P. & Sunter, A.B. (1969), 'A theory for record linkage', Journal of the American Statisti-cal Society, 64(328), 1183-1210.
-
(1969)
Journal of the American Statisti-cal Society
, vol.64
, Issue.328
, pp. 1183-1210
-
-
Fellegi, I.P.1
Sunter, A.B.2
-
17
-
-
84976856849
-
The merge/purge problem for large databases
-
(SIGMOD95), San Jose
-
Hernandez, M.A. & Stolfo, S.J. (1995), The merge/purge problem for large databases, in 'ACM International Conference on Management of Data' (SIGMOD'95), San Jose, pp. 127-138.
-
(1995)
ACM International Conference on Management of Data
, pp. 127-138
-
-
Hernandez, M.A.1
Stolfo, S.J.2
-
18
-
-
37149056535
-
Decision models for record linkage
-
Springer LNCS 3755
-
Gu, L. & Baxter, R. (2006), Decision models for record linkage, in 'Selected Papers from AusDM', Springer LNCS 3755, pp. 146-160.
-
(2006)
Selected Papers from AusDM
, pp. 146-160
-
-
Gu, L.1
Baxter, R.2
-
19
-
-
84943425383
-
Efficient record linkage in large data sets
-
(DASFAA03), Tokyo
-
Jin, L., Li, C. & Mehrotra, S. (2003), Efficient record linkage in large data sets, in 'International Confer-ence on Database Systems for Advanced Applica-tions' (DASFAA'03), Tokyo, pp. 137-146.
-
(2003)
International Confer-ence on Database Systems for Advanced Applica-tions
, pp. 137-146
-
-
Jin, L.1
Li, C.2
Mehrotra, S.3
-
20
-
-
33745266392
-
Domain-independent data cleaning via analysis of entity-relationship graph
-
Kalashnikov, D.V. & Mehrotra, S. (2006), 'Domain-independent data cleaning via analysis of entity-relationship graph', ACM Transactions on Database Systems (TODS), 31(2), 716-767.
-
(2006)
ACM Transactions on Database Systems (TODS)
, vol.31
, Issue.2
, pp. 716-767
-
-
Kalashnikov, D.V.1
Mehrotra, S.2
-
22
-
-
84870482019
-
-
Technical report, University of Potsdam, Germany
-
Weis, M. & Naumann, F. (2007), 'Space and time scalability of duplicate detection in graph data', Technical report, University of Potsdam, Germany.
-
(2007)
Space and time scalability of duplicate detection in graph data
-
-
Weis, M.1
Naumann, F.2
-
25
-
-
36348961379
-
Adaptive sorted neighborhood methods for efficient record linkage
-
(JCDL07), Vancouver
-
Yan, S., Lee, D., Kan, M.Y., & Giles, L.C. (2007), Adaptive sorted neighborhood methods for efficient record linkage, in 'ACM/IEEE-CS Joint Confer-ence on Digital Libraries' (JCDL'07'), Vancouver, pp. 185-194.
-
(2007)
ACM/IEEE-CS Joint Confer-ence on Digital Libraries
, pp. 185-194
-
-
Yan, S.1
Lee, D.2
Kan, M.Y.3
Giles, L.C.4
-
26
-
-
84893853717
-
LinkClus: Effi-cient clustering via heterogeneous semantic links
-
(VLDB06), Seoul
-
Yin, X., Han, J. & Yu, P.S. (2006), LinkClus: Effi-cient clustering via heterogeneous semantic links, in 'International Conference on Very Large Data Bases' (VLDB'06), Seoul, pp. 427-438.
-
(2006)
International Conference on Very Large Data Bases
, pp. 427-438
-
-
Yin, X.1
Han, J.2
Yu, P.S.3
-
27
-
-
33747729581
-
Inverted files for text search engines
-
(CSUR)
-
Zobel, J. & Moffat, A. (2006), 'Inverted files for text search engines', ACM Computing Surveys (CSUR), 38(2).
-
(2006)
ACM Computing Surveys
, vol.38
, Issue.2
-
-
Zobel, J.1
Moffat, A.2
|