-
1
-
-
33845363891
-
A fast linkage detection scheme for multi-source information integration
-
Tokyo
-
Aizawa, A., Oyama, K.: A fast linkage detection scheme for multi-source information integration. In: WIRI, Tokyo (2005)
-
(2005)
WIRI
-
-
Aizawa, A.1
Oyama, K.2
-
2
-
-
84878049861
-
Adaptive blocking: Learning to scale up record linkage
-
Hong Kong
-
Bilenko, M., Kamath, B., Mooney, R.J.: Adaptive blocking: learning to scale up record linkage. In: IEEE ICDM, Hong Kong (2006)
-
(2006)
IEEE ICDM
-
-
Bilenko, M.1
Kamath, B.2
Mooney, R.J.3
-
3
-
-
84881036145
-
Leveraging unlabeled data to scale blocking for record linkage
-
Barcelona
-
Cao, Y., Chen, Z., Zhu, J., Yue, P., Lin, C.Y., Yu, Y.: Leveraging unlabeled data to scale blocking for record linkage. In: IJCAI, Barcelona (2011)
-
(2011)
IJCAI
-
-
Cao, Y.1
Chen, Z.2
Zhu, J.3
Yue, P.4
Lin, C.Y.5
Yu, Y.6
-
5
-
-
84920595044
-
A survey of indexing techniques for scalable record linkage and deduplication
-
Christen, P.: A survey of indexing techniques for scalable record linkage and deduplication. IEEE Transactions on Knowledge and Data Engineering 24(9) (2012)
-
(2012)
IEEE Transactions on Knowledge and Data Engineering
, vol.24
, Issue.9
-
-
Christen, P.1
-
6
-
-
84871075183
-
An automatic blocking mechanism for large-scale de-duplication tasks
-
Hawaii
-
Das Sarma, A., Jain, A., Machanavajjhala, A., Bohannon, P.: An automatic blocking mechanism for large-scale de-duplication tasks. In: ACM CIKM, Hawaii (2012)
-
(2012)
ACM CIKM
-
-
Das Sarma, A.1
Jain, A.2
Machanavajjhala, A.3
Bohannon, P.4
-
8
-
-
33845667955
-
Duplicate record detection: A survey
-
Elmagarmid, A.K., Ipeirotis, P.G., Verykios, V.S.: Duplicate record detection: A survey. IEEE Transactions on Knowledge and Data Engineering 19(1) (2007)
-
(2007)
IEEE Transactions on Knowledge and Data Engineering
, vol.19
, Issue.1
-
-
Elmagarmid, A.K.1
Ipeirotis, P.G.2
Verykios, V.S.3
-
10
-
-
84945538473
-
A machine learning approach to create blocking criteria for record linkage
-
Giang, P.H.: A machine learning approach to create blocking criteria for record linkage. Health Care Management Science (2014)
-
(2014)
Health Care Management Science
-
-
Giang, P.H.1
-
11
-
-
84976856849
-
The merge/purge problem for large databases
-
San Jose
-
Hernandez, M.A., Stolfo, S.J.: The merge/purge problem for large databases. In: ACM SIGMOD, San Jose (1995)
-
(1995)
ACM SIGMOD
-
-
Hernandez, M.A.1
Stolfo, S.J.2
-
12
-
-
84894647271
-
An unsupervised algorithm for learning blocking schemes
-
Dallas
-
Kejriwal, M., Miranker, D.P.: An unsupervised algorithm for learning blocking schemes. In: IEEE ICDM, Dallas (2013)
-
(2013)
IEEE ICDM
-
-
Kejriwal, M.1
Miranker, D.P.2
-
13
-
-
77952280581
-
HARRA: Fast iterative hashed record linkage for large-scale data collections
-
Lausanne, Switzerland
-
Kim, H., Lee, D.: HARRA: fast iterative hashed record linkage for large-scale data collections. In: ICDT, Lausanne, Switzerland (2010)
-
(2010)
ICDT
-
-
Kim, H.1
Lee, D.2
-
14
-
-
80455148340
-
Evaluation of entity resolution approaches on real-world match problems
-
Köpcke, H., Thor, A., Rahm, E.: Evaluation of entity resolution approaches on real-world match problems. VLDB Endowment 3(1–2) (2010)
-
(2010)
VLDB Endowment
, vol.3
, Issue.1-2
-
-
Köpcke, H.1
Thor, A.2
Rahm, E.3
-
15
-
-
84901276901
-
Noise-tolerant approximate blocking for dynamic real-time entity resolution
-
In: Tseng, V.S., Ho, T.B., Zhou, Z.-H., Chen, A.L.P., Kao, H.-Y. (eds.), Springer, Heidelberg
-
Liang, H., Wang, Y., Christen, P., Gayler, R.: Noise-tolerant approximate blocking for dynamic real-time entity resolution. In: Tseng, V.S., Ho, T.B., Zhou, Z.-H., Chen, A.L.P., Kao, H.-Y. (eds.) PAKDD 2014, Part II. LNCS (LNAI), vol. 8444, pp. 449–460. Springer, Heidelberg (2014)
-
(2014)
PAKDD 2014, Part II. LNCS (LNAI
, vol.8444
, pp. 449-460
-
-
Liang, H.1
Wang, Y.2
Christen, P.3
Gayler, R.4
-
16
-
-
84874283130
-
Typimatch: Type-specific unsupervised learning of keys and key values for heterogeneous web data integration
-
Rome
-
Ma, Y., Tran, T.: Typimatch: type-specific unsupervised learning of keys and key values for heterogeneous web data integration. In: ACM WSDM, Rome (2013)
-
(2013)
ACM WSDM
-
-
Ma, Y.1
Tran, T.2
-
17
-
-
0034592784
-
Efficient clustering of high-dimensional data sets with application to reference matching
-
Boston
-
McCallum, A., Nigam, K., Ungar, L.: Efficient clustering of high-dimensional data sets with application to reference matching. In: ACM SIGKDD, Boston (2000)
-
(2000)
ACM SIGKDD
-
-
McCallum, A.1
Nigam, K.2
Ungar, L.3
-
18
-
-
36348932551
-
Learning blocking schemes for record linkage
-
Boston
-
Michelson, M., Knoblock, C.A.: Learning blocking schemes for record linkage. In: AAAI, Boston (2006)
-
(2006)
AAAI
-
-
Michelson, M.1
Knoblock, C.A.2
-
19
-
-
84937597822
-
Forest-based dynamic sorted neighborhood indexing for real-time entity resolution
-
Shanghai
-
Ramadan, B., Christen, P.: Forest-based dynamic sorted neighborhood indexing for real-time entity resolution. In: ACM CIKM, Shanghai (2014)
-
(2014)
ACM CIKM
-
-
Ramadan, B.1
Christen, P.2
-
20
-
-
84904155169
-
Dynamic sorted neighborhood indexing for real-time entity resolution
-
In: Wang, H., Sharaf, M.A. (eds.), Springer, Heidelberg
-
Ramadan, B., Christen, P., Liang, H.: Dynamic sorted neighborhood indexing for real-time entity resolution. In: Wang, H., Sharaf, M.A. (eds.) ADC 2014. LNCS, vol. 8506, pp. 1–12. Springer, Heidelberg (2014)
-
(2014)
ADC 2014. LNCS
, vol.8506
, pp. 1-12
-
-
Ramadan, B.1
Christen, P.2
Liang, H.3
-
21
-
-
84892875650
-
Dynamic similarity-aware inverted indexing for real-time entity resolution
-
In: Li, J., Cao, L., Wang, C., Tan, K.C., Liu, B., Pei, J., Tseng, V.S. (eds.), Springer, Heidelberg
-
Ramadan, B., Christen, P., Liang, H., Gayler, R.W., Hawking, D.: Dynamic similarity-aware inverted indexing for real-time entity resolution. In: Li, J., Cao, L., Wang, C., Tan, K.C., Liu, B., Pei, J., Tseng, V.S. (eds.) PAKDD 2013 Workshops. LNCS (LNAI), vol. 7867, pp. 47–58. Springer, Heidelberg (2013)
-
(2013)
PAKDD 2013 Workshops. LNCS (LNAI)
, vol.7867
, pp. 47-58
-
-
Ramadan, B.1
Christen, P.2
Liang, H.3
Gayler, R.W.4
Hawking, D.5
-
22
-
-
84937477990
-
Geco: An online personal data generator and corruptor
-
New York
-
Tran, K.N., Vatsalan, D., Christen, P.: Geco: an online personal data generator and corruptor. In: ACM CIKM, New York (2013)
-
(2013)
ACM CIKM
-
-
Tran, K.N.1
Vatsalan, D.2
Christen, P.3
-
23
-
-
84945565021
-
Automatic blocking key selection for duplicate detection based on unigram combinations
-
Istanbul
-
Vogel, T., Naumann, F.: Automatic blocking key selection for duplicate detection based on unigram combinations. In: VLDB Workshops, Istanbul (2012)
-
(2012)
VLDB Workshops
-
-
Vogel, T.1
Naumann, F.2
-
24
-
-
70849098813
-
Entity resolution with iterative blocking
-
Providence
-
Whang, S.E., Menestrina, D., Koutrika, G., Theobald, M., Garcia-Molina, H.: Entity resolution with iterative blocking. In: ACM SIGMOD, Providence (2009)
-
(2009)
ACM SIGMOD
-
-
Whang, S.E.1
Menestrina, D.2
Koutrika, G.3
Theobald, M.4
Garcia-Molina, H.5
|