-
1
-
-
84856415685
-
Operation clean data
-
Aug.
-
M. Wheatley, "Operation Clean Data," CIO Asia Magazine, http:// www.cio-asia.com, Aug. 2004.
-
(2004)
CIO Asia Magazine
-
-
Wheatley, M.1
-
2
-
-
34250670467
-
Record linkage: Similarity measures and algorithms
-
DOI 10.1145/1142473.1142599, SIGMOD 2006 - Proceedings of the ACM SIGMOD International Conference on Management of Data
-
N. Koudas, S. Sarawagi, and D. Srivastava, "Record Linkage: Similarity Measures and Algorithms," Proc. ACM SIGMOD Int'l Conf. Management of Data, pp. 802-803, 2006. (Pubitemid 46946588)
-
(2006)
Proceedings of the ACM SIGMOD International Conference on Management of Data
, pp. 802-803
-
-
Koudas, N.1
Sarawagi, S.2
Srivastava, D.3
-
3
-
-
1142279457
-
Robust and efficient fuzzy match for online data cleaning
-
S. Chaudhuri, K. Ganjam, V. Ganti, and R. Motwani, "Robust and Efficient Fuzzy Match for Online Data Cleaning," Proc. ACM SIGMOD Int'l Conf. Management of Data, pp. 313-324, 2003.
-
(2003)
Proc. ACM SIGMOD Int'l Conf. Management of Data
, pp. 313-324
-
-
Chaudhuri, S.1
Ganjam, K.2
Ganti, V.3
Motwani, R.4
-
5
-
-
84947399464
-
A theory for record linkage
-
I.P. Fellegi and A.B. Sunter, "A Theory for Record Linkage," J. Am. Statistical Assoc., vol. 66, no. 1, pp. 1183-1210, 1969.
-
(1969)
J. Am. Statistical Assoc.
, vol.66
, Issue.1
, pp. 1183-1210
-
-
Fellegi, I.P.1
Sunter, A.B.2
-
6
-
-
0038208065
-
A Bayesian decision model for cost optimal record matching
-
DOI 10.1007/s00778-002-0072-y
-
V.S. Verykios, G.V. Moustakides, and M.G. Elfeky, "A Bayesian Decision Model for Cost Optimal Record Matching," The Very Large Databases J., vol. 12, no. 1, pp. 28-40, 2003. (Pubitemid 36752332)
-
(2003)
VLDB Journal
, vol.12
, Issue.1
, pp. 28-40
-
-
Verykios, V.S.1
Moustakides, G.V.2
Elfeky, M.G.3
-
7
-
-
84856482017
-
Is you data dirty? and Does that matter?
-
R. Bell and F. Dravis, "Is You Data Dirty? and Does that Matter?," Accenture Whiter Paper, http://www.accenture.com, 2006.
-
(2006)
Accenture Whiter Paper
-
-
Bell, R.1
Dravis, F.2
-
10
-
-
36448967787
-
A combined component approach for finding collection-adapted ranking functions based on genetic programming
-
DOI 10.1145/1277741.1277810, Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07
-
H.M. de Almeida, M.A. Gonçalves, M. Cristo, and P. Calado, "A Combined Component Approach for Finding Collection-Adapted Ranking Functions Based on Genetic Programming," Proc. 30th Ann. Int'l ACM SIGIR Conf. Research and Development in Information Retrieval, pp. 399-406, 2007. (Pubitemid 350164986)
-
(2007)
Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07
, pp. 399-406
-
-
De Almeida, H.M.1
Goncalves, M.A.2
Cristo, M.3
Calado, P.4
-
11
-
-
56949100364
-
An evolutionary approach for combining different sources of evidence in search engines
-
T.P.C. Silva, E.S. de Moura, J.M.B. Cavalcanti, A.S. da Silva, M.G. de Carvalho, and M.A. Gonçalves, "An Evolutionary Approach for Combining Different Sources of Evidence in Search Engines," Information Systems, vol. 34, no. 2, pp. 276-289, 2009.
-
(2009)
Information Systems
, vol.34
, Issue.2
, pp. 276-289
-
-
Silva, T.P.C.1
De Moura, E.S.2
Cavalcanti, J.M.B.3
Da Silva, A.S.4
De Carvalho, M.G.5
Gonçalves, M.A.6
-
12
-
-
33745780991
-
Intelligent GP fusion from multiple sources for text classification
-
CIKM'05 - Proceedings of the 14th ACM International Conference on Information and Knowledge Management
-
B. Zhang, Y. Chen, W. Fan, E.A. Fox, M. Gonçalves, M. Cristo, and P. Calado, "Intelligent gp Fusion from Multiple Sources for Text Classification," Proc. 14th ACM Int'l Conf. Information and Knowledge Management, pp. 477-484, 2005. (Pubitemid 44022220)
-
(2005)
International Conference on Information and Knowledge Management, Proceedings
, pp. 477-484
-
-
Zhang, B.1
Chen, Y.2
Fan, W.3
Fox, E.A.4
Goncalves, M.5
Cristo, M.6
Calado, P.7
-
13
-
-
53449089815
-
A genetic programming framework for content-based image retrieval
-
R.d.S. Torres, A.X. Falcao, M.A. Gonçalves, J.P. Papa, B. Zhang, W. Fan, and E.A. Fox, "A Genetic Programming Framework for Content-Based Image Retrieval," Pattern Recognition, vol. 42, no. 2, pp. 283-292, 2009.
-
(2009)
Pattern Recognition
, vol.42
, Issue.2
, pp. 283-292
-
-
-
14
-
-
33750344115
-
Learning to advertise
-
A. Lacerda, M. Cristo, M.A. Gonçalves, W. Fan, N. Ziviani, and B. Ribeiro-Neto, "Learning to Advertise," Proc. 29th Ann. Int'l ACM SIGIR Conf. Research and Development in Information Retrieval, pp. 549-556, 2006.
-
(2006)
Proc. 29th Ann. Int'l ACM SIGIR Conf. Research and Development in Information Retrieval
, pp. 549-556
-
-
Lacerda, A.1
Cristo, M.2
Gonçalves, M.A.3
Fan, W.4
Ziviani, N.5
Ribeiro-Neto, B.6
-
15
-
-
34247190122
-
Learning to deduplicate
-
DOI 10.1145/1141753.1141760, 6th ACM/IEEE-CS Joint Conference on Digital Libraries 2006: Opening Information Horizons, JCDL '06
-
M.G. de Carvalho, M.A. Gonçalves, A.H.F. Laender, and A.S. da Silva, "Learning to Deduplicate," Proc. Sixth ACM/IEEE CS Joint Conf. Digital Libraries, pp. 41-50, 2006. (Pubitemid 46613746)
-
(2006)
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries
, vol.2006
, pp. 41-50
-
-
De Carvalho, M.G.1
Goncalves, M.A.2
Laender, A.H.F.3
Da Silva, A.S.4
-
16
-
-
56749179708
-
Replica identification using genetic programming
-
M.G. de Carvalho, A.H.F. Laender, M.A. Gonçalves, and A.S. da Silva, "Replica Identification Using Genetic Programming," Proc. 23rd Ann. ACM Symp. Applied Computing (SAC), pp. 1801-1806, 2008.
-
(2008)
Proc. 23rd Ann. ACM Symp. Applied Computing (SAC)
, pp. 1801-1806
-
-
De Carvalho, A.S.1
Laender, A.H.F.2
Gonçalves, M.A.3
Da Silva, M.A.4
-
17
-
-
2342447399
-
Adaptive name matching in information integration
-
Sept./Oct.
-
M. Bilenko, R. Mooney, W. Cohen, P. Ravikumar, and S. Fienberg, "Adaptive Name Matching in Information Integration," IEEE Intelligent Systems, vol. 18, no. 5, pp. 16-23, Sept./Oct. 2003.
-
(2003)
IEEE Intelligent Systems
, vol.18
, Issue.5
, pp. 16-23
-
-
Bilenko, M.1
Mooney, R.2
Cohen, W.3
Ravikumar, P.4
Fienberg, S.5
-
19
-
-
0032652968
-
Autonomous citation matching
-
S. Lawrence, C.L. Giles, and K.D. Bollacker, "Autonomous Citation Matching," Proc. Third Int'l Conf. Autonomous Agents, pp. 392-393, 1999.
-
(1999)
Proc. Third Int'l Conf. Autonomous Agents
, pp. 392-393
-
-
Lawrence, S.1
Giles, C.L.2
Bollacker, C.L.3
-
20
-
-
0032640910
-
Digital libraries and autonomous citation indexing
-
June
-
S. Lawrence, L. Giles, and K. Bollacker, "Digital Libraries and Autonomous Citation Indexing," Computer, vol. 32, no. 6, pp. 67-71, June 1999.
-
(1999)
Computer
, vol.32
, Issue.6
, pp. 67-71
-
-
Lawrence, S.1
Giles, L.2
Bollacker, K.3
-
21
-
-
33845667955
-
Duplicate record detection: A survey
-
DOI 10.1109/TKDE.2007.250581
-
A.K. Elmagarmid, P.G. Ipeirotis, and V.S. Verykios, "Duplicate Record Detection: A Survey," IEEE Trans. Knowledge and Data Eng., vol. 19, no. 1, pp. 1-16, Jan. 2007. (Pubitemid 44955773)
-
(2007)
IEEE Transactions on Knowledge and Data Engineering
, vol.19
, Issue.1
, pp. 1-16
-
-
Elmagarmid, A.K.1
Ipeirotis, P.G.2
Verykios, V.S.3
-
23
-
-
0000666461
-
Data integration using similarity joins and a word-based information representation language
-
W.W. Cohen, "Data Integration Using Similarity Joins and a Word-Based Information Representation Language," ACM Trans. Information Systems, vol. 18, no. 3, pp. 288-321, 2000.
-
(2000)
ACM Trans. Information Systems
, vol.18
, Issue.3
, pp. 288-321
-
-
Cohen, W.W.1
-
25
-
-
0001592068
-
Automatic linkage of vital records
-
Oct.
-
H.B. Newcombe, J.M. Kennedy, S. Axford, and A. James, "Automatic Linkage of Vital Records," Science, vol. 130, no. 3381, pp. 954-959, Oct. 1959.
-
(1959)
Science
, vol.130
, Issue.3381
, pp. 954-959
-
-
Newcombe, H.B.1
Kennedy, J.M.2
Axford, S.3
James, A.4
-
26
-
-
84856482021
-
-
Freely Extensible Biomedical Record Linkage
-
"Freely Extensible Biomedical Record Linkage," http:// sourceforge.net/projects/febrl, 2011.
-
(2011)
-
-
-
28
-
-
0035545848
-
Learning object identification rules for information integration
-
DOI 10.1016/S0306-4379(01)00042-4, Data Extraction, Cleaning and Reconciliation
-
S. Tejada, C.A. Knoblock, and S. Minton, "Learning Object Identification Rules for Information Integration," Information Systems, vol. 26, no. 8, pp. 607-633, 2001. (Pubitemid 33046273)
-
(2001)
Information Systems
, vol.26
, Issue.8
, pp. 607-633
-
-
Tejada, S.1
Knoblock, C.A.2
Minton, S.3
-
29
-
-
79953162324
-
Merging the results of approximate match operations
-
S. Guha, N. Koudas, A. Marathe, and D. Srivastava, "Merging the Results of Approximate Match Operations," Proc. 30th Int'l Conf. Very Large Data Bases, pp. 636-647, 2004.
-
(2004)
Proc. 30th Int'l Conf. Very Large Data Bases
, pp. 636-647
-
-
Guha, S.1
Koudas, N.2
Marathe, A.3
Srivastava, D.4
-
30
-
-
0042312958
-
Genetic programming's continued evolution
-
ch. 1, MIT Press
-
P.J. Angeline, "Genetic Programming's Continued Evolution," Advances in Genetic Programming, vol. 2, ch. 1, MIT Press, 1996.
-
(1996)
Advances in Genetic Programming
, vol.2
-
-
Angeline, P.J.1
-
31
-
-
0242456803
-
Learning domain-independent string transformation weights for high accuracy object identification
-
S. Tejada, C.A. Knoblock, and S. Minton, "Learning Domain-Independent String Transformation Weights for High Accuracy Object Identification," Proc. Eighth ACM SIGKDD Int'l Conf. Knowledge Discovery and Data Mining, pp. 350-359, 2002.
-
(2002)
Proc. Eighth ACM SIGKDD Int'l Conf. Knowledge Discovery and Data Mining
, pp. 350-359
-
-
Tejada, S.1
Knoblock, C.A.2
Minton, S.3
-
32
-
-
26444478506
-
Probabilistic data generation for deduplication and data linkage
-
Intelligent Data Engineering and Automated Learning - IDEAL 2005: 6th International Conference. Proceedings
-
P. Christen, "Probabilistic Data Generation for Deduplication and Data Linkage," Intelligent Data Eng. and Automated Learning, pp. 109-116, Springer, 2005. (Pubitemid 41431651)
-
(2005)
Lecture Notes in Computer Science
, vol.3578
, pp. 109-116
-
-
Christen, P.1
|