-
1
-
-
0031988304
-
The impact of poor data quality on the typical enterprise
-
Redman T. The impact of poor data quality on the typical enterprise [J]. Communications of the ACM, 1998, 41(2): 79-82
-
(1998)
Communications of the ACM
, vol.41
, Issue.2
, pp. 79-82
-
-
Redman, T.1
-
2
-
-
39049191963
-
Missing prenatal records at a birth center: A communication problem quantified
-
Maryland: American Medical Informatics Association
-
Miller D W, Yeast J D, Evans R L. Missing prenatal records at a birth center: A communication problem quantified [C]//Proc of AMIA Annual Symp Proceedings. Maryland: American Medical Informatics Association, 2005: 535-539
-
(2005)
Proc of AMIA Annual Symp Proceedings
, pp. 535-539
-
-
Miller, D.W.1
Yeast, J.D.2
Evans, R.L.3
-
3
-
-
48249116542
-
Gartner warns firms of 'dirty data'
-
Swartz N. Gartner warns firms of 'dirty data' [J]. Information Management Journal, 2007, 41(3): 6
-
(2007)
Information Management Journal
, vol.41
, Issue.3
, pp. 6
-
-
Swartz, N.1
-
5
-
-
78649853923
-
Data Warehousing Special Report: Data quality and the bottom line
-
Applications Development Trends
-
Eckerson W. Data Warehousing Special Report: Data quality and the bottom line [R]. Applications Development Trends, 2002
-
(2002)
-
-
Eckerson, W.1
-
7
-
-
84892646568
-
Credit card statistics, industry facts, debt statistics
-
2013-04-20
-
Woolsey B, Schulz M. Credit card statistics, industry facts, debt statistics [OL]. [2013-04-20]. http://www.creditcards.com/credit-card-news/credit-card-industry- facts-personal-debt-statistics-1276.php
-
-
-
Woolsey, B.1
Schulz, M.2
-
8
-
-
0003578571
-
Enterprise information portals
-
New York: Merrill Lynch
-
Shilakes C, Tylman J. Enterprise information portals [R]. New York: Merrill Lynch, 1998
-
(1998)
-
-
Shilakes, C.1
Tylman, J.2
-
9
-
-
0002490026
-
Data cleaning: Problems and current approaches
-
Rahm E, Do H H. Data cleaning: Problems and current approaches [J]. IEEE Data Engineering Bulletin, 2000, 23(4): 3-13
-
(2000)
IEEE Data Engineering Bulletin
, vol.23
, Issue.4
, pp. 3-13
-
-
Rahm, E.1
Do, H.H.2
-
10
-
-
77954322933
-
Integrating conflicting data: The role of source dependence
-
Dong X L, Berti-Equille L, Srivastava D. Integrating conflicting data: The role of source dependence [J]. Proceedings of the VLDB Endowment, 2009, 2(1): 550-561
-
(2009)
Proceedings of the VLDB Endowment
, vol.2
, Issue.1
, pp. 550-561
-
-
Dong, X.L.1
Berti-Equille, L.2
Srivastava, D.3
-
11
-
-
77954323674
-
Truth discovery and copying detection in a dynamic world
-
Dong X L, Berti-Equille L, Srivastava D. Truth discovery and copying detection in a dynamic world [J]. Proceedings of the VLDB Endowment, 2009, 2(1): 562-573
-
(2009)
Proceedings of the VLDB Endowment
, vol.2
, Issue.1
, pp. 562-573
-
-
Dong, X.L.1
Berti-Equille, L.2
Srivastava, D.3
-
12
-
-
84859258624
-
Global detection of complex copying relationships between sources
-
Dong X L, Berti-Equille L, Hu Yifan, et al. Global detection of complex copying relationships between sources [J]. Proceedings of the VLDB Endowment, 2010, 3(1/2): 1358-1369
-
(2010)
Proceedings of the VLDB Endowment
, vol.3
, Issue.1-2
, pp. 1358-1369
-
-
Dong, X.L.1
Berti-Equille, L.2
Hu, Y.3
-
13
-
-
84865557019
-
Solomon: Seeking the truth via copying detection
-
Dong X L, Berti-Equille L, Hu Yifan, et al. Solomon: Seeking the truth via copying detection [J]. Proceedings of the VLDB Endowment, 2010, 3(1/2): 1617-1620
-
(2010)
Proceedings of the VLDB Endowment
, vol.3
, Issue.1-2
, pp. 1617-1620
-
-
Dong, X.L.1
Berti-Equille, L.2
Hu, Y.3
-
14
-
-
84863067746
-
Data fusion: resolving data conflicts for integration
-
Dong X L, Naumann F. Data fusion: resolving data conflicts for integration [J]. Proceedings of the VLDB Endowment, 2009, 2(2): 1654-1655
-
(2009)
Proceedings of the VLDB Endowment
, vol.2
, Issue.2
, pp. 1654-1655
-
-
Dong, X.L.1
Naumann, F.2
-
15
-
-
70350212979
-
Sampling based (ε, δ)-approximate aggregation algorithm in sensor networks
-
Piscataway, NJ: IEEE
-
Cheng Siyao, Li Jianzhong. Sampling based (ε, δ)-approximate aggregation algorithm in sensor networks [C]//Proc of IEEE ICDCS'09. Piscataway, NJ: IEEE, 2009: 273-280
-
(2009)
Proc of IEEE ICDCS'09
, pp. 273-280
-
-
Cheng, S.1
Li, J.2
-
16
-
-
84855744805
-
(ε, δ)-approximate aggregation algorithms in dynamic sensor networks
-
Li Jianzhong, Cheng Siyao. (ε, δ)-approximate aggregation algorithms in dynamic sensor networks [J]. IEEE Trans on Parallel and Distributed Systems, 2012, 23(3): 385-396
-
(2012)
IEEE Trans on Parallel and Distributed Systems
, vol.23
, Issue.3
, pp. 385-396
-
-
Li, J.1
Cheng, S.2
-
17
-
-
84883127209
-
o(ε)-approximation to physical world by sensor networks
-
Piscataway, NJ: IEEE
-
Cheng Siyao, Li Jianzhong, Cai Zhipeng. o(ε)-approximation to physical world by sensor networks [C]//Proc of IEEE INFOCOM'13. Piscataway, NJ: IEEE, 2013: 3184-3192
-
(2013)
Proc of IEEE INFOCOM'13
, pp. 3184-3192
-
-
Cheng, S.1
Li, J.2
Cai, Z.3
-
18
-
-
84861622956
-
Location aware peak value queries in sensor networks
-
NJ: IEEE
-
Cheng Siyao, Li Jianzhong, Liu Yu. Location aware peak value queries in sensor networks [C]//Proc of IEEE INFOCOM'12. Piscataway, NJ: IEEE, 2012: 486-494
-
(2012)
Proc of IEEE INFOCOM'12
, pp. 486-494
-
-
Cheng, S.1
Li, J.2
Liu, Y.3
-
19
-
-
34548731840
-
Conditional functional dependencies for data cleaning
-
Piscataway, NJ: IEEE
-
Bohannon P, Fan Wenfei, Geerts F, et al. Conditional functional dependencies for data cleaning [C]//Proc of IEEE ICDE'07. Piscataway, NJ: IEEE, 2007: 746-755
-
(2007)
Proc of IEEE ICDE'07
, pp. 746-755
-
-
Bohannon, P.1
Fan, W.2
Geerts, F.3
-
21
-
-
52649161210
-
Increasing the expressivity of conditional functional dependencies without extra complexity
-
Piscataway, NJ: IEEE
-
Bravo L, Fan Wenfei, Geerts F, et al. Increasing the expressivity of conditional functional dependencies without extra complexity [C]//Proc of IEEE ICDE'08. Piscataway, NJ: IEEE, 2008: 516-525
-
(2008)
Proc of IEEE ICDE'08
, pp. 516-525
-
-
Bravo, L.1
Fan, W.2
Geerts, F.3
-
22
-
-
79953184736
-
Propagating functional dependencies with conditions
-
Fan Wenfei, Ma Shuai, Hu Yanli, et al. Propagating functional dependencies with conditions [J]. Proceedings of the VLDB Endowment, 2008, 1(1): 391-407
-
(2008)
Proceedings of the VLDB Endowment
, vol.1
, Issue.1
, pp. 391-407
-
-
Fan, W.1
Ma, S.2
Hu, Y.3
-
24
-
-
80052369619
-
Sequential dependencies
-
Golab L, Karloff H, Korn F, et al. Sequential dependencies [J]. Proceedings of the VLDB Endowment, 2009, 2(1): 574-585
-
(2009)
Proceedings of the VLDB Endowment
, vol.2
, Issue.1
, pp. 574-585
-
-
Golab, L.1
Karloff, H.2
Korn, F.3
-
25
-
-
67649655745
-
Metric functional dependencies
-
Piscataway, NJ: IEEE
-
Koudas N, Saha A, Srivastava D, et al. Metric functional dependencies [C]//Proc of IEEE ICDE'09. Piscataway, NJ: IEEE, 2009: 1275-1278
-
(2009)
Proc of IEEE ICDE'09
, pp. 1275-1278
-
-
Koudas, N.1
Saha, A.2
Srivastava, D.3
-
27
-
-
79953230060
-
Discovering conditional functional dependencies
-
Fan Wenfei, Geerts F, Li Jianzhong, et al. Discovering conditional functional dependencies [J]. IEEE Trans on Knowledge and Data Engineering, 2011, 23(5): 683-698
-
(2011)
IEEE Trans on Knowledge and Data Engineering
, vol.23
, Issue.5
, pp. 683-698
-
-
Fan, W.1
Geerts, F.2
Li, J.3
-
28
-
-
57449119633
-
Checks and balances: Monitoring data quality problems in network traffic database
-
San Francisco, CA: Morgan Kaufmann
-
Korn F, Muthukrishnan S, Zhu Y. Checks and balances: Monitoring data quality problems in network traffic databases [C]//Proc of the 29th Int Conf on Very Large Databases. San Francisco, CA: Morgan Kaufmann, 2003: 536-547
-
(2003)
Proc of the 29th Int Conf on Very Large Databases
, pp. 536-547
-
-
Korn, F.1
Muthukrishnan, S.2
Zhu, Y.3
-
29
-
-
31644450821
-
Enhancing data analysis with noise removal
-
Xiong Hui, Pandey G, Steinbach M, et al. Enhancing data analysis with noise removal [J]. IEEE Trans on Knowledge and Data Engineering, 2006, 18(3): 304-319
-
(2006)
IEEE Trans on Knowledge and Data Engineering
, vol.18
, Issue.3
, pp. 304-319
-
-
Xiong, H.1
Pandey, G.2
Steinbach, M.3
-
32
-
-
0021513522
-
Incomplete information in relational databases
-
Imieliński T, Lipski Jr W. Incomplete information in relational databases [J]. Journal of the ACM (JACM), 1984, 31(4): 761-791
-
(1984)
Journal of the ACM (JACM)
, vol.31
, Issue.4
, pp. 761-791
-
-
Imieliński, T.1
Lipski Jr., W.2
-
34
-
-
0004919827
-
Closed world databases opened through null values
-
San Francisco, CA: Morgan Kaufmann
-
Gottlob G, Zicari R. Closed world databases opened through null values [C]//Proc of the 14th Int Conf on Very Large Databases. San Francisco, CA: Morgan Kaufmann, 1988: 50-61
-
(1988)
Proc of the 14th Int Conf on Very Large Databases
, pp. 50-61
-
-
Gottlob, G.1
Zicari, R.2
-
37
-
-
33745203463
-
Representing and querying XML with incomplete information
-
Abiteboul S, Segoufin L, Vianu V. Representing and querying XML with incomplete information [J]. ACM Trans on Database Systems (TODS), 2006, 31(1): 208-254
-
(2006)
ACM Trans on Database Systems (TODS)
, vol.31
, Issue.1
, pp. 208-254
-
-
Abiteboul, S.1
Segoufin, L.2
Vianu, V.3
-
38
-
-
78650722109
-
XML with incomplete information
-
Barceló P, Libkin L, Poggi A, et al. XML with incomplete information [J]. Journal of the ACM (JACM), 2010, 58(1): 1-62
-
(2010)
Journal of the ACM (JACM)
, vol.58
, Issue.1
, pp. 1-62
-
-
Barceló, P.1
Libkin, L.2
Poggi, A.3
-
39
-
-
61349087255
-
Cleaning uncertain data with quality guarantees
-
Cheng R, Chen J, Xie X. Cleaning uncertain data with quality guarantees [J]. Proceedings of the VLDB Endowment, 2008, 1(1): 722-735
-
(2008)
Proceedings of the VLDB Endowment
, vol.1
, Issue.1
, pp. 722-735
-
-
Cheng, R.1
Chen, J.2
Xie, X.3
-
40
-
-
14744293228
-
Minimal-change integrity maintenance using tuple deletions
-
Chomicki J, Marcinkowski J. Minimal-change integrity maintenance using tuple deletions [J]. Information and Computation, 2005, 197(1): 90-121
-
(2005)
Information and Computation
, vol.197
, Issue.1
, pp. 90-121
-
-
Chomicki, J.1
Marcinkowski, J.2
-
41
-
-
0001500141
-
Temporal constraints: A survey
-
Schwalb E, Vila L. Temporal constraints: A survey [J]. Constraints, 1998, 3(2/3): 129-149
-
(1998)
Constraints
, vol.3
, Issue.2-3
, pp. 129-149
-
-
Schwalb, E.1
Vila, L.2
-
42
-
-
80051951228
-
Recognizing patterns in streams with imprecise timestamps
-
Zhang Haopeng, Diao Yanlei, Immerman N. Recognizing patterns in streams with imprecise timestamps [J]. Proceedings of the VLDB Endowment, 2010, 3(1): 244-255
-
(2010)
Proceedings of the VLDB Endowment
, vol.3
, Issue.1
, pp. 244-255
-
-
Zhang, H.1
Diao, Y.2
Immerman, N.3
-
43
-
-
85025410371
-
On the semantics of "now" in databases
-
Clifford J, Dyreson C, Isakowitz T, et al. On the semantics of "now" in databases [J]. ACM Trans on Database Systems (TODS), 1997, 22(2): 171-214
-
(1997)
ACM Trans on Database Systems (TODS)
, vol.22
, Issue.2
, pp. 171-214
-
-
Clifford, J.1
Dyreson, C.2
Isakowitz, T.3
-
44
-
-
84878497828
-
Determining the currency of data
-
Fan Wenfei, Geerts F, Wijsen J. Determining the currency of data [J]. ACM Trans on Database Systems (TODS), 2012, 37(4): 1-46
-
(2012)
ACM Trans on Database Systems (TODS)
, vol.37
, Issue.4
, pp. 1-46
-
-
Fan, W.1
Geerts, F.2
Wijsen, J.3
-
45
-
-
0001592068
-
Automatic linkage of vital records
-
Newcombe H B, Kennedy J M, Axford S J, et al. Automatic linkage of vital records [J]. Science, 1959, 130(3381): 954-959
-
(1959)
Science
, vol.130
, Issue.3381
, pp. 954-959
-
-
Newcombe, H.B.1
Kennedy, J.M.2
Axford, S.J.3
-
47
-
-
84976856849
-
The merge/purge problem for large databases
-
Hernández M A, Stolfo S J. The merge/purge problem for large databases [J]. Proc of ACM SIGMOD Record, 1995, 24(2): 127-138
-
(1995)
Proc of ACM SIGMOD Record
, vol.24
, Issue.2
, pp. 127-138
-
-
Hernández, M.A.1
Stolfo, S.J.2
-
48
-
-
0013331361
-
Real-world data is dirty: Data cleansing and the merge/purge problem
-
Hernández M A, Stolfo S J. Real-world data is dirty: Data cleansing and the merge/purge problem [J]. Data Mining and Knowledge Discovery, 1998, 2(1): 9-37
-
(1998)
Data Mining and Knowledge Discovery
, vol.2
, Issue.1
, pp. 9-37
-
-
Hernández, M.A.1
Stolfo, S.J.2
-
49
-
-
33845667955
-
Duplicate record detection: A survey
-
Elmagarmid A K, Ipeirotis P G, Verykios V S. Duplicate record detection: A survey [J]. IEEE Trans on Knowledge and Data Engineering, 2007, 19(1): 1-16
-
(2007)
IEEE Trans on Knowledge and Data Engineering
, vol.19
, Issue.1
, pp. 1-16
-
-
Elmagarmid, A.K.1
Ipeirotis, P.G.2
Verykios, V.S.3
-
50
-
-
77949817550
-
A survey of entity resolution and record linkage methodologies
-
Brizan D G, Tansel A U. A survey of entity resolution and record linkage methodologies [J]. Communications of the IIMA, 2006, 6(3): 41-50
-
(2006)
Communications of the IIMA
, vol.6
, Issue.3
, pp. 41-50
-
-
Brizan, D.G.1
Tansel, A.U.2
-
52
-
-
0030083481
-
Entity identification in database integration
-
Lim E P, Srivastava J, Prabhakar S, et al. Entity identification in database integration [J]. Information Sciences, 1996, 89(1): 1-38
-
(1996)
Information Sciences
, vol.89
, Issue.1
, pp. 1-38
-
-
Lim, E.P.1
Srivastava, J.2
Prabhakar, S.3
-
53
-
-
52649137537
-
Transformation-based framework for record matching
-
Piscataway, NJ: IEEE
-
Arasu A, Chaudhuri S, Kaushik R. Transformation-based framework for record matching [C]//Proc of IEEE ICDE'08. Piscataway, NJ: IEEE, 2008: 40-49
-
(2008)
Proc of IEEE ICDE'08
, pp. 40-49
-
-
Arasu, A.1
Chaudhuri, S.2
Kaushik, R.3
-
55
-
-
77954696920
-
Learning string transformations from examples
-
Arasu A, Chaudhuri S, Kaushik R. Learning string transformations from examples [J]. Proceedings of the VLDB Endowment, 2009, 2(1): 514-525
-
(2009)
Proceedings of the VLDB Endowment
, vol.2
, Issue.1
, pp. 514-525
-
-
Arasu, A.1
Chaudhuri, S.2
Kaushik, R.3
-
56
-
-
67649649597
-
Large-scale deduplication with constraints using dedupalog
-
Piscataway, NJ: IEEE
-
Arasu A, Ré C, Suciu D. Large-scale deduplication with constraints using dedupalog [C]//Proc of IEEE ICDE'09. Piscataway, NJ: IEEE, 2009: 952-963
-
(2009)
Proc of IEEE ICDE'09
, pp. 952-963
-
-
Arasu, A.1
Ré, C.2
Suciu, D.3
-
57
-
-
71349088450
-
Generic entity resolution with negative rules
-
Whang S E, Benjelloun O, Garcia-Molina H. Generic entity resolution with negative rules [J]. The International Journal on Very Large Databases, 2009, 18(6): 1261-1277
-
(2009)
The International Journal on Very Large Databases
, vol.18
, Issue.6
, pp. 1261-1277
-
-
Whang, S.E.1
Benjelloun, O.2
Garcia-Molina, H.3
-
59
-
-
84865086832
-
Reasoning about record matching rules
-
Fan Wenfei, Jia Xibei, Li Jianzhong, et al. Reasoning about record matching rules [J]. Proceedings of the VLDB Endowment, 2009, 2(1): 407-418
-
(2009)
Proceedings of the VLDB Endowment
, vol.2
, Issue.1
, pp. 407-418
-
-
Fan, W.1
Jia, X.2
Li, J.3
-
60
-
-
79960451998
-
Dynamic constraints for record matching
-
Fan Wenfei, Gao Hong, Jia Xibei, et al. Dynamic constraints for record matching [J]. The VLDB Journal, 2011, 20(4): 495-520
-
(2011)
The VLDB Journal
, vol.20
, Issue.4
, pp. 495-520
-
-
Fan, W.1
Gao, H.2
Jia, X.3
-
64
-
-
79957793038
-
Graph homomorphism revisited for graph matching
-
Fan Wenfei, Li Jianzhong, Ma Shuai, et al. Graph homomorphism revisited for graph matching [J]. Proceedings of the VLDB Endowment, 2010, 3(1/2): 1161-1172
-
(2010)
Proceedings of the VLDB Endowment
, vol.3
, Issue.1-2
, pp. 1161-1172
-
-
Fan, W.1
Li, J.2
Ma, S.3
-
65
-
-
79960006256
-
Graph pattern matching: From intractable to polynomial time
-
Fan Wenfei, Li Jianzhong, Ma Shuai, et al. Graph pattern matching: from intractable to polynomial time [J]. Proceedings of the VLDB Endowment, 2010, 3(1/2): 264-275
-
(2010)
Proceedings of the VLDB Endowment
, vol.3
, Issue.1-2
, pp. 264-275
-
-
Fan, W.1
Li, J.2
Ma, S.3
-
66
-
-
79960022349
-
Incremental graph pattern matching
-
New York: ACM
-
Fan Wenfei, Li Jianzhong, Luo Jizhou, et al. Incremental graph pattern matching [C]//Proc of ACM SIGMOD. New York: ACM, 2011: 925-936
-
(2011)
Proc of ACM SIGMOD
, pp. 925-936
-
-
Fan, W.1
Li, J.2
Luo, J.3
-
69
-
-
0004043396
-
An efficient domain-independent algorithm for detecting approximately duplicate database record
-
Berlin: Springer
-
Monge A, Elkan C. An efficient domain-independent algorithm for detecting approximately duplicate database records [C]//Proc of Research Issues on Data Mining and Knowledge Discovery. Berlin: Springer, 1997: 1-7
-
(1997)
Proc of Research Issues on Data Mining and Knowledge Discovery
, pp. 1-7
-
-
Monge, A.1
Elkan, C.2
-
70
-
-
0000666461
-
Data integration using similarity joins and a word-based information representation language
-
Cohen W W. Data integration using similarity joins and a word-based information representation language [J]. ACM Trans on Information Systems (TOIS), 2000, 18(3): 288-321
-
(2000)
ACM Trans on Information Systems (TOIS)
, vol.18
, Issue.3
, pp. 288-321
-
-
Cohen, W.W.1
-
72
-
-
26444550791
-
Robust identification of fuzzy duplicates
-
Piscataway, NJ: IEEE
-
Chaudhuri S, Ganti V, Motwani R. Robust identification of fuzzy duplicates [C]//Proc of IEEE ICDE'05. Piscataway, NJ: IEEE, 2005: 865-876
-
(2005)
Proc of IEEE ICDE'05
, pp. 865-876
-
-
Chaudhuri, S.1
Ganti, V.2
Motwani, R.3
-
73
-
-
79953162324
-
Merging the results of approximate match operations
-
San Francisco, CA: Morgan Kaufmann
-
Guha S, Koudas N, Marathe A, et al. Merging the results of approximate match operations [C]//Proc of the 30th Int Conf on Very Large Databases. San Francisco, CA: Morgan Kaufmann, 2004: 636-647
-
(2004)
Proc of the 30th Int Conf on Very Large Databases
, pp. 636-647
-
-
Guha, S.1
Koudas, N.2
Marathe, A.3
-
75
-
-
84878044770
-
Entity resolution with markov logic
-
Piscataway, NJ: IEEE
-
Singla P, Domingos P. Entity resolution with markov logic [C]//Proc of IEEE ICDM'06. Piscataway, NJ: IEEE, 2006: 572-582
-
(2006)
Proc of IEEE ICDM'06
, pp. 572-582
-
-
Singla, P.1
Domingos, P.2
-
76
-
-
52649127789
-
Approximate joins for data-centric XML
-
Piscataway, NJ: IEEE
-
Augsten N, Bohlen M, Dyreson C, et al. Approximate joins for data-centric XML [C]//Proc of IEEE ICDE'08. Piscataway, NJ: IEEE, 2008: 814-823
-
(2008)
Proc of IEEE ICDE'08
, pp. 814-823
-
-
Augsten, N.1
Bohlen, M.2
Dyreson, C.3
-
78
-
-
70849115286
-
Efficient approximate entity extraction with edit distance constraints
-
New York: ACM
-
Wang Wei, Xiao Chuan, Lin Xuemin, et al. Efficient approximate entity extraction with edit distance constraints [C]//Proc of the 35th SIGMOD Int Conf on Management of Data. New York: ACM, 2009: 759-770
-
(2009)
Proc of the 35th SIGMOD Int Conf on Management of Data
, pp. 759-770
-
-
Wang, W.1
Xiao, C.2
Lin, X.3
-
81
-
-
79959944062
-
Interaction between record matching and data repairing
-
New York: ACM
-
Fan Wenfei, Li Jianzhong, Ma Shuai, et al. Interaction between record matching and data repairing [C]//Proc of the 2011 Int Conf on Management of Data. New York: ACM, 2011: 469-480
-
(2011)
Proc of the 2011 Int Conf on Management of Data
, pp. 469-480
-
-
Fan, W.1
Li, J.2
Ma, S.3
-
82
-
-
70349313097
-
Analyses and validation of conditional dependencies with built-in predicates
-
Berlin: Springer
-
Chen W, Fan W, Ma S. Analyses and validation of conditional dependencies with built-in predicates [C]//Proc of DEXA'09. Berlin: Springer, 2009: 576-591
-
(2009)
Proc of DEXA'09
, pp. 576-591
-
-
Chen, W.1
Fan, W.2
Ma, S.3
-
83
-
-
46649106686
-
Conditional functional dependencies for capturing data inconsistencies
-
Fan Wenfei, Geerts F, Jia Xibei, et al. Conditional functional dependencies for capturing data inconsistencies [J]. ACM Trans on Database Systems (TODS), 2008, 33(2): 1-48
-
(2008)
ACM Trans on Database Systems (TODS)
, vol.33
, Issue.2
, pp. 1-48
-
-
Fan, W.1
Geerts, F.2
Jia, X.3
-
84
-
-
77952749687
-
Detecting inconsistencies in distributed data
-
Piscataway, NJ: IEEE
-
Fan Wenfei, Geerts F, Ma Shuai, et al. Detecting inconsistencies in distributed data [C]//Proc of IEEE ICDE'10. Piscataway, NJ: IEEE, 2010: 64-75
-
(2010)
Proc of IEEE ICDE'10
, pp. 64-75
-
-
Fan, W.1
Geerts, F.2
Ma, S.3
-
85
-
-
84864198280
-
Incremental detection of inconsistencies in distributed data
-
Piscataway, NJ: IEEE
-
Fan W, Li J, Tang N, et al. Incremental detection of inconsistencies in distributed data [C]//Proc of IEEE ICDE'10. Piscataway, NJ: IEEE, 2012: 318-329
-
(2012)
Proc of IEEE ICDE'10
, pp. 318-329
-
-
Fan, W.1
Li, J.2
Tang, N.3
-
86
-
-
72649102401
-
Mining document collections to facilitate accurate approximate entity matching
-
Chaudhuri S, Ganti V, Xin D. Mining document collections to facilitate accurate approximate entity matching [J]. Proceedings of the VLDB Endowment, 2009, 2(1): 395-406
-
(2009)
Proceedings of the VLDB Endowment
, vol.2
, Issue.1
, pp. 395-406
-
-
Chaudhuri, S.1
Ganti, V.2
Xin, D.3
-
87
-
-
67649669734
-
A latent topic model for complete entity resolution
-
Piscataway, NJ: IEEE
-
Shu Liangcai, Long Bo, Meng Weiyi. A latent topic model for complete entity resolution [C]//Proc of IEEE ICDE'09. Piscataway, NJ: IEEE, 2009: 880-891
-
(2009)
Proc of IEEE ICDE'09
, pp. 880-891
-
-
Shu, L.1
Long, B.2
Meng, W.3
-
88
-
-
65449139594
-
Automatic record linkage using seeded nearest neighbor and support vector machine classification
-
New York: ACM
-
Christen P. Automatic record linkage using seeded nearest neighbor and support vector machine classification [C]//Proc of the 14th ACM SIGKDD Int Conf on Knowledge Discovery and Data Mining. New York: ACM, 2008: 151-159
-
(2008)
Proc of the 14th ACM SIGKDD Int Conf on Knowledge Discovery and Data Mining
, pp. 151-159
-
-
Christen, P.1
-
91
-
-
83055169894
-
Large-scale collective entity matching
-
Rastogi V, Dalvi N, Garofalakis M. Large-scale collective entity matching [J]. Proceedings of the VLDB Endowment, 2011, 4(4): 208-218
-
(2011)
Proceedings of the VLDB Endowment
, vol.4
, Issue.4
, pp. 208-218
-
-
Rastogi, V.1
Dalvi, N.2
Garofalakis, M.3
-
93
-
-
77952280581
-
HARRA: Fast iterative hashed record linkage for large-scale data collections
-
New York: ACM
-
Kim H, Lee D. HARRA: Fast iterative hashed record linkage for large-scale data collections [C]//Proc of the 13th Int Conf on Extending Database Technology. New York: ACM, 2010: 525-536
-
(2010)
Proc of the 13th Int Conf on Extending Database Technology
, pp. 525-536
-
-
Kim, H.1
Lee, D.2
-
94
-
-
84876687819
-
Data partitioning for parallel entity matching
-
Kirsten T, Kolb L, Hartung M, et al. Data partitioning for parallel entity matching [J]. Proceedings of the VLDB Endowment, 2010, 3(2): 1-8
-
(2010)
Proceedings of the VLDB Endowment
, vol.3
, Issue.2
, pp. 1-8
-
-
Kirsten, T.1
Kolb, L.2
Hartung, M.3
-
95
-
-
84878049861
-
Adaptive blocking: Learning to scale up record linkage
-
Piscataway, NJ: IEEE
-
Bilenko M, Kamath B, Mooney R J. Adaptive blocking: Learning to scale up record linkage [C]//Proc of IEEE ICDM'06. Piscataway, NJ: IEEE, 2006: 87-96
-
(2006)
Proc of IEEE ICDM'06
, pp. 87-96
-
-
Bilenko, M.1
Kamath, B.2
Mooney, R.J.3
-
97
-
-
5444258997
-
A comparison of fast blocking methods for record linkage
-
New York: ACM
-
Baxter R, Christen P, Churches T. A comparison of fast blocking methods for record linkage [C]//Proc of ACM SIGKDD Workshop. New York: ACM, 2003: 25-27
-
(2003)
Proc of ACM SIGKDD Workshop
, pp. 25-27
-
-
Baxter, R.1
Christen, P.2
Churches, T.3
-
102
-
-
33749597967
-
A primitive operator for similarity joins in data cleaning
-
Piscataway, NJ: IEEE
-
Chaudhuri S, Ganti V, Kaushik R. A primitive operator for similarity joins in data cleaning [C]//Proc of IEEE ICDE'06. Piscataway, NJ: IEEE, 2006: 5-5
-
(2006)
Proc of IEEE ICDE'06
, pp. 5-5
-
-
Chaudhuri, S.1
Ganti, V.2
Kaushik, R.3
-
103
-
-
67649641448
-
Space-constrained gram-based indexing for efficient approximate string search
-
Piscataway, NJ: IEEE
-
Behm A, Ji S, Li C, et al. Space-constrained gram-based indexing for efficient approximate string search [C]//Proc of IEEE ICDE'09. Piscataway, NJ: IEEE, 2009: 604-615
-
(2009)
Proc of IEEE ICDE'09
, pp. 604-615
-
-
Behm, A.1
Ji, S.2
Li, C.3
-
104
-
-
80052344031
-
Efficient similarity joins for near-duplicate detection
-
Xiao Chuan, Wang Wei, Lin Xuemin, et al. Efficient similarity joins for near-duplicate detection [J]. ACM Trans on Database Systems (TODS), 2011, 36(3): 15
-
(2011)
ACM Trans on Database Systems (TODS)
, vol.36
, Issue.3
, pp. 15
-
-
Xiao, C.1
Wang, W.2
Lin, X.3
-
105
-
-
79959936774
-
Reference-based alignment in large sequence databases
-
Papapetrou P, Athitsos V, Kollios G, et al. Reference-based alignment in large sequence databases [J]. Proceedings of the VLDB Endowment, 2009, 2(1): 205-216
-
(2009)
Proceedings of the VLDB Endowment
, vol.2
, Issue.1
, pp. 205-216
-
-
Papapetrou, P.1
Athitsos, V.2
Kollios, G.3
-
106
-
-
57149130672
-
Cost-based variable-length-gram selection for string collections to support approximate queries efficiently
-
New York: ACM
-
Yang Xiaochun, Wang Bin, Li Chen. Cost-based variable-length-gram selection for string collections to support approximate queries efficiently [C]//Proc of the 2008 ACM SIGMOD Int Conf on Management of Data. New York: ACM, 2008: 353-364
-
(2008)
Proc of the 2008 ACM SIGMOD Int Conf on Management of Data
, pp. 353-364
-
-
Yang, X.1
Wang, B.2
Li, C.3
-
107
-
-
85011032600
-
VGRAM: Improving performance of approximate queries on string collections using variable-length grams
-
San Francisco, CA: Morgan Kaufmann
-
Li Chen, Wang Bin, Yang Xiaochun. VGRAM: Improving performance of approximate queries on string collections using variable-length grams [C]//Proc of the 33rd Int Conf on Very Large Databases. San Francisco, CA: Morgan Kaufmann, 2007: 303-314
-
(2007)
Proc of the 33rd Int Conf on Very Large Databases
, pp. 303-314
-
-
Li, C.1
Wang, B.2
Yang, X.3
-
108
-
-
52649086729
-
Efficient merging and filtering algorithms for approximate string searches
-
Piscataway, NJ: IEEE
-
Li Chen, Lu Jiaheng, Lu Yiming. Efficient merging and filtering algorithms for approximate string searches [C]//Proc of IEEE ICDE'08. Piscataway, NJ: IEEE, 2008: 257-266
-
(2008)
Proc of IEEE ICDE'08
, pp. 257-266
-
-
Li, C.1
Lu, J.2
Lu, Y.3
-
109
-
-
52649161208
-
A fast similarity join algorithm using graphics processing units
-
Piscataway, NJ: IEEE
-
Lieberman M D, Sankaranarayanan J, Samet H. A fast similarity join algorithm using graphics processing units [C]//Proc of IEEE ICDE'08. Piscataway, NJ: IEEE, 2008: 1111-1120
-
(2008)
Proc of IEEE ICDE'08
, pp. 1111-1120
-
-
Lieberman, M.D.1
Sankaranarayanan, J.2
Samet, H.3
-
110
-
-
14644439871
-
Fast detection of XML structural similarity
-
Flesca S, Manco G, Masciari E, et al. Fast detection of XML structural similarity [J]. IEEE Trans on Knowledge and Data Engineering, 2005, 17(2): 160-175
-
(2005)
IEEE Trans on Knowledge and Data Engineering
, vol.17
, Issue.2
, pp. 160-175
-
-
Flesca, S.1
Manco, G.2
Masciari, E.3
-
111
-
-
77952779390
-
Hashing tree-structured data: Methods and applications
-
Piscataway, NJ: IEEE
-
Tatikonda S, Parthasarathy S. Hashing tree-structured data: Methods and applications [C]//Proc of IEEE ICDE'10. Piscataway, NJ: IEEE, 2010: 429-440
-
(2010)
Proc of IEEE ICDE'10
, pp. 429-440
-
-
Tatikonda, S.1
Parthasarathy, S.2
-
112
-
-
74049138802
-
Development and user experiences of an open source data cleaning, deduplication and record linkage system
-
Christen P. Development and user experiences of an open source data cleaning, deduplication and record linkage system [J]. ACM SIGKDD Explorations Newsletter, 2009, 11(1): 39-48
-
(2009)
ACM SIGKDD Explorations Newsletter
, vol.11
, Issue.1
, pp. 39-48
-
-
Christen, P.1
-
113
-
-
58149472338
-
Swoosh: a generic approach to entity resolution
-
Benjelloun O, Garcia-Molina H, Menestrina D, et al. Swoosh: a generic approach to entity resolution [J]. The International Journal on Very Large Databases, 2009, 18(1): 255-276
-
(2009)
The International Journal on Very Large Databases
, vol.18
, Issue.1
, pp. 255-276
-
-
Benjelloun, O.1
Garcia-Molina, H.2
Menestrina, D.3
-
114
-
-
85011029434
-
Example-driven design of efficient record matching queries
-
San Francisco, CA: Morgan Kaufmann
-
Chaudhuri S, Chen B C, Ganti V, et al. Example-driven design of efficient record matching queries [C]//Proc of the 33rd Int Conf on Very Large Databases. San Francisco, CA: Morgan Kaufmann, 2007: 327-338
-
(2007)
Proc of the 33rd Int Conf on Very Large Databases
, pp. 327-338
-
-
Chaudhuri, S.1
Chen, B.C.2
Ganti, V.3
-
119
-
-
80455148340
-
Evaluation of entity resolution approaches on real-world match problems
-
Köpcke H, Thor A, Rahm E. Evaluation of entity resolution approaches on real-world match problems [J]. Proceedings of the VLDB Endowment, 2010, 3(1/2): 484-493
-
(2010)
Proceedings of the VLDB Endowment
, vol.3
, Issue.1-2
, pp. 484-493
-
-
Köpcke, H.1
Thor, A.2
Rahm, E.3
-
120
-
-
80052419079
-
Comparative evaluation of entity resolution approaches with FEVER
-
Köpcke H, Thor A, Rahm E. Comparative evaluation of entity resolution approaches with FEVER [J]. Proceedings of the VLDB Endowment, 2009, 2(2): 1574-1577
-
(2009)
Proceedings of the VLDB Endowment
, vol.2
, Issue.2
, pp. 1574-1577
-
-
Köpcke, H.1
Thor, A.2
Rahm, E.3
-
121
-
-
72649095071
-
Frameworks for entity matching: A comparison
-
Köpcke H, Rahm E. Frameworks for entity matching: A comparison [J]. Data & Knowledge Engineering, 2010, 69(2): 197-210
-
(2010)
Data & Knowledge Engineering
, vol.69
, Issue.2
, pp. 197-210
-
-
Köpcke, H.1
Rahm, E.2
-
122
-
-
79960270026
-
Evaluating entity resolution results
-
Menestrina D, Whang S E, Garcia-Molina H. Evaluating entity resolution results [J]. Proceedings of the VLDB Endowment, 2010, 3(1/2): 208-219
-
(2010)
Proceedings of the VLDB Endowment
, vol.3
, Issue.1-2
, pp. 208-219
-
-
Menestrina, D.1
Whang, S.E.2
Garcia-Molina, H.3
-
123
-
-
29844436973
-
A cost-based model and effective heuristic for repairing constraints by value modification
-
New York: ACM
-
Bohannon P, Fan Wenfei, Flaster M, et al. A cost-based model and effective heuristic for repairing constraints by value modification [C]//Proc of the 2005 ACM SIGMOD Int Conf on Management of Data. New York: ACM, 2005: 143-154
-
(2005)
Proc of the 2005 ACM SIGMOD Int Conf on Management of Data
, pp. 143-154
-
-
Bohannon, P.1
Fan, W.2
Flaster, M.3
-
124
-
-
84959912087
-
Improving data quality: Consistency and accuracy
-
San Francisco, CA: Morgan Kaufmann
-
Cong Gao, Fan Wenfei, Geerts F, et al. Improving data quality: Consistency and accuracy [C]//Proc of the 33rd Int Conf on Very Large Databases. San Francisco, CA: Morgan Kaufmann, 2007: 315-326
-
(2007)
Proc of the 33rd Int Conf on Very Large Databases
, pp. 315-326
-
-
Cong, G.1
Fan, W.2
Geerts, F.3
-
125
-
-
80052917068
-
Sampling the repairs of functional dependency violations under hard constraints
-
Beskales G, Ilyas I F, Golab L. Sampling the repairs of functional dependency violations under hard constraints [J]. Proceedings of the VLDB Endowment, 2010, 3(1/2): 197-207
-
(2010)
Proceedings of the VLDB Endowment
, vol.3
, Issue.1-2
, pp. 197-207
-
-
Beskales, G.1
Ilyas, I.F.2
Golab, L.3
-
126
-
-
77954736322
-
Consistent query answers in inconsistent probabilistic databases
-
New York: ACM
-
Lian Xiang, Chen Lei, Song Shaoxu. Consistent query answers in inconsistent probabilistic databases [C]//Proc of the 2010 Int Conf on Management of Data. New York: ACM, 2010: 303-314
-
(2010)
Proc of the 2010 Int Conf on Management of Data
, pp. 303-314
-
-
Lian, X.1
Chen, L.2
Song, S.3
-
127
-
-
52649155017
-
A sampling-based approach to information recovery
-
Piscataway, NJ: IEEE
-
Xie Junyi, Yang Jun, Chen Yuguo, et al. A sampling-based approach to information recovery [C]//Proc of IEEE ICDE'08. Piscataway, NJ: IEEE, 2008: 476-485
-
(2008)
Proc of IEEE ICDE'08
, pp. 476-485
-
-
Xie, J.1
Yang, J.2
Chen, Y.3
-
130
-
-
34249872509
-
In-network outlier cleaning for data collection in sensor networks
-
New York: VLDB Endowment
-
Zhuang Yongzhen, Chen Lei. In-network outlier cleaning for data collection in sensor networks [C]//Proc of VLDB Workshop on CleanDB. New York: VLDB Endowment, 2006: 41-48
-
(2006)
Proc of VLDB Workshop on CleanDB
, pp. 41-48
-
-
Zhuang, Y.1
Chen, L.2
-
132
-
-
77954695997
-
Modeling and querying possible repairs in duplicate detection
-
Beskales G, Soliman M A, Ilyas I F, et al. Modeling and querying possible repairs in duplicate detection [J]. Proceedings of the VLDB Endowment, 2009, 2(1): 598-609
-
(2009)
Proceedings of the VLDB Endowment
, vol.2
, Issue.1
, pp. 598-609
-
-
Beskales, G.1
Soliman, M.A.2
Ilyas, I.F.3
-
133
-
-
33749588820
-
Clean answers over dirty databases: A probabilistic approach
-
Piscataway, NJ: IEEE
-
Andritsos P, Fuxman A, Miller R J. Clean answers over dirty databases: A probabilistic approach [C]//Proc of IEEE ICDE'06. Piscataway, NJ: IEEE, 2006: 30-30
-
(2006)
Proc of IEEE ICDE'06
, pp. 30-30
-
-
Andritsos, P.1
Fuxman, A.2
Miller, R.J.3
-
135
-
-
72649086387
-
Framework for evaluating clustering algorithms in duplicate detection
-
Hassanzadeh O, Chiang F, Lee H C, et al. Framework for evaluating clustering algorithms in duplicate detection [J]. Proceedings of the VLDB Endowment, 2009, 2(1): 1282-1293
-
(2009)
Proceedings of the VLDB Endowment
, vol.2
, Issue.1
, pp. 1282-1293
-
-
Hassanzadeh, O.1
Chiang, F.2
Lee, H.C.3
-
137
-
-
79960023714
-
Record linkage with uniqueness constraints and erroneous values
-
Guo Songtao, Dong X L, Srivastava D, et al. Record linkage with uniqueness constraints and erroneous values [J]. Proceedings of the VLDB Endowment, 2010, 3(1/2): 417-428
-
(2010)
Proceedings of the VLDB Endowment
, vol.3
, Issue.1-2
, pp. 417-428
-
-
Guo, S.1
Dong, X.L.2
Srivastava, D.3
-
140
-
-
33745628835
-
ConQuer: A system for efficient querying over inconsistent databases
-
San Francisco, CA: Morgan Kaufmann
-
Fuxman A, Fuxman D, Miller R J. ConQuer: A system for efficient querying over inconsistent databases [C]//Proc of the 31st Int Conf on Very Large Databases. San Francisco, CA: Morgan Kaufmann, 2005: 1354-1357
-
(2005)
Proc of the 31st Int Conf on Very Large Databases
, pp. 1354-1357
-
-
Fuxman, A.1
Fuxman, D.2
Miller, R.J.3
-
141
-
-
0024941096
-
Integrity=validity+completeness
-
Motro A. Integrity=validity+completeness [J]. ACM Trans on Database Systems (TODS), 1989, 14(4): 480-502
-
(1989)
ACM Trans on Database Systems (TODS)
, vol.14
, Issue.4
, pp. 480-502
-
-
Motro, A.1
-
142
-
-
0002842314
-
Obtaining complete answers from incomplete databases
-
San Francisco, CA: Morgan Kaufmann
-
Levy A. Obtaining complete answers from incomplete databases [C]//Proc of the 22nd Int Conf on Very Large Databases. San Francisco, CA: Morgan Kaufmann, 1996: 402-412
-
(1996)
Proc of the 22nd Int Conf on Very Large Databases
, pp. 402-412
-
-
Levy, A.1
-
144
-
-
67649637305
-
Resolution-aware query answering for business intelligence
-
Piscataway, NJ: IEEE
-
Sismanis Y, Wang L, Fuxman A, et al. Resolution-aware query answering for business intelligence [C]//Proc of IEEE ICDE'09. Piscataway, NJ: IEEE, 2009: 976-987
-
(2009)
Proc of IEEE ICDE'09
, pp. 976-987
-
-
Sismanis, Y.1
Wang, L.2
Fuxman, A.3
-
149
-
-
77955171415
-
Mining frequent subgraph patterns from uncertain graph data
-
Zou Zhaonian, Li Jianzhong, Gao Hong, et al. Mining frequent subgraph patterns from uncertain graph data [J]. IEEE Trans on Knowledge and Data Engineering, 2010, 22(9): 1203-1218
-
(2010)
IEEE Trans on Knowledge and Data Engineering
, vol.22
, Issue.9
, pp. 1203-1218
-
-
Zou, Z.1
Li, J.2
Gao, H.3
-
151
-
-
84869506153
-
Mining frequent subgraphs over uncertain graph databases under probabilistic semantics
-
Li Jianzhong, Zou Zhaonian, Gao Hong. Mining frequent subgraphs over uncertain graph databases under probabilistic semantics [J]. The VLDB Journal, 2012, 21(6): 753-777
-
(2012)
The VLDB Journal
, vol.21
, Issue.6
, pp. 753-777
-
-
Li, J.1
Zou, Z.2
Gao, H.3
-
152
-
-
77952764293
-
Finding top-k maximal cliques in an uncertain graph
-
Piscataway, NJ: IEEE
-
Zou Zhaonian, Li Jianzhong, Gao Hong, et al. Finding top-k maximal cliques in an uncertain graph [C]//Proc of IEEE ICDE'10. Piscataway, NJ: IEEE, 2010: 649-652
-
(2010)
Proc of IEEE ICDE'10
, pp. 649-652
-
-
Zou, Z.1
Li, J.2
Gao, H.3
-
153
-
-
84874024305
-
Reliable clustering on uncertain graphs
-
Piscataway, NJ: IEEE
-
Liu Lin, Jin Ruoming, Aggrawal C C, et al. Reliable clustering on uncertain graphs [C]//Proc of IEEE ICDM'12. Piscataway, NJ: IEEE, 2012: 459-468
-
(2012)
Proc of IEEE ICDM'12
, pp. 459-468
-
-
Liu, L.1
Jin, R.2
Aggrawal, C.C.3
-
154
-
-
80052652443
-
Distance-constraint reachability computation in uncertain graphs
-
Jin Ruoming, Liu Lin, Ding Bolin, et al. Distance-constraint reachability computation in uncertain graphs [J]. Proceedings of the VLDB Endowment, 2011, 4(9): 551-562
-
(2011)
Proceedings of the VLDB Endowment
, vol.4
, Issue.9
, pp. 551-562
-
-
Jin, R.1
Liu, L.2
Ding, B.3
-
155
-
-
0035295154
-
Toward virtual community knowledge evolution
-
Bieber M, Engelbart D, Furuta R, et al. Toward virtual community knowledge evolution [J]. Journal of Management Information Systems, 2002, 18(4): 11-35
-
(2002)
Journal of Management Information Systems
, vol.18
, Issue.4
, pp. 11-35
-
-
Bieber, M.1
Engelbart, D.2
Furuta, R.3
-
158
-
-
80053318389
-
Rule induction for uncertain data
-
Qin Biao, Xia Yuni, Prabhakar S. Rule induction for uncertain data [J]. Knowledge and Information Systems, 2011, 29(1): 103-130
-
(2011)
Knowledge and Information Systems
, vol.29
, Issue.1
, pp. 103-130
-
-
Qin, B.1
Xia, Y.2
Prabhakar, S.3
|