-
1
-
-
84976512265
-
Temporal rules discovery for web data cleaning
-
Z. Abedjan, C. Akcora, M. Ouzzani, P. Papotti, and M. Stonebraker. Temporal rules discovery for web data cleaning. PVLDB, 9(4):336 -347, 2015.
-
(2015)
PVLDB
, vol.9
, Issue.4
, pp. 336 -347
-
-
Abedjan, Z.1
Akcora, C.2
Ouzzani, M.3
Papotti, P.4
Stonebraker, M.5
-
2
-
-
84937966957
-
Profiling relational data: a survey
-
Z. Abedjan, L. Golab, and F. Naumann. Profiling relational data: a survey. VLDB Journal, 24(4):557-581, 2015.
-
(2015)
VLDB Journal
, vol.24
, Issue.4
, pp. 557-581
-
-
Abedjan, Z.1
Golab, L.2
Naumann, F.3
-
3
-
-
85013643396
-
DataXFormer: A robust data transformation system
-
Z. Abedjan, J. Morcos, I. F. Ilyas, P. Papotti, M. Ouzzani, and M. Stonebraker. DataXFormer: A robust data transformation system. In ICDE, 2016.
-
(2016)
ICDE
-
-
Abedjan, Z.1
Morcos, J.2
Ilyas, I.F.3
Papotti, P.4
Ouzzani, M.5
Stonebraker, M.6
-
4
-
-
84975824359
-
Messing-Up with BART: Error Generation for Evaluating Data Cleaning Algorithms
-
P. C. Arocena, B. Glavic, G. Mecca, R. J. Miller, P. Papotti, and D. Santoro. Messing-Up with BART: Error Generation for Evaluating Data Cleaning Algorithms. PVLDB, 9(2):36-47, 2015.
-
(2015)
PVLDB
, vol.9
, Issue.2
, pp. 36-47
-
-
Arocena, P.C.1
Glavic, B.2
Mecca, G.3
Miller, R.J.4
Papotti, P.5
Santoro, D.6
-
5
-
-
68049121093
-
Anomaly detection: A survey
-
July
-
V. Chandola, A. Banerjee, and V. Kumar. Anomaly detection: A survey. ACM Comput. Surv., 41(3):15:1-15:58, July 2009.
-
(2009)
ACM Comput. Surv.
, vol.41
, Issue.3
, pp. 1-58
-
-
Chandola, V.1
Banerjee, A.2
Kumar, V.3
-
6
-
-
84881365460
-
Holistic data cleaning: Putting violations into context
-
X. Chu, I. F. Ilyas, and P. Papotti. Holistic data cleaning: Putting violations into context. In ICDE, 2013.
-
(2013)
ICDE
-
-
Chu, X.1
Ilyas, I.F.2
Papotti, P.3
-
7
-
-
84957586399
-
Katara: A data cleaning system powered by knowledge bases and crowdsourcing
-
X. Chu, J. Morcos, I. F. Ilyas, M. Ouzzani, P. Papotti, N. Tang, and Y. Ye. Katara: A data cleaning system powered by knowledge bases and crowdsourcing. In SIGMOD, 2015.
-
(2015)
SIGMOD
-
-
Chu, X.1
Morcos, J.2
Ilyas, I.F.3
Ouzzani, M.4
Papotti, P.5
Tang, N.6
Ye, Y.7
-
8
-
-
84880546390
-
Nadeef: A commodity data cleaning system
-
M. Dallachiesa, A. Ebaid, A. Eldawy, A. Elmagarmid, I. F. Ilyas, M. Ouzzani, and N. Tang. Nadeef: A commodity data cleaning system. In SIGMOD, 2013.
-
(2013)
SIGMOD
-
-
Dallachiesa, M.1
Ebaid, A.2
Eldawy, A.3
Elmagarmid, A.4
Ilyas, I.F.5
Ouzzani, M.6
Tang, N.7
-
9
-
-
84873193579
-
Statistical distortion: Consequences of data cleaning
-
T. Dasu and J. M. Loh. Statistical distortion: Consequences of data cleaning. PVLDB, 5(11):1674-1683, 2012.
-
(2012)
PVLDB
, vol.5
, Issue.11
, pp. 1674-1683
-
-
Dasu, T.1
Loh, J.M.2
-
12
-
-
84969569127
-
Rayyan: a systematic reviews web app for exploring and filtering searches for eligible studies for cochrane reviews
-
John Wiley & Sons, Sept.
-
A. Elmagarmid, Z. Fedorowicz, H. Hammady, I. Ilyas, M. Khabsa, and O. Mourad. Rayyan: a systematic reviews web app for exploring and filtering searches for eligible studies for cochrane reviews. In Abstracts of the 22nd Cochrane Colloquium, page 9. John Wiley & Sons, Sept. 2014.
-
(2014)
Abstracts of the 22nd Cochrane Colloquium
, pp. 9
-
-
Elmagarmid, A.1
Fedorowicz, Z.2
Hammady, H.3
Ilyas, I.4
Khabsa, M.5
Mourad, O.6
-
13
-
-
33845667955
-
Duplicate record detection: A survey
-
A. K. Elmagarmid, P. G. Ipeirotis, and V. S. Verykios. Duplicate record detection: A survey. IEEE Transactions on Knowledge and Data Engineering (TKDE), 19(1):1-16, 2007.
-
(2007)
IEEE Transactions on Knowledge and Data Engineering (TKDE)
, vol.19
, Issue.1
, pp. 1-16
-
-
Elmagarmid, A.K.1
Ipeirotis, P.G.2
Verykios, V.S.3
-
14
-
-
84975799236
-
Foundations of Data Quality Management
-
W. Fan and F. Geerts. Foundations of Data Quality Management. Morgan & Claypool, 2012.
-
(2012)
Morgan & Claypool
-
-
Fan, W.1
Geerts, F.2
-
15
-
-
84858614433
-
Towards certain fixes with editing rules and master data
-
W. Fan, J. Li, S. Ma, N. Tang, and W. Yu. Towards certain fixes with editing rules and master data. VLDB Journal, 21(2):213-238, 2012.
-
(2012)
VLDB Journal
, vol.21
, Issue.2
, pp. 213-238
-
-
Fan, W.1
Li, J.2
Ma, S.3
Tang, N.4
Yu, W.5
-
17
-
-
77951521102
-
Quantitative data cleaning for large databases
-
J. M. Hellerstein. Quantitative data cleaning for large databases, 2008.
-
(2008)
-
-
Hellerstein, J.M.1
-
19
-
-
84958053976
-
Trends in cleaning relational data: Consistency and deduplication
-
I. F. Ilyas and X. Chu. Trends in cleaning relational data: Consistency and deduplication. Foundations and Trends in Databases, 5(4):281-393, 2015.
-
(2015)
Foundations and Trends in Databases
, vol.5
, Issue.4
, pp. 281-393
-
-
Ilyas, I.F.1
Chu, X.2
-
21
-
-
85013637311
-
Wrangler: Interactive visual specification of data transformation scripts
-
S. Kandel, A. Paepcke, J. Hellerstein, and J. Heer. Wrangler: Interactive visual specification of data transformation scripts. New York, NY, USA, 2011.
-
(2011)
New York, NY, USA
-
-
Kandel, S.1
Paepcke, A.2
Hellerstein, J.3
Heer, J.4
-
22
-
-
84867627474
-
Enterprise data analysis and visualization: An interview study
-
S. Kandel, A. Paepcke, J. M. Hellerstein, and J. Heer. Enterprise data analysis and visualization: An interview study. IEEE Trans. Vis. Comput. Graph., 18(12):2917-2926, 2012.
-
(2012)
IEEE Trans. Vis. Comput. Graph.
, vol.18
, Issue.12
, pp. 2917-2926
-
-
Kandel, S.1
Paepcke, A.2
Hellerstein, J.M.3
Heer, J.4
-
23
-
-
84949872769
-
Bigdansing: A system for big data cleansing
-
Z. Khayyat, I. F. Ilyas, A. Jindal, S. Madden, M. Ouzzani, P. Papotti, J.-A. Quiané-Ruiz, N. Tang, and S. Yin. Bigdansing: A system for big data cleansing. In SIGMOD, pages 1215-1230, 2015.
-
(2015)
SIGMOD
, pp. 1215-1230
-
-
Khayyat, Z.1
Ilyas, I.F.2
Jindal, A.3
Madden, S.4
Ouzzani, M.5
Papotti, P.6
Quiané-Ruiz, J.-A.7
Tang, N.8
Yin, S.9
-
24
-
-
0037240183
-
A taxonomy of dirty data
-
Jan.
-
W. Kim, B.-J. Choi, E.-K. Hong, S.-K. Kim, and D. Lee. A taxonomy of dirty data. Data Min. Knowl. Discov., 7(1):81-99, Jan. 2003.
-
(2003)
Data Min. Knowl. Discov.
, vol.7
, Issue.1
, pp. 81-99
-
-
Kim, W.1
Choi, B.-J.2
Hong, E.-K.3
Kim, S.-K.4
Lee, D.5
-
25
-
-
77951101246
-
On Approximating Optimum Repairs for Functional Dependency Violations
-
S. Kolahi and L. V. S. Lakshmanan. On Approximating Optimum Repairs for Functional Dependency Violations. In ICDT, 2009.
-
(2009)
ICDT
-
-
Kolahi, S.1
Lakshmanan, L.V.S.2
-
27
-
-
85013647857
-
Outlier detection in heterogeneous datasets using automatic tuple expansion
-
Technical Report MIT-CSAIL-TR-2016-002, CSAIL, MIT, 32 Vassar Street, Cambridge MA 02139, February
-
C. Pit-Claudel, Z. Mariet, R. Harding, and S. Madden. Outlier detection in heterogeneous datasets using automatic tuple expansion. Technical Report MIT-CSAIL-TR-2016-002, CSAIL, MIT, 32 Vassar Street, Cambridge MA 02139, February 2016.
-
(2016)
-
-
Claudel-Pit, C.1
Mariet, Z.2
Harding, R.3
Madden, S.4
-
28
-
-
84976473806
-
Combining quantitative and logical data cleaning
-
N. Prokoshyna, J. Szlichta, F. Chiang, R. J. Miller, and D. Srivastava. Combining quantitative and logical data cleaning. PVLDB, 9(4):300-311, 2015.
-
(2015)
PVLDB
, vol.9
, Issue.4
, pp. 300-311
-
-
Prokoshyna, N.1
Szlichta, J.2
Chiang, F.3
Miller, R.J.4
Srivastava, D.5
-
29
-
-
0002490026
-
Data cleaning: Problems and current approaches
-
E. Rahm and H.-H. Do. Data cleaning: Problems and current approaches. IEEE Data Engineering Bulletin, 23(4):3-13, 2000.
-
(2000)
IEEE Data Engineering Bulletin
, vol.23
, Issue.4
, pp. 3-13
-
-
Rahm, E.1
Do, H.-H.2
-
30
-
-
85084016251
-
Data curation at scale: The Data Tamer system
-
M. Stonebraker, D. Bruckner, I. F. Ilyas, G. Beskales, M. Cherniack, S. Zdonik, A. Pagan, and S. Xu. Data curation at scale: The Data Tamer system. In CIDR, 2013.
-
(2013)
CIDR
-
-
Stonebraker, M.1
Bruckner, D.2
Ilyas, I.F.3
Beskales, G.4
Cherniack, M.5
Zdonik, S.6
Pagan, A.7
Xu, S.8
-
31
-
-
35148867982
-
Yago: a core of semantic knowledge
-
F. M. Suchanek, G. Kasneci, and G. Weikum. Yago: a core of semantic knowledge. In WWW, pages 697-706, 2007.
-
(2007)
WWW
, pp. 697-706
-
-
Suchanek, F.M.1
Kasneci, G.2
Weikum, G.3
-
32
-
-
84952767080
-
Seedb: Effcient data-driven visualization recommendations to support visual analytics
-
Sept.
-
M. Vartak, S. Rahman, S. Madden, A. Parameswaran, and N. Polyzotis. Seedb: Effcient data-driven visualization recommendations to support visual analytics. PVLDB, 8(13):2182-2193, Sept. 2015.
-
(2015)
PVLDB
, vol.8
, Issue.13
, pp. 2182-2193
-
-
Vartak, M.1
Rahman, S.2
Madden, S.3
Parameswaran, A.4
Polyzotis, N.5
-
33
-
-
84904293819
-
Towards dependable data repairing with fixing rules
-
J. Wang and N. Tang. Towards dependable data repairing with fixing rules. In SIGMOD, pages 457-468, 2014.
-
(2014)
SIGMOD
, pp. 457-468
-
-
Wang, J.1
Tang, N.2
-
34
-
-
84881115711
-
Scorpion: Explaining away outliers in aggregate queries
-
June
-
E. Wu and S. Madden. Scorpion: Explaining away outliers in aggregate queries. PVLDB, 6(8):553-564, June 2013.
-
(2013)
PVLDB
, vol.6
, Issue.8
, pp. 553-564
-
-
Wu, E.1
Madden, S.2
|