메뉴 건너뛰기




Volumn , Issue , 2014, Pages 1294-1297

Data quality: The other face of Big Data

Author keywords

[No Author keywords available]

Indexed keywords

QUALITY MANAGEMENT; REPAIR; SEMANTICS;

EID: 84901781647     PISSN: 10844627     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICDE.2014.6816764     Document Type: Conference Paper
Times cited : (175)

References (70)
  • 1
    • 35048821930 scopus 로고    scopus 로고
    • Correctors for xml data
    • U. Boobna and M. de Rougemont: Correctors for XML data. XSym 2004: 97-111.
    • (2004) XSym , pp. 97-111
    • Boobna, U.1    De Rougemont, M.2
  • 2
    • 29844436973 scopus 로고    scopus 로고
    • A cost-based model and effective heuristic for repairing constraints by value modification
    • P. Bohannon, M. Flaster, W. Fan and R. Rastogi: A cost-based model and effective heuristic for repairing constraints by value modification. SIGMOD 2005: 143-154.
    • (2005) SIGMOD , pp. 143-154
    • Bohannon, P.1    Flaster, M.2    Fan, W.3    Rastogi, R.4
  • 4
    • 85011024333 scopus 로고    scopus 로고
    • Extending dependencies with conditions
    • L. Bravo, W. Fan and S. Ma: Extending dependencies with conditions, VLDB 2007: 243-254.
    • (2007) VLDB , pp. 243-254
    • Bravo, L.1    Fan, W.2    Ma, S.3
  • 5
    • 80052917068 scopus 로고    scopus 로고
    • Sampling the repairs of functional dependency violations under hard constraints
    • G. Beskales, I. F. Ilyas and L. Golab: Sampling the repairs of functional dependency violations under hard constraints. PVLDB 3(1): 197-207 (2010).
    • (2010) PVLDB , vol.3 , Issue.1 , pp. 197-207
    • Beskales, G.1    Ilyas, I.F.2    Golab, L.3
  • 6
    • 84881320841 scopus 로고    scopus 로고
    • On the relative trust between inconsistent data and inaccurate constraints
    • G. Beskales, I. F. Ilyas, L. Golab and A. Galiullin: On the relative trust between inconsistent data and inaccurate constraints. ICDE 2013: 541-552.
    • (2013) ICDE , pp. 541-552
    • Beskales, G.1    Ilyas, I.F.2    Golab, L.3    Galiullin, A.4
  • 7
    • 79952344949 scopus 로고    scopus 로고
    • Lakshmanan: Data cleaning and query answering with matching dependencies and matching functions
    • L. Bertossi, S. Kolahi and Laks V.S. Lakshmanan: Data cleaning and query answering with matching dependencies and matching functions. ICDT 2011: 268-279.
    • (2011) ICDT , pp. 268-279
    • Bertossi, L.1    Kolahi, S.2    Laks, V.S.3
  • 8
    • 84893833429 scopus 로고    scopus 로고
    • Inference of concise dtds from xml data
    • G. J. Bex, F. Neven, T. Schwentick and K. Tuyls: Inference of concise DTDs from XML data. VLDB 2006: 115-126.
    • (2006) VLDB , pp. 115-126
    • Bex, G.J.1    Neven, F.2    Schwentick, T.3    Tuyls, K.4
  • 9
    • 84882718707 scopus 로고    scopus 로고
    • Inferring xml schema definitions from xml data
    • G. J. Bex, F. Neven and S. Vansummeren: Inferring XML schema definitions from XML data. VLDB 2007: 998-1009.
    • (2007) VLDB , pp. 998-1009
    • Bex, G.J.1    Neven, F.2    Vansummeren, S.3
  • 11
    • 85048982694 scopus 로고    scopus 로고
    • Improving data quality: Consistency and accuracy
    • G. Cong, W. Fan, F. Geerts, X. Jia and S. Ma: Improving data quality: consistency and accuracy. VLDB 2007: 315326.
    • (2007) VLDB , pp. 315326
    • Cong, G.1    Fan, W.2    Geerts, F.3    Jia, X.4    Ma, S.5
  • 12
    • 84880523644 scopus 로고    scopus 로고
    • Determining the relative accuracy of attributes
    • Y. Cao, W. Fan and W. Yu: Determining the relative accuracy of attributes. SIGMOD 2013: 565-576.
    • (2013) SIGMOD , pp. 565-576
    • Cao, Y.1    Fan, W.2    Yu, W.3
  • 14
    • 84881365460 scopus 로고    scopus 로고
    • Holistic data cleaning: Putting violations into context
    • X. Chu, I. F. Ilyas and P. Papotti: Holistic data cleaning: putting violations into context. ICDE 2013: 458-469.
    • (2013) ICDE , pp. 458-469
    • Chu, X.1    Ilyas, I.F.2    Papotti, P.3
  • 15
    • 84891066910 scopus 로고    scopus 로고
    • Discovering denial constraints
    • X. Chu, I.F. Ilyas and P. Papotti: Discovering Denial Constraints. PVLDB 6(13): 1498-1509 (2013).
    • (2013) PVLDB , vol.6 , Issue.13 , pp. 1498-1509
    • Chu, X.1    Ilyas, I.F.2    Papotti, P.3
  • 16
    • 74549188261 scopus 로고    scopus 로고
    • Discovering data quality rules
    • F. Chiang and R. J. Miller: Discovering data quality rules. PVLDB 1(1): 1166-1177 (2008).
    • (2008) PVLDB , vol.1 , Issue.1 , pp. 1166-1177
    • Chiang, F.1    Miller, R.J.2
  • 17
    • 79957823829 scopus 로고    scopus 로고
    • A unified model for data and constraint repair
    • F. Chiang and R. J. Miller: A unified model for data and constraint repair. ICDE 2011: 446-457.
    • (2011) ICDE , pp. 446-457
    • Chiang, F.1    Miller, R.J.2
  • 19
    • 0036366837 scopus 로고    scopus 로고
    • Mining database structure; Or, how to build a data quality browser
    • T. Dasu, T. Johnson, S. Muthukrishnan and V. Shkapenyuk: Mining database structure; or, how to build a data quality browser, SIGMOD 2002: 240-251.
    • (2002) SIGMOD , pp. 240-251
    • Dasu, T.1    Johnson, T.2    Muthukrishnan, S.3    Shkapenyuk, V.4
  • 20
    • 84858614392 scopus 로고    scopus 로고
    • Data quality and the bottom line: Achieving business success through a commitment to high quality data
    • W. W. Eckerson: Data quality and the bottom line: achieving business success through a commitment to high quality data. Data Warehousing Institute, 2002.
    • (2002) Data Warehousing Institute
    • Eckerson, W.W.1
  • 21
    • 79957860515 scopus 로고    scopus 로고
    • Discovery of complex glitch patterns: A novel approach to quantitative data cleaning
    • L. Berti-Equille, T. Dasu and D. Srivastava: Discovery of complex glitch patterns: a novel approach to quantitative data cleaning. ICDE 2011: 733-744.
    • (2011) ICDE , pp. 733-744
    • Berti-Equille, L.1    Dasu, T.2    Srivastava, D.3
  • 22
    • 46649108915 scopus 로고    scopus 로고
    • Repair localization for query answering from inconsistent databases
    • T. Eiter, M. Fink, G. Greco and D. Lembo: Repair localization for query answering from inconsistent databases. TODS 33(2) (2008).
    • (2008) TODS , vol.33 , Issue.2
    • Eiter, T.1    Fink, M.2    Greco, G.3    Lembo, D.4
  • 23
    • 84865627282 scopus 로고    scopus 로고
    • Data quality: Theory and practice
    • W. Fan: Data quality: theory and practice. WAIM 2012: 1-16.
    • (2012) WAIM , pp. 1-16
    • Fan, W.1
  • 24
    • 29844448776 scopus 로고    scopus 로고
    • ConQuer: Efficient management of inconsistent databases
    • A. Fuxman, E. Fazli and R. J. Miller: ConQuer: efficient management of inconsistent databases. SIGMOD 2005: 155-166.
    • (2005) SIGMOD , pp. 155-166
    • Fuxman, A.1    Fazli, E.2    Miller, R.J.3
  • 25
    • 77954691044 scopus 로고    scopus 로고
    • Capturing missing tuples and missing values
    • W. Fan and F. Geerts: Capturing missing tuples and missing values. PODS 2010: 169-178.
    • (2010) PODS , pp. 169-178
    • Fan, W.1    Geerts, F.2
  • 27
    • 77958064296 scopus 로고    scopus 로고
    • Semandaq: A data quality system based on conditional functional dependencies
    • W. Fan, F. Geerts and X. Jia: Semandaq: a data quality system based on conditional functional dependencies. PVLDB 1(2) 1460-1463 (2008).
    • (2008) PVLDB , vol.1 , Issue.2 , pp. 1460-1463
    • Fan, W.1    Geerts, F.2    Jia, X.3
  • 28
    • 84866696440 scopus 로고    scopus 로고
    • A revival of integrity constraints for data cleaning
    • W. Fan, F. Geerts and X. Jia: A revival of integrity constraints for data cleaning. PVLDB 1(2): 1522-1523 (2008).
    • (2008) PVLDB , vol.1 , Issue.2 , pp. 1522-1523
    • Fan, W.1    Geerts, F.2    Jia, X.3
  • 29
    • 33750901630 scopus 로고    scopus 로고
    • Towards correcting input data errors probabilistically using integrity constraints
    • N. Khoussainova, M. Balazinska and D. Suciu: Towards correcting input data errors probabilistically using integrity constraints. MobiDE 2006: 43-50.
    • (2006) MobiDE , pp. 43-50
    • Khoussainova, N.1    Balazinska, M.2    Suciu, D.3
  • 30
    • 46649106686 scopus 로고    scopus 로고
    • Conditional functional dependencies for capturing data inconsistencies
    • W. Fan, F. Geerts, X. Jia and A. Kementsietsidis: Conditional functional dependencies for capturing data inconsistencies. ACM TODS 33(2) (2008).
    • (2008) ACM TODS , vol.33 , Issue.2
    • Fan, W.1    Geerts, F.2    Jia, X.3    Kementsietsidis, A.4
  • 31
    • 79953230060 scopus 로고    scopus 로고
    • Discovering conditional functional dependencies
    • W. Fan, F. Geerts, J. Li and M. Xiong: Discovering conditional functional dependencies. IEEE Trans. Knowl. Data Eng. 23(5): 683-698 (2011).
    • (2011) IEEE Trans. Knowl. Data Eng. , vol.23 , Issue.5 , pp. 683-698
    • Fan, W.1    Geerts, F.2    Li, J.3    Xiong, M.4
  • 32
    • 77952749687 scopus 로고    scopus 로고
    • Detecting inconsistencies in distributed data
    • W. Fan, F. Geerts, S. Ma and H. Müller: Detecting inconsistencies in distributed data: ICDE 2010: 64-75.
    • (2010) ICDE , pp. 64-75
    • Fan, W.1    Geerts, F.2    Ma, S.3    Müller, H.4
  • 33
    • 84881326725 scopus 로고    scopus 로고
    • Inferring data currency and consistency for conflict resolution
    • W. Fan, F. Geerts, N. Tang and W. Yu: Inferring data currency and consistency for conflict resolution. ICDE 2013: 470-481.
    • (2013) ICDE , pp. 470-481
    • Fan, W.1    Geerts, F.2    Tang, N.3    Yu, W.4
  • 35
    • 84858615261 scopus 로고    scopus 로고
    • Towards certain xes with editing rules and master data
    • W. Fan, J. Li, S. Ma, N. Tang and W. Yu: Towards certain xes with editing rules and master data. PVLDB 3(1) 173184 (2010).
    • (2010) PVLDB , vol.3 , Issue.1 , pp. 173184
    • Fan, W.1    Li, J.2    Ma, S.3    Tang, N.4    Yu, W.5
  • 36
    • 79959944062 scopus 로고    scopus 로고
    • Interaction between record matching and data repairing
    • W. Fan, J. Li, S. Ma, N. Tang and W. Yu: Interaction between record matching and data repairing. SIGMOD 2011: 469-480.
    • (2011) SIGMOD , pp. 469-480
    • Fan, W.1    Li, J.2    Ma, S.3    Tang, N.4    Yu, W.5
  • 37
    • 84863765052 scopus 로고    scopus 로고
    • CerFix: A system for cleaning data with certain fixes
    • W. Fan, J. Li, S. Ma, N. Tang and W. Yu: CerFix: a system for cleaning data with certain fixes. PVLDB 4(12) 1375-1378 (2011).
    • (2011) PVLDB , vol.4 , Issue.12 , pp. 1375-1378
    • Fan, W.1    Li, J.2    Ma, S.3    Tang, N.4    Yu, W.5
  • 38
    • 84864198280 scopus 로고    scopus 로고
    • Incremental detection of inconsistencies in distributed data
    • W. Fan, J. Li, N. Tang and W. Yu: Incremental detection of inconsistencies in distributed data. ICDE 2012: 318-329.
    • (2012) ICDE , pp. 318-329
    • Fan, W.1    Li, J.2    Tang, N.3    Yu, W.4
  • 40
    • 0344756845 scopus 로고    scopus 로고
    • Declarative data cleaning: Language, model, and algorithms
    • H. Galhardas, D. Florescu, D. Shasha, E. Simon and C. Saita: Declarative data cleaning: language, model, and algorithms. VLDB 2001: 371-380.
    • (2001) VLDB , pp. 371-380
    • Galhardas, H.1    Florescu, D.2    Shasha, D.3    Simon, E.4    Saita, C.5
  • 42
    • 84858397744 scopus 로고    scopus 로고
    • Data auditor: Exploring data quality and semantics using pattern tableaux
    • L. Golab, H. Karloff, F. Korn and D. Srivastava: Data Auditor: exploring data quality and semantics using pattern tableaux. PVLDB 3(2) 1641-1644 (2010).
    • (2010) PVLDB , vol.3 , Issue.2 , pp. 1641-1644
    • Golab, L.1    Karloff, H.2    Korn, F.3    Srivastava, D.4
  • 44
    • 70349846180 scopus 로고    scopus 로고
    • On generating near-optimal tableaux for conditional functional dependencies
    • L. Golab, H. J. Karloff, F. Korn, D. Srivastava and B. Yu: On generating near-optimal tableaux for conditional functional dependencies. PVLDB 1(1): 376-390 (2008).
    • (2008) PVLDB , vol.1 , Issue.1 , pp. 376-390
    • Golab, L.1    Karloff, H.J.2    Korn, F.3    Srivastava, D.4    Yu, B.5
  • 46
    • 84871079115 scopus 로고    scopus 로고
    • Efficient and effective analysis of data quality using pattern tableaux
    • L. Golab, F. Korn and D. Srivastava: Efficient and effective analysis of data quality using pattern tableaux. IEEE Data Eng. Bull. 2011: 26-33.
    • (2011) IEEE Data Eng. Bull. , pp. 26-33
    • Golab, L.1    Korn, F.2    Srivastava, D.3
  • 47
    • 83055166031 scopus 로고    scopus 로고
    • The quality of the xml web
    • S. Grijzenhout and M. Marx: The quality of the XML web. CIKM 2011: 1719-1724.
    • (2011) CIKM , pp. 1719-1724
    • Grijzenhout, S.1    Marx, M.2
  • 48
    • 84882696854 scopus 로고    scopus 로고
    • The llunatic data-cleaning framework
    • F. Geerts, G. Mecca, P. Papotti, and D. Santoro: The LLUNATIC data-cleaning framework, PVLDB 6(9): 625-636 (2013).
    • (2013) PVLDB , vol.6 , Issue.9 , pp. 625-636
    • Geerts, F.1    Mecca, G.2    Papotti, P.3    Santoro, D.4
  • 49
    • 84894556628 scopus 로고    scopus 로고
    • Quantitative data cleaning for large databases
    • J. Hellerstein: Quantitative data cleaning for large databases. UNECE 2008.
    • (2008) UNECE
    • Hellerstein, J.1
  • 51
    • 3142708793 scopus 로고    scopus 로고
    • CORDS: Automatic discovery of correlations and soft functional dependencies
    • I. F. Ilyas, V. Markl, P. J. Haas, P. Brown and A. Aboulnaga: CORDS: automatic discovery of correlations and soft functional dependencies. SIGMOD 2004: 647-658.
    • (2004) SIGMOD , pp. 647-658
    • Ilyas, I.F.1    Markl, V.2    Haas, P.J.3    Brown, P.4    Aboulnaga, A.5
  • 52
    • 84863578888 scopus 로고    scopus 로고
    • Profiler: Integrated statistical analysis and visualization for data quality assessment
    • S. Kandel, R. Parikh, A. Paepcke, J. M. Hellerstein and J. Heer: Profiler: integrated statistical analysis and visualization for data quality assessment. AVI 2012: 547-554.
    • (2012) AVI , pp. 547-554
    • Kandel, S.1    Parikh, R.2    Paepcke, A.3    Hellerstein, J.M.4    Heer, J.5
  • 53
    • 84882637776 scopus 로고    scopus 로고
    • On repairing structural problems in semi-structured data
    • F. Korn, B. Saha, D. Srivastava and S. Ying: On repairing structural problems in semi-structured Data. PVLDB 6(9): 601-612 (2013).
    • (2013) PVLDB , vol.6 , Issue.9 , pp. 601-612
    • Korn, F.1    Saha, B.2    Srivastava, D.3    Ying, S.4
  • 54
    • 77954753210 scopus 로고    scopus 로고
    • Recognizing well-parenthesized expressions in the streaming model
    • F. Magniez, C. Mathieu and A. Nayak: Recognizing well-parenthesized expressions in the streaming model. STOC 2010: 261-270.
    • (2010) STOC , pp. 261-270
    • Magniez, F.1    Mathieu, C.2    Nayak, A.3
  • 55
    • 84871048809 scopus 로고    scopus 로고
    • Completeness of queries over sql databases
    • W. Nutt and S. Razniewski: Completeness of queries over SQL databases, CIKM 2012: 902-911.
    • (2012) CIKM , pp. 902-911
    • Nutt, W.1    Razniewski, S.2
  • 56
    • 84901780490 scopus 로고    scopus 로고
    • Incomplete databases: Missing records and missing values
    • W. Nutt, S. Razniewski and G. Vegliach: Incomplete databases: missing records and missing values. DASFAA Workshops 2012: 298-310.
    • (2012) DASFAA Workshops , pp. 298-310
    • Nutt, W.1    Razniewski, S.2    Vegliach, G.3
  • 57
    • 0031988304 scopus 로고    scopus 로고
    • The impact of poor data quality on the typical enterprise
    • T. Redman: The impact of poor data quality on the typical enterprise. Commun. ACM 1998.
    • (1998) Commun. ACM
    • Redman, T.1
  • 58
    • 84944315993 scopus 로고    scopus 로고
    • Potter's wheel: An interactive data cleaning system
    • V. Raman and J. Hellerstein: Potter's Wheel: an interactive data cleaning system. VLDB 2001: 381-390.
    • (2001) VLDB , pp. 381-390
    • Raman, V.1    Hellerstein, J.2
  • 59
    • 84863773454 scopus 로고    scopus 로고
    • Completeness of queries over incomplete databases
    • S. Razniewski and W. Nutt: Completeness of queries over incomplete databases. PVLDB 4(11): 749-760 (2011).
    • (2011) PVLDB , vol.4 , Issue.11 , pp. 749-760
    • Razniewski, S.1    Nutt, W.2
  • 60
    • 33644539174 scopus 로고    scopus 로고
    • Finding an optimum edit script between an xml document and a dtd
    • N. Suzuki: Finding an optimum edit script between an XML document and a DTD. SAC 2005: 647-653.
    • (2005) SAC , pp. 647-653
    • Suzuki, N.1
  • 62
    • 74549201636 scopus 로고    scopus 로고
    • Discovering matching dependencies
    • S. Song and L. Chen: Discovering matching dependencies. CIKM 2009: 1421-1424.
    • (2009) CIKM , pp. 1421-1424
    • Song, S.1    Chen, L.2
  • 63
    • 84901764650 scopus 로고    scopus 로고
    • Handling the four 'v's of big data: Volume, velocity, variety, and veracity
    • J. Tee: Handling the four 'V's of big data: volume, velocity, variety, and veracity. TheServerSide.com 2013.
    • (2013) TheServerSide.com
    • Tee, J.1
  • 65
    • 84865032011 scopus 로고    scopus 로고
    • Constant-memory validation of streaming xml documents against dtds
    • L. Segoufin and C. Sirangelo: Constant-memory validation of streaming XML documents against DTDs. ICDT 2007: 299-313.
    • (2007) ICDT , pp. 299-313
    • Segoufin, L.1    Sirangelo, C.2
  • 66
    • 0036036851 scopus 로고    scopus 로고
    • Validating streaming xml documents
    • L. Segoufin and V. Vianu: Validating streaming XML documents. PODS 2002: 53-64.
    • (2002) PODS , pp. 53-64
    • Segoufin, L.1    Vianu, V.2
  • 67
    • 84880515658 scopus 로고    scopus 로고
    • Don't be scared: Use scalable automatic repairing with maximal likelihood and bounded changes
    • M. Yakout, L. Berti-Equille and A. K. Elmagarmid: Don't be SCAREd: use SCalable Automatic REpairing with maximal likelihood and bounded changes. SIGMOD 2013: 553-564.
    • (2013) SIGMOD , pp. 553-564
    • Yakout, M.1    Berti-Equille, L.2    Elmagarmid, A.K.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.