메뉴 건너뛰기




Volumn 2, Issue 4, 2012, Pages

Improving data quality by source analysis

Author keywords

Conflict resolution; Data cleaning; Quality assessment; Semantic distance measure

Indexed keywords

ALTERNATIVE APPROACH; CONFLICT RESOLUTION; DATA CLEANING; DATA PATTERNS; DATA PROPERTIES; DATA QUALITY; DATA SETS; DATA SOURCE; DATA VALUES; EXPERT USERS; HIGH QUALITY; HOTSPOTS; INTEGRATED SYSTEMS; INTEGRITY CONSTRAINTS; NP COMPLETE; OVERLAPPING DATA; OVERLAPPING PATTERNS; POTENTIAL DEPENDENCY; QUALITY ASSESSMENT; QUALITY OF DATA; RULE MINING ALGORITHMS; SEMANTIC DISTANCE MEASURE; SOURCE ANALYSIS;

EID: 84859023521     PISSN: 19361955     EISSN: 19361963     Source Type: Journal    
DOI: 10.1145/2107536.2107538     Document Type: Article
Times cited : (11)

References (63)
  • 4
    • 34547844156 scopus 로고    scopus 로고
    • Manual curation is not sufficient for annotation of genomic databases
    • DOI 10.1093/bioinformatics/btm229
    • BAUMGARTNER, W. A., COHEN, K. B., FOX, L. M., ACQUAAH-MENSAH, G., AND HUNTER, L. 2007. Manual curation is not sufficient for annotation of genomic databases. Bioinformatics 23, 13, i41-i48. (Pubitemid 47244384)
    • (2007) Bioinformatics , vol.23 , Issue.13
    • Baumgartner Jr., W.A.1    Cohen, K.B.2    Fox, L.M.3    Acquaah-Mensah, G.4    Hunter, L.5
  • 5
    • 23044527560 scopus 로고    scopus 로고
    • Detecting group differences: Mining contrast sets
    • BAY, S. D. AND PAZZANI, M. J. 2001. Detecting group differences: Mining contrast sets. Data Min. Knowl. Discov. 5, 3, 213-246.
    • (2001) Data Min. Knowl. Discov , vol.5 , Issue.3 , pp. 213-246
    • Bay, S.D.1    Pazzani, M.J.2
  • 13
    • 0033119399 scopus 로고    scopus 로고
    • Errors in genome annotation
    • DOI 10.1016/S0168-9525(99)01706-0, PII S0168952599017060
    • BRENNER, S. E. 1999. Errors in genome annotation. Trends Genet. 15, 4, 132-133. (Pubitemid 29155430)
    • (1999) Trends in Genetics , vol.15 , Issue.4 , pp. 132-133
    • Brenner, S.E.1
  • 16
    • 0032946483 scopus 로고    scopus 로고
    • Molecular biology database list
    • DOI 10.1093/nar/27.1.1
    • BURKS, C. 1999. Molecular biology database list. Nucl. Acids Res. 27, 1, 1-9. (Pubitemid 29209390)
    • (1999) Nucleic Acids Research , vol.27 , Issue.1 , pp. 1-9
    • Burks, C.1
  • 18
    • 74549188261 scopus 로고    scopus 로고
    • Discovering data quality rules
    • CHIANG, F. AND MILLER, R. J. 2008. Discovering data quality rules. Proc. VLDB Endow. 1, 1, 1166-1177.
    • (2008) Proc. VLDB Endow , vol.1 , Issue.1 , pp. 1166-1177
    • Chiang, F.1    Miller, R.J.2
  • 19
    • 14744293228 scopus 로고    scopus 로고
    • Minimal-change integrity maintenance using tuple deletions
    • DOI 10.1016/j.ic.2004.04.007, PII S0890540105000179
    • CHOMICKI, J. AND MARCINKOWSKI, J. 2005. Minimal-change integrity maintenance using tuple deletions. Inf. Comput. 197, 1/2, 90-121. (Pubitemid 40330051)
    • (2005) Information and Computation , vol.197 , Issue.1-2 , pp. 90-121
    • Chomicki, J.1    Marcinkowski, J.2
  • 23
    • 77954322933 scopus 로고    scopus 로고
    • Integrating conflicting data:The role of source dependence
    • DONG, X. L., BERTI-EQUILLE, L., AND SRIVASTAVA, D. 2009. Integrating conflicting data: The role of source dependence. Proc. VLDB Endow. 2, 1, 550-561.
    • (2009) Proc. VLDB Endow , vol.2 , Issue.1 , pp. 550-561
    • Dong, X.L.1    Berti-Equille, L.2    Srivastava, D.3
  • 26
    • 46649106686 scopus 로고    scopus 로고
    • Conditional functional dependencies for capturing data inconsistencies
    • FAN, W., GEERTS, F., JIA, X., AND KEMENTSIETSIDIS, A. 2008. Conditional functional dependencies for capturing data inconsistencies. ACM Trans. Datab. Syst. 33, 2.
    • (2008) ACM Trans. Datab. Syst , vol.33 , Issue.2
    • Fan, W.1    Geerts, F.2    Jia, X.3    Kementsietsidis, A.4
  • 28
    • 0035545935 scopus 로고    scopus 로고
    • Discovering and reconciling value conflicts for numerical data integration
    • DOI 10.1016/S0306-4379(01)00043-6, Data Extraction, Cleaning and Reconciliation
    • FAN, W.,LU, H.,MADNICK, S. E., AND CHEUNG, D. 2001. Discovering and reconciling value conflicts for numerical data integration. Inf. Syst. 26, 8, 635-656. (Pubitemid 33046274)
    • (2001) Information Systems , vol.26 , Issue.8 , pp. 635-656
    • Fan, W.1    Lu, H.2    Madnick, S.E.3    Cheung, D.4
  • 31
    • 58149185138 scopus 로고    scopus 로고
    • Nucleic acids research annual database issue and the NAR online molecular biology database collection in 2009
    • GALPERIN, M. Y. AND COCHRANE, G. R. 2009. Nucleic acids research annual database issue and the NAR online molecular biology database collection in 2009. Nucl. Acids Res. 37, suppl1, D1-4.
    • (2009) Nucl. Acids Res , vol.37 , Issue.SUPPL.1
    • Galperin, M.Y.1    Cochrane, G.R.2
  • 32
    • 70349846180 scopus 로고    scopus 로고
    • On generating near-optimal tableaux for conditional functional dependencies
    • GOLAB, L., KARLOFF, H., KORN, F., SRIVASTAVA, D., AND YU, B. 2008. On generating near-optimal tableaux for conditional functional dependencies. Proc. VLDB Endow. 1, 1, 376-390.
    • (2008) Proc. VLDB Endow , vol.1 , Issue.1 , pp. 376-390
    • Golab, L.1    Karloff, H.2    Korn, F.3    Srivastava, D.4    Yu, B.5
  • 33
    • 84943817322 scopus 로고
    • Error detecting and error correcting codes
    • HAMMING, R. 1950. Error detecting and error correcting codes. Bell Syst. Techn. J. 26, 2, 147-160.
    • (1950) Bell Syst.Techn.J , vol.26 , Issue.2 , pp. 147-160
    • Hamming, R.1
  • 34
    • 5444231422 scopus 로고    scopus 로고
    • Integration of biological sources:Current systems and challenges ahead
    • HERNANDEZ, T. AND KAMBHAMPATI, S. 2004. Integration of biological sources: current systems and challenges ahead. SIGMOD Rec. 33, 3, 51-60.
    • (2004) SIGMOD Rec , vol.33 , Issue.3 , pp. 51-60
    • Hernandez, T.1    Kambhampati, S.2
  • 36
    • 7244245762 scopus 로고    scopus 로고
    • Finishing the euchromatic sequence of the human genome
    • International Human Genome Sequencing Consortium. 7011
    • INTERNATIONAL HUMAN GENOME SEQUENCING CONSORTIUM. 2004. Finishing the euchromatic sequence of the human genome. Nature 431, 7011, 931-945.
    • (2004) Nature , vol.431 , pp. 931-945
  • 38
    • 0001173632 scopus 로고
    • An automatic method of solving discrete programming problems
    • LAND, A. H. AND DOIG, A. G. 1960. An automatic method of solving discrete programming problems. Econometrica 28, 3, 497-520.
    • (1960) Econometrica , vol.28 , Issue.3 , pp. 497-520
    • Land, A.H.1    Doig, A.G.2
  • 40
    • 0001116877 scopus 로고
    • Binary codes capable of correcting deletions, insertions, and reversals
    • LEVENSHTEIN, V. I. 1966. Binary codes capable of correcting deletions, insertions, and reversals. Soviet Physics Doklady 10, 8, 707-710.
    • (1966) Soviet Physics Doklady , vol.10 , Issue.8 , pp. 707-710
    • Levenshtein, V.I.1
  • 50
    • 0035657983 scopus 로고    scopus 로고
    • A survey of approaches to automatic schema matching
    • DOI 10.1007/s007780100057
    • RAHM, E. AND BERNSTEIN, P. A. 2001. A survey of approaches to automatic schema matching. VLDB J. 10, 4, 334-350. (Pubitemid 33570972)
    • (2001) VLDB Journal , vol.10 , Issue.4 , pp. 334-350
    • Rahm, E.1    Bernstein, P.A.2
  • 51
    • 0002490026 scopus 로고    scopus 로고
    • Data cleaning:Problems and current approaches
    • RAHM, E. AND DO, H. H. 2000. Data cleaning: Problems and current approaches. IEEE Data Engin. Bull. 23, 4, 3-13.
    • (2000) IEEE Data Engin. Bull , vol.23 , Issue.4 , pp. 3-13
    • Rahm, E.1    Do, H.H.2
  • 53
    • 0038745614 scopus 로고    scopus 로고
    • Integrating biological databases
    • DOI 10.1038/nrg1065
    • STEIN, L. D. 2003. Integrating biological databases. Nat. Rev. Genet. 4, 5, 337-345. (Pubitemid 36538358)
    • (2003) Nature Reviews Genetics , vol.4 , Issue.5 , pp. 337-345
    • Stein, L.D.1
  • 54
    • 25444452664 scopus 로고    scopus 로고
    • Columba:An integrated database of proteins, structures, and annotations
    • TRISSL, S., ROTHER, K., MÜLLER, H., STEINKE, T., AND ET AL., I. K. 2005. Columba: An integrated database of proteins, structures, and annotations. BMC Bioinformatics 6, 81.
    • (2005) BMC Bioinformatics , vol.6 , pp. 81
    • Trissl, S.1    Rother, K.2    Müller, H.3    Steinke, T.4    Et, Al.I.K.5
  • 63
    • 0028762437 scopus 로고
    • The reference library system sharing biological material and experimental data
    • ZEHETNER, G. AND LEHRACH, H. 1994. The reference library system sharing biological material and experimental data. Nature 367, 489-491.
    • (1994) Nature , vol.367 , pp. 489-491
    • Zehetner, G.1    Lehrach, H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.