메뉴 건너뛰기




Volumn , Issue , 2015, Pages 2654-2660

Data quality issues in big data

Author keywords

big data; biological data; Data quality; information quality

Indexed keywords

DATA INTEGRATION; DATA REDUCTION;

EID: 84963755775     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/BigData.2015.7364065     Document Type: Conference Paper
Times cited : (27)

References (39)
  • 1
    • 84963765837 scopus 로고    scopus 로고
    • Garbage in, garbage out
    • accessed: 13 September
    • M. Quinion, "Garbage in, garbage out," World Wide Words, http://www.worldwidewords.org/qa/qagar1. htm, accessed: 13 September 2015.
    • (2015) World Wide Words
    • Quinion, M.1
  • 3
    • 84926153367 scopus 로고    scopus 로고
    • Big data: Promises and problems
    • Mar.
    • V. Gudivada, R. Baeza-Yates, and V. Raghavan, "Big data: Promises and problems," IEEE Computer, vol. 48, no. 3, pp. 20-23, Mar. 2015.
    • (2015) IEEE Computer , vol.48 , Issue.3 , pp. 20-23
    • Gudivada, V.1    Baeza-Yates, R.2    Raghavan, V.3
  • 6
    • 84873437606 scopus 로고    scopus 로고
    • 20 years of data quality research: Themes, trends and synergies
    • Darlinghurst, Australia, Australia: Australian Computer Society, Inc.
    • S. Sadiq, N. K. Yeganeh, and M. Indulska, "20 years of data quality research: Themes, trends and synergies," in Proceedings of the Twenty-Second Australasian Database Conference-Volume 115. Darlinghurst, Australia, Australia: Australian Computer Society, Inc., 2011, pp. 153-162.
    • (2011) Proceedings of the Twenty-Second Australasian Database Conference , vol.115 , pp. 153-162
    • Sadiq, S.1    Yeganeh, N.K.2    Indulska, M.3
  • 7
    • 84871564567 scopus 로고    scopus 로고
    • Research into information quality: A study of the state of the art in IQ and its consolidation
    • MIT, Cambridge, MA, USA, November 10-12, 2006
    • L. F. R. Lima, A. C. G. Macada, and L. M. Vargas, "Research into information quality: A study of the state of the art in IQ and its consolidation," in Proceedings of the 11th International Conference on Information Quality, MIT, Cambridge, MA, USA, November 10-12, 2006, 2006, pp. 146-158.
    • (2006) Proceedings of the 11th International Conference on Information Quality , pp. 146-158
    • Lima, L.F.R.1    Macada, A.C.G.2    Vargas, L.M.3
  • 8
    • 78049459123 scopus 로고    scopus 로고
    • Overview and framework for data and information quality research
    • Jun.
    • S. E. Madnick, R. Y. Wang et al., "Overview and framework for data and information quality research," J. Data and Information Quality, vol. 1, no. 1, pp. 2:1-2:22, Jun. 2009.
    • (2009) J. Data and Information Quality , vol.1 , Issue.1 , pp. 21-222
    • Madnick, S.E.1    Wang, R.Y.2
  • 9
    • 77950474555 scopus 로고    scopus 로고
    • Incorporating quality aspects in sensor data streams
    • New York, NY, USA: ACM
    • A. Klein, "Incorporating quality aspects in sensor data streams," in Proceedings of the ACM First Ph.D. Workshop in CIKM. New York, NY, USA: ACM, 2007, pp. 77-84.
    • (2007) Proceedings of the ACM First Ph.D. Workshop in CIKM , pp. 77-84
    • Klein, A.1
  • 10
    • 78651567442 scopus 로고    scopus 로고
    • Incorporating domainspecific information quality constraints into database queries
    • Sep.
    • S. M. Embury, P. Missier et al., "Incorporating domainspecific information quality constraints into database queries," J. Data and Information Quality, vol. 1, no. 2, pp. 11:1-11:31, Sep. 2009.
    • (2009) J. Data and Information Quality , vol.1 , Issue.2 , pp. 111-1131
    • Embury, S.M.1    Missier, P.2
  • 12
    • 85024248734 scopus 로고    scopus 로고
    • Utility-driven assessment of data quality
    • May
    • A. Even and G. Shankaranarayanan, "Utility-driven assessment of data quality," SIGMIS Database, vol. 38, no. 2, pp. 75-93, May 2007.
    • (2007) SIGMIS Database , vol.38 , Issue.2 , pp. 75-93
    • Even, A.1    Shankaranarayanan, G.2
  • 13
    • 77957834030 scopus 로고    scopus 로고
    • Dual assessment of data quality in customer databases
    • Dec.
    • -, "Dual assessment of data quality in customer databases," J. Data and Information Quality, vol. 1, no. 3, pp. 15:1-15:29, Dec. 2009.
    • (2009) J. Data and Information Quality , vol.1 , Issue.3 , pp. 151-1529
    • Even, A.1    Shankaranarayanan, G.2
  • 14
    • 84907021481 scopus 로고    scopus 로고
    • Process-driven data quality management: A critical review on the application of process modeling languages
    • Sep.
    • P. Glowalla and A. Sunyaev, "Process-driven data quality management: A critical review on the application of process modeling languages," J. Data and Information Quality, vol. 5, no. 1-2, pp. 7:1-7:30, Sep. 2014.
    • (2014) J. Data and Information Quality , vol.5 , Issue.1-2 , pp. 71-730
    • Glowalla, P.1    Sunyaev, A.2
  • 16
    • 67649292737 scopus 로고    scopus 로고
    • Unlocking the secrets of the genome
    • 06
    • S. E. Celniker, L. A. L. Dillon et al., "Unlocking the secrets of the genome," Nature, vol. 459, no. 7249, pp. 927-930, 06 2009.
    • (2009) Nature , vol.459 , Issue.7249 , pp. 927-930
    • Celniker, S.E.1    Dillon, L.A.L.2
  • 18
    • 84871812701 scopus 로고    scopus 로고
    • Data integration in bioinformatics: Current efforts and challenges
    • M. A. Mahdavi, Ed. InTech
    • Z. Zhang, V. B. Bajic et al., "Data integration in bioinformatics: Current efforts and challenges," in Bioinformatics-Trends and Methodologies, M. A. Mahdavi, Ed. InTech, 2011.
    • (2011) Bioinformatics-Trends and Methodologies
    • Zhang, Z.1    Bajic, V.B.2
  • 20
    • 84891789990 scopus 로고    scopus 로고
    • The 2014 nucleic acids research database issue and an updated nar online molecular biology database collection
    • X. M. FernAandez-Suarez, D. J. Rigden, and M. Y. Galperin, "The 2014 nucleic acids research database issue and an updated nar online molecular biology database collection," Nucleic Acids Research, vol. 42, no. D1, pp. D1-D6, 2014.
    • (2014) Nucleic Acids Research , vol.42 , Issue.D1 , pp. D1-D6
    • FernAandez-Suarez, X.M.1    Rigden, D.J.2    Galperin, M.Y.3
  • 21
    • 84925614976 scopus 로고    scopus 로고
    • Biological databases for human research
    • D. Zou, L. Ma et al., "Biological databases for human research," Genomics, Proteomics & Bioinformatics, vol. 13, no. 1, pp. 55-63, 2015.
    • (2015) Genomics, Proteomics & Bioinformatics , vol.13 , Issue.1 , pp. 55-63
    • Zou, D.1    Ma, L.2
  • 22
    • 77956664073 scopus 로고    scopus 로고
    • The biopax community standard for pathway data sharing
    • 09
    • E. Demir, M. P. Cary et al., "The biopax community standard for pathway data sharing," Nat Biotech, vol. 28, no. 9, pp. 935-942, 09 2010.
    • (2010) Nat Biotech , vol.28 , Issue.9 , pp. 935-942
    • Demir, E.1    Cary, M.P.2
  • 23
    • 84859350703 scopus 로고    scopus 로고
    • The dbcls biohackathon: Standardization and interoperability for bioinformatics web services and workflows
    • T. Katayama, K. Arakawa et al., "The dbcls biohackathon: standardization and interoperability for bioinformatics web services and workflows," Journal of Biomedical Semantics, vol. 1, no. 1, p. 8, 2010.
    • (2010) Journal of Biomedical Semantics , vol.1 , Issue.1 , pp. 8
    • Katayama, T.1    Arakawa, K.2
  • 24
    • 38049144316 scopus 로고    scopus 로고
    • Biomedical ontologies: A functional perspective
    • D. L. Rubin, N. H. Shah, and N. F. Noy, "Biomedical ontologies: a functional perspective," Briefings in Bioinformatics, vol. 9, no. 1, pp. 75-90, 2008.
    • (2008) Briefings in Bioinformatics , vol.9 , Issue.1 , pp. 75-90
    • Rubin, D.L.1    Shah, N.H.2    Noy, N.F.3
  • 25
    • 67849128700 scopus 로고    scopus 로고
    • Bioportal: Ontologies and integrated data resources at the click of a mouse
    • N. F. Noy, N. H. Shah et al., "Bioportal: ontologies and integrated data resources at the click of a mouse," Nucleic Acids Research, vol. 37, no. suppl 2, pp. W170-W173, 2009.
    • (2009) Nucleic Acids Research , vol.37 , Issue.SUPPL. 2 , pp. W170-W173
    • Noy, N.F.1    Shah, N.H.2
  • 26
    • 0034069495 scopus 로고    scopus 로고
    • Gene ontology: Tool for the unification of biology
    • 05
    • M. Ashburner, C. A. Ball et al., "Gene ontology: tool for the unification of biology," Nat Genet, vol. 25, no. 1, pp. 25-29, 05 2000.
    • (2000) Nat Genet , vol.25 , Issue.1 , pp. 25-29
    • Ashburner, M.1    Ball, C.A.2
  • 27
    • 22044453632 scopus 로고    scopus 로고
    • Atlas-A data warehouse for integrative bioinformatics
    • S. Shah, Y. Huang et al., "Atlas-A data warehouse for integrative bioinformatics," BMC Bioinformatics, vol. 6, no. 1, p. 34, 2005.
    • (2005) BMC Bioinformatics , vol.6 , Issue.1 , pp. 34
    • Shah, S.1    Huang, Y.2
  • 28
    • 33645958298 scopus 로고    scopus 로고
    • Biowarehouse: A bioinformatics database warehouse toolkit
    • T. Lee, Y. Pouliot et al., "Biowarehouse: a bioinformatics database warehouse toolkit," BMC Bioinformatics, vol. 7, no. 1, p. 170, 2006.
    • (2006) BMC Bioinformatics , vol.7 , Issue.1 , pp. 170
    • Lee, T.1    Pouliot, Y.2
  • 29
    • 43349101291 scopus 로고    scopus 로고
    • Interoperability with moby 1.0-it is better than sharing your toothbrush!
    • M. Wilkinson, M. Senger et al., "Interoperability with moby 1.0-it is better than sharing your toothbrush!" Brief Bioinform, vol. 9, pp. 220-231, 2008.
    • (2008) Brief Bioinform , vol.9 , pp. 220-231
    • Wilkinson, M.1    Senger, M.2
  • 30
    • 21044459901 scopus 로고    scopus 로고
    • Evolution of web services in bioinformatics
    • P. Neerincx and J. Leunissen, "Evolution of web services in bioinformatics," Briefings in Bioinformatics, vol. 6, no. 2, pp. 178-188, 2005. [Online]. Available: http://bib.oxfordjournals.org/content/6/2/178.abstract
    • (2005) Briefings in Bioinformatics , vol.6 , Issue.2 , pp. 178-188
    • Neerincx, P.1    Leunissen, J.2
  • 31
    • 67449132007 scopus 로고    scopus 로고
    • Biological knowledge management: The emerging role of the semantic web technologies
    • E. Antezana, M. Kuiper, and V. Mironov, "Biological knowledge management: the emerging role of the semantic web technologies," Briefings in Bioinformatics, vol. 10, no. 4, pp. 392-407, 2009.
    • (2009) Briefings in Bioinformatics , vol.10 , Issue.4 , pp. 392-407
    • Antezana, E.1    Kuiper, M.2    Mironov, V.3
  • 32
    • 52949143458 scopus 로고    scopus 로고
    • Semiautomatic web service composition for the life sciences using the biomoby semantic web framework
    • M. DiBernardo, R. Pottinger, and M. Wilkinson, "Semiautomatic web service composition for the life sciences using the biomoby semantic web framework," Journal of Biomedical Informatics, vol. 41, no. 5, pp. 837-847, 2008.
    • (2008) Journal of Biomedical Informatics , vol.41 , Issue.5 , pp. 837-847
    • DiBernardo, M.1    Pottinger, R.2    Wilkinson, M.3
  • 33
    • 48249158190 scopus 로고    scopus 로고
    • Bio2rdf: Towards a mashup to build bioinformatics knowledge systems
    • Oct.
    • F. Belleau, M.-A. Nolin et al., "Bio2rdf: Towards a mashup to build bioinformatics knowledge systems," J. of Biomedical Informatics, vol. 41, no. 5, pp. 706-716, Oct. 2008.
    • (2008) J. of Biomedical Informatics , vol.41 , Issue.5 , pp. 706-716
    • Belleau, F.1    Nolin, M.-A.2
  • 34
    • 48749128658 scopus 로고    scopus 로고
    • Hcls 2.0/3.0: Health care and life sciences data mashup using web 2.0/3.0
    • 10
    • K.-H. Cheung, K. Y. Yip et al., "Hcls 2.0/3.0: Health care and life sciences data mashup using web 2.0/3.0," Journal of biomedical informatics, vol. 41, no. 5, pp. 694-705, 10 2008.
    • (2008) Journal of Biomedical Informatics , vol.41 , Issue.5 , pp. 694-705
    • Cheung, K.-H.1    Yip, K.Y.2
  • 35
    • 34249860439 scopus 로고    scopus 로고
    • Advancing translational research with the semantic web
    • A. Ruttenberg, T. Clark et al., "Advancing translational research with the semantic web," BMC Bioinformatics, vol. 8, no. Suppl 3, p. S2, 2007.
    • (2007) BMC Bioinformatics , vol.8 , Issue.SUPPL. 3 , pp. S2
    • Ruttenberg, A.1    Clark, T.2
  • 36
    • 33847132279 scopus 로고    scopus 로고
    • Key biology databases go wiki
    • 02
    • J. Giles, "Key biology databases go wiki," Nature, vol. 445, no. 7129, pp. 691-691, 02 2007.
    • (2007) Nature , vol.445 , Issue.7129 , pp. 691
    • Giles, J.1
  • 37
    • 52949139763 scopus 로고    scopus 로고
    • Big data: Open-source format needed to aid wiki collaboration
    • 09
    • T.-L. Lee, "Big data: open-source format needed to aid wiki collaboration," Nature, vol. 455, no. 7212, pp. 461-461, 09 2008.
    • (2008) Nature , vol.455 , Issue.7212 , pp. 461
    • Lee, T.-L.1
  • 38
    • 84869438587 scopus 로고    scopus 로고
    • Improving the data quality of drug databases using conditional dependencies and ontologies
    • Oct.
    • O. Cure, "Improving the data quality of drug databases using conditional dependencies and ontologies," J. Data and Information Quality, vol. 4, no. 1, pp. 3:1-3:21, Oct. 2012.
    • (2012) J. Data and Information Quality , vol.4 , Issue.1 , pp. 31-321
    • Cure, O.1
  • 39
    • 84869403973 scopus 로고    scopus 로고
    • Creating a general (family) practice epidemiological database in Ireland-data quality issue management
    • Oct.
    • C. Collins and K. Janssens, "Creating a general (family) practice epidemiological database in ireland-data quality issue management," J. Data and Information Quality, vol. 4, no. 1, pp. 2:1-2:9, Oct. 2012.
    • (2012) J. Data and Information Quality , vol.4 , Issue.1 , pp. 21-29
    • Collins, C.1    Janssens, K.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.