메뉴 건너뛰기




Volumn 9, Issue 3, 2014, Pages 315-326

Trends in genome compression

Author keywords

Genome compression; Read compression; Survey

Indexed keywords

ARTICLE; COST; DATA PROCESSING; GENE SEQUENCE; GENOME; HIGH THROUGHPUT SEQUENCING; HUMAN GENOME; PRIORITY JOURNAL; ALGORITHM; BOOK; COMPRESSION; REFERENCE DATABASE; SINGLE NUCLEOTIDE POLYMORPHISM; STATISTICAL CONCEPTS;

EID: 84904764719     PISSN: 15748936     EISSN: None     Source Type: Journal    
DOI: 10.2174/1574893609666140516010143     Document Type: Article
Times cited : (46)

References (71)
  • 1
    • 79951493627 scopus 로고    scopus 로고
    • On the future of genomic data
    • Kahn SD. On the future of genomic data. Science 2011; 331(6018): 728-729.
    • (2011) Science , vol.331 , Issue.6018 , pp. 728-729
    • Kahn, S.D.1
  • 2
    • 77952586314 scopus 로고    scopus 로고
    • The 1000 Genomes Project: New opportunities for research and social challenges
    • doi: 10. 1186/gm124
    • Via M, Gignoux C, Burchard EG. The 1000 Genomes Project: new opportunities for research and social challenges. Genome Med 2010; 2(1): 3. doi: 10. 1186/gm124.
    • (2010) Genome Med , vol.2 , Issue.1 , pp. 3
    • Via, M.1    Gignoux, C.2    Burchard, E.G.3
  • 3
    • 77951115122 scopus 로고    scopus 로고
    • International network of cancer genome projects
    • Hudson TJ, Anderson W, Artez A, et al. International network of cancer genome projects. Nature 2010; 464(7291): 993-998.
    • (2010) Nature , vol.464 , Issue.7291 , pp. 993-998
    • Hudson, T.J.1    Anderson, W.2    Artez, A.3
  • 4
    • 79951811877 scopus 로고    scopus 로고
    • Big data, but are we ready?
    • Trelles O, Prins P, Snir M, et al. Big data, but are we ready? Nat Rev Genet 2011; 12(3): 224.
    • (2011) Nat Rev Genet , vol.12 , Issue.3 , pp. 224
    • Trelles, O.1    Prins, P.2    Snir, M.3
  • 5
  • 6
    • 77954526823 scopus 로고    scopus 로고
    • The case for cloud computing in genome informatics
    • Stein L. The case for cloud computing in genome informatics. Genome Biol 2010, 11(5): 207.
    • (2010) Genome Biol , vol.11 , Issue.5 , pp. 207
    • Stein, L.1
  • 8
    • 0000100455 scopus 로고
    • A new challenge for compression algorithms: Genetic sequences
    • Grumbach S, Tahi F. A new challenge for compression algorithms: genetic sequences. Inform Process Manag 1994, 30(6): 875-886.
    • (1994) Inform Process Manag , vol.30 , Issue.6 , pp. 875-886
    • Grumbach, S.1    Tahi, F.2
  • 12
    • 0021405335 scopus 로고
    • Data compression using adaptive coding and partial string matching
    • Cleary JG, Witten IH. Data compression using adaptive coding and partial string matching. IEEE T Comm 1984, 32: 396-402.
    • (1984) IEEE T Comm , vol.32 , pp. 396-402
    • Cleary, J.G.1    Witten, I.H.2
  • 17
    • 0017493286 scopus 로고
    • A universal algorithm for sequential data compression
    • Ziv J, Lempel A. A universal algorithm for sequential data compression. IEEE T Inform Theory 1977, 23(3): 337-343.
    • (1977) IEEE T Inform Theory , vol.23 , Issue.3 , pp. 337-343
    • Ziv, J.1    Lempel, A.2
  • 18
    • 84930881609 scopus 로고
    • Run-length encodings
    • Golomb SW. Run-length encodings. IEEE T Inform Theory 1966, 12: 399-401.
    • (1966) IEEE T Inform Theory , vol.12 , pp. 399-401
    • Golomb, S.W.1
  • 19
    • 79955714647 scopus 로고    scopus 로고
    • On the usefulness of fibonacci compression codes
    • Klein ST, Ben-Nissan MK. On the usefulness of fibonacci compression codes. Computer J 2010, 53(6): 701-716.
    • (2010) Computer J , vol.53 , Issue.6 , pp. 701-716
    • Klein, S.T.1    Ben-Nissan, M.K.2
  • 20
    • 84938015047 scopus 로고
    • A method for the construction of minimumredundancy codes
    • Huffman DA. A method for the construction of minimumredundancy codes. Proceedings of the Institute of Radio Engineers 1952, 40(9): 1098-1101.
    • (1952) Proceedings of the Institute of Radio Engineers , vol.40 , Issue.9 , pp. 1098-1101
    • Huffman, D.A.1
  • 21
    • 0041619492 scopus 로고
    • Is huffman coding dead?
    • Bookstein A, Klein ST. Is huffman coding dead? Computing 1993, 50: 279-296.
    • (1993) Computing , vol.50 , pp. 279-296
    • Bookstein, A.1    Klein, S.T.2
  • 24
    • 0023536787 scopus 로고
    • Data compression using dynamic markov modelling
    • Cormack G, Horspool N. Data compression using dynamic markov modelling. Comput J 1987, 30: 541-550.
    • (1987) Comput J , vol.30 , pp. 541-550
    • Cormack, G.1    Horspool, N.2
  • 26
    • 79952580139 scopus 로고    scopus 로고
    • Compression of dna sequence reads in fastq format
    • Deorowicz S, Grabowski S. Compression of dna sequence reads in fastq format. Bioinformatics 2011, 27(6): 860-862.
    • (2011) Bioinformatics , vol.27 , Issue.6 , pp. 860-862
    • Deorowicz, S.1    Grabowski, S.2
  • 27
    • 79952395270 scopus 로고    scopus 로고
    • Cancer genomics: From discovery science to personalized medicine
    • Chin L, Andersen JN, Futreal PA. Cancer genomics: from discovery science to personalized medicine. Nat Med 2011, 17(3): 297-303.
    • (2011) Nat Med , vol.17 , Issue.3 , pp. 297-303
    • Chin, L.1    Andersen, J.N.2    Futreal, P.A.3
  • 28
    • 84868670481 scopus 로고    scopus 로고
    • Adaptive efficient compression of genomes
    • Wandelt S, Leser U. Adaptive efficient compression of genomes. Algorithm Mol Bio 2012, 7: 30.
    • (2012) Algorithm Mol Bio , vol.7 , pp. 30
    • Wandelt, S.1    Leser, U.2
  • 29
    • 84894514001 scopus 로고    scopus 로고
    • FRESCO: Referential Compression of Highly-Similar sequences
    • (to appear)
    • Wandelt S and Leser U. FRESCO: Referential Compression of Highly-Similar sequences. IEEE ACM T Comput Bi 2013, (to appear).
    • (2013) IEEE ACM T Comput Bi
    • Wandelt, S.1    Leser, U.2
  • 30
    • 0037805644 scopus 로고    scopus 로고
    • Biotechnological prospects from metagenomics
    • Schloss P, Handelsman J. Biotechnological prospects from metagenomics. Curr Opin Biotechn 2003, 14(3): 303-310.
    • (2003) Curr Opin Biotechn , vol.14 , Issue.3 , pp. 303-310
    • Schloss, P.1    Handelsman, J.2
  • 31
    • 84890458860 scopus 로고    scopus 로고
    • Differential direct coding: A compression algorithm for nucleotide sequence data
    • Vey G. Differential direct coding: a compression algorithm for nucleotide sequence data. The Journal of Biological Databases and Curation 2009.
    • (2009) The Journal of Biological Databases and Curation
    • Vey, G.1
  • 33
    • 84881510889 scopus 로고    scopus 로고
    • A biological sequence compression based on cross chromosomal similarities using variable length lut
    • Bharti RK, Verma A, Singh RK. A biological sequence compression based on cross chromosomal similarities using variable length lut. Intl J Biomet Bioinform 2011, 4: 217-223.
    • (2011) Intl J Biomet Bioinform , vol.4 , pp. 217-223
    • Bharti, R.K.1    Verma, A.2    Singh, R.K.3
  • 36
    • 84877943508 scopus 로고    scopus 로고
    • An efficient horizontal and vertical method for online dna sequence compression
    • Mishra KN, Aaggarwal A, Abdelhadi E, et al. An efficient horizontal and vertical method for online dna sequence compression. Intl J Comput Appl 2010, 3(1): 39-46.
    • (2010) Intl J Comput Appl , vol.3 , Issue.1 , pp. 39-46
    • Mishra, K.N.1    Aaggarwal, A.2    Abdelhadi, E.3
  • 37
    • 79959701435 scopus 로고    scopus 로고
    • Dnabit compress-genome compression algorithm
    • Rajeswari P, Apparao A. Dnabit compress-genome compression algorithm. Bioinformation 2011, 5(8): 350-60.
    • (2011) Bioinformation , vol.5 , Issue.8 , pp. 350-360
    • Rajeswari, P.1    Apparao, A.2
  • 38
    • 81455132689 scopus 로고    scopus 로고
    • Iterative dictionary construction for compression of large dna data sets
    • Kuruppu S, Beresford-Smith B, Conway T, et al. Iterative dictionary construction for compression of large dna data sets. IEEE ACM T Comput Bi 2012, 9(1): 137-149.
    • (2012) IEEE ACM T Comput Bi , vol.9 , Issue.1 , pp. 137-149
    • Kuruppu, S.1    Beresford-Smith, B.2    Conway, T.3
  • 42
    • 80052957011 scopus 로고    scopus 로고
    • Compressing the human genome using exclusively markov models
    • Pratas D, Pinho AJ. Compressing the human genome using exclusively markov models. Adv Intel Soft Comput 2011; 213-220.
    • (2011) Adv Intel Soft Comput , pp. 213-220
    • Pratas, D.1    Pinho, A.J.2
  • 46
    • 67649855126 scopus 로고    scopus 로고
    • Data structures and compression algorithms for genomic sequence data
    • Brandon MC, Wallace DC, Baldi P. Data structures and compression algorithms for genomic sequence data. Bioinformatics 2009, 25(14): 1731-1738.
    • (2009) Bioinformatics , vol.25 , Issue.14 , pp. 1731-1738
    • Brandon, M.C.1    Wallace, D.C.2    Baldi, P.3
  • 47
    • 58349097721 scopus 로고    scopus 로고
    • Human genomes as email attachments
    • Christley S, Lu Y, Li C, et al. Human genomes as email attachments. Bioinformatics 2009, 25(2): 274-275.
    • (2009) Bioinformatics , vol.25 , Issue.2 , pp. 274-275
    • Christley, S.1    Lu, Y.2    Li, C.3
  • 48
    • 79954595666 scopus 로고    scopus 로고
    • A novel compression tool for efficient storage of genome resequencing data
    • Wang C, Zhang D. A novel compression tool for efficient storage of genome resequencing data. Nucleic Acids Res 2011, 39(7): e45.
    • (2011) Nucleic Acids Res , vol.39 , Issue.7
    • Wang, C.1    Zhang, D.2
  • 52
    • 84873187741 scopus 로고    scopus 로고
    • Green: A tool for efficient compression of genome resequencing data
    • Pinho AJ, Pratas D, Garcia SP. Green: a tool for efficient compression of genome resequencing data. Nucleic Acids Res 2011.
    • (2011) Nucleic Acids Res
    • Pinho, A.J.1    Pratas, D.2    Garcia, S.P.3
  • 53
    • 80053956723 scopus 로고    scopus 로고
    • Dna data compression based on the whole genome sequence
    • Kim JD, Kim JH. Dna data compression based on the whole genome sequence. J Convergence Inform Technol 2009, 4(3): 82-85.
    • (2009) J Convergence Inform Technol , vol.4 , Issue.3 , pp. 82-85
    • Kim, J.D.1    Kim, J.H.2
  • 60
    • 43149107930 scopus 로고    scopus 로고
    • Quality scores and SNP detection in sequencing-by-synthesis systems
    • BrockmanW, AlvarezP, Young S, et al. Quality scores and SNP detection in sequencing-by-synthesis systems. Genome Res 2008, 18(5): 763-770.
    • (2008) Genome Res , vol.18 , Issue.5 , pp. 763-770
    • Brockman, W.1    Alvarez, P.2    Young, S.3
  • 62
    • 45649084526 scopus 로고    scopus 로고
    • Data compression and genomes: A two-dimensional life domain map
    • Menconi G, Benci V, Buiatti M. Data compression and genomes: a two-dimensional life domain map. J Theor Biol 2008, 253(2): 281-288.
    • (2008) J Theor Biol , vol.253 , Issue.2 , pp. 281-288
    • Menconi, G.1    Benci, V.2    Buiatti, M.3
  • 63
    • 80054701916 scopus 로고    scopus 로고
    • Dna sequence compression using adaptive particle swarm optimization-based memetic algorithm
    • Zhu Z, Zhou J, Ji Z, et al. Dna sequence compression using adaptive particle swarm optimization-based memetic algorithm. IEEE T Evolut Comput 2011, 15(5): 643-658.
    • (2011) IEEE T Evolut Comput , vol.15 , Issue.5 , pp. 643-658
    • Zhu, Z.1    Zhou, J.2    Ji, Z.3
  • 65
    • 84869232795 scopus 로고    scopus 로고
    • Transformations for the compression of fastq quality scores of next generation sequencing data
    • Wan R, Anh VN, Asai K. Transformations for the compression of fastq quality scores of next generation sequencing data. Bioinformatics 2011.
    • (2011) Bioinformatics
    • Wan, R.1    Anh, V.N.2    Asai, K.3
  • 66
    • 77955886068 scopus 로고    scopus 로고
    • G-sqz: Compact encoding of genomic sequence and quality data
    • Tembe W, Lowey J, SuhE. G-sqz: compact encoding of genomic sequence and quality data. Bioinformatics 2010, 26(17): 2192-2194.
    • (2010) Bioinformatics , vol.26 , Issue.17 , pp. 2192-2194
    • Tembe, W.1    Lowey, J.2    Suh, E.3
  • 67
    • 84873027492 scopus 로고    scopus 로고
    • Integrating human genome database into electronic health record with sequence alignment and compression mechanism
    • Chen WH, Lu YW, Lai FP, et al. Integrating human genome database into electronic health record with sequence alignment and compression mechanism. J Med Syst 2011, 36(3): 2587-2597.
    • (2011) J Med Syst , vol.36 , Issue.3 , pp. 2587-2597
    • Chen, W.H.1    Lu, Y.W.2    Lai, F.P.3
  • 68
    • 77957765256 scopus 로고    scopus 로고
    • Data structures and compression algorithms for high throughput sequencing technologies
    • Daily K, Rigor R, Christley S, et al. Data structures and compression algorithms for high throughput sequencing technologies. BMC Bioinformatics 2010, 11(1): 514+.
    • (2010) BMC Bioinformatics , vol.11 , Issue.1 , pp. 514
    • Daily, K.1    Rigor, R.2    Christley, S.3
  • 70
    • 79955554401 scopus 로고    scopus 로고
    • Efficient storage of high throughput dna sequencing data using reference-based compression
    • Fritz MH, Leinonen R, Cochrane G, et al. Efficient storage of high throughput dna sequencing data using reference-based compression. Genome Res 2011, 21(5): 734-740.
    • (2011) Genome Res , vol.21 , Issue.5 , pp. 734-740
    • Fritz, M.H.1    Leinonen, R.2    Cochrane, G.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.