메뉴 건너뛰기




Volumn 28, Issue 11, 2012, Pages 1415-1419

Large-scale compression of genomic sequence databases with the Burrows-Wheeler transform

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHM; ARTICLE; COMPUTER SIMULATION; DNA SEQUENCE; ESCHERICHIA COLI; GENETICS; GENOMICS; HUMAN; HUMAN GENOME; INFORMATION PROCESSING; METHODOLOGY; NUCLEIC ACID DATABASE;

EID: 84861760100     PISSN: 13674803     EISSN: 14602059     Source Type: Journal    
DOI: 10.1093/bioinformatics/bts173     Document Type: Article
Times cited : (127)

References (22)
  • 2
    • 79960099867 scopus 로고    scopus 로고
    • Lightweight BWTconstruction for very large string collections
    • of LNCS, Springer
    • Bauer, M.J. et al. (2011) Lightweight BWTconstruction for very large string collections. In CPM 2011. Vol. 6661 of LNCS, Springer, pp. 219-231.
    • (2011) CPM 2011 , vol.6661 , pp. 219-231
    • Bauer, M.J.1
  • 3
    • 84876410333 scopus 로고    scopus 로고
    • Lightweight algorithms for constructing and inverting the BWT of string collections
    • 10.1016/j.tcs.2012.02.002
    • Bauer, M.J. Cox , A.J. Rosone ,G.(2013) Lightweight algorithms for constructing and inverting the BWT of string collections. Theoretical Computer Science,483(134-148), 10.1016/j.tcs.2012.02.002
    • (2013) Theoretical Computer Science , vol.483 , pp. 134-148
    • Bauer, M.J.1    Cox, A.J.2    Rosone, G.3
  • 5
    • 0036947893 scopus 로고    scopus 로고
    • DNACompress: fast and effective DNA sequence compression
    • Chen, X. et al. (2002) DNACompress: fast and effective DNA sequence compression. Bioinformatics, 18, 1696-1698.
    • (2002) Bioinformatics , vol.18 , pp. 1696-1698
    • Chen, X.1
  • 6
    • 79952580139 scopus 로고    scopus 로고
    • Compression of genomic sequences in FASTQ format
    • Deorowicz, S. and Grabowski, S. (2011) Compression of genomic sequences in FASTQ format. Bioinformatics, 27, 860-862.
    • (2011) Bioinformatics , vol.27 , pp. 860-862
    • Deorowicz, S.1    Grabowski, S.2
  • 7
    • 80053447840 scopus 로고    scopus 로고
    • Phased whole-genome genetic risk in a family quartet using a major allele reference sequence
    • Dewey, F.E. et al. (2011) Phased whole-genome genetic risk in a family quartet using a major allele reference sequence. PLoS Genet., 7, e1002280.
    • (2011) PLoS Genet , vol.7
    • Dewey, F.E.1
  • 9
    • 30544432152 scopus 로고    scopus 로고
    • Indexing compressed text
    • Ferragina, P. and Manzini, G. (2005) Indexing compressed text. J. ACM, 52, 552-581.
    • (2005) J. ACM , vol.52 , pp. 552-581
    • Ferragina, P.1    Manzini, G.2
  • 10
    • 34250171723 scopus 로고    scopus 로고
    • Compressed representations of sequences and full-text indexes
    • Article 20
    • Ferragina, P. et al. (2007) Compressed representations of sequences and full-text indexes. ACM Trans. Algor., 3(2), Article 20.
    • (2007) ACM Trans. Algor. , vol.3 , Issue.2
    • Ferragina, P.1
  • 11
    • 79955554401 scopus 로고    scopus 로고
    • Efficient storage of high throughput DNAsequencing data using reference-based compression
    • Fritz, M.H. et al. (2011) Efficient storage of high throughput DNAsequencing data using reference-based compression. Genome Res., 21, 734-740.
    • (2011) Genome Res , vol.21 , pp. 734-740
    • Fritz, M.H.1
  • 12
    • 67649170975 scopus 로고    scopus 로고
    • Textual data compression in computational biology: a synopsis
    • Giancarlo, R. et al. (2009) Textual data compression in computational biology: a synopsis. Bioinformatics, 25, 1575-1586.
    • (2009) Bioinformatics , vol.25 , pp. 1575-1586
    • Giancarlo, R.1
  • 13
    • 0000100455 scopus 로고
    • A new challenge for compression algorithms: genetic sequences
    • Grumbach, S. and Tahi, F. (1994) A new challenge for compression algorithms: genetic sequences. Inf. Process. Manage., 30, 875-886.
    • (1994) Inf. Process. Manage. , vol.30 , pp. 875-886
    • Grumbach, S.1    Tahi, F.2
  • 14
    • 78650275807 scopus 로고    scopus 로고
    • Compressing genomic sequence fragments using SlimGene
    • of LNCS, Springer
    • Kozanitis, C. et al. (2010) Compressing genomic sequence fragments using SlimGene. In RECOMB. Vol. 6044 of LNCS, Springer, pp. 310-324.
    • (2010) RECOMB , vol.6044 , pp. 310-324
    • Kozanitis, C.1
  • 15
    • 67649884743 scopus 로고    scopus 로고
    • Fast and accurate short read alignment with Burrows-Wheeler transform
    • Li, H. and Durbin, R. (2009) Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics, 25, 1754-1760.
    • (2009) Bioinformatics , vol.25 , pp. 1754-1760
    • Li, H.1    Durbin, R.2
  • 16
    • 26444516518 scopus 로고    scopus 로고
    • An extension of the Burrows Wheeler transform and applications to sequence comparison and data compression
    • of LNCS, Springer
    • Mantaci, S. et al. (2005) An extension of the Burrows Wheeler transform and applications to sequence comparison and data compression. In CPM 2005. Vol. 3537 of LNCS, Springer, pp. 178-189.
    • (2005) CPM 2005 , vol.3537 , pp. 178-189
    • Mantaci, S.1
  • 17
    • 0027194328 scopus 로고
    • Discovering simple DNA sequences by the algorithmic significance method
    • Milosavljevic, A. and Jurka, J. (1993) Discovering simple DNA sequences by the algorithmic significance method. Comput. Appl. Biosci. CABIOS, 9, 407-411.
    • (1993) Comput. Appl. Biosci. CABIOS , vol.9 , pp. 407-411
    • Milosavljevic, A.1    Jurka, J.2
  • 18
    • 0029852415 scopus 로고    scopus 로고
    • Compression and genetic sequence analysis
    • Rivals, E. et al. (1996) Compression and genetic sequence analysis. Biochimie, 78, 315-322.
    • (1996) Biochimie , vol.78 , pp. 315-322
    • Rivals, E.1
  • 19
    • 77954238055 scopus 로고    scopus 로고
    • Efficient construction of an assembly string graph using the FM-index
    • Simpson, J.T. and Durbin, R. (2010) Efficient construction of an assembly string graph using the FM-index. Bioinformatics, 26, i367-i373.
    • (2010) Bioinformatics , vol.26
    • Simpson, J.T.1    Durbin, R.2
  • 20
    • 84857838310 scopus 로고    scopus 로고
    • Efficient de novo assembly of large genomes using compressed data structures
    • Simpson, J.T. and Durbin, R. (2012) Efficient de novo assembly of large genomes using compressed data structures. Genome Res, 22, 549-556.
    • (2012) Genome Res , vol.22 , pp. 549-556
    • Simpson, J.T.1    Durbin, R.2
  • 21
    • 77955886068 scopus 로고    scopus 로고
    • G-SQZ: compact encoding of genomic sequence and quality data
    • Tembe, W. et al. (2010) G-SQZ: compact encoding of genomic sequence and quality data. Bioinformatics, 26, 2192-2194.
    • (2010) Bioinformatics , vol.26 , pp. 2192-2194
    • Tembe, W.1
  • 22
    • 80053647283 scopus 로고    scopus 로고
    • ReCoil - an algorithm for compression of extremely large datasets of DNA data
    • Yanovsky, V. (2011) ReCoil - an algorithm for compression of extremely large datasets of DNA data. Algor. Mol. Biol., 6, 23.
    • (2011) Algor. Mol. Biol. , vol.6 , pp. 23
    • Yanovsky, V.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.