메뉴 건너뛰기




Volumn 8, Issue 1, 2013, Pages

Data compression for sequencing data

Author keywords

[No Author keywords available]

Indexed keywords


EID: 84888042406     PISSN: None     EISSN: 17487188     Source Type: Journal    
DOI: 10.1186/1748-7188-8-25     Document Type: Review
Times cited : (79)

References (99)
  • 1
    • 72849144434 scopus 로고    scopus 로고
    • Sequencing technologies-the next generation
    • 10.1038/nrg2626, 19997069
    • Metzker ML. Sequencing technologies-the next generation. Nat Rev Genet 2010, 11:31-46. 10.1038/nrg2626, 19997069.
    • (2010) Nat Rev Genet , vol.11 , pp. 31-46
    • Metzker, M.L.1
  • 2
    • 79951493627 scopus 로고    scopus 로고
    • On the future of genomic data
    • 10.1126/science.1197891, 21311016
    • Kahn SD. On the future of genomic data. Science 2011, 331:728-729. 10.1126/science.1197891, 21311016.
    • (2011) Science , vol.331 , pp. 728-729
    • Kahn, S.D.1
  • 3
    • 84888043034 scopus 로고    scopus 로고
    • Million veterans sequenced
    • Roberts JP. Million veterans sequenced. Nat Biotechnol 2013, 31(6):470.
    • (2013) Nat Biotechnol , vol.31 , Issue.6 , pp. 470
    • Roberts, J.P.1
  • 4
    • 84883754411 scopus 로고    scopus 로고
    • After the gold rush
    • 10.1186/gb-2013-14-5-115, 3663089, 23657273
    • Hall N. After the gold rush. Genome Biol 2013, 14(5):115. 10.1186/gb-2013-14-5-115, 3663089, 23657273.
    • (2013) Genome Biol , vol.14 , Issue.5 , pp. 115
    • Hall, N.1
  • 5
    • 84871887720 scopus 로고    scopus 로고
    • National Human Genome Research Institute, DNA Sequencing Costs
    • (accessed February 14, 2013)
    • National Human Genome Research Institute, DNA Sequencing Costs. [http://www.genome.gov/sequencingcosts/] (accessed February 14, 2013).
  • 6
    • 84856496740 scopus 로고    scopus 로고
    • A new efficient data structure for storage and retrieval of multiple biosequences
    • Steinbiss S, Kurtz S. A new efficient data structure for storage and retrieval of multiple biosequences. IEEE/ACM Trans Comput Biol Bioinformatics 2012, 9(2):345-357.
    • (2012) IEEE/ACM Trans Comput Biol Bioinformatics , vol.9 , Issue.2 , pp. 345-357
    • Steinbiss, S.1    Kurtz, S.2
  • 7
    • 84862198590 scopus 로고    scopus 로고
    • The sequence read archive: explosive growth of sequencing data
    • Database issue
    • Kodama Y, Shumway M, Leinonen R. The sequence read archive: explosive growth of sequencing data. Nucleic Acids Res 2012, 40(Database issue):54-56.
    • (2012) Nucleic Acids Res , vol.40 , pp. 54-56
    • Kodama, Y.1    Shumway, M.2    Leinonen, R.3
  • 8
    • 84871826374 scopus 로고    scopus 로고
    • The future of DNA sequence archiving
    • article no. 2
    • Cochrane G, Cook CE, Birney E. The future of DNA sequence archiving. GigaScience 2012, 1(1). article no. 2.
    • (2012) GigaScience , vol.1 , Issue.1
    • Cochrane, G.1    Cook, C.E.2    Birney, E.3
  • 9
    • 67649170975 scopus 로고    scopus 로고
    • Textual data compression in computational biology: A synopsis
    • 10.1093/bioinformatics/btp117, 19251772
    • Giancarlo R, Scaturro D, Utro F. Textual data compression in computational biology: A synopsis. Bioinformatics 2009, 25(13):1575-1586. 10.1093/bioinformatics/btp117, 19251772.
    • (2009) Bioinformatics , vol.25 , Issue.13 , pp. 1575-1586
    • Giancarlo, R.1    Scaturro, D.2    Utro, F.3
  • 10
    • 84856608719 scopus 로고    scopus 로고
    • Textual data compression in computational biology: Algorithmic techniques
    • Giancarlo R, Scaturro D, Utro F. Textual data compression in computational biology: Algorithmic techniques. Comput Sci Rev 2012, 6(1):1-25.
    • (2012) Comput Sci Rev , vol.6 , Issue.1 , pp. 1-25
    • Giancarlo, R.1    Scaturro, D.2    Utro, F.3
  • 11
    • 84867320545 scopus 로고    scopus 로고
    • Prospects and limitations of full-text index structures in genome analysis
    • 10.1093/nar/gks408, 3424560, 22584621
    • Vyverman M, De Baets B, Fack V, Dawyndt P. Prospects and limitations of full-text index structures in genome analysis. Nucleic Acids Res 2012, 40(15):6993-7015. 10.1093/nar/gks408, 3424560, 22584621.
    • (2012) Nucleic Acids Res , vol.40 , Issue.15 , pp. 6993-7015
    • Vyverman, M.1    De Baets, B.2    Fack, V.3    Dawyndt, P.4
  • 14
    • 0017493286 scopus 로고
    • A universal algorithm for sequential data compression
    • Ziv J, Lempel A. A universal algorithm for sequential data compression. IEEE Trans Inf Theory 1977, IT-23:337-343.
    • (1977) IEEE Trans Inf Theory , vol.IT 23 , pp. 337-343
    • Ziv, J.1    Lempel, A.2
  • 15
    • 0003573193 scopus 로고    scopus 로고
    • A block sorting lossless data compression algorithm
    • Technical Report 124, Digital Equipment Corporation 1994
    • Burrows M, Wheeler D. A block sorting lossless data compression algorithm. Technical Report 124, Digital Equipment Corporation 1994, http://www.hpl.hp.com/techreports/Compaq-DEC/SRC-RR-124.pdf.
    • Burrows, M.1    Wheeler, D.2
  • 16
    • 77951226627 scopus 로고    scopus 로고
    • The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants
    • 10.1093/nar/gkp1137, 2847217, 20015970
    • Cock PJA, Fields CJ, Goto N, Heuer ML, Rive PM. The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants. Nucleic Acids Res 2010, 38(6):1767-1771. 10.1093/nar/gkp1137, 2847217, 20015970.
    • (2010) Nucleic Acids Res , vol.38 , Issue.6 , pp. 1767-1771
    • Cock, P.J.A.1    Fields, C.J.2    Goto, N.3    Heuer, M.L.4    Rive, P.M.5
  • 17
    • 79952580139 scopus 로고    scopus 로고
    • Compression of DNA sequence reads in FASTQ format
    • 10.1093/bioinformatics/btr014, 21252073
    • Deorowicz S, Grabowski S. Compression of DNA sequence reads in FASTQ format. Bioinformatics 2011, 27(6):860-862. 10.1093/bioinformatics/btr014, 21252073.
    • (2011) Bioinformatics , vol.27 , Issue.6 , pp. 860-862
    • Deorowicz, S.1    Grabowski, S.2
  • 20
    • 80053647283 scopus 로고    scopus 로고
    • ReCoil-an algorithm for compression of extremely large datasets of DNA data
    • Yanovsky V. ReCoil-an algorithm for compression of extremely large datasets of DNA data. Algo Mol Biol 2011, 6:23.
    • (2011) Algo Mol Biol , vol.6 , pp. 23
    • Yanovsky, V.1
  • 21
    • 84861760100 scopus 로고    scopus 로고
    • Large-scale compression of genomic sequence databases with the Burrows-Wheeler transform
    • 10.1093/bioinformatics/bts173, 22556365
    • Cox AJ, Bauer MJ, Jakobi T, Rosone G. Large-scale compression of genomic sequence databases with the Burrows-Wheeler transform. Bioinformatics 2012, 28(11):1415-1419. 10.1093/bioinformatics/bts173, 22556365.
    • (2012) Bioinformatics , vol.28 , Issue.11 , pp. 1415-1419
    • Cox, A.J.1    Bauer, M.J.2    Jakobi, T.3    Rosone, G.4
  • 22
    • 84870429157 scopus 로고    scopus 로고
    • SCALCE: boosting Sequence Compression Algorithms using Locally Consistent Encoding
    • 10.1093/bioinformatics/bts593, 23047557
    • Hach F, Numanagić I, Alkan C, Sahinapl SC. SCALCE: boosting Sequence Compression Algorithms using Locally Consistent Encoding. Bioinformatics 2012, 28(23):3051-3057. 10.1093/bioinformatics/bts593, 23047557.
    • (2012) Bioinformatics , vol.28 , Issue.23 , pp. 3051-3057
    • Hach, F.1    Numanagić, I.2    Alkan, C.3    Sahinapl, S.C.4
  • 23
    • 77952886150 scopus 로고    scopus 로고
    • Assembly algorithms for next-generation sequencing data
    • 10.1016/j.ygeno.2010.03.001, 2874646, 20211242
    • Miller JR, Koren S, Sutton G. Assembly algorithms for next-generation sequencing data. Genomics 2010, 95(6):315-327. 10.1016/j.ygeno.2010.03.001, 2874646, 20211242.
    • (2010) Genomics , vol.95 , Issue.6 , pp. 315-327
    • Miller, J.R.1    Koren, S.2    Sutton, G.3
  • 24
    • 84857848401 scopus 로고    scopus 로고
    • Transformations for the compression of FASTQ quality scores of next generation sequencing data
    • Wan R, Anh VN, Asai K. Transformations for the compression of FASTQ quality scores of next generation sequencing data. Bioinformatics 2011, 28(5):628-635.
    • (2011) Bioinformatics , vol.28 , Issue.5 , pp. 628-635
    • Wan, R.1    Anh, V.N.2    Asai, K.3
  • 25
    • 79952410480 scopus 로고    scopus 로고
    • Compressing genomic sequence fragments using SlimGene
    • 10.1089/cmb.2010.0253, 3123913, 21385043
    • Kozanitis C, Saunders C, Kruglyak S, Bafna V, Varghese G. Compressing genomic sequence fragments using SlimGene. J Comput Biol 2011, 18(3):401-413. 10.1089/cmb.2010.0253, 3123913, 21385043.
    • (2011) J Comput Biol , vol.18 , Issue.3 , pp. 401-413
    • Kozanitis, C.1    Saunders, C.2    Kruglyak, S.3    Bafna, V.4    Varghese, G.5
  • 26
    • 84878634014 scopus 로고    scopus 로고
    • QualComp: a new lossy compressor for quality scores based on rate distortion theory
    • 10.1186/1471-2105-14-187, 3698011, 23758828
    • Ochoa I, Asnani H, Bharadia D, Chowdhury M, Weissman T, Yona G. QualComp: a new lossy compressor for quality scores based on rate distortion theory. BMC Bioinformatics 2013, 14:187. 10.1186/1471-2105-14-187, 3698011, 23758828.
    • (2013) BMC Bioinformatics , vol.14 , pp. 187
    • Ochoa, I.1    Asnani, H.2    Bharadia, D.3    Chowdhury, M.4    Weissman, T.5    Yona, G.6
  • 27
    • 84888048736 scopus 로고    scopus 로고
    • Casava v. 1.8.2 Documentation
    • Illumina
    • Illumina Casava v. 1.8.2 Documentation. 2013, [http://support.illumina.com/sequencing/sequencing_software/casava.ilmn], Illumina.
    • (2013)
  • 28
    • 84878300793 scopus 로고    scopus 로고
    • High-throughput compression of FASTQ data with SeqDB
    • Howison M. High-throughput compression of FASTQ data with SeqDB. IEEE/ACM Trans Comput Biol Bioinformatics 2013, 10(1):213-218.
    • (2013) IEEE/ACM Trans Comput Biol Bioinformatics , vol.10 , Issue.1 , pp. 213-218
    • Howison, M.1
  • 29
    • 84871199924 scopus 로고    scopus 로고
    • Compression of next-generation sequencing reads aided by highly efficient de novo assembly
    • 10.1093/nar/gks754, 3526293, 22904078
    • Jones DC, Ruzzo WL, Peng X, Katze MG. Compression of next-generation sequencing reads aided by highly efficient de novo assembly. Nucleic Acids Res 2012, 40(22):e171. 10.1093/nar/gks754, 3526293, 22904078.
    • (2012) Nucleic Acids Res , vol.40 , Issue.22
    • Jones, D.C.1    Ruzzo, W.L.2    Peng, X.3    Katze, M.G.4
  • 30
    • 84875363204 scopus 로고    scopus 로고
    • Compression of FASTQ and SAM format sequencing data
    • 10.1371/journal.pone.0059190, 3606433, 23533605
    • Bonfield JK, Mahoney MV. Compression of FASTQ and SAM format sequencing data. PLoS ONE 2013, 8(3):e59190. 10.1371/journal.pone.0059190, 3606433, 23533605.
    • (2013) PLoS ONE , vol.8 , Issue.3
    • Bonfield, J.K.1    Mahoney, M.V.2
  • 31
    • 77955886068 scopus 로고    scopus 로고
    • G-SQZ: compact encoding of genomic sequence and quality data
    • 10.1093/bioinformatics/btq346, 20605925
    • Tembe W, Lowey J, Suh E. G-SQZ: compact encoding of genomic sequence and quality data. Bioinformatics 2010, 26(17):2192-2194. 10.1093/bioinformatics/btq346, 20605925.
    • (2010) Bioinformatics , vol.26 , Issue.17 , pp. 2192-2194
    • Tembe, W.1    Lowey, J.2    Suh, E.3
  • 32
    • 68549104404 scopus 로고    scopus 로고
    • The sequence alignment/map (SAM) format and SAMtools
    • 10.1093/bioinformatics/btp352, 2723002, 19505943, 1000 Genome Project Data Processing Subgroup
    • Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth, Abecasis G, Durbin R, 1000 Genome Project Data Processing Subgroup The sequence alignment/map (SAM) format and SAMtools. Bioinformatics 2009, 25(16):2078-2079. 10.1093/bioinformatics/btp352, 2723002, 19505943, 1000 Genome Project Data Processing Subgroup.
    • (2009) Bioinformatics , vol.25 , Issue.16 , pp. 2078-2079
    • Li, H.1    Handsaker, B.2    Wysoker, A.3    Fennell, T.4    Ruan, J.5    Homer, N.6    Marth, G.7    Abecasis, G.8    Durbin, R.9
  • 33
    • 79955554401 scopus 로고    scopus 로고
    • Efficient storage of high throughput DNA sequencing data using reference-based compression
    • 10.1101/gr.114819.110, 3083090, 21245279
    • Fritz MH-Y, Leinonen R, Cochrane G, Birney E. Efficient storage of high throughput DNA sequencing data using reference-based compression. Genome Res 2011, 21:734-740. 10.1101/gr.114819.110, 3083090, 21245279.
    • (2011) Genome Res , vol.21 , pp. 734-740
    • Fritz, M.H.-Y.1    Leinonen, R.2    Cochrane, G.3    Birney, E.4
  • 34
    • 82555175823 scopus 로고    scopus 로고
    • Improving transmission efficiency of large sequence alignment/map (SAM) files
    • 10.1371/journal.pone.0028251, 3229529, 22164252
    • Sakib MN, Tang J, Zheng WJ, Huang C-T. Improving transmission efficiency of large sequence alignment/map (SAM) files. PLoS ONE 2011, 6(12):e28251. 10.1371/journal.pone.0028251, 3229529, 22164252.
    • (2011) PLoS ONE , vol.6 , Issue.12
    • Sakib, M.N.1    Tang, J.2    Zheng, W.J.3    Huang, C.-T.4
  • 35
    • 8344261403 scopus 로고    scopus 로고
    • A simple and fast DNA compressor
    • Manzini G, Rastero M. A simple and fast DNA compressor. Softw Pract Exp 2004, 34(14):1397-1411.
    • (2004) Softw Pract Exp , vol.34 , Issue.14 , pp. 1397-1411
    • Manzini, G.1    Rastero, M.2
  • 36
    • 79959722141 scopus 로고    scopus 로고
    • On the representability of complete genomes by multiple competing finite-context (Markov) models
    • 10.1371/journal.pone.0021588, 3128062, 21738720
    • Pinho AJ, Ferreira PJSG, Neves AJR, Bastos CAC. On the representability of complete genomes by multiple competing finite-context (Markov) models. PLoS ONE 2011, 6(6):e21588. 10.1371/journal.pone.0021588, 3128062, 21738720.
    • (2011) PLoS ONE , vol.6 , Issue.6
    • Pinho, A.J.1    Ferreira, P.J.S.G.2    Neves, A.J.R.3    Bastos, C.A.C.4
  • 37
    • 34547630480 scopus 로고    scopus 로고
    • A simple statistical algorithm for biological sequence compression
    • Washington, DC, USA: IEEE Computer Society Press
    • Cao MD, Dix TI, Allison L, Mears C. A simple statistical algorithm for biological sequence compression. Proceedings of the Data Compression Conference 2007, 43-52. Washington, DC, USA: IEEE Computer Society Press.
    • (2007) Proceedings of the Data Compression Conference , pp. 43-52
    • Cao, M.D.1    Dix, T.I.2    Allison, L.3    Mears, C.4
  • 38
    • 84868670481 scopus 로고    scopus 로고
    • Adaptive efficient compression of genomes
    • Wandelt S, Leser U. Adaptive efficient compression of genomes. Algo Mol Biol 2012, 7:30.
    • (2012) Algo Mol Biol , vol.7 , pp. 30
    • Wandelt, S.1    Leser, U.2
  • 39
    • 80054918493 scopus 로고    scopus 로고
    • Robust relative compression of genomes with random access
    • Deorowicz S, Grabowski S. Robust relative compression of genomes with random access. Bioinformatics 2011, 27(11):2979-2986.
    • (2011) Bioinformatics , vol.27 , Issue.11 , pp. 2979-2986
    • Deorowicz, S.1    Grabowski, S.2
  • 40
    • 84857860662 scopus 로고    scopus 로고
    • GReEn: a tool for efficient compression of genome resequencing data
    • 10.1093/nar/gkr1124, 3287168, 22139935
    • Pinho AJ, Pratas D, Garcia SP. GReEn: a tool for efficient compression of genome resequencing data. Nucleic Acids Res 2012, 40(4):e27. 10.1093/nar/gkr1124, 3287168, 22139935.
    • (2012) Nucleic Acids Res , vol.40 , Issue.4
    • Pinho, A.J.1    Pratas, D.2    Garcia, S.P.3
  • 41
    • 79954595666 scopus 로고    scopus 로고
    • A novel compression tool for efficient storage of genome resequencing data
    • 10.1093/nar/gkr009, 3074166, 21266471
    • Wang C, Zhang D. A novel compression tool for efficient storage of genome resequencing data. Nucleic Acids Res 2011, 39(7):e45. 10.1093/nar/gkr009, 3074166, 21266471.
    • (2011) Nucleic Acids Res , vol.39 , Issue.7
    • Wang, C.1    Zhang, D.2
  • 44
    • 77957765256 scopus 로고    scopus 로고
    • Data structures and compression algorithms for high-throughput sequencing technologies
    • 10.1186/1471-2105-11-514, 2964686, 20946637
    • Daily K, Rigor P, Christley S, Hie X, Baldi P. Data structures and compression algorithms for high-throughput sequencing technologies. BMC Bioinformatics 2010, 11:514. 10.1186/1471-2105-11-514, 2964686, 20946637.
    • (2010) BMC Bioinformatics , vol.11 , pp. 514
    • Daily, K.1    Rigor, P.2    Christley, S.3    Hie, X.4    Baldi, P.5
  • 45
    • 84871807049 scopus 로고    scopus 로고
    • NGC: lossless and lossy compression of aligned high-throughput sequencing data
    • 10.1093/nar/gks939, 3592443, 23066097
    • Popitsch N, von Haeseler A. NGC: lossless and lossy compression of aligned high-throughput sequencing data. Nucleic Acids Res 2013, 41(1):e27. 10.1093/nar/gks939, 3592443, 23066097.
    • (2013) Nucleic Acids Res , vol.41 , Issue.1
    • Popitsch, N.1    von Haeseler, A.2
  • 46
    • 79951993896 scopus 로고    scopus 로고
    • Tabix: fast retrieval of sequence features from generic TAB-delimited files
    • 10.1093/bioinformatics/btq671, 3042176, 21208982
    • Li H. Tabix: fast retrieval of sequence features from generic TAB-delimited files. Bioinformatics 2011, 27(5):718-719. 10.1093/bioinformatics/btq671, 3042176, 21208982.
    • (2011) Bioinformatics , vol.27 , Issue.5 , pp. 718-719
    • Li, H.1
  • 48
    • 58349097721 scopus 로고    scopus 로고
    • Human genomes as email attachments
    • 10.1093/bioinformatics/btn582, 18996942
    • Christley S, Lu Y, Li C, Xie X. Human genomes as email attachments. Bioinformatics 2009, 25(2):274-275. 10.1093/bioinformatics/btn582, 18996942.
    • (2009) Bioinformatics , vol.25 , Issue.2 , pp. 274-275
    • Christley, S.1    Lu, Y.2    Li, C.3    Xie, X.4
  • 49
    • 84882651968 scopus 로고    scopus 로고
    • The human genome contracts again
    • 10.1093/bioinformatics/btt362, 23793748
    • Pavlichin D, Weissman T, Yona G. The human genome contracts again. Bioinformatics 2013, 29(17):2199-2202. 10.1093/bioinformatics/btt362, 23793748.
    • (2013) Bioinformatics , vol.29 , Issue.17 , pp. 2199-2202
    • Pavlichin, D.1    Weissman, T.2    Yona, G.3
  • 50
    • 84885611671 scopus 로고    scopus 로고
    • Genome compression: a novel approach for large collections
    • 10.1093/bioinformatics/btt460, 23969136
    • Deorowicz S, Danek A, Grabowski S. Genome compression: a novel approach for large collections. Bioinformatics 2013, 29(20):2572-2578. 10.1093/bioinformatics/btt460, 23969136.
    • (2013) Bioinformatics , vol.29 , Issue.20 , pp. 2572-2578
    • Deorowicz, S.1    Danek, A.2    Grabowski, S.3
  • 51
    • 84900831555 scopus 로고    scopus 로고
    • Reference based genome compression
    • Publicly available preprint arXiv:1204.1912v1 2012
    • Chern BG, Ochoa I, Manolakos A, No A, Venkat K, Weissman T. Reference based genome compression. Publicly available preprint arXiv:1204.1912v1 2012.
    • Chern, B.G.1    Ochoa, I.2    Manolakos, A.3    No, A.4    Venkat, K.5    Weissman, T.6
  • 53
    • 77952730117 scopus 로고    scopus 로고
    • LZ77-like compression with fast random access
    • Washington, DC, USA: IEEE Computer Society
    • Kreft S, Navarro G. LZ77-like compression with fast random access. Proceedings of the Data Compression Conference 2010, 239-248. Washington, DC, USA: IEEE Computer Society.
    • (2010) Proceedings of the Data Compression Conference , pp. 239-248
    • Kreft, S.1    Navarro, G.2
  • 55
    • 80755159050 scopus 로고    scopus 로고
    • How to apply de Bruijn graphs to genome assembly
    • 10.1038/nbt.2023, 22068540
    • Compeau PE, Pevzner PA, Tesler G. How to apply de Bruijn graphs to genome assembly. Nat Biotechnol 2011, 29(11):987-991. 10.1038/nbt.2023, 22068540.
    • (2011) Nat Biotechnol , vol.29 , Issue.11 , pp. 987-991
    • Compeau, P.E.1    Pevzner, P.A.2    Tesler, G.3
  • 56
    • 79951526698 scopus 로고    scopus 로고
    • Succinct data structures for assembling large genomes
    • 10.1093/bioinformatics/btq697, 21245053
    • Conway TC, Bromage AJ. Succinct data structures for assembling large genomes. Bioinformatics 2011, 27(4):479-486. 10.1093/bioinformatics/btq697, 21245053.
    • (2011) Bioinformatics , vol.27 , Issue.4 , pp. 479-486
    • Conway, T.C.1    Bromage, A.J.2
  • 57
    • 0014814325 scopus 로고
    • Space/time trade-offs in hash coding with allowable errors
    • Bloom BH. Space/time trade-offs in hash coding with allowable errors. Commun ACM 1970, 13(7):422-426.
    • (1970) Commun ACM , vol.13 , Issue.7 , pp. 422-426
    • Bloom, B.H.1
  • 58
    • 84866687164 scopus 로고    scopus 로고
    • Space-efficient and exact de Bruijn graph representation based on a Bloom filter
    • Springer-Verlag, Berlin-Heidelberg: Springer, LNCS 7534, Raphael BJ, Tang J
    • Chikhi R, Rizk G. Space-efficient and exact de Bruijn graph representation based on a Bloom filter. Proceedings of the 12th International Workshop on Algorithms in Bioinformatics (WABI) 2012, 236-248. Springer-Verlag, Berlin-Heidelberg: Springer, LNCS 7534, Raphael BJ, Tang J.
    • (2012) Proceedings of the 12th International Workshop on Algorithms in Bioinformatics (WABI) , pp. 236-248
    • Chikhi, R.1    Rizk, G.2
  • 59
    • 84884410315 scopus 로고    scopus 로고
    • Using cascading Bloom filters to improve the memory usage for de Brujin graphs
    • Springer-Verlag, Berlin-Heidelberg: Springer, LNCS 8126, Darling A. E., Stoye J
    • Salikhov K, Sacomoto G, Kucherov G. Using cascading Bloom filters to improve the memory usage for de Brujin graphs. Proceedings of the 13th International Workshop on Algorithms in Bioinformatics (WABI) 2013, 364-376. Springer-Verlag, Berlin-Heidelberg: Springer, LNCS 8126, Darling A. E., Stoye J.
    • (2013) Proceedings of the 13th International Workshop on Algorithms in Bioinformatics (WABI) , pp. 364-376
    • Salikhov, K.1    Sacomoto, G.2    Kucherov, G.3
  • 61
    • 27544497879 scopus 로고    scopus 로고
    • The fragment assembly string graph
    • Myers EW. The fragment assembly string graph. Bioinformatics 2005, 21(suppl 2):ii79-ii85.
    • (2005) Bioinformatics , vol.21 , Issue.SUPPL 2
    • Myers, E.W.1
  • 62
    • 84857838310 scopus 로고    scopus 로고
    • Efficient de novo assembly of large genomes using compressed data structures
    • 10.1101/gr.126953.111, 3290790, 22156294
    • Simpson JT, Durbin R. Efficient de novo assembly of large genomes using compressed data structures. Genome Res 2012, 22:549-556. 10.1101/gr.126953.111, 3290790, 22156294.
    • (2012) Genome Res , vol.22 , pp. 549-556
    • Simpson, J.T.1    Durbin, R.2
  • 64
    • 84860523681 scopus 로고    scopus 로고
    • Readjoiner: a fast and memory efficient string graph-based sequence assembler
    • 10.1186/1471-2105-13-82, 3507659, 22559072
    • Gonnella G, Kurtz S. Readjoiner: a fast and memory efficient string graph-based sequence assembler. BMC Bioinformatics 2012, 13:82. 10.1186/1471-2105-13-82, 3507659, 22559072.
    • (2012) BMC Bioinformatics , vol.13 , pp. 82
    • Gonnella, G.1    Kurtz, S.2
  • 66
    • 84876408746 scopus 로고    scopus 로고
    • On compressing and indexing repetitive sequences
    • Kreft S, Navarro G. On compressing and indexing repetitive sequences. Theor Comput Sci 2013, 483:115-133.
    • (2013) Theor Comput Sci , vol.483 , pp. 115-133
    • Kreft, S.1    Navarro, G.2
  • 70
    • 84859342211 scopus 로고    scopus 로고
    • Hobbes: optimized gram-based methods for efficient read alignment
    • 10.1093/nar/gkr1246, 3315303, 22199254
    • Ahmadi A, Behm A, Honnalli N, Li C, Weng L, Xie X. Hobbes: optimized gram-based methods for efficient read alignment. Nucleic Acids Res 2012, 40(6):e41. 10.1093/nar/gkr1246, 3315303, 22199254.
    • (2012) Nucleic Acids Res , vol.40 , Issue.6
    • Ahmadi, A.1    Behm, A.2    Honnalli, N.3    Li, C.4    Weng, L.5    Xie, X.6
  • 71
    • 62349130698 scopus 로고    scopus 로고
    • Ultrafast and memory-efficient alignment of short DNA sequences to the human genome
    • 10.1186/gb-2009-10-3-r25, 2690996, 19261174
    • Langmead B, Trapnell C, Pop M, Salzberg SL. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 2009, 10(3):R25. 10.1186/gb-2009-10-3-r25, 2690996, 19261174.
    • (2009) Genome Biol , vol.10 , Issue.3
    • Langmead, B.1    Trapnell, C.2    Pop, M.3    Salzberg, S.L.4
  • 72
    • 84859210032 scopus 로고    scopus 로고
    • Fast gapped-read alignment with Bowtie
    • 10.1038/nmeth.1923, 3322381, 22388286
    • Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie. Nature Methods 2012, 9:357-359. 10.1038/nmeth.1923, 3322381, 22388286.
    • (2012) Nature Methods , vol.9 , pp. 357-359
    • Langmead, B.1    Salzberg, S.L.2
  • 73
    • 67649884743 scopus 로고    scopus 로고
    • Fast and accurate short read alignment with Burrows-Wheeler transform
    • 10.1093/bioinformatics/btp324, 2705234, 19451168
    • Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 2009, 25(14):1754-1760. 10.1093/bioinformatics/btp324, 2705234, 19451168.
    • (2009) Bioinformatics , vol.25 , Issue.14 , pp. 1754-1760
    • Li, H.1    Durbin, R.2
  • 74
    • 77949587649 scopus 로고    scopus 로고
    • Fast and accurate long-read alignment with Burrows-Wheeler transform
    • 10.1093/bioinformatics/btp698, 2828108, 20080505
    • Li H, Durbin R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics 2010, 26(5):589-595. 10.1093/bioinformatics/btp698, 2828108, 20080505.
    • (2010) Bioinformatics , vol.26 , Issue.5 , pp. 589-595
    • Li, H.1    Durbin, R.2
  • 75
    • 67650711615 scopus 로고    scopus 로고
    • SOAP2: an improved ultrafast tool for short read alignment
    • 10.1093/bioinformatics/btp336, 19497933
    • Li R, Yu C, Li Y, Lam T-W, Yiu S-M, Kristiansen K, Wang J. SOAP2: an improved ultrafast tool for short read alignment. Bioinformatics 2009, 25(15):1966-1967. 10.1093/bioinformatics/btp336, 19497933.
    • (2009) Bioinformatics , vol.25 , Issue.15 , pp. 1966-1967
    • Li, R.1    Yu, C.2    Li, Y.3    Lam, T.-W.4    Yiu, S.-M.5    Kristiansen, K.6    Wang, J.7
  • 76
    • 84870837088 scopus 로고    scopus 로고
    • The GEM mapper: fast, accurate and versatile alignment by filtration
    • 10.1038/nmeth.2221, 23103880
    • Marco-Sola S, Sammeth M, Guigó R, Ribeca P. The GEM mapper: fast, accurate and versatile alignment by filtration. Nat Methods 2012, 9(12):1185-1188. 10.1038/nmeth.2221, 23103880.
    • (2012) Nat Methods , vol.9 , Issue.12 , pp. 1185-1188
    • Marco-Sola, S.1    Sammeth, M.2    Guigó, R.3    Ribeca, P.4
  • 77
    • 35449006300 scopus 로고    scopus 로고
    • Fast BWT in small space by blockwise suffix sorting
    • Kärkkäinen J. Fast BWT in small space by blockwise suffix sorting. Theor Comput Sci 2007, 387:249-257.
    • (2007) Theor Comput Sci , vol.387 , pp. 249-257
    • Kärkkäinen, J.1
  • 78
    • 84942303205 scopus 로고    scopus 로고
    • Lightweight data indexing and compression in external memory
    • Ferragina P, Gagie T, Manzini G. Lightweight data indexing and compression in external memory. Algorithmica 2012, 63(3):707-730.
    • (2012) Algorithmica , vol.63 , Issue.3 , pp. 707-730
    • Ferragina, P.1    Gagie, T.2    Manzini, G.3
  • 79
    • 57749195712 scopus 로고    scopus 로고
    • RNA-Seq: a revolutionary tool for transcriptomics
    • 10.1038/nrg2484, 2949280, 19015660
    • Wang Z, Gerstein M, Snyder M. RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet 2009, 10(1):57-63. 10.1038/nrg2484, 2949280, 19015660.
    • (2009) Nat Rev Genet , vol.10 , Issue.1 , pp. 57-63
    • Wang, Z.1    Gerstein, M.2    Snyder, M.3
  • 80
    • 65449136284 scopus 로고    scopus 로고
    • TopHat: discovering splice junctions with RNA-Seq
    • 10.1093/bioinformatics/btp120, 2672628, 19289445
    • Trapnell C, Pachter L, Salzberg SL. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics 2009, 25(9):1105-1111. 10.1093/bioinformatics/btp120, 2672628, 19289445.
    • (2009) Bioinformatics , vol.25 , Issue.9 , pp. 1105-1111
    • Trapnell, C.1    Pachter, L.2    Salzberg, S.L.3
  • 81
    • 84875292779 scopus 로고    scopus 로고
    • CRAC: an integrated approach to the analysis of RNA-seq reads
    • 10.1186/gb-2013-14-3-r30, 23537109
    • Rivals E. CRAC: an integrated approach to the analysis of RNA-seq reads. Genome Biol 2013, 14(3):R30. 10.1186/gb-2013-14-3-r30, 23537109.
    • (2013) Genome Biol , vol.14 , Issue.3
    • Rivals, E.1
  • 82
    • 84888032311 scopus 로고    scopus 로고
    • Methods to study splicing from high-throughput RNA Sequencing data
    • Publicly available preprint arXiv:1304.5952v1
    • Alamancos GP, Agirre E, Eyras E. Methods to study splicing from high-throughput RNA Sequencing data. Publicly available preprint arXiv:1304.5952v1.
    • Alamancos, G.P.1    Agirre, E.2    Eyras, E.3
  • 83
    • 84864119729 scopus 로고    scopus 로고
    • Exploring single-sample SNP and INDEL calling with whole-genome de novo assembly
    • 10.1093/bioinformatics/bts280, 3389770, 22569178
    • Li H. Exploring single-sample SNP and INDEL calling with whole-genome de novo assembly. Bioinformatics 2012, 28(14):1838-1844. 10.1093/bioinformatics/bts280, 3389770, 22569178.
    • (2012) Bioinformatics , vol.28 , Issue.14 , pp. 1838-1844
    • Li, H.1
  • 86
    • 84907874378 scopus 로고    scopus 로고
    • Optimized succinct data structures for massive data
    • doi: 10.1002/spe.2198
    • Gog S, Petri M. Optimized succinct data structures for massive data. Softw Pract Exp 2013, doi: 10.1002/spe.2198.
    • (2013) Softw Pract Exp
    • Gog, S.1    Petri, M.2
  • 87
    • 84863702145 scopus 로고    scopus 로고
    • Compressive genomics
    • 10.1038/nbt.2241, 22781691
    • Loh P-R, Baym M, Berger B. Compressive genomics. Nat Biotechnol 2012, 30(7):627-630. 10.1038/nbt.2241, 22781691.
    • (2012) Nat Biotechnol , vol.30 , Issue.7 , pp. 627-630
    • Loh, P.-R.1    Baym, M.2    Berger, B.3
  • 89
    • 0036226603 scopus 로고    scopus 로고
    • BLAT-the BLAST-like alignment tool
    • 187518, 11932250
    • Kent WJ. BLAT-the BLAST-like alignment tool. Genome Res 2002, 12(4):656-664. 187518, 11932250.
    • (2002) Genome Res , vol.12 , Issue.4 , pp. 656-664
    • Kent, W.J.1
  • 91
    • 43149115851 scopus 로고    scopus 로고
    • Velvet: algorithms for de novo short read assembly using de Bruijn graphs
    • 10.1101/gr.074492.107, 2336801, 18349386
    • Zerbino DR, Birney E. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res 2008, 18(5):821-829. 10.1101/gr.074492.107, 2336801, 18349386.
    • (2008) Genome Res , vol.18 , Issue.5 , pp. 821-829
    • Zerbino, D.R.1    Birney, E.2
  • 92
    • 66449136667 scopus 로고    scopus 로고
    • ABySS: A parallel assembler for short read sequence data
    • 10.1101/gr.089532.108, 2694472, 19251739
    • Simpson JT, Wong K, Jackman SD, Schein JE, Jones SJM, Birol I. ABySS: A parallel assembler for short read sequence data. Genome Res 2009, 19(6):1117-1123. 10.1101/gr.089532.108, 2694472, 19251739.
    • (2009) Genome Res , vol.19 , Issue.6 , pp. 1117-1123
    • Simpson, J.T.1    Wong, K.2    Jackman, S.D.3    Schein, J.E.4    Jones, S.J.M.5    Birol, I.6
  • 93
    • 78650087192 scopus 로고    scopus 로고
    • A genome alignment algorithm based on compression
    • 10.1186/1471-2105-11-599, 3022628, 21159205
    • Cao MD, Dix TI, Allison L. A genome alignment algorithm based on compression. BMC Bioinformatics 2010, 11(1):599. 10.1186/1471-2105-11-599, 3022628, 21159205.
    • (2010) BMC Bioinformatics , vol.11 , Issue.1 , pp. 599
    • Cao, M.D.1    Dix, T.I.2    Allison, L.3
  • 94
    • 84859770226 scopus 로고    scopus 로고
    • Rapid identification of nonhuman sequences in high throughput sequencing data sets
    • 10.1093/bioinformatics/bts100, 3324519, 22377895
    • Bhaduri A, Qu K, Lee CS, Ungewickell A, Khavari P. Rapid identification of nonhuman sequences in high throughput sequencing data sets. Bioinformatics 2012, 28(8):1174-1175. 10.1093/bioinformatics/bts100, 3324519, 22377895.
    • (2012) Bioinformatics , vol.28 , Issue.8 , pp. 1174-1175
    • Bhaduri, A.1    Qu, K.2    Lee, C.S.3    Ungewickell, A.4    Khavari, P.5
  • 95
    • 34547753523 scopus 로고    scopus 로고
    • Compression-based classification of biological sequences and structures via the universal similarity metric: experimental assessment
    • 10.1186/1471-2105-8-252, 1939857, 17629909
    • Ferragina P, Giancarlo R, Greco V, Manzini G, Valiente G. Compression-based classification of biological sequences and structures via the universal similarity metric: experimental assessment. BMC Bioinformatics 2007, 8:252. 10.1186/1471-2105-8-252, 1939857, 17629909.
    • (2007) BMC Bioinformatics , vol.8 , pp. 252
    • Ferragina, P.1    Giancarlo, R.2    Greco, V.3    Manzini, G.4    Valiente, G.5
  • 97
    • 84859457621 scopus 로고    scopus 로고
    • A lossy compression technique enabling duplication-aware sequence alignment
    • Freschi V, Bogliolo A. A lossy compression technique enabling duplication-aware sequence alignment. Evol Bioinformatics 2012, 8:171-180.
    • (2012) Evol Bioinformatics , vol.8 , pp. 171-180
    • Freschi, V.1    Bogliolo, A.2
  • 98
    • 84888069845 scopus 로고    scopus 로고
    • HiSeq 2500 system user guide
    • Illumina
    • Illumina HiSeq 2500 system user guide. 2012, [http://supportres.illumina.com/documents/myillumina/223bf628-0b46-409f-aa3d-4f3495fe4f69/hiseq2500_ug_15035786_ a_public.pdf], Illumina.
    • (2012)
  • 99
    • 84888022587 scopus 로고    scopus 로고
    • New algorithms increase computing efficiency for IGN whole-genome analysis
    • Illumina
    • Illumina New algorithms increase computing efficiency for IGN whole-genome analysis. 2013, [http://res.illumina.com/documents/products/technotes/technote_ign_isaac_software.pdf],Illumina.
    • (2013)


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.