메뉴 건너뛰기




Volumn 8394 LNBI, Issue , 2014, Pages 385-399

Traversing the k-mer landscape of NGS read datasets for quality score sparsification

Author keywords

accuracy; compression; quality score; RQS; sparsification; variant calling

Indexed keywords

COMPACTION; DATA HANDLING; GENES; MOLECULAR BIOLOGY;

EID: 84958549972     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-319-05269-4_31     Document Type: Conference Paper
Times cited : (18)

References (28)
  • 2
    • 79951493627 scopus 로고    scopus 로고
    • On the future of genomic data
    • Kahn, S.D.: On the future of genomic data. Science 331(6018), 728-729 (2011
    • (2011) Science , vol.331 , Issue.6018 , pp. 728-729
    • Kahn, S.D.1
  • 5
    • 84871199924 scopus 로고    scopus 로고
    • Compression of next-generation sequencing reads aided by highly efficient de novo assembly
    • Jones, D.C., Ruzzo, W.L., Peng, X., Katze, M.G.: Compression of next-generation sequencing reads aided by highly efficient de novo assembly. Nucleic Acids Research 40(22), e171 (2012
    • (2012) Nucleic Acids Research , vol.40 , Issue.22
    • Jones, D.C.1    Ruzzo, W.L.2    Peng, X.3    Katze, M.G.4
  • 6
    • 79955554401 scopus 로고    scopus 로고
    • Efficient storage of high throughput DNA sequencing data using reference-based compression
    • Fritz, M.H.Y., Leinonen, R., Cochrane, G., Birney, E.: Efficient storage of high throughput DNA sequencing data using reference-based compression. Genome Research 21, 734-740 (2011
    • (2011) Genome Research , vol.21 , pp. 734-740
    • Fritz, M.H.Y.1    Leinonen, R.2    Cochrane, G.3    Birney, E.4
  • 7
    • 79952580139 scopus 로고    scopus 로고
    • Compression of DNA sequence reads in FASTQ format
    • Deorowicz, S., Grabowski, S.: Compression of DNA sequence reads in FASTQ format. Bioinformatics 27(6), 860-862 (2011
    • (2011) Bioinformatics , vol.27 , Issue.6 , pp. 860-862
    • Deorowicz, S.1    Grabowski, S.2
  • 9
    • 84875363204 scopus 로고    scopus 로고
    • Compression of FASTQ and SAM format sequencing data
    • Bonfield, J.K., Mahoney, M.V.: Compression of FASTQ and SAM format sequencing data. PloS one 8(3), e59190 (2013
    • (2013) PloS One , vol.8 , Issue.3
    • Bonfield, J.K.1    Mahoney, M.V.2
  • 10
    • 84870429157 scopus 로고    scopus 로고
    • Scalce: Boosting sequence compression algorithms using locally consistent encoding
    • Hach, F., Numanagic, I., Alkan, C., Sahinalp, S.C.: SCALCE: Boosting sequence compression algorithms using locally consistent encoding. Bioinformatics 28(23), 3051-3057 (2012
    • (2012) Bioinformatics , vol.28 , Issue.23 , pp. 3051-3057
    • Hach, F.1    Numanagic, I.2    Alkan, C.3    Sahinalp, S.C.4
  • 11
    • 77955886068 scopus 로고    scopus 로고
    • G-SQZ: Compact encoding of genomic sequence and quality data
    • Tembe, W., Lowey, J., Suh, E.: G-SQZ: Compact encoding of genomic sequence and quality data. Bioinformatics 26(17), 2192-2194 (2010
    • (2010) Bioinformatics , vol.26 , Issue.17 , pp. 2192-2194
    • Tembe, W.1    Lowey, J.2    Suh, E.3
  • 12
    • 84871807049 scopus 로고    scopus 로고
    • NGC: Lossless and lossy compression of aligned high-Throughput sequencing data
    • Popitsch, N., von Haeseler, A.: NGC: Lossless and lossy compression of aligned high-Throughput sequencing data. Nucleic Acids Research 41(1), e27 (2013
    • (2013) Nucleic Acids Research , vol.41 , Issue.1
    • Popitsch, N.1    Von Haeseler, A.2
  • 13
    • 84857848401 scopus 로고    scopus 로고
    • Transformations for the compression of FASTQ quality scores of next-generation sequencing data
    • Wan, R., Anh, V.N., Asai, K.: Transformations for the compression of FASTQ quality scores of next-generation sequencing data. Bioinformatics 28(5), 628-635 (2012
    • (2012) Bioinformatics , vol.28 , Issue.5 , pp. 628-635
    • Wan, R.1    Anh, V.N.2    Asai, K.3
  • 14
    • 58349097721 scopus 로고    scopus 로고
    • Human genomes as email attachments
    • Christley, S., Lu, Y., Li, C., Xie, X.: Human genomes as email attachments. Bioinformatics 25(2), 274-275 (2009
    • (2009) Bioinformatics , vol.25 , Issue.2 , pp. 274-275
    • Christley, S.1    Lu, Y.2    Li, C.3    Xie, X.4
  • 15
    • 84891350227 scopus 로고    scopus 로고
    • Adaptive reference-free compression of sequence quality scores
    • Janin, L., Rosone, G., Cox, A.J.: Adaptive reference-free compression of sequence quality scores. Bioinformatics (2013
    • (2013) Bioinformatics
    • Janin, L.1    Rosone, G.2    Cox, A.J.3
  • 16
    • 84975795680 scopus 로고    scopus 로고
    • An integrated map of genetic variation from 1,092 human genomes
    • Consortium, T.G.P.: An integrated map of genetic variation from 1,092 human genomes. Nature 491, 1 (2012
    • (2012) Nature , vol.491 , pp. 1
    • Consortium, T.G.P.1
  • 17
    • 84865992574 scopus 로고    scopus 로고
    • A survey of error-correction methods for next-generation sequencing
    • Yang, X., Chockalingam, S.P., Aluru, S.: A survey of error-correction methods for next-generation sequencing. Briefings in Bioinformatics 14(1), 56-66 (2013
    • (2013) Briefings in Bioinformatics , vol.14 , Issue.1 , pp. 56-66
    • Yang, X.1    Chockalingam, S.P.2    Aluru, S.3
  • 18
    • 81155158421 scopus 로고    scopus 로고
    • Efficient counting of k-mers in DNA sequences using a bloom filter
    • Melsted, P., Pritchard, J.K.: Efficient counting of k-mers in DNA sequences using a bloom filter. BMC Bioinformatics 12(1), 333 (2011
    • (2011) BMC Bioinformatics , vol.12 , Issue.1 , pp. 333
    • Melsted, P.1    Pritchard, J.K.2
  • 19
    • 79952592810 scopus 로고    scopus 로고
    • A fast, lock-free approach for efficient parallel counting of occurrences of k-mers
    • Maŗcais, G., Kingsford, C.: A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics 27(6), 764-770 (2011
    • (2011) Bioinformatics , vol.27 , Issue.6 , pp. 764-770
    • Maŗcais, G.1    Kingsford, C.2
  • 20
    • 78649358717 scopus 로고    scopus 로고
    • Quake: Quality-Aware detection and correction of sequencing errors
    • Kelley, D.R., Schatz, M.C., Salzberg, S.L., et al.: Quake: Quality-Aware detection and correction of sequencing errors. Genome. Biol. 11(11), 116 (2010
    • (2010) Genome. Biol. , vol.11 , Issue.11 , pp. 116
    • Kelley, D.R.1    Schatz, M.C.2    Salzberg, S.L.3
  • 21
    • 84873307492 scopus 로고    scopus 로고
    • Musket: A multistage k-mer spectrum-based error corrector for Illumina sequence data
    • Liu, Y., Schröder, J., Schmidt, B.: Musket: A multistage k-mer spectrum-based error corrector for Illumina sequence data. Bioinformatics 29(3), 308-315 (2013
    • (2013) Bioinformatics , vol.29 , Issue.3 , pp. 308-315
    • Liu, Y.1    Schröder, J.2    Schmidt, B.3
  • 22
    • 84897110598 scopus 로고    scopus 로고
    • RACER: Rapid and accurate correction of errors in reads
    • Ilie, L., Molnar, M.: RACER: Rapid and accurate correction of errors in reads. Bioinformatics 29(19), 2490-2493 (2013
    • (2013) Bioinformatics , vol.29 , Issue.19 , pp. 2490-2493
    • Ilie, L.1    Molnar, M.2
  • 25
    • 77949587649 scopus 로고    scopus 로고
    • Fast and accurate long-read alignment with Burrows-Wheeler transform
    • Li, H., Durbin, R.: Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics 26(5), 589-595 (2010
    • (2010) Bioinformatics , vol.26 , Issue.5 , pp. 589-595
    • Li, H.1    Durbin, R.2
  • 28
    • 84975742565 scopus 로고    scopus 로고
    • A map of human genome variation from population-scale sequencing
    • Consortium, T.G.P.: A map of human genome variation from population-scale sequencing. Nature 467, 1061-1073 (2010
    • (2010) Nature , vol.467 , pp. 1061-1073
    • Consortium, T.G.P.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.