메뉴 건너뛰기




Volumn 59, Issue 10, 2013, Pages 6273-6289

Information theory of DNA shotgun sequencing

Author keywords

de novo assembly; DNA sequencing; information theory

Indexed keywords

ASSEMBLY ALGORITHM; ASYMPTOTIC LIMITS; BIOLOGY AND MEDICINE; CRITICAL PHENOMENON; DE NOVO ASSEMBLIES; DNA SEQUENCING; IMPACT OF NOISE; STATISTICAL MODELING;

EID: 84884382382     PISSN: 00189448     EISSN: None     Source Type: Journal    
DOI: 10.1109/TIT.2013.2270273     Document Type: Conference Paper
Times cited : (125)

References (32)
  • 1
    • 84972496740 scopus 로고
    • Poisson approximation and the Chen-Stein method
    • R. Arratia, L. Goldstein, and L. Gordon, "Poisson approximation and the Chen-Stein method," Statist. Sci., vol. 5, no. 4, pp. 403-434, 1990.
    • (1990) Statist. Sci , vol.5 , Issue.4 , pp. 403-434
    • Arratia, R.1    Goldstein, L.2    Gordon, L.3
  • 2
    • 0029792949 scopus 로고    scopus 로고
    • Poisson process approximation for sequence repeats, and sequencing by hybridization
    • R. Arratia, D. Martin, G. Reinert, and M. S. Waterman, "Poisson process approximation for sequence repeats, and sequencing by hybridization," J. Comput. Biol., vol. 3, pp. 425-463, 1996. (Pubitemid 26337809)
    • (1996) Journal of Computational Biology , vol.3 , Issue.3 , pp. 425-463
    • Arratia, R.1
  • 5
    • 35948929094 scopus 로고    scopus 로고
    • SHARCGS, a fast and highly accurate short-read assembly algorithm for de novo genomic sequencing
    • DOI 10.1101/gr.6435207
    • C. Dohm, C. Lottaz, T. Borodina, and H. Himmelbauer, "SHARCGS: A fast and highly accurate short-read assembly algorithm for de novo genomic sequencing," Genome Res., vol. 17, pp. 1697-1706, 2007. (Pubitemid 350074864)
    • (2007) Genome Research , vol.17 , Issue.11 , pp. 1697-1706
    • Dohm, J.C.1    Lottaz, C.2    Borodina, T.3    Himmelbauer, H.4
  • 6
    • 0028457460 scopus 로고
    • The probability of unique solutions of sequencing by hybridization
    • M. Dyer, A. Frieze, and S. Suen, "The probability of unique solutions of sequencing by hybridization," J. Comput. Biol., vol. 1, no. 2, pp. 105-110, 1994.
    • (1994) J. Comput. Biol , vol.1 , Issue.2 , pp. 105-110
    • Dyer, M.1    Frieze, A.2    Suen, S.3
  • 7
    • 0010835479 scopus 로고    scopus 로고
    • Greedy algorithms for the shortest common superstring that are asymptotically optimal
    • A. Frieze and W. Szpankowski, "Greedy algorithms for the shortest common superstring that are asymptotically optimal," Algorithmica, vol. 21, no. 1, pp. 21-36, 1998. (Pubitemid 128664051)
    • (1998) Algorithmica (New York) , vol.21 , Issue.1 , pp. 21-36
    • Frieze, A.1    Szpankowski, W.2
  • 8
    • 77950393797 scopus 로고
    • Ph.D. dissertation Princeton Univ., Princeton, NJ, USA
    • J. K. Gallant, "String compression algorithms," Ph.D. dissertation, Princeton Univ., Princeton, NJ, USA, 1982.
    • (1982) String Compression Algorithms
    • Gallant, J.K.1
  • 9
    • 0032849859 scopus 로고    scopus 로고
    • CAP3: A DNA sequence assembly program
    • DOI 10.1101/gr.9.9.868
    • X. Huang and A. Madan, "CAP3: A DNA sequence assembly program," Genome Res., vol. 9, no. 9, pp. 868-877, 1999. (Pubitemid 29488446)
    • (1999) Genome Research , vol.9 , Issue.9 , pp. 868-877
    • Huang, X.1    Madan, A.2
  • 10
    • 0029312687 scopus 로고
    • A new algorithm for DNA sequence assembly
    • M. S.Waterman and R. M. Idury, "A new algorithm for DNA sequence assembly," J. Comput. Biol., vol. 2, pp. 291-306, 1995.
    • (1995) J. Comput. Biol , vol.2 , pp. 291-306
    • Waterman, M.S.1    Idury, R.M.2
  • 12
    • 9944232572 scopus 로고    scopus 로고
    • The greedy algorithm for shortest superstrings
    • H. Kaplan and N. Shafrir, "The greedy algorithm for shortest superstrings," Inf. Process. Lett., vol. 93, no. 1, pp. 13-17, 2005.
    • (2005) Inf. Process. Lett , vol.93 , Issue.1 , pp. 13-17
    • Kaplan, H.1    Shafrir, N.2
  • 13
    • 0023988195 scopus 로고
    • Genomic mapping by fingerprinting random clones: A mathematical analysis
    • E. S. Lander and M. S. Waterman, "Genomic mapping by fingerprinting random clones: A mathematical analysis," Genomics, vol. 2, no. 3, pp. 231-239, 1988.
    • (1988) Genomics , vol.2 , Issue.3 , pp. 231-239
    • Lander, E.S.1    Waterman, M.S.2
  • 14
    • 0025720355 scopus 로고
    • Towards a DNA sequencing theory (learning a string)
    • M. Li, "Towards a DNA sequencing theory (learning a string)," Found. Comput. Sci., vol. 1, pp. 125-134, Oct. 1990. (Pubitemid 21663289)
    • (1991) IEEE Transactions on Industry Applications , vol.27 , Issue.1 PART 1 , pp. 125-134
    • Li Ming1
  • 15
    • 70349928668 scopus 로고    scopus 로고
    • Why greed works for shortest common superstring problem
    • B. Ma, "Why greed works for shortest common superstring problem," Theor. Comput. Sci., vol. 410, no. 51, pp. 5374-5381, 2009.
    • (2009) Theor. Comput. Sci , vol.410 , Issue.51 , pp. 5374-5381
    • Ma, B.1
  • 16
    • 70349204183 scopus 로고    scopus 로고
    • Maximum likelihood genome assembly
    • P. Medvedev and M. Brudno, "Maximum likelihood genome assembly," J. Comput. Biol., vol. 16, no. 8, pp. 1101-1116, 2009.
    • (2009) J. Comput. Biol , vol.16 , Issue.8 , pp. 1101-1116
    • Medvedev, P.1    Brudno, M.2
  • 17
    • 77952886150 scopus 로고    scopus 로고
    • Assembly algorithms for next-generation sequencing data
    • J.Miller, S. Koren, and G. Sutton, "Assembly algorithms for next-generation sequencing data," Genomics, vol. 95, pp. 315-327, 2010.
    • Genomics , vol.95 , Issue.2010 , pp. 315-327
    • Miller, J.1    Koren, S.2    Sutton, G.3
  • 19
    • 77957287342 scopus 로고    scopus 로고
    • De novo assembly of short sequence reads
    • K. Paszkiewicz and D. J. Studholme, "De novo assembly of short sequence reads," Brief. Bioinformat., vol. 11, no. 5, pp. 457-472, 2010.
    • (2010) Brief. Bioinformat , vol.11 , Issue.5 , pp. 457-472
    • Paszkiewicz, K.1    Studholme, D.J.2
  • 21
    • 67449095888 scopus 로고    scopus 로고
    • Genome assembly reborn: Recent computational challenges
    • M. Pop, "Genome assembly reborn: Recent computational challenges," Brief. Bioinformat., vol. 10, no. 4, pp. 354-366, 2009.
    • (2009) Brief. Bioinformat , vol.10 , Issue.4 , pp. 354-366
    • Pop, M.1
  • 22
    • 0035333562 scopus 로고    scopus 로고
    • Rényi's divergence and entropy rates for finite alphabet Markov sources
    • DOI 10.1109/18.923736, PII S001894480102853X
    • Z. Rached, F. Alajaji, and L. L. Campbell, "Renyi's divergence and entropy rates for finite alphabet Markov sources," IEEE Trans. Inf. Theory, vol. 47, no. 4, pp. 1553-1561, May 2001. (Pubitemid 32508494)
    • (2001) IEEE Transactions on Information Theory , vol.47 , Issue.4 , pp. 1553-1561
    • Rached, Z.1    Alajaji, F.2    Campbell, L.L.3
  • 24
    • 0017681196 scopus 로고
    • DNA sequencing with chain-terminating inhibitors
    • F. Sanger, S. Nicklen, and A. R. Coulson, "DNA sequencing with chain-terminating inhibitors," Proc. Nat. Acad. Sci. USA, vol. 74, no. 12, pp. 5463-5467, 1977.
    • (1977) Proc. Nat. Acad. Sci. USA , vol.74 , Issue.12 , pp. 5463-5467
    • Sanger, F.1    Nicklen, S.2    Coulson, A.R.3
  • 25
    • 0001899550 scopus 로고
    • TIGR Assembler: A new tool for assembling large shotgun sequencing projects
    • G. G. Sutton, O. White, M. D. Adams, and A. Kerlavage, "TIGR Assembler: A new tool for assembling large shotgun sequencing projects," Genome Sci. Technol., vol. 1, pp. 9-19, 1995.
    • (1995) Genome Sci. Technol , vol.1 , pp. 9-19
    • Sutton, G.G.1    White, O.2    Adams, M.D.3    Kerlavage, A.4
  • 26
    • 0024749527 scopus 로고
    • Approximation algorithms for the shortest common superstring problem
    • J. S. Turner, "Approximation algorithms for the shortest common superstring problem," Inf. Comput., vol. 83, no. 1, pp. 1-20, 1989.
    • (1989) Inf. Comput , vol.83 , Issue.1 , pp. 1-20
    • Turner, J.S.1
  • 27
    • 0027113212 scopus 로고
    • Approximate stringmatching with q-grams and maximal matches
    • E. Ukkonen, "Approximate stringmatching with q-grams and maximal matches," Theor. Comput. Sci., vol. 92, no. 1, pp. 191-211, 1992.
    • (1992) Theor. Comput. Sci , vol.92 , Issue.1 , pp. 191-211
    • Ukkonen, E.1
  • 28
    • 52449148946 scopus 로고
    • Linear-time algorithm for finding approximate shortest common superstrings
    • E. Ukkonen, "A linear-time algorithm for finding approximate shortest common superstrings," Algorithmica, vol. 5, pp. 313-323, 1990. (Pubitemid 20717703)
    • (1990) Algorithmica (New York) , vol.5 , Issue.3 , pp. 313-323
    • Ukkonen Esko1
  • 29
    • 33847307402 scopus 로고    scopus 로고
    • Assembling millions of short DNA sequences using SSAKE
    • DOI 10.1093/bioinformatics/btl629
    • R. L.Warren, G. G. Sutton, S. J. Jones, and R. A. Holt, "Assembling millions of short DNA sequences using SSAKE," Bioinformatics, vol. 23, pp. 500-501, 2007. (Pubitemid 46323134)
    • (2007) Bioinformatics , vol.23 , Issue.4 , pp. 500-501
    • Warren, R.L.1    Sutton, G.G.2    Jones, S.J.M.3    Holt, R.A.4
  • 30
    • 84870549091 scopus 로고    scopus 로고
    • DNA Sequencing Theory-Wikipedia Wikipedia
    • DNA Sequencing Theory-Wikipedia, The Free Encyclopedia, Wikipedia, 2012.
    • (2012) The Free Encyclopedia
  • 31
    • 84870549091 scopus 로고    scopus 로고
    • Sequence Assembly-Wikipedia Wikipedia
    • Sequence Assembly-Wikipedia, The Free Encyclopedia, Wikipedia, 2012.
    • (2012) The Free Encyclopedia


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.