메뉴 건너뛰기




Volumn 10, Issue 2, 2012, Pages 58-73

Review of General Algorithmic Features for Genome Assemblers for Next Generation Sequencers

Author keywords

Comparative assembly; De Bruijn graphs; De novo assembly; Genome assembly; Next generation sequencing

Indexed keywords

DNA;

EID: 84863444675     PISSN: 16720229     EISSN: 22103244     Source Type: Journal    
DOI: 10.1016/j.gpb.2012.05.006     Document Type: Review
Times cited : (34)

References (90)
  • 1
    • 84863498615 scopus 로고    scopus 로고
    • Oxford Molecular Group PLC. AssemblyLIGN 1.0. 9. Oxford, United Kingdom: Oxford Molecular Group PLC.
    • Oxford Molecular Group PLC. AssemblyLIGN 1.0. 9. Oxford, United Kingdom: Oxford Molecular Group PLC; 1998.
    • (1998)
  • 3
    • 0030629329 scopus 로고    scopus 로고
    • Autoassembler sequence assembly software
    • Parker S. Autoassembler sequence assembly software. Methods Mol Biol 1997, 70:107-118.
    • (1997) Methods Mol Biol , vol.70 , pp. 107-118
    • Parker, S.1
  • 5
    • 0004045278 scopus 로고
    • Gene Codes Corporation
    • Bromberg C., et al. Sequencher 1995, Gene Codes Corporation.
    • (1995) Sequencher
    • Bromberg, C.1
  • 6
    • 84863498609 scopus 로고
    • A fragment assembly project environment. University of Arizona, Dept. of Computer Science
    • Miller S, Myers E. A fragment assembly project environment. University of Arizona, Dept. of Computer Science; 1991.
    • (1991)
    • Miller, S.1    Myers, E.2
  • 7
    • 0025817983 scopus 로고
    • An x windows and unix implementation of our sequence analysis package
    • Gleeson T., Staden R. An x windows and unix implementation of our sequence analysis package. Comput Appl Biosci 1991, 7:398.
    • (1991) Comput Appl Biosci , vol.7 , pp. 398
    • Gleeson, T.1    Staden, R.2
  • 8
    • 0028679357 scopus 로고
    • A quantitative comparison of DNA sequence assembly programs
    • Miller M.J., Powell J.I. A quantitative comparison of DNA sequence assembly programs. J Comput Biol 1994, 1:257-269.
    • (1994) J Comput Biol , vol.1 , pp. 257-269
    • Miller, M.J.1    Powell, J.I.2
  • 9
    • 0020465927 scopus 로고
    • Nucleotide sequence of bacteriophage lambda DNA
    • Sanger F., et al. Nucleotide sequence of bacteriophage lambda DNA. J Mol Biol 1982, 162:729-773.
    • (1982) J Mol Biol , vol.162 , pp. 729-773
    • Sanger, F.1
  • 10
    • 43349084843 scopus 로고    scopus 로고
    • Assembling genomic DNA sequences with PHRAP. Curr Protoc Bioinformatics;
    • Bastide M, McCombie WR. Assembling genomic DNA sequences with PHRAP. Curr Protoc Bioinformatics; 2007.
    • (2007)
    • Bastide, M.1    McCombie, W.R.2
  • 11
    • 0001899550 scopus 로고
    • Tigr assembler: a new tool for assembling large shotgun sequencing projects
    • Sutton G., et al. Tigr assembler: a new tool for assembling large shotgun sequencing projects. Genome Sci Technol 1995, 1:919.
    • (1995) Genome Sci Technol , vol.1 , pp. 919
    • Sutton, G.1
  • 12
    • 0034708758 scopus 로고    scopus 로고
    • A whole-genome assembly of drosophila
    • Myers E., et al. A whole-genome assembly of drosophila. Science 2000, 287:2196-2204.
    • (2000) Science , vol.287 , pp. 2196-2204
    • Myers, E.1
  • 13
    • 0036144823 scopus 로고    scopus 로고
    • Arachne: a whole-genome shotgun assembler
    • Batzoglou S., et al. Arachne: a whole-genome shotgun assembler. Genome Res 2002, 12:177-189.
    • (2002) Genome Res , vol.12 , pp. 177-189
    • Batzoglou, S.1
  • 14
    • 0032849859 scopus 로고    scopus 로고
    • Cap3: a DNA sequence assembly program
    • Huang X., Madan A. Cap3: a DNA sequence assembly program. Genome Res 1999, 9:868-877.
    • (1999) Genome Res , vol.9 , pp. 868-877
    • Huang, X.1    Madan, A.2
  • 15
    • 0036644865 scopus 로고    scopus 로고
    • Genome sequence assembly: algorithms and issues
    • Pop M. Genome sequence assembly: algorithms and issues. Computer 2002, 4754.
    • (2002) Computer , pp. 4754
    • Pop, M.1
  • 16
    • 33846241220 scopus 로고    scopus 로고
    • Spoligotype signatures in the mycobacterium tuberculosis complex
    • Streicher E.M., et al. Spoligotype signatures in the mycobacterium tuberculosis complex. J Clin Microbiol 2007, 45:237-240.
    • (2007) J Clin Microbiol , vol.45 , pp. 237-240
    • Streicher, E.M.1
  • 17
    • 0034791388 scopus 로고    scopus 로고
    • Spoligotype diversity of mycobacterium bovis strains isolated in france from 1979 to 2000
    • Haddad N., et al. Spoligotype diversity of mycobacterium bovis strains isolated in france from 1979 to 2000. J Clin Microbiol 2001, 39:3623-3632.
    • (2001) J Clin Microbiol , vol.39 , pp. 3623-3632
    • Haddad, N.1
  • 18
    • 0035345996 scopus 로고    scopus 로고
    • Spoligotype database of mycobacterium tuberculosis: biogeographic distribution of shared types and epidemiologic and phylogenetic perspectives
    • Sola C., et al. Spoligotype database of mycobacterium tuberculosis: biogeographic distribution of shared types and epidemiologic and phylogenetic perspectives. Emerg Infect Dis 2001, 7:390-396.
    • (2001) Emerg Infect Dis , vol.7 , pp. 390-396
    • Sola, C.1
  • 19
    • 47049111621 scopus 로고    scopus 로고
    • Spoligotype diversity of mycobacterium bovis and mycobacterium caprae animal isolates
    • Duarte E.L., et al. Spoligotype diversity of mycobacterium bovis and mycobacterium caprae animal isolates. Vet Microbiol 2008, 130:415-421.
    • (2008) Vet Microbiol , vol.130 , pp. 415-421
    • Duarte, E.L.1
  • 20
    • 0033841143 scopus 로고    scopus 로고
    • Use of spoligotype analysis to detect laboratory cross-contamination
    • Nivin B., et al. Use of spoligotype analysis to detect laboratory cross-contamination. Infect Control Hosp Epidemiol 2000, 21:525-527.
    • (2000) Infect Control Hosp Epidemiol , vol.21 , pp. 525-527
    • Nivin, B.1
  • 21
    • 64149123778 scopus 로고    scopus 로고
    • Next-generation sequencing: from basic research to diagnostics
    • Voelkerding K. Next-generation sequencing: from basic research to diagnostics. Clin Chem 2009, 55:641-658.
    • (2009) Clin Chem , vol.55 , pp. 641-658
    • Voelkerding, K.1
  • 22
    • 52949096084 scopus 로고    scopus 로고
    • Next-generation dna sequencing methods
    • Mardis E. Next-generation dna sequencing methods. Annu Rev Genomics Hum Genet 2008, 9:387-402.
    • (2008) Annu Rev Genomics Hum Genet , vol.9 , pp. 387-402
    • Mardis, E.1
  • 23
    • 53649106195 scopus 로고    scopus 로고
    • Next-generation DNA sequencing
    • Shendure J., Ji H. Next-generation DNA sequencing. Nat Biotechnol 2008, 26:1135-1145.
    • (2008) Nat Biotechnol , vol.26 , pp. 1135-1145
    • Shendure, J.1    Ji, H.2
  • 24
    • 77956279237 scopus 로고    scopus 로고
    • Assembly of large genomes using second-generation sequencing
    • Schatz M., et al. Assembly of large genomes using second-generation sequencing. Genome Res 2010, 20:1165-1173.
    • (2010) Genome Res , vol.20 , pp. 1165-1173
    • Schatz, M.1
  • 25
    • 67449095888 scopus 로고    scopus 로고
    • Genome assembly reborn: recent computational challenges
    • Pop M. Genome assembly reborn: recent computational challenges. Brief Bioinform 2009, 10:354-366.
    • (2009) Brief Bioinform , vol.10 , pp. 354-366
    • Pop, M.1
  • 26
    • 84874189501 scopus 로고
    • Introduction to algorithms. Chennai: MIT Press and McGraw-Hill Book Company
    • Gormen TH et al. Introduction to algorithms, vol. 7. Chennai: MIT Press and McGraw-Hill Book Company; 1976. p. 1162-71.
    • (1976) , vol.7 , pp. 1162-71
    • Gormen, T.H.1
  • 27
    • 84874193809 scopus 로고    scopus 로고
    • Supplementary information section: review of general algorithmic features for genome assemblers for next generation sequencers.
    • Wajid B, Serpedin E. Supplementary information section: review of general algorithmic features for genome assemblers for next generation sequencers. 2011. http://www.dl.dropbox.com/u/57205928/Appendix_Review_Paper_Bilal_Erchin.pdf.
    • (2011)
    • Wajid, B.1    Serpedin, E.2
  • 37
    • 0029312158 scopus 로고
    • Toward simplifying and accurately formulating fragment assembly
    • Myers E. Toward simplifying and accurately formulating fragment assembly. J Comput Biol 1995, 2:275-290.
    • (1995) J Comput Biol , vol.2 , pp. 275-290
    • Myers, E.1
  • 38
    • 77952886150 scopus 로고    scopus 로고
    • Assembly algorithms for next-generation sequencing data
    • Miller J., et al. Assembly algorithms for next-generation sequencing data. Genomics 2010, 95:315-327.
    • (2010) Genomics , vol.95 , pp. 315-327
    • Miller, J.1
  • 39
    • 77951813504 scopus 로고    scopus 로고
    • Genome assembly quality: assessment and improvement using the neutral indel model
    • Meader S., et al. Genome assembly quality: assessment and improvement using the neutral indel model. Genome Res 2010, 20:675-684.
    • (2010) Genome Res , vol.20 , pp. 675-684
    • Meader, S.1
  • 40
    • 78650909427 scopus 로고    scopus 로고
    • Limitations of next-generation genome sequence assembly
    • Alkan C., et al. Limitations of next-generation genome sequence assembly. Nat Methods 2010, 8:61-65.
    • (2010) Nat Methods , vol.8 , pp. 61-65
    • Alkan, C.1
  • 42
    • 84863452584 scopus 로고    scopus 로고
    • Genome assembly, techniques.
    • Marcais G. Genome assembly, techniques; 2011.
    • (2011)
    • Marcais, G.1
  • 44
    • 77956479670 scopus 로고    scopus 로고
    • An algorithm for automated closure during assembly
    • Koren S., et al. An algorithm for automated closure during assembly. BMC bioinformatics 2010, 11:457.
    • (2010) BMC bioinformatics , vol.11 , pp. 457
    • Koren, S.1
  • 45
    • 57249105124 scopus 로고    scopus 로고
    • Aggressive assembly of pyrosequencing reads with mates
    • Miller J.R., et al. Aggressive assembly of pyrosequencing reads with mates. Bioinformatics 2008, 24:2818-2824.
    • (2008) Bioinformatics , vol.24 , pp. 2818-2824
    • Miller, J.R.1
  • 50
    • 0035859921 scopus 로고    scopus 로고
    • An Eulerian path approach to DNA fragment assembly
    • Pevzner P., et al. An Eulerian path approach to DNA fragment assembly. Proc Natl Acad Sci USA 2001, 98:9748-9753.
    • (2001) Proc Natl Acad Sci USA , vol.98 , pp. 9748-9753
    • Pevzner, P.1
  • 51
    • 4644275238 scopus 로고    scopus 로고
    • De novo repeat classification and fragment assembly
    • Pevzner P., et al. De novo repeat classification and fragment assembly. Genome Res 2004, 14:1786-1796.
    • (2004) Genome Res , vol.14 , pp. 1786-1796
    • Pevzner, P.1
  • 52
    • 39049156065 scopus 로고    scopus 로고
    • Short read fragment assembly of bacterial genomes
    • Chaisson M., Pevzner P. Short read fragment assembly of bacterial genomes. Genome Res 2008, 18:324-330.
    • (2008) Genome Res , vol.18 , pp. 324-330
    • Chaisson, M.1    Pevzner, P.2
  • 53
    • 59949093527 scopus 로고    scopus 로고
    • De novo fragment assembly with short mate paired reads: does the read length matter?
    • Chaisson M.J., et al. De novo fragment assembly with short mate paired reads: does the read length matter?. Genome Res 2009, 19:336-346.
    • (2009) Genome Res , vol.19 , pp. 336-346
    • Chaisson, M.J.1
  • 54
    • 8744312854 scopus 로고    scopus 로고
    • A novel method for multiple alignment of sequences with repeated and shuffled elements
    • Raphael B., et al. A novel method for multiple alignment of sequences with repeated and shuffled elements. Genome Res 2004, 14:2336-2346.
    • (2004) Genome Res , vol.14 , pp. 2336-2346
    • Raphael, B.1
  • 56
    • 33745014559 scopus 로고    scopus 로고
    • Identifying repeat domains in large genomes
    • Zhi D., et al. Identifying repeat domains in large genomes. Genome Biol 2006, 7:R7.
    • (2006) Genome Biol , vol.7
    • Zhi, D.1
  • 57
    • 84863452588 scopus 로고
    • Algorithmic graph theory. Prentice Hall.
    • McHugh JA. Algorithmic graph theory. Prentice Hall; 1990.
    • (1990)
    • McHugh, J.A.1
  • 60
    • 84863452593 scopus 로고    scopus 로고
    • Discrete mathematics, some notes. Technical reports (CIS).
    • Gallier JH. Discrete mathematics, some notes. Technical reports (CIS); 2009. p. 897.
    • (2009) , pp. 897
    • Gallier, J.H.1
  • 61
    • 43149115851 scopus 로고    scopus 로고
    • Velvet: algorithms for de novo short read assembly using de-bruijn graphs
    • Zerbino D.R., Birney E. Velvet: algorithms for de novo short read assembly using de-bruijn graphs. Genome Res 2008, 18:821-829.
    • (2008) Genome Res , vol.18 , pp. 821-829
    • Zerbino, D.R.1    Birney, E.2
  • 67
    • 43149086380 scopus 로고    scopus 로고
    • Allpaths: de novo assembly of whole-genome shotgun microreads
    • Butler J., et al. Allpaths: de novo assembly of whole-genome shotgun microreads. Genome Res 2008, 18:810-820.
    • (2008) Genome Res , vol.18 , pp. 810-820
    • Butler, J.1
  • 68
    • 79952178131 scopus 로고    scopus 로고
    • High-quality draft assemblies of mammalian genomes from massively parallel sequence data
    • Gnerre S., et al. High-quality draft assemblies of mammalian genomes from massively parallel sequence data. Proc Natl Acad Sci USA 2011, 108:1513-1518.
    • (2011) Proc Natl Acad Sci USA , vol.108 , pp. 1513-1518
    • Gnerre, S.1
  • 69
    • 43149085041 scopus 로고    scopus 로고
    • De novo bacterial genome sequencing: millions of very short reads assembled on a desktop computer
    • Hernandez D., et al. De novo bacterial genome sequencing: millions of very short reads assembled on a desktop computer. Genome Res 2008, 18:802-809.
    • (2008) Genome Res , vol.18 , pp. 802-809
    • Hernandez, D.1
  • 70
    • 0027681165 scopus 로고
    • Suffix arrays a new method for online string searches
    • Manber U., Myers G. Suffix arrays a new method for online string searches. SIAM J Sci Comput 1993, 22:935-948.
    • (1993) SIAM J Sci Comput , vol.22 , pp. 935-948
    • Manber, U.1    Myers, G.2
  • 71
    • 27544497879 scopus 로고    scopus 로고
    • The fragment assembly string graph
    • Myers E.W. The fragment assembly string graph. Bioinformatics 2005, 21(Suppl 2):ii79-ii85.
    • (2005) Bioinformatics , vol.21 , Issue.SUPPL. 2
    • Myers, E.W.1
  • 72
    • 69949167273 scopus 로고    scopus 로고
    • A fast hybrid short read fragment assembly algorithm
    • Schmidt B., et al. A fast hybrid short read fragment assembly algorithm. Bioinformatics 2009, 25:2279-2280.
    • (2009) Bioinformatics , vol.25 , pp. 2279-2280
    • Schmidt, B.1
  • 73
    • 79952655567 scopus 로고    scopus 로고
    • A practical comparison of de novo genome assembly software tools for next-generation sequencing technologies
    • Zhang W., et al. A practical comparison of de novo genome assembly software tools for next-generation sequencing technologies. PLoS One 2011, 6:e17915.
    • (2011) PLoS One , vol.6
    • Zhang, W.1
  • 74
    • 33847307402 scopus 로고    scopus 로고
    • Assembling millions of short dna sequences using ssake
    • Warren R.L., et al. Assembling millions of short dna sequences using ssake. Bioinformatics 2007, 23:500-501.
    • (2007) Bioinformatics , vol.23 , pp. 500-501
    • Warren, R.L.1
  • 75
    • 36448948250 scopus 로고    scopus 로고
    • Extending assembly of short dna sequences to handle error
    • Jeck W.R., et al. Extending assembly of short dna sequences to handle error. Bioinformatics 2007, 23:2942-2944.
    • (2007) Bioinformatics , vol.23 , pp. 2942-2944
    • Jeck, W.R.1
  • 76
    • 62549098185 scopus 로고    scopus 로고
    • Qsra-a quality-value guided de novo short read assembler
    • Bryant D.W., et al. Qsra-a quality-value guided de novo short read assembler. BMC Bioinformatics 2009, 10:69.
    • (2009) BMC Bioinformatics , vol.10 , pp. 69
    • Bryant, D.W.1
  • 77
    • 70350012234 scopus 로고    scopus 로고
    • Assisted assembly: how to improve a de novo genome assembly by using related species
    • Gnerre S., et al. Assisted assembly: how to improve a de novo genome assembly by using related species. Genome Biol 2009, 10:R88.
    • (2009) Genome Biol , vol.10
    • Gnerre, S.1
  • 78
    • 16644381328 scopus 로고    scopus 로고
    • Comparative genome assembly
    • Pop M., et al. Comparative genome assembly. Brief Bioinform 2004, 5:237-248.
    • (2004) Brief Bioinform , vol.5 , pp. 237-248
    • Pop, M.1
  • 79
    • 2942538300 scopus 로고    scopus 로고
    • Versatile and open software for comparing large genomes
    • Kurtz S., et al. Versatile and open software for comparing large genomes. Genome Biol 2004, 5:R12.
    • (2004) Genome Biol , vol.5
    • Kurtz, S.1
  • 80
    • 0346505466 scopus 로고    scopus 로고
    • Hierarchical scaffolding with bambus
    • Pop M., et al. Hierarchical scaffolding with bambus. Genome Res 2004, 14:149-159.
    • (2004) Genome Res , vol.14 , pp. 149-159
    • Pop, M.1
  • 81
    • 52949083195 scopus 로고    scopus 로고
    • Gene-boosted assembly of a novel bacterial genome from very short reads
    • Salzberg S.L., et al. Gene-boosted assembly of a novel bacterial genome from very short reads. PLoS Comput Biol 2008, 4:e1000186.
    • (2008) PLoS Comput Biol , vol.4
    • Salzberg, S.L.1
  • 82
    • 34147132825 scopus 로고    scopus 로고
    • Identifying bacterial genes and endosymbiont dna with glimmer
    • Delcher A.L., et al. Identifying bacterial genes and endosymbiont dna with glimmer. Bioinformatics 2007, 23:673-679.
    • (2007) Bioinformatics , vol.23 , pp. 673-679
    • Delcher, A.L.1
  • 83
    • 0030801002 scopus 로고    scopus 로고
    • Gapped blast and psi-blast: a new generation of protein database search programs
    • Altschul S.F., et al. Gapped blast and psi-blast: a new generation of protein database search programs. Nucleic Acids Res 1997, 25:3389-3402.
    • (1997) Nucleic Acids Res , vol.25 , pp. 3389-3402
    • Altschul, S.F.1
  • 84
    • 33846659488 scopus 로고    scopus 로고
    • Composition-based statistics and translated nucleotide searches: improving the tblastn module of blast
    • Gertz E.M., et al. Composition-based statistics and translated nucleotide searches: improving the tblastn module of blast. BMC Biol 2006, 4:41.
    • (2006) BMC Biol , vol.4 , pp. 41
    • Gertz, E.M.1
  • 85
    • 35948929094 scopus 로고    scopus 로고
    • Sharcgs, a fast and highly accurate short-read assembly algorithm for de novo genomic sequencing
    • Dohm J.C., et al. Sharcgs, a fast and highly accurate short-read assembly algorithm for de novo genomic sequencing. Genome Res 2007, 17:1697-1706.
    • (2007) Genome Res , vol.17 , pp. 1697-1706
    • Dohm, J.C.1
  • 86
    • 79952374428 scopus 로고    scopus 로고
    • Genovo: de novo assembly for metagenomes
    • Laserson J., Jojic V., Koller D. Genovo: de novo assembly for metagenomes. J Comput Biol 2011, 18:429-443.
    • (2011) J Comput Biol , vol.18 , pp. 429-443
    • Laserson, J.1    Jojic, V.2    Koller, D.3
  • 87
    • 77955801615 scopus 로고    scopus 로고
    • Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences
    • Goecks J., et al. Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biol 2010, 11:R86.
    • (2010) Genome Biol , vol.11
    • Goecks, J.1
  • 89
    • 77956295988 scopus 로고    scopus 로고
    • The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data
    • McKenna A., et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res 2010, 20:1297-1303.
    • (2010) Genome Res , vol.20 , pp. 1297-1303
    • McKenna, A.1
  • 90
    • 77954492012 scopus 로고    scopus 로고
    • Cloud computing and the DNA data race
    • Schatz M.C., et al. Cloud computing and the DNA data race. Nat Biotechnol 2010, 28:691-693.
    • (2010) Nat Biotechnol , vol.28 , pp. 691-693
    • Schatz, M.C.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.