메뉴 건너뛰기




Volumn 11, Issue , 2010, Pages

A genome alignment algorithm based on compression

Author keywords

[No Author keywords available]

Indexed keywords

CONVENTIONAL METHODS; GENETIC INFORMATION; INFORMATION CONTENTS; INFORMATION-THEORETIC APPROACH; LOSSLESS COMPRESSION; MUTUAL INFORMATIONS; OBJECTIVE FUNCTIONS; SEQUENCE ALIGNMENTS;

EID: 78650087192     PISSN: None     EISSN: 14712105     Source Type: Journal    
DOI: 10.1186/1471-2105-11-599     Document Type: Article
Times cited : (9)

References (49)
  • 1
    • 0014757386 scopus 로고
    • A General Method Applicable to the Search for Similarities in the Amino Acid Sequences of Two Proteins
    • 10.1016/0022-2836(70)90057-4, 5420325
    • Needleman SB, Wunsch CD. A General Method Applicable to the Search for Similarities in the Amino Acid Sequences of Two Proteins. Journal of Molecular Biology 1970, 48:443-453. 10.1016/0022-2836(70)90057-4, 5420325.
    • (1970) Journal of Molecular Biology , vol.48 , pp. 443-453
    • Needleman, S.B.1    Wunsch, C.D.2
  • 2
    • 0019887799 scopus 로고
    • Identification of Common Molecular Subsequences
    • 10.1016/0022-2836(81)90087-5, 7265238
    • Smith TF, Waterman MS. Identification of Common Molecular Subsequences. Journal of Molecular Biology 1981, 147:195-147. 10.1016/0022-2836(81)90087-5, 7265238.
    • (1981) Journal of Molecular Biology , vol.147 , pp. 195-1147
    • Smith, T.F.1    Waterman, M.S.2
  • 6
    • 0030801002 scopus 로고    scopus 로고
    • Gapped BLAST and PSI-BLAST: a New Generation of Protein Database Search Programs
    • 10.1093/nar/25.17.3389, 146917, 9254694
    • Altschul SF, Madden T, Schaffer A, Zhang J, Zhang Z, Miller W, Lipman D. Gapped BLAST and PSI-BLAST: a New Generation of Protein Database Search Programs. Nucleic Acids Research 1997, 25(17):3389-3402. 10.1093/nar/25.17.3389, 146917, 9254694., http://nar.oxfordjournals.org/cgi/content/abstract/25/17/3389
    • (1997) Nucleic Acids Research , vol.25 , Issue.17 , pp. 3389-3402
    • Altschul, S.F.1    Madden, T.2    Schaffer, A.3    Zhang, J.4    Zhang, Z.5    Miller, W.6    Lipman, D.7
  • 7
    • 0031732094 scopus 로고    scopus 로고
    • A Computer Program for Aligning a cDNA Sequence with a Genomic DNA Sequence
    • 310774, 9750195
    • Florea L, Hartzell G, Zhang Z, Rubin GM, Miller W. A Computer Program for Aligning a cDNA Sequence with a Genomic DNA Sequence. Genome Research 1998, 8:967-974. 310774, 9750195.
    • (1998) Genome Research , vol.8 , pp. 967-974
    • Florea, L.1    Hartzell, G.2    Zhang, Z.3    Rubin, G.M.4    Miller, W.5
  • 8
    • 0034764307 scopus 로고    scopus 로고
    • SSAHA: A Fast Search Method for Large DNA Databases
    • 10.1101/gr.194201, 311141, 11591649
    • Ning Z, Cox AJ, Mullikin JC. SSAHA: A Fast Search Method for Large DNA Databases. Genome Research 2001, 11(10):1725-1729. 10.1101/gr.194201, 311141, 11591649., http://www.genome.org/cgi/content/abstract/11/10/1725
    • (2001) Genome Research , vol.11 , Issue.10 , pp. 1725-1729
    • Ning, Z.1    Cox, A.J.2    Mullikin, J.C.3
  • 9
    • 0032945593 scopus 로고    scopus 로고
    • DIALIGN 2: Improvement of the Segment-to-segment Approach to Multiple Sequence Alignment
    • 10.1093/bioinformatics/15.3.211, 10222408
    • Morgenstern B. DIALIGN 2: Improvement of the Segment-to-segment Approach to Multiple Sequence Alignment. Bioinformatics 1999, 15:211-218. 10.1093/bioinformatics/15.3.211, 10222408.
    • (1999) Bioinformatics , vol.15 , pp. 211-218
    • Morgenstern, B.1
  • 10
    • 0242470267 scopus 로고    scopus 로고
    • Efficient Multiple Genome Alignment
    • Höhl M, Kurtz S, Ohlebusch E. Efficient Multiple Genome Alignment. Bioinformatics 2002, 18(Suppl. 1):S312-S320., http://www.zbh.uni-hamburg.de/staff/kurtz/papers/HoehKurOhl2002.pdf
    • (2002) Bioinformatics , vol.18 , Issue.SUPPL. 1
    • Höhl, M.1    Kurtz, S.2    Ohlebusch, E.3
  • 11
    • 0036606576 scopus 로고    scopus 로고
    • Fast Algorithms for Large-scale Genome Alignment and Comparison
    • 10.1093/nar/30.11.2478, 117189, 12034836
    • Delcher AL, Phillippy A, Carlton JM, Salzberg SL. Fast Algorithms for Large-scale Genome Alignment and Comparison. Nucleic Acids Research 2002, 30(11):2478-2483. 10.1093/nar/30.11.2478, 117189, 12034836., http://nar.oxfordjournals.org/cgi/content/abstract/30/11/2478
    • (2002) Nucleic Acids Research , vol.30 , Issue.11 , pp. 2478-2483
    • Delcher, A.L.1    Phillippy, A.2    Carlton, J.M.3    Salzberg, S.L.4
  • 12
    • 2942538300 scopus 로고    scopus 로고
    • Versatile and Open Software for Comparing Large Genomes
    • 10.1186/gb-2004-5-2-r12, 395750, 14759262
    • Kurtz S, Phillippy A, Delcher AL, Smoot M, Shumway M, Antonescu C, Salzberg S. Versatile and Open Software for Comparing Large Genomes. Genome Biology 2004, 5(2). 10.1186/gb-2004-5-2-r12, 395750, 14759262., http://genomebiology.com/2004/5/2/R12
    • (2004) Genome Biology , vol.5 , Issue.2
    • Kurtz, S.1    Phillippy, A.2    Delcher, A.L.3    Smoot, M.4    Shumway, M.5    Antonescu, C.6    Salzberg, S.7
  • 14
    • 2942603015 scopus 로고    scopus 로고
    • Fast and Sensitive Multiple Alignment of Large Genomic Sequences
    • 10.1186/1471-2105-4-66, 521198, 14693042
    • Brudno M, Chapman M, Gottgens B, Batzoglou S, Morgenstern B. Fast and Sensitive Multiple Alignment of Large Genomic Sequences. BMC Bioinformatics 2003, 4:66. 10.1186/1471-2105-4-66, 521198, 14693042., http://www.biomedcentral.com/1471-2105/4/66
    • (2003) BMC Bioinformatics , vol.4 , pp. 66
    • Brudno, M.1    Chapman, M.2    Gottgens, B.3    Batzoglou, S.4    Morgenstern, B.5
  • 15
    • 0037270315 scopus 로고    scopus 로고
    • AVID: A Global Alignment Program
    • 10.1101/gr.789803, 430967, 12529311
    • Bray N, Dubchak I, Pachter L. AVID: A Global Alignment Program. Genome Research 2003, 13:97-102. 10.1101/gr.789803, 430967, 12529311.
    • (2003) Genome Research , vol.13 , pp. 97-102
    • Bray, N.1    Dubchak, I.2    Pachter, L.3
  • 16
    • 17244379179 scopus 로고    scopus 로고
    • The Many Faces of Sequence Alignment
    • 10.1093/bib/6.1.6, 15826353
    • Batzoglou S. The Many Faces of Sequence Alignment. Brief Bioinform 2005, 6:6-22. 10.1093/bib/6.1.6, 15826353., http://bib.oxfordjournals.org/cgi/content/abstract/6/1/6
    • (2005) Brief Bioinform , vol.6 , pp. 6-22
    • Batzoglou, S.1
  • 18
    • 0031694427 scopus 로고    scopus 로고
    • An Evaluation of Measures of Synonymous Codon Usage Bias
    • 10.1007/PL00006384, 9732453
    • Comeron JM, Aguade M. An Evaluation of Measures of Synonymous Codon Usage Bias. Journal of Molecular Evolution 1998, 47(3):268-274. 10.1007/PL00006384, 9732453.
    • (1998) Journal of Molecular Evolution , vol.47 , Issue.3 , pp. 268-274
    • Comeron, J.M.1    Aguade, M.2
  • 19
    • 0001514262 scopus 로고
    • Statistics of local complexity in amino acid sequences and sequence databases
    • Wootton JC, Federhen S. Statistics of local complexity in amino acid sequences and sequence databases. Computers & Chemistry 1993, 17(2):149-163., http://www.sciencedirect.com/science/article/B6TFV-44PXMF3-45/2/5ecbb4a876d356f8572bde2b43015788
    • (1993) Computers & Chemistry , vol.17 , Issue.2 , pp. 149-163
    • Wootton, J.C.1    Federhen, S.2
  • 20
    • 0007517571 scopus 로고    scopus 로고
    • Simple sequences of protein and DNA
    • Oxford University Press, Bishop MJ, Rawlings CJ
    • Wootton JC. Simple sequences of protein and DNA. DNA and Protein Sequence Analysis: A Practical Approach 1997, 169-183. Oxford University Press, Bishop MJ, Rawlings CJ.
    • (1997) DNA and Protein Sequence Analysis: A Practical Approach , pp. 169-183
    • Wootton, J.C.1
  • 21
    • 84856043672 scopus 로고
    • A Mathematical Theory of Communication
    • Shannon CE. A Mathematical Theory of Communication. The Bell System Technical Journal 1948, 27:379-423., http://cm.bell-labs.com/cm/ms/what/shannonday/shannon1948.pdf
    • (1948) The Bell System Technical Journal , vol.27 , pp. 379-423
    • Shannon, C.E.1
  • 22
    • 0000107517 scopus 로고
    • An Information Measure for Classification
    • Wallace CS, Boulton DM. An Information Measure for Classification. Computer Journal 1968, 11(2):185-194.
    • (1968) Computer Journal , vol.11 , Issue.2 , pp. 185-194
    • Wallace, C.S.1    Boulton, D.M.2
  • 24
    • 0025116731 scopus 로고
    • Minimum Message Length Encoding and the Comparison of Macromolecules
    • Allison L, Yee CN. Minimum Message Length Encoding and the Comparison of Macromolecules. Bulletin of Mathematical Biology 1990, 52(3):431-452.
    • (1990) Bulletin of Mathematical Biology , vol.52 , Issue.3 , pp. 431-452
    • Allison, L.1    Yee, C.N.2
  • 26
    • 0026786131 scopus 로고
    • Finite-state Models in the Alignment of Macromolecules
    • 10.1007/BF00160262, 1518085
    • Allison L, Wallace CS, Yee CN. Finite-state Models in the Alignment of Macromolecules. Journal of Molecular Evolution 1992, 35:77-89. 10.1007/BF00160262, 1518085.
    • (1992) Journal of Molecular Evolution , vol.35 , pp. 77-89
    • Allison, L.1    Wallace, C.S.2    Yee, C.N.3
  • 27
    • 34547630480 scopus 로고    scopus 로고
    • A Simple Statistical Algorithm for Biological Sequence Compression
    • Cao MD, Dix TI, Allison L, Mears C. A Simple Statistical Algorithm for Biological Sequence Compression. Data Compression Conference 2007, 43-52., http://doi.ieeecomputersociety.org/10.1109/DCC.2007.7
    • (2007) Data Compression Conference , pp. 43-52
    • Cao, M.D.1    Dix, T.I.2    Allison, L.3    Mears, C.4
  • 29
    • 0014516784 scopus 로고
    • The Information Content of a Multistate Distribution
    • 10.1016/0022-5193(69)90041-1, 5821532
    • Boulton DM, Wallace CS. The Information Content of a Multistate Distribution. Journal of Theoretical Biology 1969, 23(2):269-278. 10.1016/0022-5193(69)90041-1, 5821532.
    • (1969) Journal of Theoretical Biology , vol.23 , Issue.2 , pp. 269-278
    • Boulton, D.M.1    Wallace, C.S.2
  • 30
    • 1642632854 scopus 로고    scopus 로고
    • On Spaced Seeds for Similarity Search
    • Keich U, Li M, Ma B, Tromp J. On Spaced Seeds for Similarity Search. Discrete Appl Math 2004, 138(3):253-263.
    • (2004) Discrete Appl Math , vol.138 , Issue.3 , pp. 253-263
    • Keich, U.1    Li, M.2    Ma, B.3    Tromp, J.4
  • 31
    • 67650686477 scopus 로고    scopus 로고
    • Computing Substitution Matrices for Genomic Comparative Analysis
    • Cao MD, Dix TI, Allison L. Computing Substitution Matrices for Genomic Comparative Analysis. PAKDD 2009, LNAI 5476 2009, 647-655.
    • (2009) PAKDD 2009, LNAI 5476 , pp. 647-655
    • Cao, M.D.1    Dix, T.I.2    Allison, L.3
  • 32
    • 0025878149 scopus 로고
    • Amino Acid Substitution Matrices from an Information Theoretic Perspective
    • 10.1016/0022-2836(91)90193-A, 2051488
    • Altschul SF. Amino Acid Substitution Matrices from an Information Theoretic Perspective. Journal of Molecular Biology 1991, 219(3):555-565. 10.1016/0022-2836(91)90193-A, 2051488., http://www.sciencedirect.com/science/article/B6WK7-4FNGD09-4X/2/a0f9e00dbe41135c2072a3f7463c46d6
    • (1991) Journal of Molecular Biology , vol.219 , Issue.3 , pp. 555-565
    • Altschul, S.F.1
  • 34
    • 0030585734 scopus 로고    scopus 로고
    • Evaluation of Gene Structure Prediction Programs
    • 10.1006/geno.1996.0298, 8786136
    • Burset M, Guigó R. Evaluation of Gene Structure Prediction Programs. Genomics 1996, 34(3):353-367. 10.1006/geno.1996.0298, 8786136.
    • (1996) Genomics , vol.34 , Issue.3 , pp. 353-367
    • Burset, M.1    Guigó, R.2
  • 35
    • 2942516151 scopus 로고    scopus 로고
    • Benchmarking Tools for the Alignment of Functional Noncoding DNA
    • 10.1186/1471-2105-5-6, 344529, 14736341
    • Pollard DA, Bergman CM, Stoye J, Celniker SE, Eisen MB. Benchmarking Tools for the Alignment of Functional Noncoding DNA. BMC Bioinformatics 2004, 5:6. 10.1186/1471-2105-5-6, 344529, 14736341., http://www.biomedcentral.com/1471-2105/5/6
    • (2004) BMC Bioinformatics , vol.5 , pp. 6
    • Pollard, D.A.1    Bergman, C.M.2    Stoye, J.3    Celniker, S.E.4    Eisen, M.B.5
  • 38
    • 0032884653 scopus 로고    scopus 로고
    • Comparative Analysis of Noncoding Regions of 77 Orthologous Mouse and Human Gene Pairs
    • 10.1101/gr.9.9.815, 310816, 10508839
    • Jareborg N, Birney E, Durbin R. Comparative Analysis of Noncoding Regions of 77 Orthologous Mouse and Human Gene Pairs. Genome Research 1999, 9(9):815-824. 10.1101/gr.9.9.815, 310816, 10508839., http://genome.cshlp.org/content/9/9/815.abstract
    • (1999) Genome Research , vol.9 , Issue.9 , pp. 815-824
    • Jareborg, N.1    Birney, E.2    Durbin, R.3
  • 39
    • 76949098763 scopus 로고    scopus 로고
    • Towards realistic benchmarks for multiple alignments of non-coding sequences
    • 10.1186/1471-2105-11-54, 2823711, 20102627
    • Kim J, Sinha S. Towards realistic benchmarks for multiple alignments of non-coding sequences. BMC Bioinformatics 2010, 11:54. 10.1186/1471-2105-11-54, 2823711, 20102627., http://www.biomedcentral.com/1471-2105/11/54
    • (2010) BMC Bioinformatics , vol.11 , pp. 54
    • Kim, J.1    Sinha, S.2
  • 40
    • 84874759211 scopus 로고    scopus 로고
    • PlasmoDB: Plasmodium Genome Resource, Release 6.2
    • [Accessed Nov 2009], PlasmoDB
    • PlasmoDB PlasmoDB: Plasmodium Genome Resource, Release 6.2. 2009, [Accessed Nov 2009], PlasmoDB., http://www.plasmodb.org/common/downloads/release-6.2/
    • (2009)
  • 41
    • 69249217827 scopus 로고    scopus 로고
    • Plasmodium falciparum and Plasmodium vivax: so similar, yet very different
    • 10.1007/s00436-009-1521-y, 19543915
    • Das A, Sharma M, Gupta B, Dash A. Plasmodium falciparum and Plasmodium vivax: so similar, yet very different. Parasitology Research 2009, 105(4):1169-1171. 10.1007/s00436-009-1521-y, 19543915.
    • (2009) Parasitology Research , vol.105 , Issue.4 , pp. 1169-1171
    • Das, A.1    Sharma, M.2    Gupta, B.3    Dash, A.4
  • 42
    • 45149113022 scopus 로고    scopus 로고
    • Comparative Analysis of Long DNA Sequences by Per Element Information Content Using Different Contexts
    • 10.1186/1471-2105-8-S2-S10, 1892068, 17493248
    • Dix TI, Powell D, Allison L, Bernal J, Jaeger S, Stern L. Comparative Analysis of Long DNA Sequences by Per Element Information Content Using Different Contexts. BMC Bioinformatics 2007, 8(Suppl 2):S10. 10.1186/1471-2105-8-S2-S10, 1892068, 17493248., http://www.biomedcentral.com/1471-2105/8/S2/S10
    • (2007) BMC Bioinformatics , vol.8 , Issue.SUPPL 2
    • Dix, T.I.1    Powell, D.2    Allison, L.3    Bernal, J.4    Jaeger, S.5    Stern, L.6
  • 43
    • 78651513931 scopus 로고    scopus 로고
    • A Genome Alignment Algorithm Based on Compression
    • Tech. Rep. 2009/233, Faculty of Information Technology, Monash University, Victoria, Australia
    • Cao MD, Dix TI, Allison L. A Genome Alignment Algorithm Based on Compression. 2009, Tech. Rep. 2009/233, Faculty of Information Technology, Monash University, Victoria, Australia.
    • (2009)
    • Cao, M.D.1    Dix, T.I.2    Allison, L.3
  • 45
    • 0026458378 scopus 로고
    • Amino Acid Substitution Matrices from Protein Blocks
    • Henikoff S, Henikoff JG. Amino Acid Substitution Matrices from Protein Blocks. Proceedings of the National Academy of Sciences 1992, 89(22):10915-10919., http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=50453
    • (1992) Proceedings of the National Academy of Sciences , vol.89 , Issue.22 , pp. 10915-10919
    • Henikoff, S.1    Henikoff, J.G.2
  • 46
    • 57149085484 scopus 로고    scopus 로고
    • Genome bias influences amino acid choices: analysis of amino acid substitution and re-compilation of substitution matrices exclusive to an AT-biased genome
    • 10.1093/nar/gkn635, 2588515, 18948281
    • Paila U, Kondam R, Ranjan A. Genome bias influences amino acid choices: analysis of amino acid substitution and re-compilation of substitution matrices exclusive to an AT-biased genome. Nucleic Acids Research 2008, 36(21):6664-6675. 10.1093/nar/gkn635, 2588515, 18948281., http://nar.oxfordjournals.org/content/36/21/6664.abstract
    • (2008) Nucleic Acids Research , vol.36 , Issue.21 , pp. 6664-6675
    • Paila, U.1    Kondam, R.2    Ranjan, A.3
  • 47
    • 63149153318 scopus 로고    scopus 로고
    • Pairwise statistical significance of local sequence alignment using multiple parameter sets and empirical justification of parameter set change penalty
    • 10.1186/1471-2105-10-S3-S1, 2665049, 19344477
    • Agrawal A, Huang X. Pairwise statistical significance of local sequence alignment using multiple parameter sets and empirical justification of parameter set change penalty. BMC Bioinformatics 2009, 10(Suppl 3):S1. 10.1186/1471-2105-10-S3-S1, 2665049, 19344477., http://www.biomedcentral.com/1471-2105/10/S3/S1
    • (2009) BMC Bioinformatics , vol.10 , Issue.SUPPL 3
    • Agrawal, A.1    Huang, X.2
  • 48
    • 16344388556 scopus 로고    scopus 로고
    • The construction of amino acid substitution matrices for the comparison of proteins with non-standard compositions
    • 10.1093/bioinformatics/bti070, 15509610
    • Yu YK, Altschul SF. The construction of amino acid substitution matrices for the comparison of proteins with non-standard compositions. Bioinformatics 2005, 21(7):902-911. 10.1093/bioinformatics/bti070, 15509610., http://bioinformatics.oxfordjournals.org/content/21/7/902.abstract
    • (2005) Bioinformatics , vol.21 , Issue.7 , pp. 902-911
    • Yu, Y.K.1    Altschul, S.F.2
  • 49
    • 0035878724 scopus 로고    scopus 로고
    • Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements
    • 55814, 11452024, 10.1093/nar/29.14.2994
    • Schäffer AA, Aravind L, Madden TL, Shavirin S, Spouge JL, Wolf YI, Koonin EV, Altschul SF. Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements. Nucleic Acids Research 2001, 29(14):2994-3005. 55814, 11452024, 10.1093/nar/29.14.2994., http://nar.oxfordjournals.org/content/29/14/2994.abstract
    • (2001) Nucleic Acids Research , vol.29 , Issue.14 , pp. 2994-3005
    • Schäffer, A.A.1    Aravind, L.2    Madden, T.L.3    Shavirin, S.4    Spouge, J.L.5    Wolf, Y.I.6    Koonin, E.V.7    Altschul, S.F.8


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.