메뉴 건너뛰기




Volumn 6, Issue 7, 2010, Pages 11-

The construction and use of log-odds substitution scores for multiple sequence alignment

Author keywords

[No Author keywords available]

Indexed keywords

BAYESIAN NETWORKS; BIOINFORMATICS;

EID: 77957880647     PISSN: 1553734X     EISSN: 15537358     Source Type: Journal    
DOI: 10.1371/journal.pcbi.1000852     Document Type: Article
Times cited : (55)

References (122)
  • 1
    • 0019887799 scopus 로고
    • Identification of common molecular subsequences
    • Smith TF, Waterman MS (1981) Identification of common molecular subsequences. J Mol Biol 147: 195-197.
    • (1981) J Mol Biol , vol.147 , pp. 195-197
    • Smith, T.F.1    Waterman, M.S.2
  • 2
    • 0347637930 scopus 로고
    • Pattern recognition in genetic sequences by mismatch density
    • Sellers PH (1984) Pattern recognition in genetic sequences by mismatch density. Bull Math Biol 46: 501-514.
    • (1984) Bull Math Biol , vol.46 , pp. 501-514
    • Sellers, P.H.1
  • 3
    • 0023989064 scopus 로고
    • Improved tools for biological sequence comparison
    • Pearson WR, Lipman DJ (1988) Improved tools for biological sequence comparison. Proc Natl Acad Sci USA 85: 2444-2448.
    • (1988) Proc Natl Acad Sci USA , vol.85 , pp. 2444-2448
    • Pearson, W.R.1    Lipman, D.J.2
  • 4
    • 0030801002 scopus 로고    scopus 로고
    • Gapped BLAST and PSI-BLAST: A new generation of protein database search programs
    • Altschul SF, Madden TL, Schä ffer AA, Zhang J, Zhang Z, et al. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25: 3389-3402.
    • (1997) Nucleic Acids Res , vol.25 , pp. 3389-3402
    • Altschul, S.F.1    Madden, T.L.2    Schäffer, A.A.3    Zhang, J.4    Zhang, Z.5
  • 6
    • 34547916098 scopus 로고    scopus 로고
    • The identification of complete domains within protein sequences using accurate e-values for semi-global alignment
    • Kann MG, Sheetlin SL, Park Y, Bryant SH, Spouge JL (2007) The identification of complete domains within protein sequences using accurate e-values for semi-global alignment. Nucleic Acids Res 35: 4678-4685.
    • (2007) Nucleic Acids Res , vol.35 , pp. 4678-4685
    • Kann, M.G.1    Sheetlin, S.L.2    Park, Y.3    Bryant, S.H.4    Spouge, J.L.5
  • 7
    • 0000228203 scopus 로고
    • A model of evolutionary change in proteins
    • In: Dayhoff MO, ed, Washington, DC: Natl. Biomed. Res. Found
    • Dayhoff MO, Schwartz RM, Orcutt BC (1978) A model of evolutionary change in proteins. In: Dayhoff MO, ed. Atlas of Protein Sequence and Structure. Washington, DC: Natl. Biomed. Res. Found., volume 5, suppl. 3. pp 345-352.
    • (1978) Atlas of Protein Sequence and Structure , vol.5 , Issue.SUPPL. 3 , pp. 345-352
    • Dayhoff, M.O.1    Schwartz, R.M.2    Orcutt, B.C.3
  • 8
    • 0000507749 scopus 로고
    • Matrices for detecting distant relationships
    • In: Dayhoff MO, ed, Washington, DC: Natl. Biomed. Res. Found
    • Schwartz RM, Dayhoff MO (1978) Matrices for detecting distant relationships. In: Dayhoff MO, ed. Atlas of Protein Sequence and Structure. Washington, DC: Natl. Biomed. Res. Found., volume 5, suppl. 3. pp 353-358.
    • (1978) Atlas of Protein Sequence and Structure , vol.5 , Issue.SUPPL. 3 , pp. 353-358
    • Schwartz, R.M.1    Dayhoff, M.O.2
  • 9
    • 0021712450 scopus 로고
    • Aligning amino acid sequences: Comparison of commonly used methods
    • Feng DF, Johnson MS, Doolittle RF (1985) Aligning amino acid sequences: comparison of commonly used methods. J Mol Evol 21: 112-125.
    • (1985) J Mol Evol , vol.21 , pp. 112-125
    • Feng, D.F.1    Johnson, M.S.2    Doolittle, R.F.3
  • 10
    • 0022591495 scopus 로고
    • The classification of amino acid conservation
    • Taylor WR (1986) The classification of amino acid conservation. J Theor Biol 119: 205-218.
    • (1986) J Theor Biol , vol.119 , pp. 205-218
    • Taylor, W.R.1
  • 11
    • 0023286275 scopus 로고
    • New scoring matrix for amino acid residue exchanges based on residue characteristic physical parameters
    • Rao JKM (1987) New scoring matrix for amino acid residue exchanges based on residue characteristic physical parameters. Int J Peptide Protein Res 29: 276-281.
    • (1987) Int J Peptide Protein Res , vol.29 , pp. 276-281
    • Rao, J.K.M.1
  • 12
    • 0024267619 scopus 로고
    • Amino acid substitutions in structurally related proteins
    • Risler JL, Delorme MO, Delacroix H, Henaut A (1988) Amino acid substitutions in structurally related proteins. J Mol Biol 204: 1019-1029.
    • (1988) J Mol Biol , vol.204 , pp. 1019-1029
    • Risler, J.L.1    Delorme, M.O.2    Delacroix, H.3    Henaut, A.4
  • 13
    • 0026656815 scopus 로고
    • Exhaustive matching of the entire protein sequence database
    • Gonnet GH, Cohen MA, Benner SA (1992) Exhaustive matching of the entire protein sequence database. Science 256: 1443-1445.
    • (1992) Science , vol.256 , pp. 1443-1445
    • Gonnet, G.H.1    Cohen, M.A.2    Benner, S.A.3
  • 14
    • 0026458378 scopus 로고
    • Amino acid substitution matrices from protein blocks
    • Henikoff S, Henikoff JG (1992) Amino acid substitution matrices from protein blocks. Proc Natl Acad Sci USA 89: 10915-10919.
    • (1992) Proc Natl Acad Sci USA , vol.89 , pp. 10915-10919
    • Henikoff, S.1    Henikoff, J.G.2
  • 15
    • 0027062943 scopus 로고
    • Environment-specific amino acid substitution tables: Tertiary templates and prediction of protein folds
    • Overington J, Donnelly D, Johnson MS, Sali A, Blundell TL (1992) Environment-specific amino acid substitution tables: Tertiary templates and prediction of protein folds. Prot Sci 1: 216-226.
    • (1992) Prot Sci , vol.1 , pp. 216-226
    • Overington, J.1    Donnelly, D.2    Johnson, M.S.3    Sali, A.4    Blundell, T.L.5
  • 16
    • 0026691182 scopus 로고
    • The rapid generation of mutation data matrices from protein sequences
    • Jones DT, Taylor WR, Thornton JM (1992) The rapid generation of mutation data matrices from protein sequences. Comput Appl Biosci 8: 275-282.
    • (1992) Comput Appl Biosci , vol.8 , pp. 275-282
    • Jones, D.T.1    Taylor, W.R.2    Thornton, J.M.3
  • 17
    • 0034562066 scopus 로고    scopus 로고
    • Optimization of a new score function for the detection of remote homologs
    • Kann M, Qian B, Goldstein RA (2000) Optimization of a new score function for the detection of remote homologs. Proteins 41: 498-503.
    • (2000) Proteins , vol.41 , pp. 498-503
    • Kann, M.1    Qian, B.2    Goldstein, R.A.3
  • 18
    • 0033670313 scopus 로고    scopus 로고
    • PHAT: A transmembrane-specific substitution matrix
    • Ng PC, Henikoff JG, Henikoff S (2000) PHAT: a transmembrane-specific substitution matrix. Bioinformatics 16: 760-766.
    • (2000) Bioinformatics , vol.16 , pp. 760-766
    • Ng, P.C.1    Henikoff, J.G.2    Henikoff, S.3
  • 19
    • 0035230037 scopus 로고    scopus 로고
    • Non-symmetric score matrices and the detection of homologous transmembrane proteins
    • Müller T, Rahmann S, Rehmsmeier M (2001) Non-symmetric score matrices and the detection of homologous transmembrane proteins. Bioinformatics 17, Suppl. 1: S182-S189.
    • (2001) Bioinformatics , vol.17 , Issue.SUPPL. 1
    • Müller, T.1    Rahmann, S.2    Rehmsmeier, M.3
  • 20
    • 41149112563 scopus 로고    scopus 로고
    • Context-specific amino acid substitution matrices and their use in the detection of protein homologs
    • Goonesekere NC, Lee B (2008) Context-specific amino acid substitution matrices and their use in the detection of protein homologs. Proteins 71: 910-919.
    • (2008) Proteins , vol.71 , pp. 910-919
    • Goonesekere, N.C.1    Lee, B.2
  • 21
    • 0002164370 scopus 로고
    • Improved sensitivity of nucleic acid database searches using application-specific scoring matrices
    • States DJ, Gish W, Altschul SF (1991) Improved sensitivity of nucleic acid database searches using application-specific scoring matrices. Methods 3: 66-70.
    • (1991) Methods , vol.3 , pp. 66-70
    • States, D.J.1    Gish, W.2    Altschul, S.F.3
  • 22
    • 0036372452 scopus 로고    scopus 로고
    • Scoring pairwise genomic sequence alignments
    • In: Altman R, Dunker AK, Hunter L, Lauderdale K, Klein TE, eds, Mountain View, CA: World Scientific
    • Chiaromonte F, Yap VB, Miller W (2002) Scoring pairwise genomic sequence alignments. In: Altman R, Dunker AK, Hunter L, Lauderdale K, Klein TE, eds. Proc. Pacific Symp. Biocomput. Mountain View, CA: World Scientific. pp 115-126.
    • (2002) Proc. Pacific Symp. Biocomput , pp. 115-126
    • Chiaromonte, F.1    Yap, V.B.2    Miller, W.3
  • 23
    • 0025259313 scopus 로고
    • Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes
    • Karlin S, Altschul SF (1990) Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes. Proc Natl Acad Sci USA 87: 2264-2268.
    • (1990) Proc Natl Acad Sci USA , vol.87 , pp. 2264-2268
    • Karlin, S.1    Altschul, S.F.2
  • 24
    • 0000526802 scopus 로고
    • Limit distribution of maximal nonaligned two-sequence segmental score
    • Dembo A, Karlin S, Zeitouni O (1994) Limit distribution of maximal nonaligned two-sequence segmental score. Ann Prob 22: 2022-2039.
    • (1994) Ann Prob , vol.22 , pp. 2022-2039
    • Dembo, A.1    Karlin, S.2    Zeitouni, O.3
  • 25
    • 0016424695 scopus 로고
    • Minimal mutation trees of sequences
    • Sankoff D (1975) Minimal mutation trees of sequences. SIAM J Appl Math 28: 35-42.
    • (1975) SIAM J Appl Math , vol.28 , pp. 35-42
    • Sankoff, D.1
  • 28
    • 0022552744 scopus 로고
    • Multiple sequence alignment
    • Bacon DJ, Anderson WF (1986) Multiple sequence alignment. J Mol Biol 191: 153-161.
    • (1986) J Mol Biol , vol.191 , pp. 153-161
    • Bacon, D.J.1    Anderson, W.F.2
  • 29
    • 0023042012 scopus 로고
    • Information content of binding sites on nucleotide sequences
    • Schneider TD, Stormo GD, Gold L, Ehrenfeucht A (1986) Information content of binding sites on nucleotide sequences. J Mol Biol 188: 415-431.
    • (1986) J Mol Biol , vol.188 , pp. 415-431
    • Schneider, T.D.1    Stormo, G.D.2    Gold, L.3    Ehrenfeucht, A.4
  • 30
    • 0025878149 scopus 로고
    • Amino acid substitution matrices from an information theoretic perspective
    • Altschul SF (1991) Amino acid substitution matrices from an information theoretic perspective. J Mol Biol 219: 555-565.
    • (1991) J Mol Biol , vol.219 , pp. 555-565
    • Altschul, S.F.1
  • 31
    • 0027903113 scopus 로고
    • Using Dirichlet mixture priors to derive hidden Markov models for protein families
    • In: Hunter L, Searls D, Shavlik J, eds, Menlo Park, CA: AAAI Press
    • Brown M, Hughey R, Krogh A, Mian IS, Sjölander K, et al. (1993) Using Dirichlet mixture priors to derive hidden Markov models for protein families. In: Hunter L, Searls D, Shavlik J, eds. Proc. First Int. Conf. on Intelligent System for Mol. Biol. Menlo Park, CA: AAAI Press. pp 47-55.
    • (1993) Proc. First Int. Conf. on Intelligent System for Mol. Biol. , pp. 47-55
    • Brown, M.1    Hughey, R.2    Krogh, A.3    Mian, I.S.4    Sjölander, K.5
  • 32
    • 0029906607 scopus 로고    scopus 로고
    • Dirichlet mixtures: A method for improved detection of weak but significant protein sequence homology
    • Sjölander K, Karplus K, Brown M, Hughey R, Krogh A, et al. (1996) Dirichlet mixtures: a method for improved detection of weak but significant protein sequence homology. Comput Appl Biosci 12: 327-345.
    • (1996) Comput Appl Biosci , vol.12 , pp. 327-345
    • Sjölander, K.1    Karplus, K.2    Brown, M.3    Hughey, R.4    Krogh, A.5
  • 34
    • 3242662380 scopus 로고    scopus 로고
    • MotifPrototyper: A Bayesian profile model for motif families
    • Xing EP, Karp RM (2004) MotifPrototyper: a Bayesian profile model for motif families. Proc Natl Acad Sci USA 101: 10523-10528.
    • (2004) Proc Natl Acad Sci USA , vol.101 , pp. 10523-10528
    • Xing, E.P.1    Karp, R.M.2
  • 37
    • 70349205853 scopus 로고    scopus 로고
    • BigFoot: Bayesian alignment and phylogenetic footprinting with MCMC
    • Satija R, Novák A, Miklós I, Lyngsø R, Hein J (2009) BigFoot: Bayesian alignment and phylogenetic footprinting with MCMC. BMC Evol Biol 9: 217.
    • (2009) BMC Evol Biol , vol.9 , pp. 217
    • Satija, R.1    Novák, A.2    Miklós, I.3    Lyngsø, R.4    Hein, J.5
  • 38
    • 0023084055 scopus 로고
    • Progressive sequence alignment as a prerequisite to correct phylogenetic trees
    • Feng DF, Doolittle RF (1987) Progressive sequence alignment as a prerequisite to correct phylogenetic trees. J Mol Evol 25: 351-360.
    • (1987) J Mol Evol , vol.25 , pp. 351-360
    • Feng, D.F.1    Doolittle, R.F.2
  • 39
    • 0027968068 scopus 로고
    • CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice
    • Thompson JD, Higgins DG, Gibson TJ (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res 22: 4673-4680.
    • (1994) Nucleic Acids Res , vol.22 , pp. 4673-4680
    • Thompson, J.D.1    Higgins, D.G.2    Gibson, T.J.3
  • 40
    • 0029789461 scopus 로고    scopus 로고
    • Searching databases of conserved sequence regions by aligning protein multiple-alignments
    • Pietrokovski S (1996) Searching databases of conserved sequence regions by aligning protein multiple-alignments. Nucleic Acids Res 24: 3836-3845.
    • (1996) Nucleic Acids Res , vol.24 , pp. 3836-3845
    • Pietrokovski, S.1
  • 41
    • 0033997036 scopus 로고    scopus 로고
    • Comparison of sequence profiles. strategies for structural predictions using sequence information
    • Rychlewski L, Jaroszewski L, Li W, Godzik A (2000) Comparison of sequence profiles. strategies for structural predictions using sequence information. Protein Sci 9: 232-241.
    • (2000) Protein Sci , vol.9 , pp. 232-241
    • Rychlewski, L.1    Jaroszewski, L.2    Li, W.3    Godzik, A.4
  • 42
    • 0036307493 scopus 로고    scopus 로고
    • Within the twilight zone: A sensitive profile-profile comparison tool based on information theory
    • Yona G, Levitt M (2002) Within the twilight zone: a sensitive profile-profile comparison tool based on information theory. J Mol Biol 315: 1257-1275.
    • (2002) J Mol Biol , vol.315 , pp. 1257-1275
    • Yona, G.1    Levitt, M.2
  • 43
    • 0042594474 scopus 로고    scopus 로고
    • SATCHMO: Sequence alignment and tree construction using hidden markov models
    • Edgar RC, Sjölander K (2003) SATCHMO: sequence alignment and tree construction using hidden markov models. Bioinformatics 19: 1404-1411.
    • (2003) Bioinformatics , vol.19 , pp. 1404-1411
    • Edgar, R.C.1    Sjölander, K.2
  • 44
    • 0037440190 scopus 로고    scopus 로고
    • Finding weak similarities between proteins by sequence profile comparison
    • Panchenko AR (2003) Finding weak similarities between proteins by sequence profile comparison. Nucleic Acids Res 31: 683-689.
    • (2003) Nucleic Acids Res , vol.31 , pp. 683-689
    • Panchenko, A.R.1
  • 45
    • 0037423702 scopus 로고    scopus 로고
    • COMPASS: A tool for comparison of multiple protein alignments with assessment of statistical significance
    • Sadreyev R, Grishin N (2003) COMPASS: a tool for comparison of multiple protein alignments with assessment of statistical significance. J Mol Biol 326: 317-336.
    • (2003) J Mol Biol , vol.326 , pp. 317-336
    • Sadreyev, R.1    Grishin, N.2
  • 46
    • 3042550791 scopus 로고    scopus 로고
    • A comparison of scoring functions for protein sequence profile alignment
    • Edgar RC, Sjölander K (2004) A comparison of scoring functions for protein sequence profile alignment. Bioinformatics 20: 1301-1308.
    • (2004) Bioinformatics , vol.20 , pp. 1301-1308
    • Edgar, R.C.1    Sjölander, K.2
  • 47
    • 2442663920 scopus 로고    scopus 로고
    • Scoring profile-to-profile sequence alignments
    • Wang G, Dunbrack RL, Jr. (2004) Scoring profile-to-profile sequence alignments. Protein Sci 13: 1612-1626.
    • (2004) Protein Sci , vol.13 , pp. 1612-1626
    • Wang, G.1    Dunbrack Jr., R.L.2
  • 48
    • 16344373015 scopus 로고    scopus 로고
    • Protein homology detection by HMM-HMM comparison
    • Söding J (2005) Protein homology detection by HMM-HMM comparison. Bioinformatics 21: 951-960.
    • (2005) Bioinformatics , vol.21 , pp. 951-960
    • Söding, J.1
  • 51
    • 0025641641 scopus 로고
    • Weighting aligned protein or nucleic acid sequences to correct for unequal representation
    • Sibbald PR, Argos P (1990) Weighting aligned protein or nucleic acid sequences to correct for unequal representation. J Mol Biol 216: 813-818.
    • (1990) J Mol Biol , vol.216 , pp. 813-818
    • Sibbald, P.R.1    Argos, P.2
  • 52
    • 0026030641 scopus 로고
    • Database of homology-derived protein structures and the structural meaning of sequence alignment
    • Sander C, Schneider R (1991) Database of homology-derived protein structures and the structural meaning of sequence alignment. Proteins 9: 56-68.
    • (1991) Proteins , vol.9 , pp. 56-68
    • Sander, C.1    Schneider, R.2
  • 53
    • 0027496980 scopus 로고
    • Weighting in sequence space: A comparison of methods in terms of generalized sequences
    • Vingron M, Sibbald PR (1993) Weighting in sequence space: a comparison of methods in terms of generalized sequences. Proc Natl Acad Sci USA 90: 8777-8781.
    • (1993) Proc Natl Acad Sci USA , vol.90 , pp. 8777-8781
    • Vingron, M.1    Sibbald, P.R.2
  • 54
    • 0028221642 scopus 로고
    • Volume changes in protein evolution. Appendix: A method to weight protein sequences to correct for unequal representation
    • Gerstein M, Sonnhammer ELL, Chothia C (1994) Volume changes in protein evolution. Appendix: A method to weight protein sequences to correct for unequal representation. J Mol Biol 236: 1067-1078.
    • (1994) J Mol Biol , vol.236 , pp. 1067-1078
    • Gerstein, M.1    Sonnhammer, E.L.L.2    Chothia, C.3
  • 55
    • 0028043552 scopus 로고
    • Position-based sequence weights
    • Henikoff S, Henikoff JG (1994) Position-based sequence weights. J Mol Biol 243: 574-578.
    • (1994) J Mol Biol , vol.243 , pp. 574-578
    • Henikoff, S.1    Henikoff, J.G.2
  • 56
    • 0028013177 scopus 로고
    • Improved sensitivity of profile searches through the use of sequence weights and gap excision
    • Thompson JD, Higgins DG, Gibson TJ (1994) Improved sensitivity of profile searches through the use of sequence weights and gap excision. Comput Appl Biosci 10: 19-29.
    • (1994) Comput Appl Biosci , vol.10 , pp. 19-29
    • Thompson, J.D.1    Higgins, D.G.2    Gibson, T.J.3
  • 57
    • 0029259085 scopus 로고
    • Maximum discrimination hidden Markov models of sequence consensus
    • Eddy SR, Mitchison G, Durbin R (1995) Maximum discrimination hidden Markov models of sequence consensus. J Comput Biol 2: 9-23.
    • (1995) J Comput Biol , vol.2 , pp. 9-23
    • Eddy, S.R.1    Mitchison, G.2    Durbin, R.3
  • 58
    • 0028787980 scopus 로고
    • A weighting system and algorithm for aligning many phylogenetically related sequences
    • Gotoh O (1995) A weighting system and algorithm for aligning many phylogenetically related sequences. Comput Appl Biosci 11: 543-551.
    • (1995) Comput Appl Biosci , vol.11 , pp. 543-551
    • Gotoh, O.1
  • 59
    • 0029198940 scopus 로고
    • Maximum entropy weighting of aligned sequences of protein or DNA
    • In: Rawlings C, Clark D, Altman R, Hunter L, Lengauer T, et al. (1995), Menlo Park, CA: AAAI Press
    • Krogh A, Mitchison G (1995) Maximum entropy weighting of aligned sequences of protein or DNA. In: Rawlings C, Clark D, Altman R, Hunter L, Lengauer T, et al. (1995) Proc. Third Int. Conf. on Intelligent System for Mol. Biol. Menlo Park, CA: AAAI Press. pp 215-221.
    • (1995) Proc. Third Int. Conf. on Intelligent System for Mol. Biol. , pp. 215-221
    • Krogh, A.1    Mitchison, G.2
  • 60
    • 0030332515 scopus 로고    scopus 로고
    • The megaprior heuristic for discovering protein sequence patterns
    • In: States D, Agarwal P, Gaasterland T, Hunter L, Smith R, eds
    • Bailey TL, Gribskov M (1996) The megaprior heuristic for discovering protein sequence patterns. In: States D, Agarwal P, Gaasterland T, Hunter L, Smith R, eds. Proc. Fourth Int. Conf. on Intelligent System for Mol. Biol. pp 15-24.
    • (1996) Proc. Fourth Int. Conf. on Intelligent System for Mol. Biol. , pp. 15-24
    • Bailey, T.L.1    Gribskov, M.2
  • 61
    • 0033028454 scopus 로고    scopus 로고
    • PSIC: Profile extraction from sequence alignments with positionspecific counts of independent observations
    • Sunyaev SR, Eisenhaber F, Rodchenkov IV, Eisenhaber B, Tumanyan VG, et al. (1999) PSIC: profile extraction from sequence alignments with positionspecific counts of independent observations. Protein Eng 12: 387-394.
    • (1999) Protein Eng , vol.12 , pp. 387-394
    • Sunyaev, S.R.1    Eisenhaber, F.2    Rodchenkov, I.V.3    Eisenhaber, B.4    Tumanyan, V.G.5
  • 64
    • 0346734129 scopus 로고    scopus 로고
    • The compositional adjustment of amino acid substitution matrices
    • Yu YK, Wootton JC, Altschul SF (2003) The compositional adjustment of amino acid substitution matrices. Proc Natl Acad Sci USA 100: 15688-15693.
    • (2003) Proc Natl Acad Sci USA , vol.100 , pp. 15688-15693
    • Yu, Y.K.1    Wootton, J.C.2    Altschul, S.F.3
  • 65
    • 16344388556 scopus 로고    scopus 로고
    • The construction of amino acid substitution matrices for the comparison of proteins with non-standard compositions
    • Yu YK, Altschul SF (2005) The construction of amino acid substitution matrices for the comparison of proteins with non-standard compositions. Bioinformatics 21: 902-911.
    • (2005) Bioinformatics , vol.21 , pp. 902-911
    • Yu, Y.K.1    Altschul, S.F.2
  • 66
    • 84873751778 scopus 로고
    • An invariant form of the prior probability in estimation problems
    • Jeffreys H (1946) An invariant form of the prior probability in estimation problems. Proc Royal Soc London Series A 186: 453-461.
    • (1946) Proc Royal Soc London Series A , vol.186 , pp. 453-461
    • Jeffreys, H.1
  • 67
    • 63349083142 scopus 로고    scopus 로고
    • Pseudocounts for transcription factor binding sites
    • Nishida K, Frith MC, Nakai K (2009) Pseudocounts for transcription factor binding sites. Nucleic Acids Res 37: 939-944.
    • (2009) Nucleic Acids Res , vol.37 , pp. 939-944
    • Nishida, K.1    Frith, M.C.2    Nakai, K.3
  • 68
    • 0028047892 scopus 로고
    • Sequence alignment and penalty choice. Review of concepts, case studies and implications
    • Vingron M, Waterman MS (1994) Sequence alignment and penalty choice. Review of concepts, case studies and implications. J Mol Biol 235: 1-12.
    • (1994) J Mol Biol , vol.235 , pp. 1-12
    • Vingron, M.1    Waterman, M.S.2
  • 69
    • 0027912333 scopus 로고
    • Detecting subtle sequence signals: A Gibbs sampling strategy for multiple alignment
    • Lawrence CE, Altschul SF, Boguski MS, Liu JS, Neuwald AF, et al. (1993) Detecting subtle sequence signals: A Gibbs sampling strategy for multiple alignment. Science 262: 208-214.
    • (1993) Science , vol.262 , pp. 208-214
    • Lawrence, C.E.1    Altschul, S.F.2    Boguski, M.S.3    Liu, J.S.4    Neuwald, A.F.5
  • 71
    • 0031604140 scopus 로고    scopus 로고
    • Phylogenetic inference in protein superfamilies: Analysis of SH2 domains
    • In: Glasgow J, Littlejohn T, Major F, Lathrop R, Sankoff D, et al. (1998), Menlo Park, CA: AAAI Press
    • Sjölander K (1998) Phylogenetic inference in protein superfamilies: analysis of SH2 domains. In: Glasgow J, Littlejohn T, Major F, Lathrop R, Sankoff D, et al. (1998) Proc. Sixth Int. Conf. on Intelligent System for Mol. Biol. Menlo Park, CA: AAAI Press. pp 165-174.
    • (1998) Proc. Sixth Int. Conf. on Intelligent System for Mol. Biol. , pp. 165-174
    • Sjölander, K.1
  • 72
    • 49549118253 scopus 로고    scopus 로고
    • Efficient functional clustering of protein sequences using the Dirichlet process
    • Brown DP (2008) Efficient functional clustering of protein sequences using the Dirichlet process. Bioinformatics 24: 1765-1771.
    • (2008) Bioinformatics , vol.24 , pp. 1765-1771
    • Brown, D.P.1
  • 73
    • 0024964655 scopus 로고
    • Gap costs for multiple sequence alignment
    • Altschul SF (1989) Gap costs for multiple sequence alignment. J Theor Biol 138: 297-309.
    • (1989) J Theor Biol , vol.138 , pp. 297-309
    • Altschul, S.F.1
  • 74
    • 0026079507 scopus 로고
    • An evolutionary model for maximum likelihood alignment of DNA sequences
    • Thorne JL, Kishino H, Felsenstein J (1991) An evolutionary model for maximum likelihood alignment of DNA sequences. J Mol Evol 33: 114-124.
    • (1991) J Mol Evol , vol.33 , pp. 114-124
    • Thorne, J.L.1    Kishino, H.2    Felsenstein, J.3
  • 75
    • 0026528734 scopus 로고
    • Inching toward reality: An improved likelihood model of sequence evolution
    • Thorne JL, Kishino H, Felsenstein J (1992) Inching toward reality: an improved likelihood model of sequence evolution. J Mol Evol 34: 3-16.
    • (1992) J Mol Evol , vol.34 , pp. 3-16
    • Thorne, J.L.1    Kishino, H.2    Felsenstein, J.3
  • 76
    • 0027902062 scopus 로고
    • Hidden Markov models and iterative aligners: Study of their equivalence and possibilities
    • In: Hunter L, Searls D, Shavlik J, eds, Menlo Park, CA: AAAI Press
    • Tanaka H, Ishikawa M, Asai K, Konagaya A (1993) Hidden Markov models and iterative aligners: study of their equivalence and possibilities. In: Hunter L, Searls D, Shavlik J, eds. Proc. First Int. Conf. on Intelligent System for Mol. Biol. Menlo Park, CA: AAAI Press. pp 395-401.
    • (1993) Proc. First Int. Conf. on Intelligent System for Mol. Biol. , pp. 395-401
    • Tanaka, H.1    Ishikawa, M.2    Asai, K.3    Konagaya, A.4
  • 78
    • 0028181441 scopus 로고
    • Hidden Markov models in computational biology. Applications to protein modeling
    • Krogh A, Brown M, Mian IS, Sjölander K, Haussler D (1994) Hidden Markov models in computational biology. Applications to protein modeling. J Mol Biol 235: 1501-1531.
    • (1994) J Mol Biol , vol.235 , pp. 1501-1531
    • Krogh, A.1    Brown, M.2    Mian, I.S.3    Sjölander, K.4    Haussler, D.5
  • 80
    • 0031743421 scopus 로고    scopus 로고
    • Profile hidden Markov models
    • Eddy SR (1998) Profile hidden Markov models. Bioinformatics 14: 755-763.
    • (1998) Bioinformatics , vol.14 , pp. 755-763
    • Eddy, S.R.1
  • 81
    • 0032438987 scopus 로고    scopus 로고
    • Hidden Markov models for detecting remote protein homologies
    • Karplus K, Barrett C, Hughey R (1998) Hidden Markov models for detecting remote protein homologies. Bioinformatics 14: 846-856.
    • (1998) Bioinformatics , vol.14 , pp. 846-856
    • Karplus, K.1    Barrett, C.2    Hughey, R.3
  • 82
    • 13244299130 scopus 로고    scopus 로고
    • Gapped alignment of protein sequence motifs through Monte Carlo optimization of a hidden Markov model
    • Neuwald AF, Liu JS (2004) Gapped alignment of protein sequence motifs through Monte Carlo optimization of a hidden Markov model. BMC Bioinformatics 5: 157.
    • (2004) BMC Bioinformatics , vol.5 , pp. 157
    • Neuwald, A.F.1    Liu, J.S.2
  • 83
    • 0020484488 scopus 로고
    • An improved algorithm for matching biological sequences
    • Gotoh O (1982) An improved algorithm for matching biological sequences. J Mol Biol 162: 705-708.
    • (1982) J Mol Biol , vol.162 , pp. 705-708
    • Gotoh, O.1
  • 85
    • 0022899010 scopus 로고
    • Optimal sequence alignment using affine gap costs
    • Altschul SF, Erickson BW (1986) Optimal sequence alignment using affine gap costs. Bull Math Biol 48: 603-616.
    • (1986) Bull Math Biol , vol.48 , pp. 603-616
    • Altschul, S.F.1    Erickson, B.W.2
  • 87
    • 0024236865 scopus 로고
    • Sequence comparison with concave weighting functions
    • Miller W, Myers EW (1988) Sequence comparison with concave weighting functions. Bull Math Biol 50: 97-120.
    • (1988) Bull Math Biol , vol.50 , pp. 97-120
    • Miller, W.1    Myers, E.W.2
  • 88
    • 0027483434 scopus 로고
    • Empirical and structural models for insertions and deletions in the divergent evolution of proteins
    • Benner SA, Cohen MA, Gonnet GH (1993) Empirical and structural models for insertions and deletions in the divergent evolution of proteins. J Mol Biol 229: 1065-1082.
    • (1993) J Mol Biol , vol.229 , pp. 1065-1082
    • Benner, S.A.1    Cohen, M.A.2    Gonnet, G.H.3
  • 89
    • 3042852894 scopus 로고    scopus 로고
    • Frequency of gaps observed in a structurally aligned protein pair database suggests a simple gap penalty function
    • Goonesekere NC, Lee B (2004) Frequency of gaps observed in a structurally aligned protein pair database suggests a simple gap penalty function. Nucleic Acids Res 32: 2838-2843.
    • (2004) Nucleic Acids Res , vol.32 , pp. 2838-2843
    • Goonesekere, N.C.1    Lee, B.2
  • 90
    • 0031576337 scopus 로고    scopus 로고
    • Glutamine, alanine or glycine repeats inserted into the loop of a protein have minimal effects on stability and folding rates
    • Ladurner AG, Fersht AR (1997) Glutamine, alanine or glycine repeats inserted into the loop of a protein have minimal effects on stability and folding rates. J Mol Biol 273: 330-337.
    • (1997) J Mol Biol , vol.273 , pp. 330-337
    • Ladurner, A.G.1    Fersht, A.R.2
  • 91
    • 0037305938 scopus 로고    scopus 로고
    • Low free energy cost of very long loop insertions in proteins
    • Scalley-Kim M, Minard P, Baker D (2003) Low free energy cost of very long loop insertions in proteins. Protein Sci 12: 197-206.
    • (2003) Protein Sci , vol.12 , pp. 197-206
    • Scalley-Kim, M.1    Minard, P.2    Baker, D.3
  • 94
    • 0033649176 scopus 로고    scopus 로고
    • Stochastic heuristic algorithms for target motif identification (extended abstract)
    • Wareham HT, Jiang T, Zhang X, Trendall CG (2000) Stochastic heuristic algorithms for target motif identification (extended abstract). Pac Symp Biocomput. pp 392-403.
    • (2000) Pac Symp Biocomput , pp. 392-403
    • Wareham, H.T.1    Jiang, T.2    Zhang, X.3    Trendall, C.G.4
  • 95
    • 0032988850 scopus 로고    scopus 로고
    • BAliBASE: A benchmark alignment database for the evaluation of multiple alignment programs
    • Thompson JD, Plewniak F, Poch O (1999) BAliBASE: a benchmark alignment database for the evaluation of multiple alignment programs. Bioinformatics 15: 87-88.
    • (1999) Bioinformatics , vol.15 , pp. 87-88
    • Thompson, J.D.1    Plewniak, F.2    Poch, O.3
  • 97
    • 0031857684 scopus 로고    scopus 로고
    • Rose: Generating sequence families
    • Stoye J, Evers D, Meyer F (1998) Rose: generating sequence families. Bioinformatics 14: 157-163.
    • (1998) Bioinformatics , vol.14 , pp. 157-163
    • Stoye, J.1    Evers, D.2    Meyer, F.3
  • 98
    • 45949105543 scopus 로고    scopus 로고
    • DIALIGN-TX: Greedy and progressive approaches for segment-based multiple sequence alignment
    • Subramanian AR, Kaufmann M, Morgenstern B (2008) DIALIGN-TX: greedy and progressive approaches for segment-based multiple sequence alignment. Algorithms Mol Biol 3: 6.
    • (2008) Algorithms Mol Biol , vol.3 , pp. 6
    • Subramanian, A.R.1    Kaufmann, M.2    Morgenstern, B.3
  • 99
    • 34249857539 scopus 로고    scopus 로고
    • COBALT: Constraint-based alignment tool for multiple protein sequences
    • Papadopoulos JS, Agarwala R (2007) COBALT: constraint-based alignment tool for multiple protein sequences. Bioinformatics 23: 1073-1079.
    • (2007) Bioinformatics , vol.23 , pp. 1073-1079
    • Papadopoulos, J.S.1    Agarwala, R.2
  • 100
    • 0027968068 scopus 로고
    • CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice
    • Thompson JD, Higgins DG, Gibson TJ (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res 22: 4673-4680.
    • (1994) Nucleic Acids Res , vol.22 , pp. 4673-4680
    • Thompson, J.D.1    Higgins, D.G.2    Gibson, T.J.3
  • 101
    • 0037433034 scopus 로고    scopus 로고
    • PCMA: Fast and accurate multiple sequence alignment based on profile consistency
    • Pei J, Sadreyev R, Grishin NV (2003) PCMA: fast and accurate multiple sequence alignment based on profile consistency. Bioinformatics 19: 427-428.
    • (2003) Bioinformatics , vol.19 , pp. 427-428
    • Pei, J.1    Sadreyev, R.2    Grishin, N.V.3
  • 102
    • 3042666256 scopus 로고    scopus 로고
    • MUSCLE: Multiple sequence alignment with high accuracy and high throughput
    • Edgar RC (2004) MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 32: 1792-1797.
    • (2004) Nucleic Acids Res , vol.32 , pp. 1792-1797
    • Edgar, R.C.1
  • 103
    • 13244255415 scopus 로고    scopus 로고
    • MUSCLE: A multiple sequence alignment method with reduced time and space complexity
    • Edgar RC (2004) MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics 5: 113.
    • (2004) BMC Bioinformatics , vol.5 , pp. 113
    • Edgar, R.C.1
  • 104
    • 14644430471 scopus 로고    scopus 로고
    • ProbCons: Probabilistic consistency-based multiple sequence alignment
    • Do CB, Mahabhashyam MS, Brudno M, Batzoglou S (2005) ProbCons: Probabilistic consistency-based multiple sequence alignment. Genome Res 15: 330-340.
    • (2005) Genome Res , vol.15 , pp. 330-340
    • Do, C.B.1    Mahabhashyam, M.S.2    Brudno, M.3    Batzoglou, S.4
  • 105
    • 0030925920 scopus 로고    scopus 로고
    • Pfam: A comprehensive database of protein domain families based on seed alignments
    • Sonnhammer EL, Eddy SR, Durbin R (1997) Pfam: a comprehensive database of protein domain families based on seed alignments. Proteins 28: 405-420.
    • (1997) Proteins , vol.28 , pp. 405-420
    • Sonnhammer, E.L.1    Eddy, S.R.2    Durbin, R.3
  • 108
    • 0030310347 scopus 로고    scopus 로고
    • Extraction of hidden Markov model representations of signal patterns in DNA sequences
    • Yada T, Ishikawa M, Tanaka H, Asai K (1996) Extraction of hidden Markov model representations of signal patterns in DNA sequences. Pac Symp Biocomput. pp 686-696.
    • (1996) Pac Symp Biocomput , pp. 686-696
    • Yada, T.1    Ishikawa, M.2    Tanaka, H.3    Asai, K.4
  • 109
    • 12344325673 scopus 로고    scopus 로고
    • Training HMM structure with genetic algorithm for biological sequence analysis
    • Won KJ, Prugel-Bennett A, Krogh A (2004) Training HMM structure with genetic algorithm for biological sequence analysis. Bioinformatics 20: 3613-3619.
    • (2004) Bioinformatics , vol.20 , pp. 3613-3619
    • Won, K.J.1    Prugel-Bennett, A.2    Krogh, A.3
  • 110
    • 48249097859 scopus 로고    scopus 로고
    • Modeling promoter grammars with evolving hidden Markov models
    • Won KJ, Sandelin A, Marstrand TT, Krogh A (2008) Modeling promoter grammars with evolving hidden Markov models. Bioinformatics 24: 1669-1675.
    • (2008) Bioinformatics , vol.24 , pp. 1669-1675
    • Won, K.J.1    Sandelin, A.2    Marstrand, T.T.3    Krogh, A.4
  • 111
    • 0032770364 scopus 로고    scopus 로고
    • Local sequence alignments with monotonic gap penalties
    • Mott R (1999) Local sequence alignments with monotonic gap penalties. Bioinformatics 15: 455-462.
    • (1999) Bioinformatics , vol.15 , pp. 455-462
    • Mott, R.1
  • 112
    • 23144448511 scopus 로고    scopus 로고
    • Discovery of the principal specific transcription factors of Apicomplexa and their implication for the evolution of the AP2-integrase DNA binding domains
    • Balaji S, Babu MM, Iyer LM, Aravind L (2005) Discovery of the principal specific transcription factors of Apicomplexa and their implication for the evolution of the AP2-integrase DNA binding domains. Nucleic Acids Res 33: 3994-4006.
    • (2005) Nucleic Acids Res , vol.33 , pp. 3994-4006
    • Balaji, S.1    Babu, M.M.2    Iyer, L.M.3    Aravind, L.4
  • 113
    • 4544372760 scopus 로고    scopus 로고
    • From endonucleases to transcription factors: Evolution of the AP2 DNA binding domain in plants
    • Magnani E, Sjölander K, Hake S (2004) From endonucleases to transcription factors: evolution of the AP2 DNA binding domain in plants. Plant Cell 16: 2265-2277.
    • (2004) Plant Cell , vol.16 , pp. 2265-2277
    • Magnani, E.1    Sjölander, K.2    Hake, S.3
  • 114
    • 2942683518 scopus 로고    scopus 로고
    • Homing endonucleases encoded by germ line-limited genes in Tetrahymena thermophila have APETELA2 DNA binding domains
    • Wuitschick JD, Lindstrom PR, Meyer AE, Karrer KM (2004) Homing endonucleases encoded by germ line-limited genes in Tetrahymena thermophila have APETELA2 DNA binding domains. Eukaryotic Cell 3: 685-694.
    • (2004) Eukaryotic Cell , vol.3 , pp. 685-694
    • Wuitschick, J.D.1    Lindstrom, P.R.2    Meyer, A.E.3    Karrer, K.M.4
  • 116
    • 62449221794 scopus 로고    scopus 로고
    • Identification of a transcription factor in the mosquito-invasive stage of malaria parasites
    • Yuda M, Iwanaga S, Shigenobu S, Mair GR, Janse CJ, et al. (2009) Identification of a transcription factor in the mosquito-invasive stage of malaria parasites. Mol Microbiol 71: 1402-1414.
    • (2009) Mol Microbiol , vol.71 , pp. 1402-1414
    • Yuda, M.1    Iwanaga, S.2    Shigenobu, S.3    Mair, G.R.4    Janse, C.J.5
  • 117
    • 33845682909 scopus 로고    scopus 로고
    • Multiple alignment of protein sequences with repeats and rearrangements
    • Phuong TM, Do CB, Edgar RC, Batzoglou S (2006) Multiple alignment of protein sequences with repeats and rearrangements. Nucleic Acids Res 34: 5932-5942.
    • (2006) Nucleic Acids Res , vol.34 , pp. 5932-5942
    • Phuong, T.M.1    Do, C.B.2    Edgar, R.C.3    Batzoglou, S.4
  • 118
    • 8744312854 scopus 로고    scopus 로고
    • A novel method for multiple alignment of sequences with repeated and shuffled elements
    • Raphael B, Zhi D, Tang H, Pevzner P (2004) A novel method for multiple alignment of sequences with repeated and shuffled elements. Genome Res 14: 2336-2346.
    • (2004) Genome Res , vol.14 , pp. 2336-2346
    • Raphael, B.1    Zhi, D.2    Tang, H.3    Pevzner, P.4
  • 119
    • 0028501914 scopus 로고
    • Non-globular domains in protein sequences: Automated segmentation using complexity measures
    • Wootton JC (1994) Non-globular domains in protein sequences: automated segmentation using complexity measures. Comput Chem 18: 269-285.
    • (1994) Comput Chem , vol.18 , pp. 269-285
    • Wootton, J.C.1
  • 120
    • 0025008168 scopus 로고
    • Sequence logos: A new way to display consensus sequences
    • Schneider TD, Stephens RM (1990) Sequence logos: a new way to display consensus sequences. Nucleic Acids Res 18: 6097-6100.
    • (1990) Nucleic Acids Res , vol.18 , pp. 6097-6100
    • Schneider, T.D.1    Stephens, R.M.2
  • 121
    • 0032530719 scopus 로고    scopus 로고
    • A novel mode of DNA recognition by a beta-sheet revealed by the solution structure of the GCC-box binding domain in complex with DNA
    • Allen MD, Yamasaki K, Ohme-Takagi M, Tateno M, Suzuki M (1998) A novel mode of DNA recognition by a beta-sheet revealed by the solution structure of the GCC-box binding domain in complex with DNA. EMBO J 17: 5484-5496.
    • (1998) EMBO J , vol.17 , pp. 5484-5496
    • Allen, M.D.1    Yamasaki, K.2    Ohme-Takagi, M.3    Tateno, M.4    Suzuki, M.5
  • 122
    • 73249119294 scopus 로고    scopus 로고
    • Structural determinants of DNA binding by a P. falciparum ApiAP2 transcriptional regulator
    • Lindner SE, De Silva EK, Keck JL, Llinás M (2010) Structural determinants of DNA binding by a P. falciparum ApiAP2 transcriptional regulator. J Mol Biol 395: 558-567.
    • (2010) J Mol Biol , vol.395 , pp. 558-567
    • Lindner, S.E.1    De Silva, E.K.2    Keck, J.L.3    Llinás, M.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.