메뉴 건너뛰기




Volumn 67, Issue 2, 2018, Pages 216-235

Modeling Site Heterogeneity with Posterior Mean Site Frequency Profiles Accelerates Accurate Phylogenomic Estimation

Author keywords

Long branch attraction; long branch repulsion; maximum likelihood; mixture model; posterior mean site frequency; site heterogeneity

Indexed keywords

AMINO ACID SUBSTITUTION; BIOLOGICAL MODEL; CLASSIFICATION; COMPUTER SIMULATION; MOLECULAR EVOLUTION; NONPARAMETRIC TEST; PHYLOGENY; PROCEDURES;

EID: 85041590310     PISSN: 10635157     EISSN: 1076836X     Source Type: Journal    
DOI: 10.1093/sysbio/syx068     Document Type: Article
Times cited : (297)

References (51)
  • 2
    • 0016355478 scopus 로고
    • A new look at the statistical model identification
    • Akaike H. 1974. A new look at the statistical model identification. IEEE Trans. Automat. Control 19:716-723.
    • (1974) IEEE Trans. Automat. Control , vol.19 , pp. 716-723
    • Akaike, H.1
  • 6
    • 0036061865 scopus 로고    scopus 로고
    • A phylogenomic approach to bacterial phylogeny: Evidence of a core of genes sharing a common history
    • Daubin V., Gouy M., Perrière G. 2002. A phylogenomic approach to bacterial phylogeny: evidence of a core of genes sharing a common history. Genome Res. 12:1080-1090.
    • (2002) Genome Res. , vol.12 , pp. 1080-1090
    • Daubin, V.1    Gouy, M.2    Perrière, G.3
  • 7
    • 17744394753 scopus 로고    scopus 로고
    • Phylogenomics and the reconstruction of the tree of life
    • Delsuc F., Brinkmann H., Philippe H. 2005. Phylogenomics and the reconstruction of the tree of life. Nat. Rev. Genet. 6:361-375.
    • (2005) Nat. Rev. Genet. , vol.6 , pp. 361-375
    • Delsuc, F.1    Brinkmann, H.2    Philippe, H.3
  • 10
    • 84959798530 scopus 로고
    • Cases in which parsimony or compatibility methods will be positively misleading
    • Felsenstein J. 1978. Cases in which parsimony or compatibility methods will be positively misleading. Syst. Zool. 27:401-410.
    • (1978) Syst. Zool. , vol.27 , pp. 401-410
    • Felsenstein, J.1
  • 11
    • 0031805465 scopus 로고    scopus 로고
    • Assessing the impact of secondary structure and solvent accessibility on protein evolution
    • Goldman N., Thorne J.L., Jones D.T. 1998. Assessing the impact of secondary structure and solvent accessibility on protein evolution. Genetics 149:445-458.
    • (1998) Genetics , vol.149 , pp. 445-458
    • Goldman, N.1    Thorne, J.L.2    Jones, D.T.3
  • 12
    • 77950806408 scopus 로고    scopus 로고
    • New algorithms and methods to estimate maximum-likelihood phylogenies: Assessing the performance of PhyML 3.0
    • Guindon S., Dufayard J.F., Lefort V., Anisimova M., Hordijk W., Gascuel O. 2010. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst. Biol. 59:307-21.
    • (2010) Syst. Biol. , vol.59 , pp. 307-321
    • Guindon, S.1    Dufayard, J.F.2    Lefort, V.3    Anisimova, M.4    Hordijk, W.5    Gascuel, O.6
  • 13
    • 0031875569 scopus 로고    scopus 로고
    • Evolutionary distances for protein-coding sequences: Modeling site-specific residue frequencies
    • Halpern A.L., Bruno W.J. 1998. Evolutionary distances for protein-coding sequences: modeling site-specific residue frequencies. Mol. Biol. Evol. 15:910-917.
    • (1998) Mol. Biol. Evol. , vol.15 , pp. 910-917
    • Halpern, A.L.1    Bruno, W.J.2
  • 15
    • 0026691182 scopus 로고
    • The rapid generation of mutation data matrices from protein sequences
    • Jones D.T., Taylor W.R., Thornton J.M. 1992. The rapid generation of mutation data matrices from protein sequences. Comput. Appl. Biosci. 8:275-282.
    • (1992) Comput. Appl. Biosci. , vol.8 , pp. 275-282
    • Jones, D.T.1    Taylor, W.R.2    Thornton, J.M.3
  • 16
    • 0004033713 scopus 로고
    • Probability and statistical inference
    • New York: Springer
    • Kalbfleisch, J.G. (1985). Probability and statistical inference. Statistical inference, Vol. 2. New York: Springer.
    • (1985) Statistical Inference , vol.2
    • Kalbfleisch, J.G.1
  • 19
    • 84861380450 scopus 로고    scopus 로고
    • Partitionfinder: Combinedselectionofpartitioning schemes and substitution models for phylogenetic analyses
    • Lanfear R., Calcott B., Ho S.Y.W., Guindon S. 2012. Partitionfinder: combinedselectionofpartitioning schemes and substitution models for phylogenetic analyses. Mol. Biol. Evol. 29:1695-1701.
    • (2012) Mol. Biol. Evol. , vol.29 , pp. 1695-1701
    • Lanfear, R.1    Calcott, B.2    Ho, S.Y.W.3    Guindon, S.4
  • 20
    • 34248159030 scopus 로고    scopus 로고
    • Suppression of long-branch attraction artefacts in the animal phylogeny using a site-heterogeneous model
    • Lartillot N., Brinkmann H., Philippe H. 2007. Suppression of long-branch attraction artefacts in the animal phylogeny using a site-heterogeneous model. BMC Evol. Biol. 7(1 Suppl):S4.
    • (2007) BMC Evol. Biol. , vol.7 , Issue.1 , pp. S4
    • Lartillot, N.1    Brinkmann, H.2    Philippe, H.3
  • 21
    • 2442691520 scopus 로고    scopus 로고
    • A Bayesian mixture model for across-site heterogeneities in the amino-acid replacement process
    • Lartillot N., Philippe H. 2004. A Bayesian mixture model for across-site heterogeneities in the amino-acid replacement process. Mol. Biol. Evol. 21:1095-1109.
    • (2004) Mol. Biol. Evol. , vol.21 , pp. 1095-1109
    • Lartillot, N.1    Philippe, H.2
  • 22
    • 84881629698 scopus 로고    scopus 로고
    • PhyloBayes MPI: Phylogenetic reconstruction with infinite mixtures of profiles in a parallel environment
    • Lartillot N., Rodrigue N., Stubbs D., Richer J. 2013. PhyloBayes MPI: phylogenetic reconstruction with infinite mixtures of profiles in a parallel environment. Syst. Biol. 62:611-615.
    • (2013) Syst. Biol. , vol.62 , pp. 611-615
    • Lartillot, N.1    Rodrigue, N.2    Stubbs, D.3    Richer, J.4
  • 23
    • 84866924443 scopus 로고    scopus 로고
    • Modeling protein evolution with several amino acid replacement matrices depending on site rates
    • Le S.Q., Dang C.C., Gascuel O. 2012. Modeling protein evolution with several amino acid replacement matrices depending on site rates. Mol. Biol. Evol. 29:2921-2936.
    • (2012) Mol. Biol. Evol. , vol.29 , pp. 2921-2936
    • Le, S.Q.1    Dang, C.C.2    Gascuel, O.3
  • 24
    • 45849154166 scopus 로고    scopus 로고
    • An improved general amino acid replacement matrix
    • Le S.Q., Gascuel O. 2008. An improved general amino acid replacement matrix. Mol. Biol. Evol. 25:1307-1320.
    • (2008) Mol. Biol. Evol. , vol.25 , pp. 1307-1320
    • Le, S.Q.1    Gascuel, O.2
  • 25
    • 77950851573 scopus 로고    scopus 로고
    • Accounting for solvent accessibility and secondary structure in protein phylogenetics is clearly beneficial
    • Le S.Q., Gascuel O. 2010. Accounting for solvent accessibility and secondary structure in protein phylogenetics is clearly beneficial. Syst. Biol. 59:277-287.
    • (2010) Syst. Biol. , vol.59 , pp. 277-287
    • Le, S.Q.1    Gascuel, O.2
  • 26
    • 53749090508 scopus 로고    scopus 로고
    • Empirical profile mixture models for phylogenetic reconstruction
    • Le S.Q., Gascuel O., Lartillot N. 2008a. Empirical profile mixture models for phylogenetic reconstruction. Bioinformatics. 24:2317-2323.
    • (2008) Bioinformatics. , vol.24 , pp. 2317-2323
    • Le, S.Q.1    Gascuel, O.2    Lartillot, N.3
  • 29
    • 84876519306 scopus 로고    scopus 로고
    • Ultrafast approximation for phylogenetic bootstrap
    • Minh B.Q., Nguyen M.A.T., von Haeseler A. 2013. Ultrafast approximation for phylogenetic bootstrap. Mol. Biol. Evol. 30:1188-1195.
    • (2013) Mol. Biol. Evol. , vol.30 , pp. 1188-1195
    • Minh, B.Q.1    Nguyen, M.A.T.2    Von Haeseler, A.3
  • 30
    • 0001831031 scopus 로고
    • Consistent estimates based on partially consistent observations
    • Neyman J., E.L. Scott. 1948. Consistent estimates based on partially consistent observations. Econometrica 16:1-32.
    • (1948) Econometrica , vol.16 , pp. 1-32
    • Neyman, J.1    Scott, E.L.2
  • 31
    • 84922362345 scopus 로고    scopus 로고
    • IQ-TREE: A fast and effective stochastic algorithm for estimating maximum likelihood phylogenies
    • Nguyen L.-T., Schmidt H.A., vonHaeseler A., Minh B.Q. 2015. IQ-TREE: A fast and effective stochastic algorithm for estimating maximum likelihood phylogenies. Mol. Biol. Evol. 32:268-274.
    • (2015) Mol. Biol. Evol. , vol.32 , pp. 268-274
    • Nguyen, L.-T.1    Schmidt, H.A.2    VonHaeseler, A.3    Minh, B.Q.4
  • 34
    • 0036900111 scopus 로고    scopus 로고
    • Combining multiple data sets in a likelihood analysis: Which models are the best?
    • Pupko T., Huchon D., Cao Y., Okada N., Hasegawa M. 2002. Combining multiple data sets in a likelihood analysis: which models are the best? Mol. Biol. Evol. 19:2294-2307.
    • (2002) Mol. Biol. Evol. , vol.19 , pp. 2294-2307
    • Pupko, T.1    Huchon, D.2    Cao, Y.3    Okada, N.4    Hasegawa, M.5
  • 35
    • 0030928378 scopus 로고    scopus 로고
    • Seq-Gen: An application for the Monte Carlo simulation of DNA sequence evolution along phylogenetic Trees
    • Rambaut A., Grassly N.C. 1997. Seq-Gen: an application for the Monte Carlo simulation of DNA sequence evolution along phylogenetic Trees. Comput. Appl. Biosci. 13:235-238.
    • (1997) Comput. Appl. Biosci. , vol.13 , pp. 235-238
    • Rambaut, A.1    Grassly, N.C.2
  • 36
  • 37
    • 0019424782 scopus 로고
    • Comparison of phylogenetic trees
    • Robinson D. R., Foulds L. R. 1981. Comparison of phylogenetic trees. Math. Biosci. 53:131-147.
    • (1981) Math. Biosci. , vol.53 , pp. 131-147
    • Robinson, D.R.1    Foulds, L.R.2
  • 38
    • 84876372052 scopus 로고    scopus 로고
    • On the statistical interpretation of site-specific variables in phylogeny-based substitution models
    • Rodrigue N. 2013. On the statistical interpretation of site-specific variables in phylogeny-based substitution models. Genetics 193:557-564.
    • (2013) Genetics , vol.193 , pp. 557-564
    • Rodrigue, N.1
  • 39
    • 0028071565 scopus 로고
    • The HSSP database of protein structure-sequence alignments
    • Sander C., Schneider R. 1994. The HSSP database of protein structure-sequence alignments. Nucleic Acids Res. 22:3597-3599.
    • (1994) Nucleic Acids Res. , vol.22 , pp. 3597-3599
    • Sander, C.1    Schneider, R.2
  • 40
    • 84907319426 scopus 로고
    • Asymptotic properties of maximum likelihood estimators and likelihood ratio tests under non-standard conditions, J
    • Self S., Liang K. 1987. Asymptotic properties of maximum likelihood estimators and likelihood ratio tests under non-standard conditions, J. Am. Stat. Assoc. 82:605-610.
    • (1987) Am. Stat. Assoc. , vol.82 , pp. 605-610
    • Self, S.1    Liang, K.2
  • 42
    • 0242487523 scopus 로고    scopus 로고
    • Estimationof rates-across-sites distributions in phylogenetic substitution models
    • Susko E., Field C., Blouin C., Roger A.J. 2003. Estimationof rates-across-sites distributions in phylogenetic substitution models. Syst. Biol. 52:594-603.
    • (2003) Syst. Biol. , vol.52 , pp. 594-603
    • Susko, E.1    Field, C.2    Blouin, C.3    Roger, A.J.4
  • 43
    • 3042545431 scopus 로고    scopus 로고
    • On inconsistency of the neighbor-joining, least squares, and minimum evolution estimation when substitution processes are incorrectly modeled
    • Susko E., Inagaki Y., Roger A.J. 2004. On inconsistency of the neighbor-joining, least squares, and minimum evolution estimation when substitution processes are incorrectly modeled. Mol. Biol. Evol. 21:1629-1642.
    • (2004) Mol. Biol. Evol. , vol.21 , pp. 1629-1642
    • Susko, E.1    Inagaki, Y.2    Roger, A.J.3
  • 44
    • 84943382227 scopus 로고    scopus 로고
    • Phylogenomic insights into animal evolution
    • Telford M.J., Budd G.E., Philippe H. 2015. Phylogenomic insights into animal evolution. Curr. Biol. 25:R876-R887.
    • (2015) Curr. Biol. , vol.25 , pp. R876-R887
    • Telford, M.J.1    Budd, G.E.2    Philippe, H.3
  • 45
    • 60049091295 scopus 로고    scopus 로고
    • A class frequency mixture model that adjusts for site specific amino acid frequencies and imporves inference of protein phylogeny
    • Wang H.C., Li L., Susko E., Roger A.J. 2008. A class frequency mixture model that adjusts for site specific amino acid frequencies and imporves inference of protein phylogeny. BMC Evol. Biol. 8:331.
    • (2008) BMC Evol. Biol. , vol.8 , pp. 331
    • Wang, H.C.1    Li, L.2    Susko, E.3    Roger, A.J.4
  • 46
    • 84897856309 scopus 로고    scopus 로고
    • An amino acid substitution-selection model adjusts residue fitness to improve phylogenetic estimation
    • Wang H.C., Susko E., Roger A.J. 2014. An amino acid substitution-selection model adjusts residue fitness to improve phylogenetic estimation. Mol. Biol. Evol. 31:779-792.
    • (2014) Mol. Biol. Evol. , vol.31 , pp. 779-792
    • Wang, H.C.1    Susko, E.2    Roger, A.J.3
  • 47
    • 85016938196 scopus 로고    scopus 로고
    • Who let the CAT out of the bag? Accurately dealing with substitutional heterogeneity in phylogenomic analyses
    • Whelan N.V., Halanych K.M. 2016. Who let the CAT out of the bag? accurately dealing with substitutional heterogeneity in phylogenomic analyses. Syst. Biol. 66:232-255.
    • (2016) Syst. Biol. , vol.66 , pp. 232-255
    • Whelan, N.V.1    Halanych, K.M.2
  • 48
    • 84928974643 scopus 로고    scopus 로고
    • Error, signal, and the placement of Ctenophora sister to all other animals
    • Whelan N.V., Kocot K.M., Moroz L.L., Halanych K.M. 2015. Error, signal, and the placement of Ctenophora sister to all other animals. Proc. Natl. Acad. Sci. USA 112:5773-5778.
    • (2015) Proc. Natl. Acad. Sci. USA , vol.112 , pp. 5773-5778
    • Whelan, N.V.1    Kocot, K.M.2    Moroz, L.L.3    Halanych, K.M.4
  • 49
    • 0035031966 scopus 로고    scopus 로고
    • A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach
    • Whelan S., Goldman N. 2001. A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach. Mol. Biol. Evol. 18: 691-699.
    • (2001) Mol. Biol. Evol. , vol.18 , pp. 691-699
    • Whelan, S.1    Goldman, N.2
  • 51
    • 0029970097 scopus 로고    scopus 로고
    • Maximum-Likelihood models for combined analyses of multiple sequence data
    • Yang Z. 1996. Maximum-Likelihood models for combined analyses of multiple sequence data. J. Mol. Evol. 42:587-96.
    • (1996) J. Mol. Evol. , vol.42 , pp. 587-596
    • Yang, Z.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.