메뉴 건너뛰기




Volumn 34, Issue 2, 2006, Pages 647-658

Hierarchical clustering algorithm for comprehensive orthologous-domain classification in multiple genomes

Author keywords

[No Author keywords available]

Indexed keywords

ANALYTIC METHOD; ARTICLE; CLASSIFICATION; CLUSTER ANALYSIS; COMPUTER PROGRAM; CONTROLLED STUDY; DATA ANALYSIS; GENE CLUSTER; GENETIC ALGORITHM; GENETIC DATABASE; GENOME ANALYSIS; INTERMETHOD COMPARISON; ORTHOLOGY; PARALOGY; PRIORITY JOURNAL;

EID: 32644443138     PISSN: 03051048     EISSN: None     Source Type: Journal    
DOI: 10.1093/nar/gkj448     Document Type: Article
Times cited : (63)

References (45)
  • 1
    • 0037305939 scopus 로고    scopus 로고
    • Domains, motifs and clusters in the protein universe
    • Liu,J. and Rost,B. (2003) Domains, motifs and clusters in the protein universe. Curr. Opin. Chem. Biol., 7, 5-11.
    • (2003) Curr. Opin. Chem. Biol. , vol.7 , pp. 5-11
    • Liu, J.1    Rost, B.2
  • 3
    • 0030925920 scopus 로고    scopus 로고
    • Pfam: A comprehensive database of protein domain families based on seed alignments
    • Sonnhammer,E.L., Eddy,S.R. and Durbin,R. (1997) Pfam: A comprehensive database of protein domain families based on seed alignments. Proteins, 28, 405-420.
    • (1997) Proteins , vol.28 , pp. 405-420
    • Sonnhammer, E.L.1    Eddy, S.R.2    Durbin, R.3
  • 4
    • 0032568655 scopus 로고    scopus 로고
    • SMART,a simple modular architecture research tool: Identification of signaling domains
    • Schultz,J., Milpetz,F., Bork,P. and Ponting,C.P. (1998)SMART,a simple modular architecture research tool: Identification of signaling domains. Proc. Natl Acad. Sci. USA, 95, 5857-5864.
    • (1998) Proc. Natl Acad. Sci. USA , vol.95 , pp. 5857-5864
    • Schultz, J.1    Milpetz, F.2    Bork, P.3    Ponting, C.P.4
  • 6
    • 0028218683 scopus 로고
    • Modular arrangement of proteins as inferred from analysis of homology
    • Sonnhammer,E.L. and Kahn,D. (1994) Modular arrangement of proteins as inferred from analysis of homology. Protein Sci., 3, 482-492.
    • (1994) Protein Sci. , vol.3 , pp. 482-492
    • Sonnhammer, E.L.1    Kahn, D.2
  • 7
    • 0031857779 scopus 로고    scopus 로고
    • Automated protein sequence database classification. II. Delineation of domain boundaries from sequence similarities
    • Gracy,J. and Argos,P. (1998) Automated protein sequence database classification. II. Delineation of domain boundaries from sequence similarities. Bioinformatics, 14, 174-187.
    • (1998) Bioinformatics , vol.14 , pp. 174-187
    • Gracy, J.1    Argos, P.2
  • 8
    • 0032726692 scopus 로고    scopus 로고
    • ProtoMap: Automatic classification of protein sequences, a hierarchy of protein families, and local maps of the protein space
    • Yona,G., Linial,N. and Linial,M. (1999) ProtoMap: Automatic classification of protein sequences, a hierarchy of protein families, and local maps of the protein space. Proteins, 37, 360-378.
    • (1999) Proteins , vol.37 , pp. 360-378
    • Yona, G.1    Linial, N.2    Linial, M.3
  • 9
    • 0036529479 scopus 로고    scopus 로고
    • An efficient algorithm for large-scale detection of protein families
    • Enright,A.J., Van Dongen,S. and Ouzounis,C.A. (2002) An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res., 30, 1575-1584.
    • (2002) Nucleic Acids Res. , vol.30 , pp. 1575-1584
    • Enright, A.J.1    Van Dongen, S.2    Ouzounis, C.A.3
  • 10
    • 0037414465 scopus 로고    scopus 로고
    • Exhaustive enumeration of protein domain families
    • Heger,A. and Holm,L. (2003) Exhaustive enumeration of protein domain families. J. Mol. Biol., 328, 749-767.
    • (2003) J. Mol. Biol. , vol.328 , pp. 749-767
    • Heger, A.1    Holm, L.2
  • 11
    • 0033944826 scopus 로고    scopus 로고
    • GeneRAGE: A robust algorithm for sequence clustering and domain detection
    • Enright,A.J. and Ouzounis,C.A. (2000) GeneRAGE: A robust algorithm for sequence clustering and domain detection. Bioinformatics, 16, 451-457.
    • (2000) Bioinformatics , vol.16 , pp. 451-457
    • Enright, A.J.1    Ouzounis, C.A.2
  • 12
    • 0014800108 scopus 로고
    • Distinguishing homologous from analogous proteins
    • Fitch,W.M. (1970) Distinguishing homologous from analogous proteins. Syst. Zool., 19, 99-113.
    • (1970) Syst. Zool. , vol.19 , pp. 99-113
    • Fitch, W.M.1
  • 13
    • 0000365320 scopus 로고
    • Fitting the gene lineage into its species lineage, a parsimony strategy illustrated by cladograms constructed from globin sequences
    • Goodman,M., Czelusniak,J., Moore,W.M., Romero-Herrera,A.E. and Matsuda,G. (1979) Fitting the gene lineage into its species lineage, a parsimony strategy illustrated by cladograms constructed from globin sequences. Syst. Zool., 28, 132-163.
    • (1979) Syst. Zool. , vol.28 , pp. 132-163
    • Goodman, M.1    Czelusniak, J.2    Moore, W.M.3    Romero-Herrera, A.E.4    Matsuda, G.5
  • 14
    • 12044253257 scopus 로고
    • Maps between trees and cladistic analysis of historical associations among genes, organisms, and areas
    • Page,R.D.M. (1994) Maps between trees and cladistic analysis of historical associations among genes, organisms, and areas. Syst. Biol., 43, 58-77.
    • (1994) Syst. Biol. , vol.43 , pp. 58-77
    • Page, R.D.M.1
  • 16
    • 0033618555 scopus 로고    scopus 로고
    • Detecting protein function and protein-protein interactions from genome sequences
    • Marcotte,E.M., Pellegrini,M., Ng,H.L., Rice,D.W., Yeates,T.O. and Eisenberg,D. (1999) Detecting protein function and protein-protein interactions from genome sequences. Science, 285, 751-753.
    • (1999) Science , vol.285 , pp. 751-753
    • Marcotte, E.M.1    Pellegrini, M.2    Ng, H.L.3    Rice, D.W.4    Yeates, T.O.5    Eisenberg, D.6
  • 17
    • 0033523989 scopus 로고    scopus 로고
    • Protein interaction maps for complete genomes based on gene fusion events
    • Enright,A.J., Iliopoulos,I., Kyrpides,N.C. and Ouzounis,C.A. (1999) Protein interaction maps for complete genomes based on gene fusion events. Nature, 402, 86-90.
    • (1999) Nature , vol.402 , pp. 86-90
    • Enright, A.J.1    Iliopoulos, I.2    Kyrpides, N.C.3    Ouzounis, C.A.4
  • 20
    • 0034084865 scopus 로고    scopus 로고
    • Who's your neighbor? New computational approaches for functional genomics
    • Galperin,M.Y. and Koonin,E.V. (2000) Who's your neighbor? New computational approaches for functional genomics. Nat. Biotechnol., 18, 609-613.
    • (2000) Nat. Biotechnol. , vol.18 , pp. 609-613
    • Galperin, M.Y.1    Koonin, E.V.2
  • 21
    • 0346505465 scopus 로고    scopus 로고
    • A cross-genomic approach for systematic mapping of phenotypic traits to genes
    • Jim,K., Parmar,K., Singh,M. and Tavazoie,S. (2004) A cross-genomic approach for systematic mapping of phenotypic traits to genes. Genome Res., 14, 109-115.
    • (2004) Genome Res. , vol.14 , pp. 109-115
    • Jim, K.1    Parmar, K.2    Singh, M.3    Tavazoie, S.4
  • 22
    • 0030660581 scopus 로고    scopus 로고
    • A genomic perspective on protein families
    • Tatusov,R.L., Koonin,E.V. and Lipman,D.J. (1997) A genomic perspective on protein families. Science, 278, 631-637.
    • (1997) Science , vol.278 , pp. 631-637
    • Tatusov, R.L.1    Koonin, E.V.2    Lipman, D.J.3
  • 24
    • 0035861990 scopus 로고    scopus 로고
    • Automatic clustering of orthologs and in-paralogs from pairwise species comparisons
    • Remm,M., Storm,C.E. and Sonnhammer,E.L. (2001) Automatic clustering of orthologs and in-paralogs from pairwise species comparisons. J. Mol. Biol., 314, 1041-1052.
    • (2001) J. Mol. Biol. , vol.314 , pp. 1041-1052
    • Remm, M.1    Storm, C.E.2    Sonnhammer, E.L.3
  • 25
    • 0036327988 scopus 로고    scopus 로고
    • Clustering of proximal sequence space for the identification of protein families
    • Abascal,F. and Valencia,A. (2002) Clustering of proximal sequence space for the identification of protein families. Bioinformatics, 18, 908-921.
    • (2002) Bioinformatics , vol.18 , pp. 908-921
    • Abascal, F.1    Valencia, A.2
  • 26
    • 0013017592 scopus 로고    scopus 로고
    • RIO: Analyzing proteomes by automated phylogenomics using resampled inference of orthologs
    • Zmasek,C.M. and Eddy,S.R. (2002) RIO: Analyzing proteomes by automated phylogenomics using resampled inference of orthologs. BMC Bioinformatics, 3, 14.
    • (2002) BMC Bioinformatics , vol.3 , pp. 14
    • Zmasek, C.M.1    Eddy, S.R.2
  • 27
    • 0036173130 scopus 로고    scopus 로고
    • Automated ortholog inference from phylogenetic trees and calculation of orthology reliability
    • Storm,C.E. and Sonnhammer,E.L. (2002) Automated ortholog inference from phylogenetic trees and calculation of orthology reliability. Bioinformatics, 18, 92-99.
    • (2002) Bioinformatics , vol.18 , pp. 92-99
    • Storm, C.E.1    Sonnhammer, E.L.2
  • 28
    • 0036170118 scopus 로고    scopus 로고
    • Improved database searches for orthologous sequences by conditioning on outgroup sequences
    • Cotter,P.J., Caffrey,D.R. and Shields,D.C. (2002) Improved database searches for orthologous sequences by conditioning on outgroup sequences. Bioinformatics, 18, 83-91.
    • (2002) Bioinformatics , vol.18 , pp. 83-91
    • Cotter, P.J.1    Caffrey, D.R.2    Shields, D.C.3
  • 29
    • 0642340503 scopus 로고    scopus 로고
    • OrthoParaMap: Distinguishing orthologs from paralogs by integrating comparativegenomedata and gene phylogenies
    • Cannon,S.B. and Young,N.D. (2003) OrthoParaMap: Distinguishing orthologs from paralogs by integrating comparativegenomedata and gene phylogenies. BMC Bioinformatics, 4, 35.
    • (2003) BMC Bioinformatics , vol.4 , pp. 35
    • Cannon, S.B.1    Young, N.D.2
  • 30
    • 0141519279 scopus 로고    scopus 로고
    • OrthoMCL: Identification of ortholog groups for eukaryotic genomes
    • Li,L., Stoeckert,C.J.,Jr and Roos,D.S. (2003) OrthoMCL: Identification of ortholog groups for eukaryotic genomes. Genome Res., 13, 2178-2189.
    • (2003) Genome Res. , vol.13 , pp. 2178-2189
    • Li, L.1    Stoeckert Jr, C.J.2    Roos, D.S.3
  • 31
    • 0142028973 scopus 로고    scopus 로고
    • Comprehensive analysis of orthologous protein domains using the HOPS database
    • Storm,C.E. and Sonnhammer,E.L. (2003) Comprehensive analysis of orthologous protein domains using the HOPS database. Genome Res., 13, 2353-2362.
    • (2003) Genome Res. , vol.13 , pp. 2353-2362
    • Storm, C.E.1    Sonnhammer, E.L.2
  • 32
    • 0037249403 scopus 로고    scopus 로고
    • MBGD: Microbial genome database for comparative analysis
    • Uchiyama,I. (2003) MBGD: Microbial genome database for comparative analysis. Nucleic Acids Res., 31, 58-62.
    • (2003) Nucleic Acids Res. , vol.31 , pp. 58-62
    • Uchiyama, I.1
  • 34
    • 0019887799 scopus 로고
    • Identification of common molecular subsequences
    • Smith,T.F. and Waterman,M.S. (1981) Identification of common molecular subsequences. J. Mol. Biol., 147, 195-197.
    • (1981) J. Mol. Biol. , vol.147 , pp. 195-197
    • Smith, T.F.1    Waterman, M.S.2
  • 35
    • 0026691182 scopus 로고
    • The rapid generation of mutation data matrices from protein sequences
    • Jones,D.T., Taylor,W.R. and Thornton,J.M. (1992) The rapid generation of mutation data matrices from protein sequences. Comput. Appl. Biosci., 8, 275-282.
    • (1992) Comput. Appl. Biosci. , vol.8 , pp. 275-282
    • Jones, D.T.1    Taylor, W.R.2    Thornton, J.M.3
  • 36
    • 0004069901 scopus 로고
    • Numerical taxonomy
    • Freeman, San Francisco, CA
    • Sneath,P.H.A. and Sokal,R.R. (1973) Numerical taxonomy. Freeman, San Francisco, CA.
    • (1973)
    • Sneath, P.H.A.1    Sokal, R.R.2
  • 37
    • 0033621536 scopus 로고    scopus 로고
    • Genome evolution. Gene fusion versus gene fission
    • Snel,B., Bork,P. and Huynen,M. (2000) Genome evolution. Gene fusion versus gene fission. Trends Genet., 16, 9-11.
    • (2000) Trends Genet. , vol.16 , pp. 9-11
    • Snel, B.1    Bork, P.2    Huynen, M.3
  • 39
    • 0031722273 scopus 로고    scopus 로고
    • Functional dissection of the molybdate-responsive transcription regulator, ModE, from Escherichia coli
    • McNicholas,P.M., Mazzotta,M.M., Rech,S.A. and Gunsalus,R.P. (1998) Functional dissection of the molybdate-responsive transcription regulator, ModE, from Escherichia coli. J. Bacteriol., 180, 4638-4643.
    • (1998) J. Bacteriol. , vol.180 , pp. 4638-4643
    • McNicholas, P.M.1    Mazzotta, M.M.2    Rech, S.A.3    Gunsalus, R.P.4
  • 40
    • 0033559680 scopus 로고    scopus 로고
    • The high-resolution crystal structure of the molybdate-dependent transcriptional regulator (ModE) from Escherichia coli: A novel combination of domain folds
    • Hall,D.R., Gourley,D.G., Leonard,G.A., Duke,E.M., Anderson,L.A., Boxer,D.H. and Hunter,W.N. (1999) The high-resolution crystal structure of the molybdate-dependent transcriptional regulator (ModE) from Escherichia coli: A novel combination of domain folds. EMBO J., 18, 1435-1446.
    • (1999) EMBO J. , vol.18 , pp. 1435-1446
    • Hall, D.R.1    Gourley, D.G.2    Leonard, G.A.3    Duke, E.M.4    Anderson, L.A.5    Boxer, D.H.6    Hunter, W.N.7
  • 41
    • 0036889686 scopus 로고    scopus 로고
    • Orthology, paralogy and proposed classification for paralog subtypes
    • Sonnhammer,E.L. and Koonin,E.V. (2002) Orthology, paralogy and proposed classification for paralog subtypes. Trends Genet., 18, 619-620.
    • (2002) Trends Genet. , vol.18 , pp. 619-620
    • Sonnhammer, E.L.1    Koonin, E.V.2
  • 42
    • 0034178257 scopus 로고    scopus 로고
    • Homology a personal view on some of the problems
    • Fitch,W.M. (2000) Homology a personal view on some of the problems. Trends Genet., 16, 227-231.
    • (2000) Trends Genet. , vol.16 , pp. 227-231
    • Fitch, W.M.1
  • 43
    • 0033740039 scopus 로고    scopus 로고
    • Towards a covering set of protein family profiles
    • Heger,A. and Holm,L. (2000) Towards a covering set of protein family profiles. Prog. Biophys. Mol. Biol., 73, 321-337.
    • (2000) Prog. Biophys. Mol. Biol. , vol.73 , pp. 321-337
    • Heger, A.1    Holm, L.2
  • 45
    • 0001899680 scopus 로고    scopus 로고
    • The metric space of proteins-comparative study of clustering algorithms
    • Sasson,O., Linial,N. and Linial,M. (2002) The metric space of proteins-comparative study of clustering algorithms. Bioinformatics, 18, S14-S21.
    • (2002) Bioinformatics , vol.18
    • Sasson, O.1    Linial, N.2    Linial, M.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.