메뉴 건너뛰기




Volumn , Issue , 2013, Pages 1-24

Automated Sequence-Based Approaches for Identifying Domain Families

Author keywords

Automatic domain delineation algorithm (ADDA); Cysteine free proteins; Cysteine rich domains; Protein domain families; Quality assessment; Sequence clustering algorithms; Sequence space graph

Indexed keywords

AMINO ACIDS; PROTEINS;

EID: 85016017902     PISSN: None     EISSN: None     Source Type: Book    
DOI: 10.1002/9781118743089.ch1     Document Type: Chapter
Times cited : (1)

References (93)
  • 4
    • 0034777598 scopus 로고    scopus 로고
    • Clustering protein sequences-structure prediction by transitive homology
    • Bolten, E., Schliep, A., Schneckener, S., Schomburg, D., and Schrader, R. (2001) Clustering protein sequences-structure prediction by transitive homology. Bioinformatics, 17, 935-941.
    • (2001) Bioinformatics , vol.17 , pp. 935-941
    • Bolten, E.1    Schliep, A.2    Schneckener, S.3    Schomburg, D.4    Schrader, R.5
  • 6
    • 37849023306 scopus 로고    scopus 로고
    • Assessing performance of orthology detection strategies applied to eukaryotic genomes
    • Chen, F., Mackey, A.J., Vermunt, J.K., and Roos, D.S. (2007) Assessing performance of orthology detection strategies applied to eukaryotic genomes. PLoS One, 2, e383.
    • (2007) PLoS One , vol.2 , pp. e383
    • Chen, F.1    Mackey, A.J.2    Vermunt, J.K.3    Roos, D.S.4
  • 7
    • 77956938868 scopus 로고    scopus 로고
    • DomSVR: domain boundary prediction with support vector regression from sequence information alone
    • Chen, P., Liu, C., Burge, L., Li, J., Mohammad, M., Southerland, W., Gloster, C., and Wang, B. (2010) DomSVR: domain boundary prediction with support vector regression from sequence information alone. Amino Acids, 39, 713-726.
    • (2010) Amino Acids , vol.39 , pp. 713-726
    • Chen, P.1    Liu, C.2    Burge, L.3    Li, J.4    Mohammad, M.5    Southerland, W.6    Gloster, C.7    Wang, B.8
  • 8
    • 33745101459 scopus 로고    scopus 로고
    • DOMpro: protein domain prediction using profiles, secondary structure, relative solvent accessibility, and recursive neural networks
    • Cheng, J., Sweredoski, M.J., and Baldi, P. (2006) DOMpro: protein domain prediction using profiles, secondary structure, relative solvent accessibility, and recursive neural networks. Data Min Knowl Discov, 13, 1-10.
    • (2006) Data Min Knowl Discov , vol.13 , pp. 1-10
    • Cheng, J.1    Sweredoski, M.J.2    Baldi, P.3
  • 10
    • 0020649886 scopus 로고
    • Establishing homologies in protein sequences
    • Dayhoff, M.O., Barker, W.C., and Hunt, L.T. (1983) Establishing homologies in protein sequences. Methods Enzymol, 91, 524-545.
    • (1983) Methods Enzymol , vol.91 , pp. 524-545
    • Dayhoff, M.O.1    Barker, W.C.2    Hunt, L.T.3
  • 11
    • 0031841279 scopus 로고    scopus 로고
    • The HSSP database of protein structure-sequence alignments and family profiles
    • Dodge, C., Schneider, R., and Sander, C. (1998) The HSSP database of protein structure-sequence alignments and family profiles. Nucleic Acids Res, 26, 313-315.
    • (1998) Nucleic Acids Res , vol.26 , pp. 313-315
    • Dodge, C.1    Schneider, R.2    Sander, C.3
  • 12
    • 0027364941 scopus 로고
    • Evolutionarily mobile modules in proteins
    • Doolittle, R.F. and Bork, P. (1993) Evolutionarily mobile modules in proteins. Sci Am, 269, 50-56.
    • (1993) Sci Am , vol.269 , pp. 50-56
    • Doolittle, R.F.1    Bork, P.2
  • 13
    • 21744461895 scopus 로고    scopus 로고
    • Armadillo: domain boundary prediction by amino acid composition
    • Dumontier, M., Yao, R., Feldman, H.J., and Hogue, C.W.V. (2005) Armadillo: domain boundary prediction by amino acid composition. J Mol Biol, 350, 1061-1073.
    • (2005) J Mol Biol , vol.350 , pp. 1061-1073
    • Dumontier, M.1    Yao, R.2    Feldman, H.J.3    Hogue, C.W.V.4
  • 14
    • 0031743421 scopus 로고    scopus 로고
    • Profile hidden Markov models
    • Eddy, S.R. (1998) Profile hidden Markov models. Bioinformatics, 14, 755-763.
    • (1998) Bioinformatics , vol.14 , pp. 755-763
    • Eddy, S.R.1
  • 15
    • 0031576913 scopus 로고    scopus 로고
    • On punctuated equilibria
    • Eldredge, N. and Gould, S.J. (1997) On punctuated equilibria. Science, 276, 338-341.
    • (1997) Science , vol.276 , pp. 338-341
    • Eldredge, N.1    Gould, S.J.2
  • 16
    • 0036529479 scopus 로고    scopus 로고
    • An efficient algorithm for large-scale detection of protein families
    • Enright, A.J., Van Dongen, S., and Ouzounis, C.A. (2002) An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res, 30, 1575-1584.
    • (2002) Nucleic Acids Res , vol.30 , pp. 1575-1584
    • Enright, A.J.1    Van Dongen, S.2    Ouzounis, C.A.3
  • 17
    • 0033944826 scopus 로고    scopus 로고
    • GeneRAGE: a robust algorithm for sequence clustering and domain detection
    • Enright, A.J. and Ouzounis, C.A. (2000) GeneRAGE: a robust algorithm for sequence clustering and domain detection. Bioinformatics, 16, 451-457.
    • (2000) Bioinformatics , vol.16 , pp. 451-457
    • Enright, A.J.1    Ouzounis, C.A.2
  • 19
    • 0014800108 scopus 로고
    • Distinguishing homologous from analogous proteins
    • Fitch, W.M. (1970) Distinguishing homologous from analogous proteins. Syst Zool, 19, 99-113.
    • (1970) Syst Zool , vol.19 , pp. 99-113
    • Fitch, W.M.1
  • 21
    • 33847172327 scopus 로고    scopus 로고
    • Clustering by passing messages between data points
    • Frey, B.J. and Dueck, D. (2007) Clustering by passing messages between data points. Science, 315, 972-976.
    • (2007) Science , vol.315 , pp. 972-976
    • Frey, B.J.1    Dueck, D.2
  • 23
    • 0037377548 scopus 로고    scopus 로고
    • Prediction of protein domain boundaries from sequence alone
    • Galzitskaya, O.V. and Melnik, B.S. (2003) Prediction of protein domain boundaries from sequence alone. Protein Sci, 12, 696-701.
    • (2003) Protein Sci , vol.12 , pp. 696-701
    • Galzitskaya, O.V.1    Melnik, B.S.2
  • 24
    • 0036306348 scopus 로고    scopus 로고
    • SnapDRAGON: a method to delineate protein structural domains from sequence data
    • George, R.A. and Heringa, J. (2002) SnapDRAGON: a method to delineate protein structural domains from sequence data. J Mol Biol, 316, 839-851.
    • (2002) J Mol Biol , vol.316 , pp. 839-851
    • George, R.A.1    Heringa, J.2
  • 25
    • 0034710876 scopus 로고    scopus 로고
    • Coupled two-way clustering analysis of gene microarray data
    • Getz, G., Levine, E., and Domany, E. (2000) Coupled two-way clustering analysis of gene microarray data. Proc Natl Acad Sci USA, 97, 12079-12084.
    • (2000) Proc Natl Acad Sci USA , vol.97 , pp. 12079-12084
    • Getz, G.1    Levine, E.2    Domany, E.3
  • 26
    • 0035798406 scopus 로고    scopus 로고
    • Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structure
    • Gough, J., Karplus, K., Hughey, R., and Chothia, C. (2001) Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structure. J Mol Biol, 313, 903-919.
    • (2001) J Mol Biol , vol.313 , pp. 903-919
    • Gough, J.1    Karplus, K.2    Hughey, R.3    Chothia, C.4
  • 27
    • 0033563522 scopus 로고    scopus 로고
    • Whole genome protein domain analysis using a new method for domain clustering
    • Gouzy, J., Corpet, F., and Kahn, D. (1999) Whole genome protein domain analysis using a new method for domain clustering. Comput Chem, 23, 333-340.
    • (1999) Comput Chem , vol.23 , pp. 333-340
    • Gouzy, J.1    Corpet, F.2    Kahn, D.3
  • 28
    • 0023375315 scopus 로고
    • Profile analysis: detection of distantly related proteins
    • Gribskov, M., McLachlan, A.D., and Eisenberg, D. (1987) Profile analysis: detection of distantly related proteins. Proc Natl Acad Sci USA, 84, 4355-4358.
    • (1987) Proc Natl Acad Sci USA , vol.84 , pp. 4355-4358
    • Gribskov, M.1    McLachlan, A.D.2    Eisenberg, D.3
  • 29
    • 0031744176 scopus 로고    scopus 로고
    • Domain identification by clustering sequence alignments
    • Guan, X. and Du, L. (1998) Domain identification by clustering sequence alignments. Bioinformatics, 14, 783-788.
    • (1998) Bioinformatics , vol.14 , pp. 783-788
    • Guan, X.1    Du, L.2
  • 30
    • 0034333196 scopus 로고    scopus 로고
    • Rapid automatic detection and alignment of repeats in protein sequences
    • Heger, A. and Holm, L. (2000) Rapid automatic detection and alignment of repeats in protein sequences. Proteins Struct Funct Bioinform, 41, 224-237.
    • (2000) Proteins Struct Funct Bioinform , vol.41 , pp. 224-237
    • Heger, A.1    Holm, L.2
  • 31
    • 0035070578 scopus 로고    scopus 로고
    • Picasso: generating a covering set of protein family profiles
    • Heger, A. and Holm, L. (2001) Picasso: generating a covering set of protein family profiles. Bioinformatics, 17, 272-279.
    • (2001) Bioinformatics , vol.17 , pp. 272-279
    • Heger, A.1    Holm, L.2
  • 32
    • 0037414465 scopus 로고    scopus 로고
    • Exhaustive enumeration of protein domain families
    • Heger, A. and Holm, L. (2003) Exhaustive enumeration of protein domain families. J Mol Biol, 328, 749-767.
    • (2003) J Mol Biol , vol.328 , pp. 749-767
    • Heger, A.1    Holm, L.2
  • 34
    • 34548733641 scopus 로고    scopus 로고
    • The global trace graph, a novel paradigm for searching protein sequence databases
    • Heger, A., Mallick, S., Wilton, C., and Holm, L. (2007) The global trace graph, a novel paradigm for searching protein sequence databases. Bioinformatics, 23, 2361-2367.
    • (2007) Bioinformatics , vol.23 , pp. 2361-2367
    • Heger, A.1    Mallick, S.2    Wilton, C.3    Holm, L.4
  • 35
    • 0027491666 scopus 로고
    • A method to recognize distant repeats in protein sequences
    • 391-341
    • Heringa, J. and Argos, P. (1993) A method to recognize distant repeats in protein sequences. Proteins, 17, 391-341.
    • (1993) Proteins , vol.17
    • Heringa, J.1    Argos, P.2
  • 36
    • 0031829372 scopus 로고    scopus 로고
    • Removing near-neighbour redundancy from large protein sequence collections
    • Holm, L. and Sander, C. (1998a) Removing near-neighbour redundancy from large protein sequence collections. Bioinformatics, 14, 423-429.
    • (1998) Bioinformatics , vol.14 , pp. 423-429
    • Holm, L.1    Sander, C.2
  • 37
    • 0031865006 scopus 로고    scopus 로고
    • Touring protein fold space with Dali/FSSP
    • Holm, L. and Sander, C. (1998b) Touring protein fold space with Dali/FSSP. Nucleic Acids Res, 26, 316-319.
    • (1998) Nucleic Acids Res , vol.26 , pp. 316-319
    • Holm, L.1    Sander, C.2
  • 39
    • 66349138035 scopus 로고    scopus 로고
    • Family classification without domain chaining
    • Joseph, J.M. and Durand, D. (2009) Family classification without domain chaining. Bioinformatics, 25, i45-i53.
    • (2009) Bioinformatics , vol.25 , pp. i45-i53
    • Joseph, J.M.1    Durand, D.2
  • 40
    • 34548605214 scopus 로고    scopus 로고
    • CLUSS: clustering of protein sequences based on a new similarity measure
    • Kelil, A., Wang, S., Brzezinski, R., and Fleury, A. (2007) CLUSS: clustering of protein sequences based on a new similarity measure. BMC Bioinformatics, 8, 286.
    • (2007) BMC Bioinformatics , vol.8 , pp. 286
    • Kelil, A.1    Wang, S.2    Brzezinski, R.3    Fleury, A.4
  • 41
    • 30344438515 scopus 로고    scopus 로고
    • Automated prediction of domain boundaries in CASP6 targets using Ginzu and RosettaDOM
    • Kim, D.E., Chivian, D., Malmström, L., and Baker, D. (2005) Automated prediction of domain boundaries in CASP6 targets using Ginzu and RosettaDOM. Proteins, 61(7 Suppl), 193-200.
    • (2005) Proteins , vol.61 , Issue.7 , pp. 193-200
    • Kim, D.E.1    Chivian, D.2    Malmström, L.3    Baker, D.4
  • 42
    • 13244291417 scopus 로고    scopus 로고
    • Large scale hierarchical clustering of protein sequences
    • Krause, A., Stoye, J., and Vingron, M. (2005) Large scale hierarchical clustering of protein sequences. BMC Bioinformatics, 6, 15.
    • (2005) BMC Bioinformatics , vol.6 , pp. 15
    • Krause, A.1    Stoye, J.2    Vingron, M.3
  • 43
    • 0031876711 scopus 로고    scopus 로고
    • A set-theoretic approach to database searching and clustering
    • Krause, A. and Vingron, M. (1998) A set-theoretic approach to database searching and clustering. Bioinformatics, 14, 430-438.
    • (1998) Bioinformatics , vol.14 , pp. 430-438
    • Krause, A.1    Vingron, M.2
  • 44
    • 77954198131 scopus 로고    scopus 로고
    • A low-polynomial algorithm for assembling clusters of orthologous groups from intergenomic symmetric best matches
    • Kristensen, D.M., Kannan, L., Coleman, M.K., Wolf, Y.I., Sorokin, A., Koonin, E.V., and Mushegian, A. (2010) A low-polynomial algorithm for assembling clusters of orthologous groups from intergenomic symmetric best matches. Bioinformatics, 26, 1481-1487.
    • (2010) Bioinformatics , vol.26 , pp. 1481-1487
    • Kristensen, D.M.1    Kannan, L.2    Coleman, M.K.3    Wolf, Y.I.4    Sorokin, A.5    Koonin, E.V.6    Mushegian, A.7
  • 46
    • 58149194624 scopus 로고    scopus 로고
    • SMART 6: recent updates and new developments
    • Letunic, I., Doerks, T., and Bork, P. (2009) SMART 6: recent updates and new developments. Nucleic Acids Res, 37, D229-D232.
    • (2009) Nucleic Acids Res , vol.37 , pp. D229-D232
    • Letunic, I.1    Doerks, T.2    Bork, P.3
  • 47
    • 0141519279 scopus 로고    scopus 로고
    • OrthoMCL: identification of ortholog groups for eukaryotic genomes
    • Li, L., Stoeckert, C.J., and Roos, D.S. (2003) OrthoMCL: identification of ortholog groups for eukaryotic genomes. Genome Res, 13, 2178-2189.
    • (2003) Genome Res , vol.13 , pp. 2178-2189
    • Li, L.1    Stoeckert, C.J.2    Roos, D.S.3
  • 48
    • 0035072551 scopus 로고    scopus 로고
    • Clustering of highly homologous sequences to reduce the size of large protein databases
    • Li, W., Jaroszewski, L., and Godzik, A. (2001) Clustering of highly homologous sequences to reduce the size of large protein databases. Bioinformatics, 17, 282-283.
    • (2001) Bioinformatics , vol.17 , pp. 282-283
    • Li, W.1    Jaroszewski, L.2    Godzik, A.3
  • 49
    • 0036169928 scopus 로고    scopus 로고
    • Tolerating some redundancy significantly speeds up clustering of large protein databases
    • Li, W., Jaroszewski, L., and Godzik, A. (2002) Tolerating some redundancy significantly speeds up clustering of large protein databases. Bioinformatics, 18, 77-82.
    • (2002) Bioinformatics , vol.18 , pp. 77-82
    • Li, W.1    Jaroszewski, L.2    Godzik, A.3
  • 50
    • 3242891265 scopus 로고    scopus 로고
    • CHOP: parsing proteins into structural domains
    • Liu, J. and Rost, B. (2004) CHOP: parsing proteins into structural domains. Nucleic Acids Res, 32, W569-W571.
    • (2004) Nucleic Acids Res , vol.32 , pp. W569-W571
    • Liu, J.1    Rost, B.2
  • 51
    • 46249133773 scopus 로고    scopus 로고
    • Efficient algorithms for accurate hierarchical clustering of huge datasets: tackling the entire protein space
    • Loewenstein, Y., Portugaly, E., Fromer, M., and Linial, M. (2008) Efficient algorithms for accurate hierarchical clustering of huge datasets: tackling the entire protein space. Bioinformatics, 24, i41-i49.
    • (2008) Bioinformatics , vol.24 , pp. i41-i49
    • Loewenstein, Y.1    Portugaly, E.2    Fromer, M.3    Linial, M.4
  • 53
    • 0036893072 scopus 로고    scopus 로고
    • Rapid protein domain assignment from amino acid sequence using predicted secondary structure
    • Marsden, R.L., McGuffin, L.J., and Jones, D.T. (2002) Rapid protein domain assignment from amino acid sequence using predicted secondary structure. Protein Sci, 11, 2814-2824.
    • (2002) Protein Sci , vol.11 , pp. 2814-2824
    • Marsden, R.L.1    McGuffin, L.J.2    Jones, D.T.3
  • 54
    • 0028961335 scopus 로고
    • SCOP: a structural classification of proteins database for the investigation of sequences and structures
    • Murzin, A.G., Brenner, S.E., Hubbard, T., and Chothia, C. (1995) SCOP: a structural classification of proteins database for the investigation of sequences and structures. J Mol Biol, 247, 536-540.
    • (1995) J Mol Biol , vol.247 , pp. 536-540
    • Murzin, A.G.1    Brenner, S.E.2    Hubbard, T.3    Chothia, C.4
  • 55
    • 3142680264 scopus 로고    scopus 로고
    • Automatic prediction of protein domains from sequence information using a hybrid learning system
    • Nagarajan, N. and Yona, G. (2004) Automatic prediction of protein domains from sequence information using a hybrid learning system. Bioinformatics, 20, 1335-1360.
    • (2004) Bioinformatics , vol.20 , pp. 1335-1360
    • Nagarajan, N.1    Yona, G.2
  • 56
    • 77950430912 scopus 로고    scopus 로고
    • SCPS: a fast implementation of a spectral method for detecting protein families on a genome-wide scale
    • Nepusz, T., Sasidharan, R., and Paccanaro, A. (2010) SCPS: a fast implementation of a spectral method for detecting protein families on a genome-wide scale. BMC Bioinformatics, 11, 120.
    • (2010) BMC Bioinformatics , vol.11 , pp. 120
    • Nepusz, T.1    Sasidharan, R.2    Paccanaro, A.3
  • 57
    • 0029358115 scopus 로고
    • Parallel algorithms for hierarchical clustering
    • Olson, C.F. (1995) Parallel algorithms for hierarchical clustering. Parallel Comput, 21, 1313-1325.
    • (1995) Parallel Comput , vol.21 , pp. 1313-1325
    • Olson, C.F.1
  • 58
  • 59
    • 0033940118 scopus 로고    scopus 로고
    • RSDB: representative protein sequence databases have high information content
    • Park, J., Holm, L., Heger, A., and Chothia, C. (2000) RSDB: representative protein sequence databases have high information content. Bioinformatics, 16, 458-464.
    • (2000) Bioinformatics , vol.16 , pp. 458-464
    • Park, J.1    Holm, L.2    Heger, A.3    Chothia, C.4
  • 60
    • 0031576361 scopus 로고    scopus 로고
    • Intermediate sequences increase the detection of homology between sequences
    • Park, J., Teichmann, S.A., Hubbard, T., and Chothia, C. (1997) Intermediate sequences increase the detection of homology between sequences. J Mol Biol, 273, 349-354.
    • (1997) J Mol Biol , vol.273 , pp. 349-354
    • Park, J.1    Teichmann, S.A.2    Hubbard, T.3    Chothia, C.4
  • 61
    • 0023989064 scopus 로고
    • Improved tools for biological sequence comparison
    • Pearson, W.R. and Lipman, D.J. (1988) Improved tools for biological sequence comparison. Proc Natl Acad Sci USA, 85, 2444-2448.
    • (1988) Proc Natl Acad Sci USA , vol.85 , pp. 2444-2448
    • Pearson, W.R.1    Lipman, D.J.2
  • 62
    • 0033151949 scopus 로고    scopus 로고
    • A fast algorithm for genome-wide analysis of proteins with repeated sequences
    • Pellegrini, M., Marcotte, E.M., and Yeates, T.O. (1999) A fast algorithm for genome-wide analysis of proteins with repeated sequences. Proteins, 35, 440-446.
    • (1999) Proteins , vol.35 , pp. 440-446
    • Pellegrini, M.1    Marcotte, E.M.2    Yeates, T.O.3
  • 64
    • 0346652457 scopus 로고    scopus 로고
    • ProClust: improved clustering of protein sequences with an extended graph-based approach
    • Pipenbacher, P., Schliep, A., Schneckener, S., Schönhuth, A., Schomburg, D., and Schrader, R. (2002) ProClust: improved clustering of protein sequences with an extended graph-based approach. Bioinformatics, 18(2 Suppl), S182-S191.
    • (2002) Bioinformatics , vol.18 , Issue.2 , pp. S182-S191
    • Pipenbacher, P.1    Schliep, A.2    Schneckener, S.3    Schönhuth, A.4    Schomburg, D.5    Schrader, R.6
  • 65
    • 33746962182 scopus 로고    scopus 로고
    • EVEREST: automatic identification and classification of protein domains in all protein sequences
    • Portugaly, E., Harel, A., Linial, N., and Linial, M. (2006) EVEREST: automatic identification and classification of protein domains in all protein sequences. BMC Bioinformatics, 7, 277.
    • (2006) BMC Bioinformatics , vol.7 , pp. 277
    • Portugaly, E.1    Harel, A.2    Linial, N.3    Linial, M.4
  • 68
    • 0036220048 scopus 로고    scopus 로고
    • Use of covariance analysis for the prediction of structural domain boundaries from multiple protein sequence alignments
    • Rigden, D.J. (2002) Use of covariance analysis for the prediction of structural domain boundaries from multiple protein sequence alignments. Protein Eng, 15, 65-77.
    • (2002) Protein Eng , vol.15 , pp. 65-77
    • Rigden, D.J.1
  • 69
    • 0018015137 scopus 로고
    • Modeling by shortest data description
    • Rissanen, J. (1978) Modeling by shortest data description. Automatica, 14, 465-471.
    • (1978) Automatica , vol.14 , pp. 465-471
    • Rissanen, J.1
  • 70
    • 0038419681 scopus 로고    scopus 로고
    • Functional links between proteins
    • Sali, A. (1999) Functional links between proteins. Nature, 402(23), 25-26.
    • (1999) Nature , vol.402 , Issue.23 , pp. 25-26
    • Sali, A.1
  • 72
    • 0038438514 scopus 로고    scopus 로고
    • IMPALA: matching a protein sequence against a collection of PSI-BLAST-constructed position-specific score matrices
    • Schäffer, A.A., Wolf, Y.I., Ponting, C.P., Koonin, E.V., Aravind, L., and Altschul, S.F. (1999) IMPALA: matching a protein sequence against a collection of PSI-BLAST-constructed position-specific score matrices. Bioinformatics, 15, 1000-1011.
    • (1999) Bioinformatics , vol.15 , pp. 1000-1011
    • Schäffer, A.A.1    Wolf, Y.I.2    Ponting, C.P.3    Koonin, E.V.4    Aravind, L.5    Altschul, S.F.6
  • 74
    • 33947385412 scopus 로고    scopus 로고
    • Improving the performance of DomainDiscovery of protein domain boundary assignment using inter-domain linker index
    • Sikder, A.R. and Zomaya, A.Y. (2006) Improving the performance of DomainDiscovery of protein domain boundary assignment using inter-domain linker index. BMC Bioinformatics, 7(Suppl 5), S6.
    • (2006) BMC Bioinformatics , vol.7 , pp. S6
    • Sikder, A.R.1    Zomaya, A.Y.2
  • 75
    • 17844363963 scopus 로고    scopus 로고
    • PPRODO: prediction of protein domain boundaries using neural networks
    • Sim, J., Kim, S.-Y., and Lee, J. (2005) PPRODO: prediction of protein domain boundaries using neural networks. Proteins, 59, 627-632.
    • (2005) Proteins , vol.59 , pp. 627-632
    • Sim, J.1    Kim, S.-Y.2    Lee, J.3
  • 76
    • 44949207428 scopus 로고    scopus 로고
    • Sequence similarity network reveals common ancestry of multidomain proteins
    • Song, N., Joseph, J.M., Davis, G.B., and Durand, D. (2008) Sequence similarity network reveals common ancestry of multidomain proteins. PLoS Comput Biol, 4, e1000063.
    • (2008) PLoS Comput Biol , vol.4 , pp. e1000063
    • Song, N.1    Joseph, J.M.2    Davis, G.B.3    Durand, D.4
  • 77
    • 0037460953 scopus 로고    scopus 로고
    • DomCut: prediction of inter-domain linker regions in amino acid sequences
    • Suyama, M. and Ohara, O. (2003) DomCut: prediction of inter-domain linker regions in amino acid sequences. Bioinformatics, 19, 673-674.
    • (2003) Bioinformatics , vol.19 , pp. 673-674
    • Suyama, M.1    Ohara, O.2
  • 78
    • 0030660581 scopus 로고    scopus 로고
    • A genomic perspective on protein families
    • Tatusov, R.L., Koonin, E.V., and Lipman, D.J. (1997) A genomic perspective on protein families. Science, 278, 631-637.
    • (1997) Science , vol.278 , pp. 631-637
    • Tatusov, R.L.1    Koonin, E.V.2    Lipman, D.J.3
  • 79
    • 0034062874 scopus 로고    scopus 로고
    • Fast assignment of protein structures to sequences using the intermediate sequence library PDB-ISL
    • Teichmann, S.A., Chothia, C., Church, G.M., and Park, J. (2000) Fast assignment of protein structures to sequences using the intermediate sequence library PDB-ISL. Bioinformatics, 16, 117-124.
    • (2000) Bioinformatics , vol.16 , pp. 117-124
    • Teichmann, S.A.1    Chothia, C.2    Church, G.M.3    Park, J.4
  • 81
    • 78651319979 scopus 로고    scopus 로고
    • Ongoing and future developments at the Universal Protein Resource
    • The UniProt Consortium (2011) Ongoing and future developments at the Universal Protein Resource. Nucleic Acids Res, 39, D214-D219.
    • (2011) Nucleic Acids Res , vol.39 , pp. D214-D219
  • 82
    • 0005924596 scopus 로고    scopus 로고
    • Graph clustering by flow simulation
    • PhD Thesis. University of Utrecht, The Netherlands
    • Van Dongen (2000). Graph clustering by flow simulation. PhD Thesis. University of Utrecht, The Netherlands.
    • (2000)
    • Dongen, V.1
  • 83
    • 0000107517 scopus 로고
    • An Information Measure for Classification
    • Wallace, C.S. and Boulton, D.M. (1968) An Information Measure for Classification. Comput J, 11, 185-194.
    • (1968) Comput J , vol.11 , pp. 185-194
    • Wallace, C.S.1    Boulton, D.M.2
  • 84
    • 0015597839 scopus 로고
    • Nucleation, rapid folding, and globular intrachain regions in proteins
    • Wetlaufer, D.B. (1973) Nucleation, rapid folding, and globular intrachain regions in proteins. Proc Natl Acad Sci USA, 70, 697-701.
    • (1973) Proc Natl Acad Sci USA , vol.70 , pp. 697-701
    • Wetlaufer, D.B.1
  • 85
    • 0033753811 scopus 로고    scopus 로고
    • Domain size distributions can predict domain boundaries
    • Wheelan, S.J., Marchler-Bauer, A., and Bryant, S.H. (2000) Domain size distributions can predict domain boundaries. Bioinformatics, 16, 613-618.
    • (2000) Bioinformatics , vol.16 , pp. 613-618
    • Wheelan, S.J.1    Marchler-Bauer, A.2    Bryant, S.H.3
  • 86
    • 37249051926 scopus 로고    scopus 로고
    • Large scale clustering of protein sequences with FORCE -a layout based heuristic for weighted cluster editing
    • Wittkop, T., Baumbach, J., Lobo, F.P., and Rahmann, S. (2007) Large scale clustering of protein sequences with FORCE -a layout based heuristic for weighted cluster editing. BMC Bioinformatics, 8, 396.
    • (2007) BMC Bioinformatics , vol.8 , pp. 396
    • Wittkop, T.1    Baumbach, J.2    Lobo, F.P.3    Rahmann, S.4
  • 87
    • 46249129962 scopus 로고    scopus 로고
    • MACHOS: Markov clusters of homologous subsequences
    • Wong, S. and Ragan, M.A. (2008) MACHOS: Markov clusters of homologous subsequences. Bioinformatics, 24, i77-i85.
    • (2008) Bioinformatics , vol.24 , pp. i77-i85
    • Wong, S.1    Ragan, M.A.2
  • 88
    • 77955979374 scopus 로고    scopus 로고
    • Using affinity propagation combined post-processing to cluster protein sequences
    • Yang, F., Zhu, Q., Tang, D., and Zhao, M. (2010) Using affinity propagation combined post-processing to cluster protein sequences. Protein Pept Lett, 17, 681-689.
    • (2010) Protein Pept Lett , vol.17 , pp. 681-689
    • Yang, F.1    Zhu, Q.2    Tang, D.3    Zhao, M.4
  • 89
    • 47249148373 scopus 로고    scopus 로고
    • Performance comparison of gene family clustering methods with expert curated gene family data set in Arabidopsis thaliana
    • Yang, K. and Zhang, L. (2008) Performance comparison of gene family clustering methods with expert curated gene family data set in Arabidopsis thaliana. Planta, 228, 439-447.
    • (2008) Planta , vol.228 , pp. 439-447
    • Yang, K.1    Zhang, L.2
  • 90
    • 40549099733 scopus 로고    scopus 로고
    • Sequence-based protein domain boundary prediction using BP neural network with various property profiles
    • Ye, L., Liu, T., Wu, Z., and Zhou, R. (2008) Sequence-based protein domain boundary prediction using BP neural network with various property profiles. Proteins, 71, 300-307.
    • (2008) Proteins , vol.71 , pp. 300-307
    • Ye, L.1    Liu, T.2    Wu, Z.3    Zhou, R.4
  • 91
    • 77951946371 scopus 로고    scopus 로고
    • A fast and automated solution for accurately resolving protein domain architectures
    • Yeats, C., Redfern, O.C., and Orengo, C. (2010) A fast and automated solution for accurately resolving protein domain architectures. Bioinformatics, 26, 745-751.
    • (2010) Bioinformatics , vol.26 , pp. 745-751
    • Yeats, C.1    Redfern, O.C.2    Orengo, C.3
  • 92
    • 0032726692 scopus 로고    scopus 로고
    • ProtoMap: automatic classification of protein sequences, a hierarchy of protein families, and local maps of the protein space
    • Yona, G., Linial, N., and Linial, M. (1999) ProtoMap: automatic classification of protein sequences, a hierarchy of protein families, and local maps of the protein space. Proteins, 37, 360-378.
    • (1999) Proteins , vol.37 , pp. 360-378
    • Yona, G.1    Linial, N.2    Linial, M.3
  • 93
    • 41949117705 scopus 로고    scopus 로고
    • Improved general regression network for protein domain boundary prediction
    • Yoo, P.D., Sikder, A.R., Zhou, B.B., and Zomaya, A.Y. (2008) Improved general regression network for protein domain boundary prediction. BMC Bioinformatics, 9(1 Suppl), S12.
    • (2008) BMC Bioinformatics , vol.9 , Issue.1 , pp. S12
    • Yoo, P.D.1    Sikder, A.R.2    Zhou, B.B.3    Zomaya, A.Y.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.