-
1
-
-
0025183708
-
Basic local alignment search tool
-
Altschul, S.F., Gish, W., Miller, W., Myers, E.W., and Lipman, D.J. (1990) Basic local alignment search tool. J Mol Biol, 215, 403-410.
-
(1990)
J Mol Biol
, vol.215
, pp. 403-410
-
-
Altschul, S.F.1
Gish, W.2
Miller, W.3
Myers, E.W.4
Lipman, D.J.5
-
2
-
-
0030801002
-
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
-
Altschul, S.F., Madden, T.L., Schäffer, A.A., Zhang, J., Zhang, Z., Miller, W., and Lipman, D.J. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res, 25, 3389-3402.
-
(1997)
Nucleic Acids Res
, vol.25
, pp. 3389-3402
-
-
Altschul, S.F.1
Madden, T.L.2
Schäffer, A.A.3
Zhang, J.4
Zhang, Z.5
Miller, W.6
Lipman, D.J.7
-
3
-
-
38549153238
-
Data growth and its impact on the SCOP database: new developments
-
Andreeva, A., Howorth, D., Chandonia, J.-M., Brenner, S.E., Hubbard, T.J.P., Chothia, C., and Murzin, A.G. (2008) Data growth and its impact on the SCOP database: new developments. Nucleic Acids Res, 36, D419-D425.
-
(2008)
Nucleic Acids Res
, vol.36
, pp. D419-D425
-
-
Andreeva, A.1
Howorth, D.2
Chandonia, J.-M.3
Brenner, S.E.4
Hubbard, T.J.P.5
Chothia, C.6
Murzin, A.G.7
-
4
-
-
0034777598
-
Clustering protein sequences-structure prediction by transitive homology
-
Bolten, E., Schliep, A., Schneckener, S., Schomburg, D., and Schrader, R. (2001) Clustering protein sequences-structure prediction by transitive homology. Bioinformatics, 17, 935-941.
-
(2001)
Bioinformatics
, vol.17
, pp. 935-941
-
-
Bolten, E.1
Schliep, A.2
Schneckener, S.3
Schomburg, D.4
Schrader, R.5
-
5
-
-
13444305296
-
The ProDom database of protein domain families: more emphasis on 3D
-
Bru, C., Courcelle, E., Carrère, S., Beausse, Y., Dalmar, S., and Kahn, D. (2005) The ProDom database of protein domain families: more emphasis on 3D. Nucleic Acids Res, 33, D212-D215.
-
(2005)
Nucleic Acids Res
, vol.33
, pp. D212-D215
-
-
Bru, C.1
Courcelle, E.2
Carrère, S.3
Beausse, Y.4
Dalmar, S.5
Kahn, D.6
-
6
-
-
37849023306
-
Assessing performance of orthology detection strategies applied to eukaryotic genomes
-
Chen, F., Mackey, A.J., Vermunt, J.K., and Roos, D.S. (2007) Assessing performance of orthology detection strategies applied to eukaryotic genomes. PLoS One, 2, e383.
-
(2007)
PLoS One
, vol.2
, pp. e383
-
-
Chen, F.1
Mackey, A.J.2
Vermunt, J.K.3
Roos, D.S.4
-
7
-
-
77956938868
-
DomSVR: domain boundary prediction with support vector regression from sequence information alone
-
Chen, P., Liu, C., Burge, L., Li, J., Mohammad, M., Southerland, W., Gloster, C., and Wang, B. (2010) DomSVR: domain boundary prediction with support vector regression from sequence information alone. Amino Acids, 39, 713-726.
-
(2010)
Amino Acids
, vol.39
, pp. 713-726
-
-
Chen, P.1
Liu, C.2
Burge, L.3
Li, J.4
Mohammad, M.5
Southerland, W.6
Gloster, C.7
Wang, B.8
-
8
-
-
33745101459
-
DOMpro: protein domain prediction using profiles, secondary structure, relative solvent accessibility, and recursive neural networks
-
Cheng, J., Sweredoski, M.J., and Baldi, P. (2006) DOMpro: protein domain prediction using profiles, secondary structure, relative solvent accessibility, and recursive neural networks. Data Min Knowl Discov, 13, 1-10.
-
(2006)
Data Min Knowl Discov
, vol.13
, pp. 1-10
-
-
Cheng, J.1
Sweredoski, M.J.2
Baldi, P.3
-
9
-
-
78651338019
-
Extending CATH: increasing coverage of the protein structure universe and linking structure with function
-
Cuff, A.L., Sillitoe, I., Lewis, T., Clegg, A.B., Rentzsch, R., Furnham, N., PellegriniCalace, M., Jones, D., Thornton, J., and Orengo, C.A. (2011) Extending CATH: increasing coverage of the protein structure universe and linking structure with function. Nucleic Acids Res, 39, D420-D426.
-
(2011)
Nucleic Acids Res
, vol.39
, pp. D420-D426
-
-
Cuff, A.L.1
Sillitoe, I.2
Lewis, T.3
Clegg, A.B.4
Rentzsch, R.5
Furnham, N.6
PellegriniCalace, M.7
Jones, D.8
Thornton, J.9
Orengo, C.A.10
-
10
-
-
0020649886
-
Establishing homologies in protein sequences
-
Dayhoff, M.O., Barker, W.C., and Hunt, L.T. (1983) Establishing homologies in protein sequences. Methods Enzymol, 91, 524-545.
-
(1983)
Methods Enzymol
, vol.91
, pp. 524-545
-
-
Dayhoff, M.O.1
Barker, W.C.2
Hunt, L.T.3
-
11
-
-
0031841279
-
The HSSP database of protein structure-sequence alignments and family profiles
-
Dodge, C., Schneider, R., and Sander, C. (1998) The HSSP database of protein structure-sequence alignments and family profiles. Nucleic Acids Res, 26, 313-315.
-
(1998)
Nucleic Acids Res
, vol.26
, pp. 313-315
-
-
Dodge, C.1
Schneider, R.2
Sander, C.3
-
12
-
-
0027364941
-
Evolutionarily mobile modules in proteins
-
Doolittle, R.F. and Bork, P. (1993) Evolutionarily mobile modules in proteins. Sci Am, 269, 50-56.
-
(1993)
Sci Am
, vol.269
, pp. 50-56
-
-
Doolittle, R.F.1
Bork, P.2
-
13
-
-
21744461895
-
Armadillo: domain boundary prediction by amino acid composition
-
Dumontier, M., Yao, R., Feldman, H.J., and Hogue, C.W.V. (2005) Armadillo: domain boundary prediction by amino acid composition. J Mol Biol, 350, 1061-1073.
-
(2005)
J Mol Biol
, vol.350
, pp. 1061-1073
-
-
Dumontier, M.1
Yao, R.2
Feldman, H.J.3
Hogue, C.W.V.4
-
14
-
-
0031743421
-
Profile hidden Markov models
-
Eddy, S.R. (1998) Profile hidden Markov models. Bioinformatics, 14, 755-763.
-
(1998)
Bioinformatics
, vol.14
, pp. 755-763
-
-
Eddy, S.R.1
-
15
-
-
0031576913
-
On punctuated equilibria
-
Eldredge, N. and Gould, S.J. (1997) On punctuated equilibria. Science, 276, 338-341.
-
(1997)
Science
, vol.276
, pp. 338-341
-
-
Eldredge, N.1
Gould, S.J.2
-
16
-
-
0036529479
-
An efficient algorithm for large-scale detection of protein families
-
Enright, A.J., Van Dongen, S., and Ouzounis, C.A. (2002) An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res, 30, 1575-1584.
-
(2002)
Nucleic Acids Res
, vol.30
, pp. 1575-1584
-
-
Enright, A.J.1
Van Dongen, S.2
Ouzounis, C.A.3
-
17
-
-
0033944826
-
GeneRAGE: a robust algorithm for sequence clustering and domain detection
-
Enright, A.J. and Ouzounis, C.A. (2000) GeneRAGE: a robust algorithm for sequence clustering and domain detection. Bioinformatics, 16, 451-457.
-
(2000)
Bioinformatics
, vol.16
, pp. 451-457
-
-
Enright, A.J.1
Ouzounis, C.A.2
-
18
-
-
75549090603
-
The Pfam protein families database
-
Finn, R.D., Mistry, J., Tate, J., Coggill, P., Heger, A., Pollington, J.E., Gavin, O.L., Gunasekaran, P., Ceric, G., Forslund, K. et al. (2010) The Pfam protein families database. Nucleic Acids Res, 38, D211-D222.
-
(2010)
Nucleic Acids Res
, vol.38
, pp. D211-D222
-
-
Finn, R.D.1
Mistry, J.2
Tate, J.3
Coggill, P.4
Heger, A.5
Pollington, J.E.6
Gavin, O.L.7
Gunasekaran, P.8
Ceric, G.9
Forslund, K.10
-
19
-
-
0014800108
-
Distinguishing homologous from analogous proteins
-
Fitch, W.M. (1970) Distinguishing homologous from analogous proteins. Syst Zool, 19, 99-113.
-
(1970)
Syst Zool
, vol.19
, pp. 99-113
-
-
Fitch, W.M.1
-
20
-
-
78651289449
-
Ensembl 2011
-
Flicek, P., Amode, M.R., Barrell, D., Beal, K., Brent, S., Chen, Y., Clapham, P., Coates, G., Fairley, S., Fitzgerald, S. et al. (2011) Ensembl 2011. Nucleic Acids Res, 39, D800-D806.
-
(2011)
Nucleic Acids Res
, vol.39
, pp. D800-D806
-
-
Flicek, P.1
Amode, M.R.2
Barrell, D.3
Beal, K.4
Brent, S.5
Chen, Y.6
Clapham, P.7
Coates, G.8
Fairley, S.9
Fitzgerald, S.10
-
21
-
-
33847172327
-
Clustering by passing messages between data points
-
Frey, B.J. and Dueck, D. (2007) Clustering by passing messages between data points. Science, 315, 972-976.
-
(2007)
Science
, vol.315
, pp. 972-976
-
-
Frey, B.J.1
Dueck, D.2
-
22
-
-
33746755926
-
Improving the specificity of high-throughput ortholog prediction
-
Fulton, D.L., Li, Y.Y., Laird, M.R., Horsman, B.G.S., Roche, F.M., and Brinkman, F.S.L. (2006) Improving the specificity of high-throughput ortholog prediction. BMC Bioinformatics, 7, 270.
-
(2006)
BMC Bioinformatics
, vol.7
, pp. 270
-
-
Fulton, D.L.1
Li, Y.Y.2
Laird, M.R.3
Horsman, B.G.S.4
Roche, F.M.5
Brinkman, F.S.L.6
-
23
-
-
0037377548
-
Prediction of protein domain boundaries from sequence alone
-
Galzitskaya, O.V. and Melnik, B.S. (2003) Prediction of protein domain boundaries from sequence alone. Protein Sci, 12, 696-701.
-
(2003)
Protein Sci
, vol.12
, pp. 696-701
-
-
Galzitskaya, O.V.1
Melnik, B.S.2
-
24
-
-
0036306348
-
SnapDRAGON: a method to delineate protein structural domains from sequence data
-
George, R.A. and Heringa, J. (2002) SnapDRAGON: a method to delineate protein structural domains from sequence data. J Mol Biol, 316, 839-851.
-
(2002)
J Mol Biol
, vol.316
, pp. 839-851
-
-
George, R.A.1
Heringa, J.2
-
25
-
-
0034710876
-
Coupled two-way clustering analysis of gene microarray data
-
Getz, G., Levine, E., and Domany, E. (2000) Coupled two-way clustering analysis of gene microarray data. Proc Natl Acad Sci USA, 97, 12079-12084.
-
(2000)
Proc Natl Acad Sci USA
, vol.97
, pp. 12079-12084
-
-
Getz, G.1
Levine, E.2
Domany, E.3
-
26
-
-
0035798406
-
Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structure
-
Gough, J., Karplus, K., Hughey, R., and Chothia, C. (2001) Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structure. J Mol Biol, 313, 903-919.
-
(2001)
J Mol Biol
, vol.313
, pp. 903-919
-
-
Gough, J.1
Karplus, K.2
Hughey, R.3
Chothia, C.4
-
27
-
-
0033563522
-
Whole genome protein domain analysis using a new method for domain clustering
-
Gouzy, J., Corpet, F., and Kahn, D. (1999) Whole genome protein domain analysis using a new method for domain clustering. Comput Chem, 23, 333-340.
-
(1999)
Comput Chem
, vol.23
, pp. 333-340
-
-
Gouzy, J.1
Corpet, F.2
Kahn, D.3
-
28
-
-
0023375315
-
Profile analysis: detection of distantly related proteins
-
Gribskov, M., McLachlan, A.D., and Eisenberg, D. (1987) Profile analysis: detection of distantly related proteins. Proc Natl Acad Sci USA, 84, 4355-4358.
-
(1987)
Proc Natl Acad Sci USA
, vol.84
, pp. 4355-4358
-
-
Gribskov, M.1
McLachlan, A.D.2
Eisenberg, D.3
-
29
-
-
0031744176
-
Domain identification by clustering sequence alignments
-
Guan, X. and Du, L. (1998) Domain identification by clustering sequence alignments. Bioinformatics, 14, 783-788.
-
(1998)
Bioinformatics
, vol.14
, pp. 783-788
-
-
Guan, X.1
Du, L.2
-
30
-
-
0034333196
-
Rapid automatic detection and alignment of repeats in protein sequences
-
Heger, A. and Holm, L. (2000) Rapid automatic detection and alignment of repeats in protein sequences. Proteins Struct Funct Bioinform, 41, 224-237.
-
(2000)
Proteins Struct Funct Bioinform
, vol.41
, pp. 224-237
-
-
Heger, A.1
Holm, L.2
-
31
-
-
0035070578
-
Picasso: generating a covering set of protein family profiles
-
Heger, A. and Holm, L. (2001) Picasso: generating a covering set of protein family profiles. Bioinformatics, 17, 272-279.
-
(2001)
Bioinformatics
, vol.17
, pp. 272-279
-
-
Heger, A.1
Holm, L.2
-
32
-
-
0037414465
-
Exhaustive enumeration of protein domain families
-
Heger, A. and Holm, L. (2003) Exhaustive enumeration of protein domain families. J Mol Biol, 328, 749-767.
-
(2003)
J Mol Biol
, vol.328
, pp. 749-767
-
-
Heger, A.1
Holm, L.2
-
33
-
-
38549113430
-
PairsDB atlas of protein sequence space
-
Heger, A., Korpelainen, E., Hupponen, T., Mattila, K., Ollikainen, V., and Holm, L. (2008) PairsDB atlas of protein sequence space. Nucleic Acids Res, 36, D276-D280.
-
(2008)
Nucleic Acids Res
, vol.36
, pp. D276-D280
-
-
Heger, A.1
Korpelainen, E.2
Hupponen, T.3
Mattila, K.4
Ollikainen, V.5
Holm, L.6
-
34
-
-
34548733641
-
The global trace graph, a novel paradigm for searching protein sequence databases
-
Heger, A., Mallick, S., Wilton, C., and Holm, L. (2007) The global trace graph, a novel paradigm for searching protein sequence databases. Bioinformatics, 23, 2361-2367.
-
(2007)
Bioinformatics
, vol.23
, pp. 2361-2367
-
-
Heger, A.1
Mallick, S.2
Wilton, C.3
Holm, L.4
-
35
-
-
0027491666
-
A method to recognize distant repeats in protein sequences
-
391-341
-
Heringa, J. and Argos, P. (1993) A method to recognize distant repeats in protein sequences. Proteins, 17, 391-341.
-
(1993)
Proteins
, vol.17
-
-
Heringa, J.1
Argos, P.2
-
36
-
-
0031829372
-
Removing near-neighbour redundancy from large protein sequence collections
-
Holm, L. and Sander, C. (1998a) Removing near-neighbour redundancy from large protein sequence collections. Bioinformatics, 14, 423-429.
-
(1998)
Bioinformatics
, vol.14
, pp. 423-429
-
-
Holm, L.1
Sander, C.2
-
37
-
-
0031865006
-
Touring protein fold space with Dali/FSSP
-
Holm, L. and Sander, C. (1998b) Touring protein fold space with Dali/FSSP. Nucleic Acids Res, 26, 316-319.
-
(1998)
Nucleic Acids Res
, vol.26
, pp. 316-319
-
-
Holm, L.1
Sander, C.2
-
38
-
-
75549086174
-
Eukaryotic protein domains as functional units of cellular evolution
-
Jin, J., Xie, X., Chen, C., Park, J.G., Stark, C., James, D.A., Olhovsky, M., Linding, R., Mao, Y., and Pawson, T. (2009) Eukaryotic protein domains as functional units of cellular evolution. Sci Signal, 2, ra76.
-
(2009)
Sci Signal
, vol.2
, pp. ra76
-
-
Jin, J.1
Xie, X.2
Chen, C.3
Park, J.G.4
Stark, C.5
James, D.A.6
Olhovsky, M.7
Linding, R.8
Mao, Y.9
Pawson, T.10
-
39
-
-
66349138035
-
Family classification without domain chaining
-
Joseph, J.M. and Durand, D. (2009) Family classification without domain chaining. Bioinformatics, 25, i45-i53.
-
(2009)
Bioinformatics
, vol.25
, pp. i45-i53
-
-
Joseph, J.M.1
Durand, D.2
-
40
-
-
34548605214
-
CLUSS: clustering of protein sequences based on a new similarity measure
-
Kelil, A., Wang, S., Brzezinski, R., and Fleury, A. (2007) CLUSS: clustering of protein sequences based on a new similarity measure. BMC Bioinformatics, 8, 286.
-
(2007)
BMC Bioinformatics
, vol.8
, pp. 286
-
-
Kelil, A.1
Wang, S.2
Brzezinski, R.3
Fleury, A.4
-
41
-
-
30344438515
-
Automated prediction of domain boundaries in CASP6 targets using Ginzu and RosettaDOM
-
Kim, D.E., Chivian, D., Malmström, L., and Baker, D. (2005) Automated prediction of domain boundaries in CASP6 targets using Ginzu and RosettaDOM. Proteins, 61(7 Suppl), 193-200.
-
(2005)
Proteins
, vol.61
, Issue.7
, pp. 193-200
-
-
Kim, D.E.1
Chivian, D.2
Malmström, L.3
Baker, D.4
-
42
-
-
13244291417
-
Large scale hierarchical clustering of protein sequences
-
Krause, A., Stoye, J., and Vingron, M. (2005) Large scale hierarchical clustering of protein sequences. BMC Bioinformatics, 6, 15.
-
(2005)
BMC Bioinformatics
, vol.6
, pp. 15
-
-
Krause, A.1
Stoye, J.2
Vingron, M.3
-
43
-
-
0031876711
-
A set-theoretic approach to database searching and clustering
-
Krause, A. and Vingron, M. (1998) A set-theoretic approach to database searching and clustering. Bioinformatics, 14, 430-438.
-
(1998)
Bioinformatics
, vol.14
, pp. 430-438
-
-
Krause, A.1
Vingron, M.2
-
44
-
-
77954198131
-
A low-polynomial algorithm for assembling clusters of orthologous groups from intergenomic symmetric best matches
-
Kristensen, D.M., Kannan, L., Coleman, M.K., Wolf, Y.I., Sorokin, A., Koonin, E.V., and Mushegian, A. (2010) A low-polynomial algorithm for assembling clusters of orthologous groups from intergenomic symmetric best matches. Bioinformatics, 26, 1481-1487.
-
(2010)
Bioinformatics
, vol.26
, pp. 1481-1487
-
-
Kristensen, D.M.1
Kannan, L.2
Coleman, M.K.3
Wolf, Y.I.4
Sorokin, A.5
Koonin, E.V.6
Mushegian, A.7
-
45
-
-
0035174881
-
CluSTr: a database of clusters of SWISS-PROT+TrEMBL proteins
-
Kriventseva, E.V., Fleischmann, W., Zdobnov, E.M., and Apweiler, R. (2001) CluSTr: a database of clusters of SWISS-PROT+TrEMBL proteins. Nucleic Acids Res, 29, 33-36.
-
(2001)
Nucleic Acids Res
, vol.29
, pp. 33-36
-
-
Kriventseva, E.V.1
Fleischmann, W.2
Zdobnov, E.M.3
Apweiler, R.4
-
46
-
-
58149194624
-
SMART 6: recent updates and new developments
-
Letunic, I., Doerks, T., and Bork, P. (2009) SMART 6: recent updates and new developments. Nucleic Acids Res, 37, D229-D232.
-
(2009)
Nucleic Acids Res
, vol.37
, pp. D229-D232
-
-
Letunic, I.1
Doerks, T.2
Bork, P.3
-
47
-
-
0141519279
-
OrthoMCL: identification of ortholog groups for eukaryotic genomes
-
Li, L., Stoeckert, C.J., and Roos, D.S. (2003) OrthoMCL: identification of ortholog groups for eukaryotic genomes. Genome Res, 13, 2178-2189.
-
(2003)
Genome Res
, vol.13
, pp. 2178-2189
-
-
Li, L.1
Stoeckert, C.J.2
Roos, D.S.3
-
48
-
-
0035072551
-
Clustering of highly homologous sequences to reduce the size of large protein databases
-
Li, W., Jaroszewski, L., and Godzik, A. (2001) Clustering of highly homologous sequences to reduce the size of large protein databases. Bioinformatics, 17, 282-283.
-
(2001)
Bioinformatics
, vol.17
, pp. 282-283
-
-
Li, W.1
Jaroszewski, L.2
Godzik, A.3
-
49
-
-
0036169928
-
Tolerating some redundancy significantly speeds up clustering of large protein databases
-
Li, W., Jaroszewski, L., and Godzik, A. (2002) Tolerating some redundancy significantly speeds up clustering of large protein databases. Bioinformatics, 18, 77-82.
-
(2002)
Bioinformatics
, vol.18
, pp. 77-82
-
-
Li, W.1
Jaroszewski, L.2
Godzik, A.3
-
50
-
-
3242891265
-
CHOP: parsing proteins into structural domains
-
Liu, J. and Rost, B. (2004) CHOP: parsing proteins into structural domains. Nucleic Acids Res, 32, W569-W571.
-
(2004)
Nucleic Acids Res
, vol.32
, pp. W569-W571
-
-
Liu, J.1
Rost, B.2
-
51
-
-
46249133773
-
Efficient algorithms for accurate hierarchical clustering of huge datasets: tackling the entire protein space
-
Loewenstein, Y., Portugaly, E., Fromer, M., and Linial, M. (2008) Efficient algorithms for accurate hierarchical clustering of huge datasets: tackling the entire protein space. Bioinformatics, 24, i41-i49.
-
(2008)
Bioinformatics
, vol.24
, pp. i41-i49
-
-
Loewenstein, Y.1
Portugaly, E.2
Fromer, M.3
Linial, M.4
-
52
-
-
78651285748
-
CDD: a conserved domain database for the functional annotation of proteins
-
Marchler-Bauer, A., Lu, S., Anderson, J.B., Chitsaz, F., Derbyshire, M.K., DeWeese-Scott, C., Fong, J.H., Geer, L.Y., Geer, R.C., Gonzales, N.R. et al. (2011) CDD: a conserved domain database for the functional annotation of proteins. Nucleic Acids Res, 39, D225-D229.
-
(2011)
Nucleic Acids Res
, vol.39
, pp. D225-D229
-
-
Marchler-Bauer, A.1
Lu, S.2
Anderson, J.B.3
Chitsaz, F.4
Derbyshire, M.K.5
DeWeese-Scott, C.6
Fong, J.H.7
Geer, L.Y.8
Geer, R.C.9
Gonzales, N.R.10
-
53
-
-
0036893072
-
Rapid protein domain assignment from amino acid sequence using predicted secondary structure
-
Marsden, R.L., McGuffin, L.J., and Jones, D.T. (2002) Rapid protein domain assignment from amino acid sequence using predicted secondary structure. Protein Sci, 11, 2814-2824.
-
(2002)
Protein Sci
, vol.11
, pp. 2814-2824
-
-
Marsden, R.L.1
McGuffin, L.J.2
Jones, D.T.3
-
54
-
-
0028961335
-
SCOP: a structural classification of proteins database for the investigation of sequences and structures
-
Murzin, A.G., Brenner, S.E., Hubbard, T., and Chothia, C. (1995) SCOP: a structural classification of proteins database for the investigation of sequences and structures. J Mol Biol, 247, 536-540.
-
(1995)
J Mol Biol
, vol.247
, pp. 536-540
-
-
Murzin, A.G.1
Brenner, S.E.2
Hubbard, T.3
Chothia, C.4
-
55
-
-
3142680264
-
Automatic prediction of protein domains from sequence information using a hybrid learning system
-
Nagarajan, N. and Yona, G. (2004) Automatic prediction of protein domains from sequence information using a hybrid learning system. Bioinformatics, 20, 1335-1360.
-
(2004)
Bioinformatics
, vol.20
, pp. 1335-1360
-
-
Nagarajan, N.1
Yona, G.2
-
56
-
-
77950430912
-
SCPS: a fast implementation of a spectral method for detecting protein families on a genome-wide scale
-
Nepusz, T., Sasidharan, R., and Paccanaro, A. (2010) SCPS: a fast implementation of a spectral method for detecting protein families on a genome-wide scale. BMC Bioinformatics, 11, 120.
-
(2010)
BMC Bioinformatics
, vol.11
, pp. 120
-
-
Nepusz, T.1
Sasidharan, R.2
Paccanaro, A.3
-
57
-
-
0029358115
-
Parallel algorithms for hierarchical clustering
-
Olson, C.F. (1995) Parallel algorithms for hierarchical clustering. Parallel Comput, 21, 1313-1325.
-
(1995)
Parallel Comput
, vol.21
, pp. 1313-1325
-
-
Olson, C.F.1
-
58
-
-
33645523636
-
Spectral clustering of protein sequences
-
Paccanaro, A., Casbon, J.A., and Saqi, M.A.S. (2006) Spectral clustering of protein sequences. Nucleic Acids Res, 34, 1571-1580.
-
(2006)
Nucleic Acids Res
, vol.34
, pp. 1571-1580
-
-
Paccanaro, A.1
Casbon, J.A.2
Saqi, M.A.S.3
-
59
-
-
0033940118
-
RSDB: representative protein sequence databases have high information content
-
Park, J., Holm, L., Heger, A., and Chothia, C. (2000) RSDB: representative protein sequence databases have high information content. Bioinformatics, 16, 458-464.
-
(2000)
Bioinformatics
, vol.16
, pp. 458-464
-
-
Park, J.1
Holm, L.2
Heger, A.3
Chothia, C.4
-
60
-
-
0031576361
-
Intermediate sequences increase the detection of homology between sequences
-
Park, J., Teichmann, S.A., Hubbard, T., and Chothia, C. (1997) Intermediate sequences increase the detection of homology between sequences. J Mol Biol, 273, 349-354.
-
(1997)
J Mol Biol
, vol.273
, pp. 349-354
-
-
Park, J.1
Teichmann, S.A.2
Hubbard, T.3
Chothia, C.4
-
61
-
-
0023989064
-
Improved tools for biological sequence comparison
-
Pearson, W.R. and Lipman, D.J. (1988) Improved tools for biological sequence comparison. Proc Natl Acad Sci USA, 85, 2444-2448.
-
(1988)
Proc Natl Acad Sci USA
, vol.85
, pp. 2444-2448
-
-
Pearson, W.R.1
Lipman, D.J.2
-
62
-
-
0033151949
-
A fast algorithm for genome-wide analysis of proteins with repeated sequences
-
Pellegrini, M., Marcotte, E.M., and Yeates, T.O. (1999) A fast algorithm for genome-wide analysis of proteins with repeated sequences. Proteins, 35, 440-446.
-
(1999)
Proteins
, vol.35
, pp. 440-446
-
-
Pellegrini, M.1
Marcotte, E.M.2
Yeates, T.O.3
-
63
-
-
24644474560
-
The predictive power of the CluSTr database
-
Petryszak, R., Kretschmann, E., Wieser, D., and Apweiler, R. (2005) The predictive power of the CluSTr database. Bioinformatics, 21, 3604-3609.
-
(2005)
Bioinformatics
, vol.21
, pp. 3604-3609
-
-
Petryszak, R.1
Kretschmann, E.2
Wieser, D.3
Apweiler, R.4
-
64
-
-
0346652457
-
ProClust: improved clustering of protein sequences with an extended graph-based approach
-
Pipenbacher, P., Schliep, A., Schneckener, S., Schönhuth, A., Schomburg, D., and Schrader, R. (2002) ProClust: improved clustering of protein sequences with an extended graph-based approach. Bioinformatics, 18(2 Suppl), S182-S191.
-
(2002)
Bioinformatics
, vol.18
, Issue.2
, pp. S182-S191
-
-
Pipenbacher, P.1
Schliep, A.2
Schneckener, S.3
Schönhuth, A.4
Schomburg, D.5
Schrader, R.6
-
65
-
-
33746962182
-
EVEREST: automatic identification and classification of protein domains in all protein sequences
-
Portugaly, E., Harel, A., Linial, N., and Linial, M. (2006) EVEREST: automatic identification and classification of protein domains in all protein sequences. BMC Bioinformatics, 7, 277.
-
(2006)
BMC Bioinformatics
, vol.7
, pp. 277
-
-
Portugaly, E.1
Harel, A.2
Linial, N.3
Linial, M.4
-
66
-
-
37249000536
-
Exact and heuristic algorithms for weighted cluster editing
-
Rahmann, S., Wittkop, T., Baumbach, J., Martin, M., Truss, A., and Böcker, S. (2007) Exact and heuristic algorithms for weighted cluster editing. Comput Syst Bioinform Conf, 6, 391-401.
-
(2007)
Comput Syst Bioinform Conf
, vol.6
, pp. 391-401
-
-
Rahmann, S.1
Wittkop, T.2
Baumbach, J.3
Martin, M.4
Truss, A.5
Böcker, S.6
-
67
-
-
33644874175
-
SIMAP: the similarity matrix of proteins
-
Rattei, T., Arnold, R., Tischler, P., Lindner, D., Stümpflen, V., and Mewes, H.W. (2006) SIMAP: the similarity matrix of proteins. Nucleic Acids Res, 34, D252-D256.
-
(2006)
Nucleic Acids Res
, vol.34
, pp. D252-D256
-
-
Rattei, T.1
Arnold, R.2
Tischler, P.3
Lindner, D.4
Stümpflen, V.5
Mewes, H.W.6
-
68
-
-
0036220048
-
Use of covariance analysis for the prediction of structural domain boundaries from multiple protein sequence alignments
-
Rigden, D.J. (2002) Use of covariance analysis for the prediction of structural domain boundaries from multiple protein sequence alignments. Protein Eng, 15, 65-77.
-
(2002)
Protein Eng
, vol.15
, pp. 65-77
-
-
Rigden, D.J.1
-
69
-
-
0018015137
-
Modeling by shortest data description
-
Rissanen, J. (1978) Modeling by shortest data description. Automatica, 14, 465-471.
-
(1978)
Automatica
, vol.14
, pp. 465-471
-
-
Rissanen, J.1
-
70
-
-
0038419681
-
Functional links between proteins
-
Sali, A. (1999) Functional links between proteins. Nature, 402(23), 25-26.
-
(1999)
Nature
, vol.402
, Issue.23
, pp. 25-26
-
-
Sali, A.1
-
71
-
-
0037250523
-
ProtoNet: hierarchical classification of the protein space
-
Sasson, O., Vaaknin, A., Fleischer, H., Portugaly, E., Bilu, Y., Linial, N., and Linial, M. (2003) ProtoNet: hierarchical classification of the protein space. Nucleic Acids Res, 31, 348-352.
-
(2003)
Nucleic Acids Res
, vol.31
, pp. 348-352
-
-
Sasson, O.1
Vaaknin, A.2
Fleischer, H.3
Portugaly, E.4
Bilu, Y.5
Linial, N.6
Linial, M.7
-
72
-
-
0038438514
-
IMPALA: matching a protein sequence against a collection of PSI-BLAST-constructed position-specific score matrices
-
Schäffer, A.A., Wolf, Y.I., Ponting, C.P., Koonin, E.V., Aravind, L., and Altschul, S.F. (1999) IMPALA: matching a protein sequence against a collection of PSI-BLAST-constructed position-specific score matrices. Bioinformatics, 15, 1000-1011.
-
(1999)
Bioinformatics
, vol.15
, pp. 1000-1011
-
-
Schäffer, A.A.1
Wolf, Y.I.2
Ponting, C.P.3
Koonin, E.V.4
Aravind, L.5
Altschul, S.F.6
-
73
-
-
0001290045
-
PROSITE: a documented database using patterns and profiles as motif descriptors
-
Sigrist, C.J.A., Cerutti, L., Hulo, N., Gattiker, A., Falquet, L., Pagni, M., Bairoch, A., and Bucher, P. (2002) PROSITE: a documented database using patterns and profiles as motif descriptors. Brief Bioinform, 3, 265-274.
-
(2002)
Brief Bioinform
, vol.3
, pp. 265-274
-
-
Sigrist, C.J.A.1
Cerutti, L.2
Hulo, N.3
Gattiker, A.4
Falquet, L.5
Pagni, M.6
Bairoch, A.7
Bucher, P.8
-
74
-
-
33947385412
-
Improving the performance of DomainDiscovery of protein domain boundary assignment using inter-domain linker index
-
Sikder, A.R. and Zomaya, A.Y. (2006) Improving the performance of DomainDiscovery of protein domain boundary assignment using inter-domain linker index. BMC Bioinformatics, 7(Suppl 5), S6.
-
(2006)
BMC Bioinformatics
, vol.7
, pp. S6
-
-
Sikder, A.R.1
Zomaya, A.Y.2
-
75
-
-
17844363963
-
PPRODO: prediction of protein domain boundaries using neural networks
-
Sim, J., Kim, S.-Y., and Lee, J. (2005) PPRODO: prediction of protein domain boundaries using neural networks. Proteins, 59, 627-632.
-
(2005)
Proteins
, vol.59
, pp. 627-632
-
-
Sim, J.1
Kim, S.-Y.2
Lee, J.3
-
76
-
-
44949207428
-
Sequence similarity network reveals common ancestry of multidomain proteins
-
Song, N., Joseph, J.M., Davis, G.B., and Durand, D. (2008) Sequence similarity network reveals common ancestry of multidomain proteins. PLoS Comput Biol, 4, e1000063.
-
(2008)
PLoS Comput Biol
, vol.4
, pp. e1000063
-
-
Song, N.1
Joseph, J.M.2
Davis, G.B.3
Durand, D.4
-
77
-
-
0037460953
-
DomCut: prediction of inter-domain linker regions in amino acid sequences
-
Suyama, M. and Ohara, O. (2003) DomCut: prediction of inter-domain linker regions in amino acid sequences. Bioinformatics, 19, 673-674.
-
(2003)
Bioinformatics
, vol.19
, pp. 673-674
-
-
Suyama, M.1
Ohara, O.2
-
78
-
-
0030660581
-
A genomic perspective on protein families
-
Tatusov, R.L., Koonin, E.V., and Lipman, D.J. (1997) A genomic perspective on protein families. Science, 278, 631-637.
-
(1997)
Science
, vol.278
, pp. 631-637
-
-
Tatusov, R.L.1
Koonin, E.V.2
Lipman, D.J.3
-
79
-
-
0034062874
-
Fast assignment of protein structures to sequences using the intermediate sequence library PDB-ISL
-
Teichmann, S.A., Chothia, C., Church, G.M., and Park, J. (2000) Fast assignment of protein structures to sequences using the intermediate sequence library PDB-ISL. Bioinformatics, 16, 117-124.
-
(2000)
Bioinformatics
, vol.16
, pp. 117-124
-
-
Teichmann, S.A.1
Chothia, C.2
Church, G.M.3
Park, J.4
-
80
-
-
25444458854
-
Super paramagnetic clustering of protein sequences
-
Tetko, I.V., Facius, A., Ruepp, A., and Mewes, H.-W. (2005) Super paramagnetic clustering of protein sequences. BMC Bioinformatics, 6, 82.
-
(2005)
BMC Bioinformatics
, vol.6
, pp. 82
-
-
Tetko, I.V.1
Facius, A.2
Ruepp, A.3
Mewes, H.-W.4
-
81
-
-
78651319979
-
Ongoing and future developments at the Universal Protein Resource
-
The UniProt Consortium (2011) Ongoing and future developments at the Universal Protein Resource. Nucleic Acids Res, 39, D214-D219.
-
(2011)
Nucleic Acids Res
, vol.39
, pp. D214-D219
-
-
-
82
-
-
0005924596
-
Graph clustering by flow simulation
-
PhD Thesis. University of Utrecht, The Netherlands
-
Van Dongen (2000). Graph clustering by flow simulation. PhD Thesis. University of Utrecht, The Netherlands.
-
(2000)
-
-
Dongen, V.1
-
83
-
-
0000107517
-
An Information Measure for Classification
-
Wallace, C.S. and Boulton, D.M. (1968) An Information Measure for Classification. Comput J, 11, 185-194.
-
(1968)
Comput J
, vol.11
, pp. 185-194
-
-
Wallace, C.S.1
Boulton, D.M.2
-
84
-
-
0015597839
-
Nucleation, rapid folding, and globular intrachain regions in proteins
-
Wetlaufer, D.B. (1973) Nucleation, rapid folding, and globular intrachain regions in proteins. Proc Natl Acad Sci USA, 70, 697-701.
-
(1973)
Proc Natl Acad Sci USA
, vol.70
, pp. 697-701
-
-
Wetlaufer, D.B.1
-
85
-
-
0033753811
-
Domain size distributions can predict domain boundaries
-
Wheelan, S.J., Marchler-Bauer, A., and Bryant, S.H. (2000) Domain size distributions can predict domain boundaries. Bioinformatics, 16, 613-618.
-
(2000)
Bioinformatics
, vol.16
, pp. 613-618
-
-
Wheelan, S.J.1
Marchler-Bauer, A.2
Bryant, S.H.3
-
86
-
-
37249051926
-
Large scale clustering of protein sequences with FORCE -a layout based heuristic for weighted cluster editing
-
Wittkop, T., Baumbach, J., Lobo, F.P., and Rahmann, S. (2007) Large scale clustering of protein sequences with FORCE -a layout based heuristic for weighted cluster editing. BMC Bioinformatics, 8, 396.
-
(2007)
BMC Bioinformatics
, vol.8
, pp. 396
-
-
Wittkop, T.1
Baumbach, J.2
Lobo, F.P.3
Rahmann, S.4
-
87
-
-
46249129962
-
MACHOS: Markov clusters of homologous subsequences
-
Wong, S. and Ragan, M.A. (2008) MACHOS: Markov clusters of homologous subsequences. Bioinformatics, 24, i77-i85.
-
(2008)
Bioinformatics
, vol.24
, pp. i77-i85
-
-
Wong, S.1
Ragan, M.A.2
-
88
-
-
77955979374
-
Using affinity propagation combined post-processing to cluster protein sequences
-
Yang, F., Zhu, Q., Tang, D., and Zhao, M. (2010) Using affinity propagation combined post-processing to cluster protein sequences. Protein Pept Lett, 17, 681-689.
-
(2010)
Protein Pept Lett
, vol.17
, pp. 681-689
-
-
Yang, F.1
Zhu, Q.2
Tang, D.3
Zhao, M.4
-
89
-
-
47249148373
-
Performance comparison of gene family clustering methods with expert curated gene family data set in Arabidopsis thaliana
-
Yang, K. and Zhang, L. (2008) Performance comparison of gene family clustering methods with expert curated gene family data set in Arabidopsis thaliana. Planta, 228, 439-447.
-
(2008)
Planta
, vol.228
, pp. 439-447
-
-
Yang, K.1
Zhang, L.2
-
90
-
-
40549099733
-
Sequence-based protein domain boundary prediction using BP neural network with various property profiles
-
Ye, L., Liu, T., Wu, Z., and Zhou, R. (2008) Sequence-based protein domain boundary prediction using BP neural network with various property profiles. Proteins, 71, 300-307.
-
(2008)
Proteins
, vol.71
, pp. 300-307
-
-
Ye, L.1
Liu, T.2
Wu, Z.3
Zhou, R.4
-
91
-
-
77951946371
-
A fast and automated solution for accurately resolving protein domain architectures
-
Yeats, C., Redfern, O.C., and Orengo, C. (2010) A fast and automated solution for accurately resolving protein domain architectures. Bioinformatics, 26, 745-751.
-
(2010)
Bioinformatics
, vol.26
, pp. 745-751
-
-
Yeats, C.1
Redfern, O.C.2
Orengo, C.3
-
92
-
-
0032726692
-
ProtoMap: automatic classification of protein sequences, a hierarchy of protein families, and local maps of the protein space
-
Yona, G., Linial, N., and Linial, M. (1999) ProtoMap: automatic classification of protein sequences, a hierarchy of protein families, and local maps of the protein space. Proteins, 37, 360-378.
-
(1999)
Proteins
, vol.37
, pp. 360-378
-
-
Yona, G.1
Linial, N.2
Linial, M.3
-
93
-
-
41949117705
-
Improved general regression network for protein domain boundary prediction
-
Yoo, P.D., Sikder, A.R., Zhou, B.B., and Zomaya, A.Y. (2008) Improved general regression network for protein domain boundary prediction. BMC Bioinformatics, 9(1 Suppl), S12.
-
(2008)
BMC Bioinformatics
, vol.9
, Issue.1
, pp. S12
-
-
Yoo, P.D.1
Sikder, A.R.2
Zhou, B.B.3
Zomaya, A.Y.4
|