-
1
-
-
0036080338
-
Detecting cryptically simple protein sequences using the SIMPLE algorithm
-
Alba, M.M., Laskowski, R.A., and Hancock, J.M. 2002. Detecting cryptically simple protein sequences using the SIMPLE algorithm. Bioinformatics 18, 672-678.
-
(2002)
Bioinformatics
, vol.18
, pp. 672-678
-
-
Alba, M.M.1
Laskowski, R.A.2
Hancock, J.M.3
-
2
-
-
0029889221
-
Local alignment statistics
-
Altschul, S.F., and Gish, W. 1996. Local alignment statistics. Methods Enzymol. 266, 460-480.
-
(1996)
Methods Enzymol.
, vol.266
, pp. 460-480
-
-
Altschul, S.F.1
Gish, W.2
-
3
-
-
0000051438
-
An extreme value theory for sequence matching
-
Arratia, R., Gordon, L., and Waterman, M.S. 1986. An extreme value theory for sequence matching. Ann. Stat. 14, 971-993.
-
(1986)
Ann. Stat.
, vol.14
, pp. 971-993
-
-
Arratia, R.1
Gordon, L.2
Waterman, M.S.3
-
4
-
-
0001619220
-
A phase transition for the score in matching random sequences allowing deletions
-
Arratia, R., and Waterman, M.S. 1994. A phase transition for the score in matching random sequences allowing deletions. Ann. Appl. Prob. 4, 200-225.
-
(1994)
Ann. Appl. Prob.
, vol.4
, pp. 200-225
-
-
Arratia, R.1
Waterman, M.S.2
-
5
-
-
0034069495
-
Gene ontology: Tool for the unification of biology
-
The Gene Ontology Consortium
-
Ashburner, M., Ball, C.A., Blake, J.A., Botstein, D., Butler, H., Cherry, J.M., Davis, A.P., Dolinski, K., Dwight, S.S., Eppig, J.T., Harris, M.A., Hill, D.P., Issel-Tarver, L., Kasarskis, A., Lewis, S., Matese, J.C., Richardson, J.E., Ringwald, M., Rubin, G.M., and Sherlock, G. 2000. Gene ontology: Tool for the unification of biology. The Gene Ontology Consortium. Nat. Genet. 25, 25-29.
-
(2000)
Nat. Genet.
, vol.25
, pp. 25-29
-
-
Ashburner, M.1
Ball, C.A.2
Blake, J.A.3
Botstein, D.4
Butler, H.5
Cherry, J.M.6
Davis, A.P.7
Dolinski, K.8
Dwight, S.S.9
Eppig, J.T.10
Harris, M.A.11
Hill, D.P.12
Issel-Tarver, L.13
Kasarskis, A.14
Lewis, S.15
Matese, J.C.16
Richardson, J.E.17
Ringwald, M.18
Rubin, G.M.19
Sherlock, G.20
more..
-
6
-
-
0032918028
-
PRINTS prepares for the new millennium
-
Attwood, T.K., Flower, D.R., Lewis, A.P., Mabey, J.E., Morgan, S.R., Scordis, P., Selley, J., and Wright, W. 1999. PRINTS prepares for the new millennium. Nucl. Acids Res. 27, 220-225.
-
(1999)
Nucl. Acids Res.
, vol.27
, pp. 220-225
-
-
Attwood, T.K.1
Flower, D.R.2
Lewis, A.P.3
Mabey, J.E.4
Morgan, S.R.5
Scordis, P.6
Selley, J.7
Wright, W.8
-
7
-
-
0032952229
-
Pfam 3.1: 1313 multiple alignments and profile HMMs match the majority of proteins
-
Bateman, A., Birney, E., Durbin, R., Eddy, S.R., Finn R.D., and Sonnhammer E.L. 1999. Pfam 3.1: 1313 multiple alignments and profile HMMs match the majority of proteins. Nucl. Acids Res. 27, 260-262.
-
(1999)
Nucl. Acids Res.
, vol.27
, pp. 260-262
-
-
Bateman, A.1
Birney, E.2
Durbin, R.3
Eddy, S.R.4
Finn, R.D.5
Sonnhammer, E.L.6
-
8
-
-
0001282761
-
Information enhancement methods for large scale sequence analysis
-
Claverie, J.M., and States, D.J. 1993. Information enhancement methods for large scale sequence analysis. Comput. Chem. 17, 191-201.
-
(1993)
Comput. Chem.
, vol.17
, pp. 191-201
-
-
Claverie, J.M.1
States, D.J.2
-
9
-
-
2942580909
-
Computational identification of transcription factor binding sites by functional analysis of sets of genes sharing overrep-resented upstream motifs
-
Cora, D., Di Cunto, P., Provero, P., Silengo, L., and Caselle, M. 2004. Computational identification of transcription factor binding sites by functional analysis of sets of genes sharing overrep-resented upstream motifs. BMC Bioinformatics 5, 57.
-
(2004)
BMC Bioinformatics
, vol.5
, pp. 57
-
-
Cora, D.1
Di Cunto, P.2
Provero, P.3
Silengo, L.4
Caselle, M.5
-
10
-
-
0032919372
-
Recent improvements of the ProDom database of protein domain families
-
Corpet, F., Gouzy, J., and Kahn, D. 1999. Recent improvements of the ProDom database of protein domain families. Nucl. Acids Res. 27, 263-267.
-
(1999)
Nucl. Acids Res.
, vol.27
, pp. 263-267
-
-
Corpet, F.1
Gouzy, J.2
Kahn, D.3
-
11
-
-
0000387249
-
Strong limit theorems of empirical functionals for large exceedances of partial sums of i.i.d variables
-
Dembo, A., and Karlin, S. 1991. Strong limit theorems of empirical functionals for large exceedances of partial sums of i.i.d variables. Ann. Prob. 19, 1737-1755.
-
(1991)
Ann. Prob.
, vol.19
, pp. 1737-1755
-
-
Dembo, A.1
Karlin, S.2
-
12
-
-
0000526801
-
Critical phenomena for sequence matching with scoring
-
Dembo, A., Karlin, S., and Zeitouni, O. 1994a. Critical phenomena for sequence matching with scoring. Ann. Prob. 22, 1993-2021.
-
(1994)
Ann. Prob.
, vol.22
, pp. 1993-2021
-
-
Dembo, A.1
Karlin, S.2
Zeitouni, O.3
-
13
-
-
0000526802
-
Limit distribution of maximal non-aligned two-sequence segmental score
-
Dembo, A., Karlin, S., and Zeitouni, O. 1994b. Limit distribution of maximal non-aligned two-sequence segmental score. Ann. Prob. 22, 2022-2039.
-
(1994)
Ann. Prob.
, vol.22
, pp. 2022-2039
-
-
Dembo, A.1
Karlin, S.2
Zeitouni, O.3
-
14
-
-
84898935075
-
Agnostic classification of Markovian sequences
-
El-Yaniv, R., Fine, S., and Tishby, N. 1997. Agnostic classification of Markovian sequences. Advances in Neural Information Processing Systems 10, 465-471.
-
(1997)
Advances in Neural Information Processing Systems
, vol.10
, pp. 465-471
-
-
El-Yaniv, R.1
Fine, S.2
Tishby, N.3
-
16
-
-
0033027083
-
Simple sequence is abundant in eukaryotic proteins
-
Golding, G.B. 1999. Simple sequence is abundant in eukaryotic proteins. Protein Sci. 8, 1358-1361.
-
(1999)
Protein Sci.
, vol.8
, pp. 1358-1361
-
-
Golding, G.B.1
-
17
-
-
0020083498
-
The meaning and use of the area under the receiver operating characteristic (ROC) curve
-
Hanley, J.A., and McNeil, B.J. 1982. The meaning and use of the area under the receiver operating characteristic (ROC) curve. Radiology 143, 29-36.
-
(1982)
Radiology
, vol.143
, pp. 29-36
-
-
Hanley, J.A.1
McNeil, B.J.2
-
18
-
-
0026458378
-
Amino acid substitution matrices from protein blocks
-
Henikoff, S., and Henikoff, J.G. 1992. Amino acid substitution matrices from protein blocks. Proc. Natl. Acad. Sci. USA 89, 10915-10919.
-
(1992)
Proc. Natl. Acad. Sci. USA
, vol.89
, pp. 10915-10919
-
-
Henikoff, S.1
Henikoff, J.G.2
-
19
-
-
0029977162
-
Using substitution probabilities to improve position-specific scoring matrices
-
Henikoff, J.G., and Henikoff, S. 1996. Using substitution probabilities to improve position-specific scoring matrices. Comp. Appl. Biosci. 12(2), 135-143.
-
(1996)
Comp. Appl. Biosci.
, vol.12
, Issue.2
, pp. 135-143
-
-
Henikoff, J.G.1
Henikoff, S.2
-
20
-
-
0025259313
-
Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes
-
Karlin, S., and Altschul, S.F. 1990. Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes. Proc. Natl. Acad. Sci. USA 87, 2264-2268.
-
(1990)
Proc. Natl. Acad. Sci. USA
, vol.87
, pp. 2264-2268
-
-
Karlin, S.1
Altschul, S.F.2
-
21
-
-
0027175241
-
Applications and statistics for multiple high-scoring segments in molecular sequences
-
Karlin, S., and Altschul, S.F. 1993. Applications and statistics for multiple high-scoring segments in molecular sequences. Proc. Natl. Acad. Sci. USA 90, 5873-5877.
-
(1993)
Proc. Natl. Acad. Sci. USA
, vol.90
, pp. 5873-5877
-
-
Karlin, S.1
Altschul, S.F.2
-
22
-
-
25644439091
-
Calibrating E-values for hidden Markov models with reverse-sequence null models
-
in press
-
Karplus, K., Karchin, R., and Hughey, R. 2005. Calibrating E-values for hidden Markov models with reverse-sequence null models. Bioinformatics, in press.
-
(2005)
Bioinformatics
-
-
Karplus, K.1
Karchin, R.2
Hughey, R.3
-
24
-
-
0025952277
-
Divergence measures based on the Shannon entropy
-
Lin, J. 1991. Divergence measures based on the Shannon entropy. IEEE Trans. Info. Theory 37(1), 145-151.
-
(1991)
IEEE Trans. Info. Theory
, vol.37
, Issue.1
, pp. 145-151
-
-
Lin, J.1
-
25
-
-
0037480738
-
Investigating semantic similarity measures across the gene ontology: The relationship between sequence and annotation
-
Lord, P.W., Stevens, R.D., Brass, A., and Goble, C.A. 2003. Investigating semantic similarity measures across the gene ontology: The relationship between sequence and annotation. Bioinformatics 19, 1275-1283.
-
(2003)
Bioinformatics
, vol.19
, pp. 1275-1283
-
-
Lord, P.W.1
Stevens, R.D.2
Brass, A.3
Goble, C.A.4
-
26
-
-
0034647416
-
Accurate formula for P-values of gapped local sequence and profile alignments
-
Mott, R. 2000. Accurate formula for P-values of gapped local sequence and profile alignments. J. Mol. Biol. 300, 649-659.
-
(2000)
J. Mol. Biol.
, vol.300
, pp. 649-659
-
-
Mott, R.1
-
27
-
-
0032943842
-
Approximate statistics of gapped alignments
-
Mott, R., and Tribe, R. 1999. Approximate statistics of gapped alignments. J. Comp. Biol. 6, 91-112.
-
(1999)
J. Comp. Biol.
, vol.6
, pp. 91-112
-
-
Mott, R.1
Tribe, R.2
-
28
-
-
0033638015
-
CAST: An iterative algorithm for the complexity analysis of sequence tracts
-
Promponas, V.J., Enright, A.J., Tsoka, S.T., Kreil, D.P., Leroy, C., Hamodrakas, S., Sander, C., and Ouzounis, C.A. 2000. CAST: An iterative algorithm for the complexity analysis of sequence tracts. Bioinformatics 16, 915-922.
-
(2000)
Bioinformatics
, vol.16
, pp. 915-922
-
-
Promponas, V.J.1
Enright, A.J.2
Tsoka, S.T.3
Kreil, D.P.4
Leroy, C.5
Hamodrakas, S.6
Sander, C.7
Ouzounis, C.A.8
-
29
-
-
0035188314
-
Sequence complexity of disordered protein
-
Romero, P., Obradovic, Z., Li, X., Garner, E.G., Brown, C.J., and Dunker, A.K. 2001. Sequence complexity of disordered protein. Proteins 42, 38-48.
-
(2001)
Proteins
, vol.42
, pp. 38-48
-
-
Romero, P.1
Obradovic, Z.2
Li, X.3
Garner, E.G.4
Brown, C.J.5
Dunker, A.K.6
-
30
-
-
0035878724
-
Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements
-
Schaffer, A.A., Aravind, L., Madden, T.L., Shavirin, S., Spouge, J.L., Wolf, Y.I., Koonin, E.V., and Altschul, S.F. 2001. Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements. Nucl. Acids Res. 29, 2994-3005.
-
(2001)
Nucl. Acids Res.
, vol.29
, pp. 2994-3005
-
-
Schaffer, A.A.1
Aravind, L.2
Madden, T.L.3
Shavirin, S.4
Spouge, J.L.5
Wolf, Y.I.6
Koonin, E.V.7
Altschul, S.F.8
-
31
-
-
0030735796
-
Performance standards and evaluations in IR test collections: Cluster-based retrieval models
-
Shaw, W.M., Burgin, R., and Howell, P. 1997. Performance standards and evaluations in IR test collections: Cluster-based retrieval models. Information Processing and Management 33, 1-14.
-
(1997)
Information Processing and Management
, vol.33
, pp. 1-14
-
-
Shaw, W.M.1
Burgin, R.2
Howell, P.3
-
32
-
-
0022431785
-
The statistical distribution of nucleic acid similarities
-
Smith, T.F., Waterman, M.S., and Burks, C. 1985. The statistical distribution of nucleic acid similarities. Nucl. Acids. Res. 13, 645-656.
-
(1985)
Nucl. Acids. Res.
, vol.13
, pp. 645-656
-
-
Smith, T.F.1
Waterman, M.S.2
Burks, C.3
-
33
-
-
0028234758
-
Rapid and accurate estimates of statistical significance for sequence data base searches
-
Waterman, M.S., and Vingron, M. 1994. Rapid and accurate estimates of statistical significance for sequence data base searches. Proc. Natl. Acad. Sci. USA 91, 4625-4628.
-
(1994)
Proc. Natl. Acad. Sci. USA
, vol.91
, pp. 4625-4628
-
-
Waterman, M.S.1
Vingron, M.2
-
34
-
-
0028234347
-
Sequences with 'unusual' amino acid compositions
-
Wootton, J.C. 1994. Sequences with 'unusual' amino acid compositions. Curr. Opin. Struct. Biol. 4, 413-421.
-
(1994)
Curr. Opin. Struct. Biol.
, vol.4
, pp. 413-421
-
-
Wootton, J.C.1
-
35
-
-
0001514262
-
Statistics of local complexity in amino acid sequences and sequence databases
-
Wootton, J.C., and Federhen, S. 1993. Statistics of local complexity in amino acid sequences and sequence databases. Comp. Chem. 17, 149-163.
-
(1993)
Comp. Chem.
, vol.17
, pp. 149-163
-
-
Wootton, J.C.1
Federhen, S.2
-
36
-
-
1042269463
-
Shared relationship analysis: Ranking set cohesion and commonalities within a literature-derived relationship network
-
Wren, J.D., and Garner, H.R. 2004. Shared relationship analysis: Ranking set cohesion and commonalities within a literature-derived relationship network. Bioinformatics 20, 191-198.
-
(2004)
Bioinformatics
, vol.20
, pp. 191-198
-
-
Wren, J.D.1
Garner, H.R.2
-
37
-
-
0033705812
-
A unified sequence-structure classification of proteins: Combining sequence and structure in a map of protein space
-
Yona, G., and Levitt, M. 2000a. A unified sequence-structure classification of proteins: Combining sequence and structure in a map of protein space. Proc. RECOMB 2000, 308-317.
-
(2000)
Proc. RECOMB
, vol.2000
, pp. 308-317
-
-
Yona, G.1
Levitt, M.2
|