메뉴 건너뛰기




Volumn 12, Issue 7, 2005, Pages 980-1003

Correcting BLAST e-values for low-complexity segments

Author keywords

BLAST; Extreme value distribution; Gene ontology; Low complexity sequences; SEG; Statistics of sequence similarity

Indexed keywords

ARTICLE; BLAST E-VALUE; COMPUTER PROGRAM; EVALUATION; GENE ONTOLOGY; GENE STRUCTURE; PRIORITY JOURNAL; REFERENCE VALUE; SEQUENCE HOMOLOGY; STATISTICAL ANALYSIS; STATISTICS;

EID: 25644451272     PISSN: 10665277     EISSN: None     Source Type: Journal    
DOI: 10.1089/cmb.2005.12.980     Document Type: Article
Times cited : (11)

References (37)
  • 1
    • 0036080338 scopus 로고    scopus 로고
    • Detecting cryptically simple protein sequences using the SIMPLE algorithm
    • Alba, M.M., Laskowski, R.A., and Hancock, J.M. 2002. Detecting cryptically simple protein sequences using the SIMPLE algorithm. Bioinformatics 18, 672-678.
    • (2002) Bioinformatics , vol.18 , pp. 672-678
    • Alba, M.M.1    Laskowski, R.A.2    Hancock, J.M.3
  • 2
    • 0029889221 scopus 로고    scopus 로고
    • Local alignment statistics
    • Altschul, S.F., and Gish, W. 1996. Local alignment statistics. Methods Enzymol. 266, 460-480.
    • (1996) Methods Enzymol. , vol.266 , pp. 460-480
    • Altschul, S.F.1    Gish, W.2
  • 3
    • 0000051438 scopus 로고
    • An extreme value theory for sequence matching
    • Arratia, R., Gordon, L., and Waterman, M.S. 1986. An extreme value theory for sequence matching. Ann. Stat. 14, 971-993.
    • (1986) Ann. Stat. , vol.14 , pp. 971-993
    • Arratia, R.1    Gordon, L.2    Waterman, M.S.3
  • 4
    • 0001619220 scopus 로고
    • A phase transition for the score in matching random sequences allowing deletions
    • Arratia, R., and Waterman, M.S. 1994. A phase transition for the score in matching random sequences allowing deletions. Ann. Appl. Prob. 4, 200-225.
    • (1994) Ann. Appl. Prob. , vol.4 , pp. 200-225
    • Arratia, R.1    Waterman, M.S.2
  • 8
    • 0001282761 scopus 로고
    • Information enhancement methods for large scale sequence analysis
    • Claverie, J.M., and States, D.J. 1993. Information enhancement methods for large scale sequence analysis. Comput. Chem. 17, 191-201.
    • (1993) Comput. Chem. , vol.17 , pp. 191-201
    • Claverie, J.M.1    States, D.J.2
  • 9
    • 2942580909 scopus 로고    scopus 로고
    • Computational identification of transcription factor binding sites by functional analysis of sets of genes sharing overrep-resented upstream motifs
    • Cora, D., Di Cunto, P., Provero, P., Silengo, L., and Caselle, M. 2004. Computational identification of transcription factor binding sites by functional analysis of sets of genes sharing overrep-resented upstream motifs. BMC Bioinformatics 5, 57.
    • (2004) BMC Bioinformatics , vol.5 , pp. 57
    • Cora, D.1    Di Cunto, P.2    Provero, P.3    Silengo, L.4    Caselle, M.5
  • 10
    • 0032919372 scopus 로고    scopus 로고
    • Recent improvements of the ProDom database of protein domain families
    • Corpet, F., Gouzy, J., and Kahn, D. 1999. Recent improvements of the ProDom database of protein domain families. Nucl. Acids Res. 27, 263-267.
    • (1999) Nucl. Acids Res. , vol.27 , pp. 263-267
    • Corpet, F.1    Gouzy, J.2    Kahn, D.3
  • 11
    • 0000387249 scopus 로고
    • Strong limit theorems of empirical functionals for large exceedances of partial sums of i.i.d variables
    • Dembo, A., and Karlin, S. 1991. Strong limit theorems of empirical functionals for large exceedances of partial sums of i.i.d variables. Ann. Prob. 19, 1737-1755.
    • (1991) Ann. Prob. , vol.19 , pp. 1737-1755
    • Dembo, A.1    Karlin, S.2
  • 12
    • 0000526801 scopus 로고
    • Critical phenomena for sequence matching with scoring
    • Dembo, A., Karlin, S., and Zeitouni, O. 1994a. Critical phenomena for sequence matching with scoring. Ann. Prob. 22, 1993-2021.
    • (1994) Ann. Prob. , vol.22 , pp. 1993-2021
    • Dembo, A.1    Karlin, S.2    Zeitouni, O.3
  • 13
    • 0000526802 scopus 로고
    • Limit distribution of maximal non-aligned two-sequence segmental score
    • Dembo, A., Karlin, S., and Zeitouni, O. 1994b. Limit distribution of maximal non-aligned two-sequence segmental score. Ann. Prob. 22, 2022-2039.
    • (1994) Ann. Prob. , vol.22 , pp. 2022-2039
    • Dembo, A.1    Karlin, S.2    Zeitouni, O.3
  • 16
    • 0033027083 scopus 로고    scopus 로고
    • Simple sequence is abundant in eukaryotic proteins
    • Golding, G.B. 1999. Simple sequence is abundant in eukaryotic proteins. Protein Sci. 8, 1358-1361.
    • (1999) Protein Sci. , vol.8 , pp. 1358-1361
    • Golding, G.B.1
  • 17
    • 0020083498 scopus 로고
    • The meaning and use of the area under the receiver operating characteristic (ROC) curve
    • Hanley, J.A., and McNeil, B.J. 1982. The meaning and use of the area under the receiver operating characteristic (ROC) curve. Radiology 143, 29-36.
    • (1982) Radiology , vol.143 , pp. 29-36
    • Hanley, J.A.1    McNeil, B.J.2
  • 18
    • 0026458378 scopus 로고
    • Amino acid substitution matrices from protein blocks
    • Henikoff, S., and Henikoff, J.G. 1992. Amino acid substitution matrices from protein blocks. Proc. Natl. Acad. Sci. USA 89, 10915-10919.
    • (1992) Proc. Natl. Acad. Sci. USA , vol.89 , pp. 10915-10919
    • Henikoff, S.1    Henikoff, J.G.2
  • 19
    • 0029977162 scopus 로고    scopus 로고
    • Using substitution probabilities to improve position-specific scoring matrices
    • Henikoff, J.G., and Henikoff, S. 1996. Using substitution probabilities to improve position-specific scoring matrices. Comp. Appl. Biosci. 12(2), 135-143.
    • (1996) Comp. Appl. Biosci. , vol.12 , Issue.2 , pp. 135-143
    • Henikoff, J.G.1    Henikoff, S.2
  • 20
    • 0025259313 scopus 로고
    • Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes
    • Karlin, S., and Altschul, S.F. 1990. Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes. Proc. Natl. Acad. Sci. USA 87, 2264-2268.
    • (1990) Proc. Natl. Acad. Sci. USA , vol.87 , pp. 2264-2268
    • Karlin, S.1    Altschul, S.F.2
  • 21
    • 0027175241 scopus 로고
    • Applications and statistics for multiple high-scoring segments in molecular sequences
    • Karlin, S., and Altschul, S.F. 1993. Applications and statistics for multiple high-scoring segments in molecular sequences. Proc. Natl. Acad. Sci. USA 90, 5873-5877.
    • (1993) Proc. Natl. Acad. Sci. USA , vol.90 , pp. 5873-5877
    • Karlin, S.1    Altschul, S.F.2
  • 22
    • 25644439091 scopus 로고    scopus 로고
    • Calibrating E-values for hidden Markov models with reverse-sequence null models
    • in press
    • Karplus, K., Karchin, R., and Hughey, R. 2005. Calibrating E-values for hidden Markov models with reverse-sequence null models. Bioinformatics, in press.
    • (2005) Bioinformatics
    • Karplus, K.1    Karchin, R.2    Hughey, R.3
  • 24
    • 0025952277 scopus 로고
    • Divergence measures based on the Shannon entropy
    • Lin, J. 1991. Divergence measures based on the Shannon entropy. IEEE Trans. Info. Theory 37(1), 145-151.
    • (1991) IEEE Trans. Info. Theory , vol.37 , Issue.1 , pp. 145-151
    • Lin, J.1
  • 25
    • 0037480738 scopus 로고    scopus 로고
    • Investigating semantic similarity measures across the gene ontology: The relationship between sequence and annotation
    • Lord, P.W., Stevens, R.D., Brass, A., and Goble, C.A. 2003. Investigating semantic similarity measures across the gene ontology: The relationship between sequence and annotation. Bioinformatics 19, 1275-1283.
    • (2003) Bioinformatics , vol.19 , pp. 1275-1283
    • Lord, P.W.1    Stevens, R.D.2    Brass, A.3    Goble, C.A.4
  • 26
    • 0034647416 scopus 로고    scopus 로고
    • Accurate formula for P-values of gapped local sequence and profile alignments
    • Mott, R. 2000. Accurate formula for P-values of gapped local sequence and profile alignments. J. Mol. Biol. 300, 649-659.
    • (2000) J. Mol. Biol. , vol.300 , pp. 649-659
    • Mott, R.1
  • 27
    • 0032943842 scopus 로고    scopus 로고
    • Approximate statistics of gapped alignments
    • Mott, R., and Tribe, R. 1999. Approximate statistics of gapped alignments. J. Comp. Biol. 6, 91-112.
    • (1999) J. Comp. Biol. , vol.6 , pp. 91-112
    • Mott, R.1    Tribe, R.2
  • 31
    • 0030735796 scopus 로고    scopus 로고
    • Performance standards and evaluations in IR test collections: Cluster-based retrieval models
    • Shaw, W.M., Burgin, R., and Howell, P. 1997. Performance standards and evaluations in IR test collections: Cluster-based retrieval models. Information Processing and Management 33, 1-14.
    • (1997) Information Processing and Management , vol.33 , pp. 1-14
    • Shaw, W.M.1    Burgin, R.2    Howell, P.3
  • 32
    • 0022431785 scopus 로고
    • The statistical distribution of nucleic acid similarities
    • Smith, T.F., Waterman, M.S., and Burks, C. 1985. The statistical distribution of nucleic acid similarities. Nucl. Acids. Res. 13, 645-656.
    • (1985) Nucl. Acids. Res. , vol.13 , pp. 645-656
    • Smith, T.F.1    Waterman, M.S.2    Burks, C.3
  • 33
    • 0028234758 scopus 로고
    • Rapid and accurate estimates of statistical significance for sequence data base searches
    • Waterman, M.S., and Vingron, M. 1994. Rapid and accurate estimates of statistical significance for sequence data base searches. Proc. Natl. Acad. Sci. USA 91, 4625-4628.
    • (1994) Proc. Natl. Acad. Sci. USA , vol.91 , pp. 4625-4628
    • Waterman, M.S.1    Vingron, M.2
  • 34
    • 0028234347 scopus 로고
    • Sequences with 'unusual' amino acid compositions
    • Wootton, J.C. 1994. Sequences with 'unusual' amino acid compositions. Curr. Opin. Struct. Biol. 4, 413-421.
    • (1994) Curr. Opin. Struct. Biol. , vol.4 , pp. 413-421
    • Wootton, J.C.1
  • 35
    • 0001514262 scopus 로고
    • Statistics of local complexity in amino acid sequences and sequence databases
    • Wootton, J.C., and Federhen, S. 1993. Statistics of local complexity in amino acid sequences and sequence databases. Comp. Chem. 17, 149-163.
    • (1993) Comp. Chem. , vol.17 , pp. 149-163
    • Wootton, J.C.1    Federhen, S.2
  • 36
    • 1042269463 scopus 로고    scopus 로고
    • Shared relationship analysis: Ranking set cohesion and commonalities within a literature-derived relationship network
    • Wren, J.D., and Garner, H.R. 2004. Shared relationship analysis: Ranking set cohesion and commonalities within a literature-derived relationship network. Bioinformatics 20, 191-198.
    • (2004) Bioinformatics , vol.20 , pp. 191-198
    • Wren, J.D.1    Garner, H.R.2
  • 37
    • 0033705812 scopus 로고    scopus 로고
    • A unified sequence-structure classification of proteins: Combining sequence and structure in a map of protein space
    • Yona, G., and Levitt, M. 2000a. A unified sequence-structure classification of proteins: Combining sequence and structure in a map of protein space. Proc. RECOMB 2000, 308-317.
    • (2000) Proc. RECOMB , vol.2000 , pp. 308-317
    • Yona, G.1    Levitt, M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.