메뉴 건너뛰기




Volumn 61, Issue 4, 2005, Pages 926-937

Sensitive detection of sequence similarity using combinatorial pattern discovery: A challenging study of two distantly related protein families

Author keywords

Protein families; Sensitivity detection; Sequence similarity

Indexed keywords

ALLERGEN; AMICYANIN; AURACYANIN A; AURACYANIN B; AZURIN; BACTERIAL PROTEIN; BINDING PROTEIN; COPPER; COPPER PROTEIN; CUPREDOXIN; CUSACYANIN; HALOCYANIN; OXIDOREDUCTASE; PLANTACYANIN; PLASTOCYANIN; POLYPEPTIDE; PROTEIN; PSEUDOAZURIN; RAGWEED ANTIGEN; RUSTICYANIN; STELLACYANINE; UMECYANIN; UNCLASSIFIED DRUG;

EID: 28644451639     PISSN: 08873585     EISSN: None     Source Type: Journal    
DOI: 10.1002/prot.20608     Document Type: Article
Times cited : (7)

References (70)
  • 1
    • 0032104511 scopus 로고    scopus 로고
    • Unification of protein families
    • Holm L. Unification of protein families, Curr Opin Struct Biol 1998;8:372-379.
    • (1998) Curr Opin Struct Biol , vol.8 , pp. 372-379
    • Holm, L.1
  • 4
    • 0031302793 scopus 로고    scopus 로고
    • Distant homology recognition using structural classification of proteins
    • Murzin AG, Bateman A. Distant homology recognition using structural classification of proteins. Proteins 1997;Suppl 1:105-112.
    • (1997) Proteins , Issue.SUPPL. 1 , pp. 105-112
    • Murzin, A.G.1    Bateman, A.2
  • 6
    • 0031839544 scopus 로고    scopus 로고
    • Superior performance in protein homology detection with the Blocks Database servers
    • Henikoff S, Pietrokovski S, Henikoff JG. Superior performance in protein homology detection with the Blocks Database servers. Nucleic Acids Res 1998;26:309-312.
    • (1998) Nucleic Acids Res , vol.26 , pp. 309-312
    • Henikoff, S.1    Pietrokovski, S.2    Henikoff, J.G.3
  • 7
    • 0031901903 scopus 로고    scopus 로고
    • Methods and statistics for combining motif match scores
    • Bailey TL, Gribskov M. Methods and statistics for combining motif match scores. J Comput Biol 1998;5:211-221.
    • (1998) J Comput Biol , vol.5 , pp. 211-221
    • Bailey, T.L.1    Gribskov, M.2
  • 8
    • 0031743421 scopus 로고    scopus 로고
    • Profile hidden Markov models
    • Eddy SR. Profile hidden Markov models. Bioinformatics 1998;14:755-763.
    • (1998) Bioinformatics , vol.14 , pp. 755-763
    • Eddy, S.R.1
  • 9
    • 0032438987 scopus 로고    scopus 로고
    • Hidden Markov models for detecting remote protein homologies
    • Karplus K, Barrett C, Hughey R. Hidden Markov models for detecting remote protein homologies. Bioinformatics 1998;14:846-856.
    • (1998) Bioinformatics , vol.14 , pp. 846-856
    • Karplus, K.1    Barrett, C.2    Hughey, R.3
  • 10
    • 0025830469 scopus 로고
    • A method to identify protein sequences that fold into a known three-dimensional structure
    • Bowie JU, Luthy R, Eisenberg D. A method to identify protein sequences that fold into a known three-dimensional structure. Science 1991;253:164-170.
    • (1991) Science , vol.253 , pp. 164-170
    • Bowie, J.U.1    Luthy, R.2    Eisenberg, D.3
  • 11
    • 0027302043 scopus 로고
    • Prediction of protein structure by evaluation of sequence-structure fitness. Aligning sequences to contact profiles derived from three-dimensional structures
    • Ouzounis C, Sander C, Scharf M, Schneider R. Prediction of protein structure by evaluation of sequence-structure fitness. Aligning sequences to contact profiles derived from three-dimensional structures. J Mol Biol 1993;232:805-825.
    • (1993) J Mol Biol , vol.232 , pp. 805-825
    • Ouzounis, C.1    Sander, C.2    Scharf, M.3    Schneider, R.4
  • 12
    • 0027459747 scopus 로고
    • Structural alignment of globing, phycocyanins and colicin a
    • Holm L, Sander C. Structural alignment of globing, phycocyanins and colicin A. FEBS Lett 1993;315:301-306.
    • (1993) FEBS Lett , vol.315 , pp. 301-306
    • Holm, L.1    Sander, C.2
  • 13
    • 0026490256 scopus 로고
    • Structure of a fibronectin type III domain from tenascin phased by MAD analysis of the selenomethionyl protein
    • Leahy DJ, Hendrickson WA, Aukhil I, Erickson HP. Structure of a fibronectin type III domain from tenascin phased by MAD analysis of the selenomethionyl protein. Science 1992;258:987-991.
    • (1992) Science , vol.258 , pp. 987-991
    • Leahy, D.J.1    Hendrickson, W.A.2    Aukhil, I.3    Erickson, H.P.4
  • 14
    • 0013776758 scopus 로고
    • Molecules as documents of evolutionary history
    • Zuckerkandl E, Pauling L. Molecules as documents of evolutionary history. J Theor Biol 1965;8:357-366.
    • (1965) J Theor Biol , vol.8 , pp. 357-366
    • Zuckerkandl, E.1    Pauling, L.2
  • 15
    • 0027057526 scopus 로고
    • A database of protein structure families with common folding motifs
    • Holm L, Ouzounis C, Sander C, Tuparev G, Vriend G. A database of protein structure families with common folding motifs. Protein Sci 1992;1:1691-1698.
    • (1992) Protein Sci , vol.1 , pp. 1691-1698
    • Holm, L.1    Ouzounis, C.2    Sander, C.3    Tuparev, G.4    Vriend, G.5
  • 16
    • 0029785147 scopus 로고    scopus 로고
    • Mapping the protein universe
    • Holm L, Sander C. Mapping the protein universe. Science 1996;273:595-603.
    • (1996) Science , vol.273 , pp. 595-603
    • Holm, L.1    Sander, C.2
  • 17
    • 0032104477 scopus 로고    scopus 로고
    • How far divergent evolution goes in proteins
    • Murzin AG. How far divergent evolution goes in proteins. Curr Opin Struct Biol 1998;8:380-387.
    • (1998) Curr Opin Struct Biol , vol.8 , pp. 380-387
    • Murzin, A.G.1
  • 18
    • 0022706389 scopus 로고
    • The relation between the divergence of sequence and structure in proteins
    • Chothia C, Lesk AM. The relation between the divergence of sequence and structure in proteins. EMBO J 1986;5:823-826.
    • (1986) EMBO J , vol.5 , pp. 823-826
    • Chothia, C.1    Lesk, A.M.2
  • 21
    • 0030623641 scopus 로고    scopus 로고
    • Predicting enzyme function from sequence: A systematic appraisal
    • Shah I, Hunter L. Predicting enzyme function from sequence: a systematic appraisal. Proc Int Conf Intell Syst Mol Biol 1997;5:276-283.
    • (1997) Proc Int Conf Intell Syst Mol Biol , vol.5 , pp. 276-283
    • Shah, I.1    Hunter, L.2
  • 22
    • 0037460964 scopus 로고    scopus 로고
    • Prediction of human protein function according to Gene Ontology categories
    • Jensen LJ, Gupta R, Staerfeldt HH, Brunak S. Prediction of human protein function according to Gene Ontology categories. Bioinformatics 2003;19:635-642.
    • (2003) Bioinformatics , vol.19 , pp. 635-642
    • Jensen, L.J.1    Gupta, R.2    Staerfeldt, H.H.3    Brunak, S.4
  • 23
    • 0037480738 scopus 로고    scopus 로고
    • Investigating semantic similarity measures across the Gene Ontology: The relationship between sequence and annotation
    • Lord PW, Stevens RD, Brass A, Goble CA, Investigating semantic similarity measures across the Gene Ontology: the relationship between sequence and annotation. Bioinformatics 2003;19:1275-1283.
    • (2003) Bioinformatics , vol.19 , pp. 1275-1283
    • Lord, P.W.1    Stevens, R.D.2    Brass, A.3    Goble, C.A.4
  • 24
    • 0019887799 scopus 로고
    • Identification of common molecular subsequences
    • Smith TF, Waterman MS. Identification of common molecular subsequences. J Mol Biol 1981;147:195-197.
    • (1981) J Mol Biol , vol.147 , pp. 195-197
    • Smith, T.F.1    Waterman, M.S.2
  • 26
    • 0025272240 scopus 로고
    • Rapid and sensitive sequence comparison with FASTP and FASTA
    • Pearson WR. Rapid and sensitive sequence comparison with FASTP and FASTA. Methods Enzymol 1990;183:63-98.
    • (1990) Methods Enzymol , vol.183 , pp. 63-98
    • Pearson, W.R.1
  • 29
    • 0034048878 scopus 로고    scopus 로고
    • A discriminative framework for detecting remote protein homologies
    • Jaakkola T, Diekhans M, Haussler D. A discriminative framework for detecting remote protein homologies. J Comput Biol 2000;7:95-114.
    • (2000) J Comput Biol , vol.7 , pp. 95-114
    • Jaakkola, T.1    Diekhans, M.2    Haussler, D.3
  • 31
    • 0031827544 scopus 로고    scopus 로고
    • Comparative accuracy of methods for protein sequence similarity search
    • Agarwal P, States DJ. Comparative accuracy of methods for protein sequence similarity search. Bioinformatics 1998;14:40-47.
    • (1998) Bioinformatics , vol.14 , pp. 40-47
    • Agarwal, P.1    States, D.J.2
  • 32
    • 0029889221 scopus 로고    scopus 로고
    • Local alignment statistics
    • Altschul SF, Gish W. Local alignment statistics. Methods Enzymol 1996;266:460-480.
    • (1996) Methods Enzymol , vol.266 , pp. 460-480
    • Altschul, S.F.1    Gish, W.2
  • 33
    • 0032568596 scopus 로고    scopus 로고
    • Assessing sequence comparison methods with reliable structurally identified distant evolutionary relationships
    • Brenner SE, Chothia C, Hubbard TJ. Assessing sequence comparison methods with reliable structurally identified distant evolutionary relationships. Proc Natl Acad Sci USA 1998;95:6073-6078.
    • (1998) Proc Natl Acad Sci USA , vol.95 , pp. 6073-6078
    • Brenner, S.E.1    Chothia, C.2    Hubbard, T.J.3
  • 34
    • 0032509105 scopus 로고    scopus 로고
    • Sequence comparisons using multiple sequences detect three times as many remote homologues as pairwise methods
    • Park J, Karplus K, Barrett C, Hughey R, Haussler D, Hubbard T, et al. Sequence comparisons using multiple sequences detect three times as many remote homologues as pairwise methods. J Mol Biol 1998;284:1201-1210.
    • (1998) J Mol Biol , vol.284 , pp. 1201-1210
    • Park, J.1    Karplus, K.2    Barrett, C.3    Hughey, R.4    Haussler, D.5    Hubbard, T.6
  • 35
    • 0036796379 scopus 로고    scopus 로고
    • A comparison of profile hidden Markov model procedures for remote homology detection
    • Madera M, Gough J. A comparison of profile hidden Markov model procedures for remote homology detection. Nucleic Acids Res 2002;30:4321-4328.
    • (2002) Nucleic Acids Res , vol.30 , pp. 4321-4328
    • Madera, M.1    Gough, J.2
  • 36
    • 0036166451 scopus 로고    scopus 로고
    • Classifying G-protein coupled receptors with support vector machines
    • Karchin R, Karplus K, Haussler D. Classifying G-protein coupled receptors with support vector machines. Bioinformatics 2002;18:147-159.
    • (2002) Bioinformatics , vol.18 , pp. 147-159
    • Karchin, R.1    Karplus, K.2    Haussler, D.3
  • 38
  • 39
    • 0028685354 scopus 로고
    • A generalized profile syntax for biomolecular sequence motifs and its function in automatic sequence interpretation
    • Bucher P, Bairoch A. A generalized profile syntax for biomolecular sequence motifs and its function in automatic sequence interpretation. Proc Int Conf Intell Syst Mol Biol 1994;2:53-61.
    • (1994) Proc Int Conf Intell Syst Mol Biol , vol.2 , pp. 53-61
    • Bucher, P.1    Bairoch, A.2
  • 40
    • 0028181441 scopus 로고
    • Hidden Markov models in computational biology. Applications to protein modeling
    • Krogh A, Brown M, Mian IS, Sjolander K, Haussler D. Hidden Markov models in computational biology. Applications to protein modeling. J Mol Biol 1994;235:1501-1531.
    • (1994) J Mol Biol , vol.235 , pp. 1501-1531
    • Krogh, A.1    Brown, M.2    Mian, I.S.3    Sjolander, K.4    Haussler, D.5
  • 43
    • 0026527758 scopus 로고
    • Pattern-induced multi-sequence alignment (PIMA) algorithm employing secondary structure-dependent gap penalties for use in comparative protein modelling
    • Smith RF, Smith TF. Pattern-induced multi-sequence alignment (PIMA) algorithm employing secondary structure-dependent gap penalties for use in comparative protein modelling. Protein Eng 1992;5:35-41.
    • (1992) Protein Eng , vol.5 , pp. 35-41
    • Smith, R.F.1    Smith, T.F.2
  • 45
    • 0032719352 scopus 로고    scopus 로고
    • Dictionary building via unsupervised hierarchical motif discovery in the sequence space of natural proteins
    • Rigoutsos I, Floratos A, Ouzounis C, Gao Y, Parida L. Dictionary building via unsupervised hierarchical motif discovery in the sequence space of natural proteins. Proteins 1999;37:264-277.
    • (1999) Proteins , vol.37 , pp. 264-277
    • Rigoutsos, I.1    Floratos, A.2    Ouzounis, C.3    Gao, Y.4    Parida, L.5
  • 47
    • 0037096851 scopus 로고    scopus 로고
    • Dictionary-driven prokaryotic gene finding
    • Shibuya T, Rigoutsos I. Dictionary-driven prokaryotic gene finding. Nucleic Acids Res 2002;30:2710-2725.
    • (2002) Nucleic Acids Res , vol.30 , pp. 2710-2725
    • Shibuya, T.1    Rigoutsos, I.2
  • 48
    • 0033671736 scopus 로고    scopus 로고
    • The emergence of pattern discovery techniques in computational biology
    • Rigoutsos I, Floratos A, Parida L, Gao Y, Platt D. The emergence of pattern discovery techniques in computational biology. Metab Eng 2000;2:159-177.
    • (2000) Metab Eng , vol.2 , pp. 159-177
    • Rigoutsos, I.1    Floratos, A.2    Parida, L.3    Gao, Y.4    Platt, D.5
  • 49
    • 0031684427 scopus 로고    scopus 로고
    • Combinatorial pattern discovery in biological sequences: The TEIRESIAS algorithm
    • Rigoutsos I, Floratos A. Combinatorial pattern discovery in biological sequences: The TEIRESIAS algorithm. Bioinformatics 1998;14:55-67.
    • (1998) Bioinformatics , vol.14 , pp. 55-67
    • Rigoutsos, I.1    Floratos, A.2
  • 51
    • 0036940872 scopus 로고    scopus 로고
    • A novel approach to remote homology detection: Jumping alignments
    • Spang R, Rehmsmeier M, Stoye J. A novel approach to remote homology detection: jumping alignments. J Comput Biol 2002;9:747-760.
    • (2002) J Comput Biol , vol.9 , pp. 747-760
    • Spang, R.1    Rehmsmeier, M.2    Stoye, J.3
  • 52
    • 0028961335 scopus 로고
    • SCOP: A structural classification of proteins database for the investigation of sequences and structures
    • Murzin AG, Brenner SE, Hubbard T, Chothia C. SCOP: a structural classification of proteins database for the investigation of sequences and structures. J Mol Biol 1995;247:536-540.
    • (1995) J Mol Biol , vol.247 , pp. 536-540
    • Murzin, A.G.1    Brenner, S.E.2    Hubbard, T.3    Chothia, C.4
  • 54
    • 0026060726 scopus 로고
    • A structure-derived sequence pattern for the detection of type I copper binding domains in distantly related proteins
    • Ouzounis C, Sander C. A structure-derived sequence pattern for the detection of type I copper binding domains in distantly related proteins. FEBS Lett 1991;279:73-78.
    • (1991) FEBS Lett , vol.279 , pp. 73-78
    • Ouzounis, C.1    Sander, C.2
  • 55
    • 0033841688 scopus 로고    scopus 로고
    • Blue , copper proteins: A comparative analysis of their molecular interaction properties
    • De Rienzo F, Gabdoulline RR, Menziani MC, Wade RC. Blue , copper proteins: a comparative analysis of their molecular interaction properties. Protein Sci 2000;9:1439-1454.
    • (2000) Protein Sci , vol.9 , pp. 1439-1454
    • De Rienzo, F.1    Gabdoulline, R.R.2    Menziani, M.C.3    Wade, R.C.4
  • 56
    • 2542539752 scopus 로고    scopus 로고
    • Computational approaches to structural and functional analysis of plastocyanin and other blue copper proteins
    • De Rienzo F, Gabdoulline RR, Wade RC, Sola M, Menziani MC. Computational approaches to structural and functional analysis of plastocyanin and other blue copper proteins. Cell Mol Life Sci 2004;61:1123-1142.
    • (2004) Cell Mol Life Sci , vol.61 , pp. 1123-1142
    • De Rienzo, F.1    Gabdoulline, R.R.2    Wade, R.C.3    Sola, M.4    Menziani, M.C.5
  • 57
    • 0020483945 scopus 로고
    • Evolution of proteins formed by beta-sheets. I. Plastocyanin and azurin
    • Chothia C, Lesk AM. Evolution of proteins formed by beta-sheets. I. Plastocyanin and azurin. J Mol Biol 1982;160:309-323.
    • (1982) J Mol Biol , vol.160 , pp. 309-323
    • Chothia, C.1    Lesk, A.M.2
  • 58
    • 0027193825 scopus 로고
    • Engineering type 1 copper sites in proteins
    • Canters GW, Gilardi G. Engineering type 1 copper sites in proteins. FEBS Lett 1993;325:39-48.
    • (1993) FEBS Lett , vol.325 , pp. 39-48
    • Canters, G.W.1    Gilardi, G.2
  • 59
    • 0027529952 scopus 로고
    • Evolution of protein complexity: The blue copper-containing oxidases and related proteins
    • Ryden LG, Hunt LT. Evolution of protein complexity: the blue copper-containing oxidases and related proteins. J Mol Evol 1993;36:41-66.
    • (1993) J Mol Evol , vol.36 , pp. 41-66
    • Ryden, L.G.1    Hunt, L.T.2
  • 60
    • 0025058305 scopus 로고
    • The blue oxidases, ascorbate oxidase, laccase and ceruloplasmin. Modelling and structural relationships
    • Messerschmidt A, Huber R. The blue oxidases, ascorbate oxidase, laccase and ceruloplasmin. Modelling and structural relationships. Eur J Biochem 1990;187:341-352.
    • (1990) Eur J Biochem , vol.187 , pp. 341-352
    • Messerschmidt, A.1    Huber, R.2
  • 63
    • 0033638015 scopus 로고    scopus 로고
    • CAST: An iterative algorithm for the complexity analysis of sequence tracts. Complexity analysis of sequence tracts
    • Promponas VJ, Enright AJ, Tsoka S, Kreil DP, Leroy C, Hamodrakas S, et al. CAST: an iterative algorithm for the complexity analysis of sequence tracts. Complexity analysis of sequence tracts. Bioinformatics 2000;16:915-922.
    • (2000) Bioinformatics , vol.16 , pp. 915-922
    • Promponas, V.J.1    Enright, A.J.2    Tsoka, S.3    Kreil, D.P.4    Leroy, C.5    Hamodrakas, S.6
  • 64
    • 0022591495 scopus 로고
    • The classification of amino acid conservation
    • Taylor WR. The classification of amino acid conservation. J Theor Biol 1986;119:205-218.
    • (1986) J Theor Biol , vol.119 , pp. 205-218
    • Taylor, W.R.1
  • 65
    • 0035841458 scopus 로고    scopus 로고
    • NR-grep: A fast and flexible pattern matching tool
    • Navarro G. NR-grep: a fast and flexible pattern matching tool. Software Pract Exp (SPE) 2001;31:1265-1312.
    • (2001) Software Pract Exp (SPE) , vol.31 , pp. 1265-1312
    • Navarro, G.1
  • 66
    • 0027968068 scopus 로고
    • CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice
    • Thompson JD, Higgins DG, Gibson TJ. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res 1994;22:4673-4680.
    • (1994) Nucleic Acids Res , vol.22 , pp. 4673-4680
    • Thompson, J.D.1    Higgins, D.G.2    Gibson, T.J.3
  • 67
    • 0030464460 scopus 로고    scopus 로고
    • SEAVIEW and PHYLO_WIN: Two graphic tools for sequence alignment and molecular phylogeny
    • Galtier N, Gouy M, Gautier C. SEAVIEW and PHYLO_WIN: two graphic tools for sequence alignment and molecular phylogeny. Comput Appl Biosci 1996;12:543-548.
    • (1996) Comput Appl Biosci , vol.12 , pp. 543-548
    • Galtier, N.1    Gouy, M.2    Gautier, C.3
  • 70
    • 0037350415 scopus 로고    scopus 로고
    • The phylogenetic extent of metabolic enzymes and pathways
    • Peregrin-Alvarez JM, Tsoka S, Ouzounis CA. The phylogenetic extent of metabolic enzymes and pathways. Genome Res 2003;13:422-427.
    • (2003) Genome Res , vol.13 , pp. 422-427
    • Peregrin-Alvarez, J.M.1    Tsoka, S.2    Ouzounis, C.A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.