메뉴 건너뛰기




Volumn 9, Issue 3, 2009, Pages 89-103

Remote homology detection using a kernel method that combines sequence and secondary-structure similarity scores

Author keywords

Remote homology detection; Secondary structures; Support vector machines

Indexed keywords

ACCURACY; AMINO ACID SEQUENCE; ARTICLE; COMPUTER PROGRAM; GENETIC CONSERVATION; KERNEL METHOD; PREDICTION; PROTEIN FAMILY; PROTEIN SECONDARY STRUCTURE; SCORING SYSTEM; SEQUENCE ALIGNMENT; SEQUENCE DATABASE; SEQUENCE HOMOLOGY; STRUCTURAL HOMOLOGY; SUPPORT VECTOR MACHINE;

EID: 67649876422     PISSN: 13866338     EISSN: None     Source Type: Journal    
DOI: 10.3233/ISB-2009-0390     Document Type: Article
Times cited : (7)

References (42)
  • 2
    • 33947238287 scopus 로고    scopus 로고
    • The Sorcerer II Global Ocean Sampling expedition: Northwest Atlantic through eastern tropical Pacific
    • Rusch, D. B., et al. (2007). The Sorcerer II Global Ocean Sampling expedition: Northwest Atlantic through eastern tropical Pacific. PLoS Biol. 5, e77.
    • (2007) PLoS Biol , vol.5
    • Rusch, D.B.1
  • 3
    • 33748776559 scopus 로고    scopus 로고
    • Automated protein function prediction-the genomic challenge
    • Friedberg, I. (2006). Automated protein function prediction-the genomic challenge. Brief. Bioinform. 7, 225-242.
    • (2006) Brief. Bioinform , vol.7 , pp. 225-242
    • Friedberg, I.1
  • 5
    • 0019887799 scopus 로고
    • Identification of common molecular subsequences
    • Smith, T. F. and Waterman, M. S. (1981). Identification of common molecular subsequences. J. Mol. Biol. 147, 195-197.
    • (1981) J. Mol. Biol , vol.147 , pp. 195-197
    • Smith, T.F.1    Waterman, M.S.2
  • 10
    • 0034776419 scopus 로고    scopus 로고
    • Automatic rule generation for protein annotation with the C4.5 data mining algorithm applied on SWISS-PROT
    • Kretschmann, E., Fleischmann, W. and Apweiler, R. (2001). Automatic rule generation for protein annotation with the C4.5 data mining algorithm applied on SWISS-PROT. Bioinformatics 17, 920-926.
    • (2001) Bioinformatics , vol.17 , pp. 920-926
    • Kretschmann, E.1    Fleischmann, W.2    Apweiler, R.3
  • 11
    • 0742287001 scopus 로고    scopus 로고
    • Combining pairwise sequence similarity and support vector machines for detecting remote protein evolutionary and structural relationships
    • Liao, L. and Noble, W. S. (2003). Combining pairwise sequence similarity and support vector machines for detecting remote protein evolutionary and structural relationships. J. Comput. Biol. 10, 857-868.
    • (2003) J. Comput. Biol , vol.10 , pp. 857-868
    • Liao, L.1    Noble, W.S.2
  • 12
    • 0022706389 scopus 로고
    • The relation between the divergence of sequence and structure in proteins
    • Chothia, C. and Lesk, A. M. (1986). The relation between the divergence of sequence and structure in proteins. EMBO J. 5, 823-826.
    • (1986) EMBO J , vol.5 , pp. 823-826
    • Chothia, C.1    Lesk, A.M.2
  • 13
    • 0028961335 scopus 로고
    • SCOP: A structural classification of proteinsdatabase for the investigation of sequences and structures
    • Murzin, A. G., Brenner, S. E., Hubbard, T. and Chothia, C. (1995). SCOP: a structural classification of proteinsdatabase for the investigation of sequences and structures. J. Mol. Biol. 247, 536-540.
    • (1995) J. Mol. Biol , vol.247 , pp. 536-540
    • Murzin, A.G.1    Brenner, S.E.2    Hubbard, T.3    Chothia, C.4
  • 14
    • 37549024654 scopus 로고    scopus 로고
    • The Protein Data Bank: A historical perspective
    • Berman, H. (2008). The Protein Data Bank: A historical perspective. Acta Crystallogr. A 64, 88-95.
    • (2008) Acta Crystallogr. A , vol.64 , pp. 88-95
    • Berman, H.1
  • 15
    • 0034649566 scopus 로고    scopus 로고
    • Analysis of the genome sequence of the flowering plant Arabidopsis thaliana
    • Arabidopsis Genome Initiative
    • Arabidopsis Genome Initiative (2000). Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408, 796-815.
    • (2000) Nature , vol.408 , pp. 796-815
  • 17
    • 13244268370 scopus 로고    scopus 로고
    • GOtcha: A new method for prediction of protein function assessed by the annotation of seven genomes
    • Martin, D. M., Berriman, M. and Barton, G. J. (2004). GOtcha: A new method for prediction of protein function assessed by the annotation of seven genomes. BMC Bioinformatics 5, 178.
    • (2004) BMC Bioinformatics , vol.5 , pp. 178
    • Martin, D.M.1    Berriman, M.2    Barton, G.J.3
  • 19
    • 33748761629 scopus 로고    scopus 로고
    • Genome comparison using Gene Ontology (GO) with statistical testing
    • Cai, Z., Mao, X., Li, S. and Wei, L. (2006). Genome comparison using Gene Ontology (GO) with statistical testing. BMC Bioinformatics 7, 374.
    • (2006) BMC Bioinformatics , vol.7 , pp. 374
    • Cai, Z.1    Mao, X.2    Li, S.3    Wei, L.4
  • 21
    • 4444273377 scopus 로고    scopus 로고
    • Protein homology detection using string alignment kernels
    • Saigo, H., Vert, J. P., Ueda, N. and Akutsu, T. (2004). Protein homology detection using string alignment kernels. Bioinformatics 20, 1682-1689.
    • (2004) Bioinformatics , vol.20 , pp. 1682-1689
    • Saigo, H.1    Vert, J.P.2    Ueda, N.3    Akutsu, T.4
  • 22
    • 28444492998 scopus 로고    scopus 로고
    • Profile-based direct kernels for remote homology detection and fold recognition
    • Rangwala, H. and Karypis, G. (2005). Profile-based direct kernels for remote homology detection and fold recognition. Bioinformatics 21, 4239-4247.
    • (2005) Bioinformatics , vol.21 , pp. 4239-4247
    • Rangwala, H.1    Karypis, G.2
  • 23
    • 0036358995 scopus 로고    scopus 로고
    • The spectrum kernel: A string kernel for SVM protein classification
    • Leslie, C., Eskin, E. and Noble, W. S. (2002). The spectrum kernel: A string kernel for SVM protein classification. Pac. Symp. Biocomput. 7 564-575.
    • (2002) Pac. Symp. Biocomput , vol.7 , pp. 564-575
    • Leslie, C.1    Eskin, E.2    Noble, W.S.3
  • 24
    • 1542714925 scopus 로고    scopus 로고
    • Mismatch string kernels for discriminative protein classification
    • Leslie, C. S., Eskin, E., Cohen, A., Weston, J. and Noble, W. S. (2004). Mismatch string kernels for discriminative protein classification. Bioinformatics 20, 467-476.
    • (2004) Bioinformatics , vol.20 , pp. 467-476
    • Leslie, C.S.1    Eskin, E.2    Cohen, A.3    Weston, J.4    Noble, W.S.5
  • 25
    • 33846947543 scopus 로고    scopus 로고
    • Motif kernel generated by genetic programming improves remote homology and fold detection
    • Håndstad, T., Hestnes, A. J. and Saetrom, P. (2007). Motif kernel generated by genetic programming improves remote homology and fold detection. BMC Bioinformatics 8, 23.
    • (2007) BMC Bioinformatics , vol.8 , pp. 23
    • Håndstad, T.1    Hestnes, A.J.2    Saetrom, P.3
  • 26
    • 0028081403 scopus 로고
    • Structural features can be unconserved in proteins with similar folds. An analysis of side-chain to side-chain contacts secondary structure and accessibility
    • Russell, R. B. and Barton, G. J. (1994). Structural features can be unconserved in proteins with similar folds. An analysis of side-chain to side-chain contacts secondary structure and accessibility. J. Mol. Biol. 244, 332-350.
    • (1994) J. Mol. Biol , vol.244 , pp. 332-350
    • Russell, R.B.1    Barton, G.J.2
  • 27
    • 0034518144 scopus 로고    scopus 로고
    • Iterative sequence/secondary structure search for protein homologs: Comparison with amino acid sequence alignments and application to fold recognition in genome databases
    • Wallqvist, A., Fukunishi, Y., Murphy, L. R., Fadel, A. and Levy, R. M. (2000). Iterative sequence/secondary structure search for protein homologs: Comparison with amino acid sequence alignments and application to fold recognition in genome databases. Bioinformatics 16, 988-1002.
    • (2000) Bioinformatics , vol.16 , pp. 988-1002
    • Wallqvist, A.1    Fukunishi, Y.2    Murphy, L.R.3    Fadel, A.4    Levy, R.M.5
  • 28
    • 0042121000 scopus 로고    scopus 로고
    • ORFeus: Detection of distant homology using sequence profiles and predicted secondary structure
    • Ginalski, K., Pas, J., Wyrwicz, L. S., von Grotthuss, M., Bujnicki, J. M. and Rychlewski, L. (2003). ORFeus: Detection of distant homology using sequence profiles and predicted secondary structure. Nucleic Acids Res. 31, 3804-3807.
    • (2003) Nucleic Acids Res , vol.31 , pp. 3804-3807
    • Ginalski, K.1    Pas, J.2    Wyrwicz, L.S.3    von Grotthuss, M.4    Bujnicki, J.M.5    Rychlewski, L.6
  • 29
    • 0033578684 scopus 로고    scopus 로고
    • Protein secondary structure prediction based on position-specific scoring matrices
    • Jones, D. T. (1999). Protein secondary structure prediction based on position-specific scoring matrices. J. Mol. Biol. 292, 195-202.
    • (1999) J. Mol. Biol , vol.292 , pp. 195-202
    • Jones, D.T.1
  • 30
    • 0035967880 scopus 로고    scopus 로고
    • FUGUE: Sequence-structure homology recognition using environment-specific substitution tables and structure-dependent gap penalties
    • Shi, J., Blundell, T. L. and Mizuguchi, K. (2001). FUGUE: sequence-structure homology recognition using environment-specific substitution tables and structure-dependent gap penalties. J. Mol. Biol. 310, 243-257.
    • (2001) J. Mol. Biol , vol.310 , pp. 243-257
    • Shi, J.1    Blundell, T.L.2    Mizuguchi, K.3
  • 31
    • 13244255462 scopus 로고    scopus 로고
    • Protein family comparison using statistical models and predicted structural information
    • Chung, R. and Yona, G. (2004). Protein family comparison using statistical models and predicted structural information. BMC Bioinformatics 5, 183.
    • (2004) BMC Bioinformatics , vol.5 , pp. 183
    • Chung, R.1    Yona, G.2
  • 32
    • 0035109761 scopus 로고    scopus 로고
    • What are the baselines for protein fold recognition?
    • McGuffin, L. J., Bryson, K. and Jones, D. T. (2001). What are the baselines for protein fold recognition? Bioinformatics 17, 63-72.
    • (2001) Bioinformatics , vol.17 , pp. 63-72
    • McGuffin, L.J.1    Bryson, K.2    Jones, D.T.3
  • 33
    • 0036643498 scopus 로고    scopus 로고
    • Targeting novel folds for structural genomics
    • McGuffin, L. J. and Jones, D. T. (2002). Targeting novel folds for structural genomics. Proteins 48, 44-52.
    • (2002) Proteins , vol.48 , pp. 44-52
    • McGuffin, L.J.1    Jones, D.T.2
  • 34
    • 0036893072 scopus 로고    scopus 로고
    • Rapid protein domain assignment from amino acid sequence using predicted secondary structure
    • Marsden, R. L., McGuffin, L. J. and Jones, D. T. (2002). Rapid protein domain assignment from amino acid sequence using predicted secondary structure. Protein Sci. 11, 2814-2824.
    • (2002) Protein Sci , vol.11 , pp. 2814-2824
    • Marsden, R.L.1    McGuffin, L.J.2    Jones, D.T.3
  • 35
    • 6344261961 scopus 로고    scopus 로고
    • Remote homolog detection using local sequence-structure correlations
    • Hou, Y., Hsu, W., Lee, M. L. and Bystroff, C. (2004). Remote homolog detection using local sequence-structure correlations. Proteins 57 518-530.
    • (2004) Proteins , vol.57 , pp. 518-530
    • Hou, Y.1    Hsu, W.2    Lee, M.L.3    Bystroff, C.4
  • 36
    • 0032555696 scopus 로고    scopus 로고
    • Prediction of local structure in proteins using a library of sequence-structure motifs
    • Bystroff, C. and Baker, D. (1998). Prediction of local structure in proteins using a library of sequence-structure motifs. J. Mol. Biol. 281, 565-577.
    • (1998) J. Mol. Biol , vol.281 , pp. 565-577
    • Bystroff, C.1    Baker, D.2
  • 38
    • 0020997912 scopus 로고
    • Dictionary of protein secondary structure: Pattern recognition of hydrogen-bonded and geometrical features
    • Kabsch, W. and Sander, C. (1983). Dictionary of protein secondary structure: Pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22, 2577-2637.
    • (1983) Biopolymers , vol.22 , pp. 2577-2637
    • Kabsch, W.1    Sander, C.2
  • 42
    • 0023571979 scopus 로고
    • Evaluation and improvements in the automatic alignment of protein sequences
    • Barton, G. J. and Sternberg, M. J. (1987). Evaluation and improvements in the automatic alignment of protein sequences. Protein Eng. 1, 89-94.
    • (1987) Protein Eng , vol.1 , pp. 89-94
    • Barton, G.J.1    Sternberg, M.J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.