메뉴 건너뛰기




Volumn 253, Issue 2, 2008, Pages 375-380

Predicting protein structural class by SVM with class-wise optimized features and decision probabilities

Author keywords

Multi class SVM; Probability outputs SVM; SCOP class classification

Indexed keywords

ALGORITHM; DATA SET; DECISION ANALYSIS; PREDICTION; PROBABILITY; PROTEIN;

EID: 45649085108     PISSN: 00225193     EISSN: 10958541     Source Type: Journal    
DOI: 10.1016/j.jtbi.2008.02.031     Document Type: Article
Times cited : (57)

References (57)
  • 1
    • 0037076322 scopus 로고    scopus 로고
    • Selection bias in gene extraction on the basis of microarray gene-expression data
    • Ambroise C., and McLachlan G.J. Selection bias in gene extraction on the basis of microarray gene-expression data. Proc. Natl. Acad. Sci. 99 (2002) 6562-6566
    • (2002) Proc. Natl. Acad. Sci. , vol.99 , pp. 6562-6566
    • Ambroise, C.1    McLachlan, G.J.2
  • 2
    • 0000481188 scopus 로고
    • An empirical distribution function for sampling with incomplete information
    • Ayer M., Brunk H., Ewing G., Reid W., and Silverman E. An empirical distribution function for sampling with incomplete information. Ann. Math. Stat. 26 4 (1955) 641-647
    • (1955) Ann. Math. Stat. , vol.26 , Issue.4 , pp. 641-647
    • Ayer, M.1    Brunk, H.2    Ewing, G.3    Reid, W.4    Silverman, E.5
  • 3
    • 45649084885 scopus 로고    scopus 로고
    • Bottou, L., Cortes, C., Denker, J., Drucker, H., Guyon, I., Jackel, L., LeCun, Y., Muller, U., Sackinger, E., Simard, P., Vapnik, V., 1994. Comparison of classifier methods: a case study in handwriting digit recognition. In: Proc. Int. Conf. Pattern Recognition, pp. 77-87.
    • Bottou, L., Cortes, C., Denker, J., Drucker, H., Guyon, I., Jackel, L., LeCun, Y., Muller, U., Sackinger, E., Simard, P., Vapnik, V., 1994. Comparison of classifier methods: a case study in handwriting digit recognition. In: Proc. Int. Conf. Pattern Recognition, pp. 77-87.
  • 4
    • 2942601555 scopus 로고    scopus 로고
    • Support vector machines for predicting protein structural class
    • Cai Y.-D., Liu X.-J., Xu X.-B., and Zhou G.-P. Support vector machines for predicting protein structural class. BMC Bioinform. 2 (2001) 3
    • (2001) BMC Bioinform. , vol.2 , pp. 3
    • Cai, Y.-D.1    Liu, X.-J.2    Xu, X.-B.3    Zhou, G.-P.4
  • 5
    • 0037423777 scopus 로고    scopus 로고
    • Support vector machines for prediction of protein domain structural class
    • Cai Y.-D., Liu X.-J., Xu X.-B., and Chou K.-C. Support vector machines for prediction of protein domain structural class. J. Theor. Biol. 221 (2003) 115-120
    • (2003) J. Theor. Biol. , vol.221 , pp. 115-120
    • Cai, Y.-D.1    Liu, X.-J.2    Xu, X.-B.3    Chou, K.-C.4
  • 6
    • 28444439947 scopus 로고    scopus 로고
    • Using logitboost classifier to predict protein structural classes
    • Cai Y.-D., Feng K.-Y., Lu W.-C., and Chou K.-C. Using logitboost classifier to predict protein structural classes. J. Theor. Biol. 238 (2006) 172-176
    • (2006) J. Theor. Biol. , vol.238 , pp. 172-176
    • Cai, Y.-D.1    Feng, K.-Y.2    Lu, W.-C.3    Chou, K.-C.4
  • 8
    • 45649083943 scopus 로고    scopus 로고
    • Chai, H., Domeniconi, C., 2004. An evaluation of gene selection methods for multi-class microarray data classification. In: T. Scheffer (Ed.), Proceedings of the Second European Workshop on Data Mining and Text Mining in Bioinformatics, pp. 3-10.
    • Chai, H., Domeniconi, C., 2004. An evaluation of gene selection methods for multi-class microarray data classification. In: T. Scheffer (Ed.), Proceedings of the Second European Workshop on Data Mining and Text Mining in Bioinformatics, pp. 3-10.
  • 10
    • 45649083174 scopus 로고    scopus 로고
    • Chang, C.-C., Lin, C.-J., 2001. LIBSVM: a library for support vector machines 2001. Software available at 〈http://www.csie.ntu.edu.tw/~cjlin/libsvm〉.
    • Chang, C.-C., Lin, C.-J., 2001. LIBSVM: a library for support vector machines 2001. Software available at 〈http://www.csie.ntu.edu.tw/~cjlin/libsvm〉.
  • 11
    • 33750475941 scopus 로고    scopus 로고
    • Using pseudo-amino acid composition and support vector machine to predict protein structural class
    • Chen C., Tian Y.-X., Zou X.-Y., Cai P.-X., and Mo J.-Y. Using pseudo-amino acid composition and support vector machine to predict protein structural class. J. Theor. Biol. 243 (2006) 444-448
    • (2006) J. Theor. Biol. , vol.243 , pp. 444-448
    • Chen, C.1    Tian, Y.-X.2    Zou, X.-Y.3    Cai, P.-X.4    Mo, J.-Y.5
  • 12
    • 33748287103 scopus 로고    scopus 로고
    • Predicting protein structural class with pseudo-amino acid composition and support vector machine fusion network
    • Chen C., Zhou X., Tian Y., Zou X., and Cai P. Predicting protein structural class with pseudo-amino acid composition and support vector machine fusion network. Anal. Biochem. 357 (2006) 116-121
    • (2006) Anal. Biochem. , vol.357 , pp. 116-121
    • Chen, C.1    Zhou, X.2    Tian, Y.3    Zou, X.4    Cai, P.5
  • 13
    • 35348911371 scopus 로고    scopus 로고
    • Chen, K.E., Lukasz, K., Jishou, R., 2007. Prediction of protein structural class using PSI-BLAST profile based collocation of amino acid pairs. The 1st International Conference on Bioinformatics and Biomedical Engineering, pp. 17-20.
    • Chen, K.E., Lukasz, K., Jishou, R., 2007. Prediction of protein structural class using PSI-BLAST profile based collocation of amino acid pairs. The 1st International Conference on Bioinformatics and Biomedical Engineering, pp. 17-20.
  • 14
    • 0029051959 scopus 로고
    • A novel approach to predicting protein structural classes in a (20-1)-D amino acid composition space
    • Chou K.C. A novel approach to predicting protein structural classes in a (20-1)-D amino acid composition space. Proteins 21 4 (1995) 319-344
    • (1995) Proteins , vol.21 , Issue.4 , pp. 319-344
    • Chou, K.C.1
  • 15
    • 0034285487 scopus 로고    scopus 로고
    • Prediction of protein structural classes and subcellular locations
    • Chou K.C. Prediction of protein structural classes and subcellular locations. Curr. Protein Pept. Sci. 1 2 (2000) 171-208
    • (2000) Curr. Protein Pept. Sci. , vol.1 , Issue.2 , pp. 171-208
    • Chou, K.C.1
  • 16
    • 3843117638 scopus 로고    scopus 로고
    • Predicting protein structural class by functional domain composition
    • Chou K.-C., and Cai Y.-D. Predicting protein structural class by functional domain composition. Biochem. Biophys. Res. Commun. 321 (2004) 1007-1009
    • (2004) Biochem. Biophys. Res. Commun. , vol.321 , pp. 1007-1009
    • Chou, K.-C.1    Cai, Y.-D.2
  • 17
    • 0018110116 scopus 로고
    • Prediction of the secondary structure of proteins from their amino acid sequence
    • Chou P.Y., and Fasman G.D. Prediction of the secondary structure of proteins from their amino acid sequence. Adv. Enzymol. Relat. Areas Mol. Biol. 47 (1978) 45-148
    • (1978) Adv. Enzymol. Relat. Areas Mol. Biol. , vol.47 , pp. 45-148
    • Chou, P.Y.1    Fasman, G.D.2
  • 18
    • 0031928664 scopus 로고    scopus 로고
    • Domain structural class prediction
    • Chou K.-C., and Maggiora G.M. Domain structural class prediction. Protein Eng. 11 7 (1998) 523-538
    • (1998) Protein Eng. , vol.11 , Issue.7 , pp. 523-538
    • Chou, K.-C.1    Maggiora, G.M.2
  • 19
    • 0028121983 scopus 로고
    • Predicting protein-folding types by distance functions that make allowances for amino-acid interactions
    • Chou K.C., and Zhang C.T. Predicting protein-folding types by distance functions that make allowances for amino-acid interactions. J. Biol. Chem. 269 (1994) 22014-22020
    • (1994) J. Biol. Chem. , vol.269 , pp. 22014-22020
    • Chou, K.C.1    Zhang, C.T.2
  • 20
    • 0010442827 scopus 로고    scopus 로고
    • On the algorithmic implementation of multiclass kernel-based vector machines
    • Crammer K., and Singer Y. On the algorithmic implementation of multiclass kernel-based vector machines. J. Mach. Learn. Res. 2 (2001) 265-292
    • (2001) J. Mach. Learn. Res. , vol.2 , pp. 265-292
    • Crammer, K.1    Singer, Y.2
  • 21
    • 34548697717 scopus 로고    scopus 로고
    • Prediction of protein structure classes with pseudo amino acid composition and fuzzy support vector machine network
    • Ding Y.-S., Zhang T.-L., and Chou K.-C. Prediction of protein structure classes with pseudo amino acid composition and fuzzy support vector machine network. Protein Pept. Lett. 14 8 (2007) 811-815
    • (2007) Protein Pept. Lett. , vol.14 , Issue.8 , pp. 811-815
    • Ding, Y.-S.1    Zhang, T.-L.2    Chou, K.-C.3
  • 22
    • 33744988605 scopus 로고    scopus 로고
    • A contact energy function considering residue hydrophobic environment and its application in protein fold recognition
    • Duan M.J., and Zhou Y.H. A contact energy function considering residue hydrophobic environment and its application in protein fold recognition. Genomics Proteomics Bioinformatics 3 4 (2005) 218-224
    • (2005) Genomics Proteomics Bioinformatics , vol.3 , Issue.4 , pp. 218-224
    • Duan, M.J.1    Zhou, Y.H.2
  • 23
    • 35248866563 scopus 로고    scopus 로고
    • Multi-category classification by soft-max combination of binary classifiers
    • Windeatt T., and Roli F. (Eds), Springer, Heidelberg
    • Duan K., Keerthi S.S., Chu W., Shevade S.K., and Poo A.N. Multi-category classification by soft-max combination of binary classifiers. In: Windeatt T., and Roli F. (Eds). MCS 2003, LNCS vol. 2709 (2003), Springer, Heidelberg 125-134
    • (2003) MCS 2003, LNCS , vol.2709 , pp. 125-134
    • Duan, K.1    Keerthi, S.S.2    Chu, W.3    Shevade, S.K.4    Poo, A.N.5
  • 24
    • 45649083249 scopus 로고    scopus 로고
    • Duembgen, L., 2000. Available at 〈http://www.staff.unibe.ch/duembgen/software/#Isotone〉.
    • Duembgen, L., 2000. Available at 〈http://www.staff.unibe.ch/duembgen/software/#Isotone〉.
  • 25
    • 22144470177 scopus 로고    scopus 로고
    • Boosting classifier for predicting protein domain structural class
    • Feng K.Y., Cai Y.D., and Chou K.C. Boosting classifier for predicting protein domain structural class. Biochem. Biophys. Res. Commun. 334 1 (2005) 213-217
    • (2005) Biochem. Biophys. Res. Commun. , vol.334 , Issue.1 , pp. 213-217
    • Feng, K.Y.1    Cai, Y.D.2    Chou, K.C.3
  • 26
    • 0036161259 scopus 로고    scopus 로고
    • Gene selection for cancer classification using support vector machines
    • Guyon I., Weston J., Barnhill S., and Vapnik V. Gene selection for cancer classification using support vector machines. Mach. Learn. 46 (2002) 389-422
    • (2002) Mach. Learn. , vol.46 , pp. 389-422
    • Guyon, I.1    Weston, J.2    Barnhill, S.3    Vapnik, V.4
  • 28
    • 0036505670 scopus 로고    scopus 로고
    • A comparison of methods for multi-class support vector machines
    • Hsu C.-W., and Lin C.-J. A comparison of methods for multi-class support vector machines. IEEE Trans. Neural Netw. 13 (2002) 415-425
    • (2002) IEEE Trans. Neural Netw. , vol.13 , pp. 415-425
    • Hsu, C.-W.1    Lin, C.-J.2
  • 29
    • 0035018354 scopus 로고    scopus 로고
    • Environmental features are important in determining protein secondary structure
    • Johnson Jr. W.C., and Macdonald J.R. Environmental features are important in determining protein secondary structure. Protein Sci. 10 (2001) 1172-1177
    • (2001) Protein Sci. , vol.10 , pp. 1172-1177
    • Johnson Jr., W.C.1    Macdonald, J.R.2
  • 31
    • 0022777472 scopus 로고
    • Prediction of protein structural class from the amino-acid sequence
    • Klein P., and Delisi C. Prediction of protein structural class from the amino-acid sequence. Biopolymers 25 (1986) 1659-1672
    • (1986) Biopolymers , vol.25 , pp. 1659-1672
    • Klein, P.1    Delisi, C.2
  • 32
    • 0002229304 scopus 로고    scopus 로고
    • Pairwise classification and support vector machines
    • Scholkopf B., Burges C.J.C., and Smola A.J. (Eds), MIT press, Cambridge, MA, USA
    • Kreßel U.H.-G. Pairwise classification and support vector machines. In: Scholkopf B., Burges C.J.C., and Smola A.J. (Eds). Advances in Kernel Methods: Support Vector Learning (1999), MIT press, Cambridge, MA, USA 255-268
    • (1999) Advances in Kernel Methods: Support Vector Learning , pp. 255-268
    • Kreßel, U.H.-G.1
  • 33
    • 33748415440 scopus 로고    scopus 로고
    • Prediction of structural classes for protein sequences and domains-impact of prediction algorithms, sequence representation and homology, and test procedures on accuracy
    • Kurgan L.A., and Homaeian L. Prediction of structural classes for protein sequences and domains-impact of prediction algorithms, sequence representation and homology, and test procedures on accuracy. Pattern Recognit. 39 12 (2006) 2323-2343
    • (2006) Pattern Recognit. , vol.39 , Issue.12 , pp. 2323-2343
    • Kurgan, L.A.1    Homaeian, L.2
  • 34
    • 2142775432 scopus 로고    scopus 로고
    • Multicategory support vector machines: theory and application to the classification of microarray data and satellite radiance data
    • Lee Y., et al. Multicategory support vector machines: theory and application to the classification of microarray data and satellite radiance data. J. Am. Stat. Assoc. 99 (2004) 67-81
    • (2004) J. Am. Stat. Assoc. , vol.99 , pp. 67-81
    • Lee, Y.1
  • 35
    • 0017309766 scopus 로고
    • Structural patterns in globular proteins
    • Levitt M., and Chothia C. Structural patterns in globular proteins. Nature 261 (1976) 552-557
    • (1976) Nature , vol.261 , pp. 552-557
    • Levitt, M.1    Chothia, C.2
  • 36
    • 33745634395 scopus 로고    scopus 로고
    • Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences
    • Li W., and Godzik A. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 22 (2006) 1658-1659
    • (2006) Bioinformatics , vol.22 , pp. 1658-1659
    • Li, W.1    Godzik, A.2
  • 37
    • 7244248755 scopus 로고    scopus 로고
    • A comparative study of feature selection and multiclass classification methods for tissue classification based on gene expression
    • Li T., Zhang C., and Ogihara M. A comparative study of feature selection and multiclass classification methods for tissue classification based on gene expression. Bioinformatics 20 (2004) 2429-2437
    • (2004) Bioinformatics , vol.20 , pp. 2429-2437
    • Li, T.1    Zhang, C.2    Ogihara, M.3
  • 38
    • 0018543443 scopus 로고
    • Antiparallel and parallel beta-strands differ in amino acid residue preferences
    • Lifson S., and Sander C. Antiparallel and parallel beta-strands differ in amino acid residue preferences. Nature 282 (1979) 109-111
    • (1979) Nature , vol.282 , pp. 109-111
    • Lifson, S.1    Sander, C.2
  • 39
    • 0036051172 scopus 로고    scopus 로고
    • Prediction of protein structural class by amino acid and polypeptide composition
    • Luo R.-Y., Feng Z.-P., and Liu J.-K. Prediction of protein structural class by amino acid and polypeptide composition. Eur. J. Biochem. 269 (2002) 4219-4225
    • (2002) Eur. J. Biochem. , vol.269 , pp. 4219-4225
    • Luo, R.-Y.1    Feng, Z.-P.2    Liu, J.-K.3
  • 40
    • 0027217392 scopus 로고
    • Crossvalidation of protein structural class prediction using statistical clustering and neural networks
    • Metfessel B.A., Saurugger P.N., Connelly D.P., and Rich S. Crossvalidation of protein structural class prediction using statistical clustering and neural networks. Protein Sci. 2 (1993) 1171-1182
    • (1993) Protein Sci. , vol.2 , pp. 1171-1182
    • Metfessel, B.A.1    Saurugger, P.N.2    Connelly, D.P.3    Rich, S.4
  • 41
    • 25144494760 scopus 로고    scopus 로고
    • Prediction error estimation: a comparison of resampling methods
    • Molinaro A.M., Simon R., and Pfeiffer R.M. Prediction error estimation: a comparison of resampling methods. Bioinformatics 21 15 (2005) 3301-3307
    • (2005) Bioinformatics , vol.21 , Issue.15 , pp. 3301-3307
    • Molinaro, A.M.1    Simon, R.2    Pfeiffer, R.M.3
  • 42
    • 0028961335 scopus 로고
    • SCOP: a structural classification of proteins database for the investigation of sequences and structures
    • Murzin A.G., Brenner S.E., Hubbard T., and Chothia C. SCOP: a structural classification of proteins database for the investigation of sequences and structures. J. Mol. Biol. 247 (1995) 536-540
    • (1995) J. Mol. Biol. , vol.247 , pp. 536-540
    • Murzin, A.G.1    Brenner, S.E.2    Hubbard, T.3    Chothia, C.4
  • 43
    • 0022631926 scopus 로고
    • The folding type of a protein is relevant to the amino acid composition
    • Nakashima H., Nishikawa K., and Ooi T. The folding type of a protein is relevant to the amino acid composition. J. Biochem. 99 (1986) 153-162
    • (1986) J. Biochem. , vol.99 , pp. 153-162
    • Nakashima, H.1    Nishikawa, K.2    Ooi, T.3
  • 44
    • 45649085726 scopus 로고    scopus 로고
    • Øhrn, A., 1999. Discernibility and rough sets in medicine: tools and applications. Ph.D. volume, Department of Computer and Information Science, Norwegian University of Science and Technology, Norway.
    • Øhrn, A., 1999. Discernibility and rough sets in medicine: tools and applications. Ph.D. volume, Department of Computer and Information Science, Norwegian University of Science and Technology, Norway.
  • 45
    • 45649084970 scopus 로고    scopus 로고
    • Pawlak, Z., 1991. Rough Sets: theoretical aspects of reasoning about data. In: Theory and Decision Library Series D, System Theory, Knowledge Engineering and Problem Solving, Kluwer Academic Publishers.
    • Pawlak, Z., 1991. Rough Sets: theoretical aspects of reasoning about data. In: Theory and Decision Library Series D, System Theory, Knowledge Engineering and Problem Solving, Kluwer Academic Publishers.
  • 46
    • 0003243224 scopus 로고    scopus 로고
    • Probabilistic outputs for support vector machines and comparison to regularized likelihood methods
    • Smola A.J., Bartlett P.L., Scholkopf B., and Schuurmans D. (Eds), MIT Press, Cambridge
    • Platt J. Probabilistic outputs for support vector machines and comparison to regularized likelihood methods. In: Smola A.J., Bartlett P.L., Scholkopf B., and Schuurmans D. (Eds). Advances in Large Margin Classifiers (2000), MIT Press, Cambridge 61-74
    • (2000) Advances in Large Margin Classifiers , pp. 61-74
    • Platt, J.1
  • 47
    • 38649097852 scopus 로고    scopus 로고
    • A machine learning approach for the identification of odorant binding proteins from sequence-derived properties
    • Pugalenthi G., Tang Ke., Suganthan P.N., Archunan G., and Sowdhamini R. A machine learning approach for the identification of odorant binding proteins from sequence-derived properties. BMC Bioinform. 8 (2007) 351
    • (2007) BMC Bioinform. , vol.8 , pp. 351
    • Pugalenthi, G.1    Tang, Ke.2    Suganthan, P.N.3    Archunan, G.4    Sowdhamini, R.5
  • 49
    • 33750948399 scopus 로고    scopus 로고
    • Building multiclass classifiers for remote homology detection and fold recognition
    • Rangwala H., and Karypis G. Building multiclass classifiers for remote homology detection and fold recognition. BMC Bioinform. 7 (2006) 455
    • (2006) BMC Bioinform. , vol.7 , pp. 455
    • Rangwala, H.1    Karypis, G.2
  • 50
    • 56749117943 scopus 로고    scopus 로고
    • In defense of one-vs-all classification
    • Rifkin R., and Klautau A. In defense of one-vs-all classification. J. Mach. Learn. Res. 5 (2004) 101-141
    • (2004) J. Mach. Learn. Res. , vol.5 , pp. 101-141
    • Rifkin, R.1    Klautau, A.2
  • 53
    • 0034141493 scopus 로고    scopus 로고
    • How good is prediction of protein structural class by the component-coupled method?
    • Wang Z.-X., and Yuan Z. How good is prediction of protein structural class by the component-coupled method?. Proteins: Struct. Funct. Genet. 38 (2000) 165-175
    • (2000) Proteins: Struct. Funct. Genet. , vol.38 , pp. 165-175
    • Wang, Z.-X.1    Yuan, Z.2
  • 54
    • 45649083489 scopus 로고    scopus 로고
    • Weston, J., Watkins, C., 1999. Support vector machines for multiclass pattern recognition. In: Proceedings of the Seventh European Symposium on Artificial Neural Networks.
    • Weston, J., Watkins, C., 1999. Support vector machines for multiclass pattern recognition. In: Proceedings of the Seventh European Symposium on Artificial Neural Networks.
  • 55
    • 0242456763 scopus 로고    scopus 로고
    • Zadrozny B., Elkan C., 2002. Transforming classifier scores into accurate multiclass probability estimates. In: Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 694-699.
    • Zadrozny B., Elkan C., 2002. Transforming classifier scores into accurate multiclass probability estimates. In: Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 694-699.
  • 56
    • 0027080161 scopus 로고
    • An optimization approach to predicting protein structural class from amino-acid composition
    • Zhang C.T., and Chou K.C. An optimization approach to predicting protein structural class from amino-acid composition. Protein Sci. 1 (1992) 401-408
    • (1992) Protein Sci. , vol.1 , pp. 401-408
    • Zhang, C.T.1    Chou, K.C.2
  • 57
    • 54749084166 scopus 로고    scopus 로고
    • An intriguing controversy over protein structural class prediction
    • Zhou G.P. An intriguing controversy over protein structural class prediction. J. Protein Chem. 17 8 (1998) 729-738
    • (1998) J. Protein Chem. , vol.17 , Issue.8 , pp. 729-738
    • Zhou, G.P.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.