메뉴 건너뛰기




Volumn 70, Issue 4, 2008, Pages 1125-1132

Protein classification with imbalanced data

Author keywords

Ensemble classifier; Feature extraction; Hybrid sampling; Multi class classification; Rebalancing technique

Indexed keywords

PROTEIN;

EID: 39749147033     PISSN: 08873585     EISSN: 10970134     Source Type: Journal    
DOI: 10.1002/prot.21870     Document Type: Article
Times cited : (125)

References (35)
  • 2
    • 0028043552 scopus 로고
    • Position-based sequence weights
    • Henikoff S, Henikoff J. Position-based sequence weights. J Mol Biol 1994;243:574-578.
    • (1994) J Mol Biol , vol.243 , pp. 574-578
    • Henikoff, S.1    Henikoff, J.2
  • 3
    • 0028181441 scopus 로고
    • Hidden markov models in computational biology: Applications to protein modeling
    • Krogh A, Brown M, Mian I, Sjolander K, Haussler D. Hidden markov models in computational biology: applications to protein modeling. J Mol Biol 1994;235:1501-1531.
    • (1994) J Mol Biol , vol.235 , pp. 1501-1531
    • Krogh, A.1    Brown, M.2    Mian, I.3    Sjolander, K.4    Haussler, D.5
  • 4
    • 1542714925 scopus 로고    scopus 로고
    • Mismatch string kernels for discriminative protein classification
    • Leslie C, Eskin E, Cohen A, Weston J, Noble W. Mismatch string kernels for discriminative protein classification. Bioinformatics 2004;20:467-476.
    • (2004) Bioinformatics , vol.20 , pp. 467-476
    • Leslie, C.1    Eskin, E.2    Cohen, A.3    Weston, J.4    Noble, W.5
  • 5
    • 0742287001 scopus 로고    scopus 로고
    • Combining pairwise sequence similarity and support vector machines for detecting remote protein evolutionary and structural relationships
    • Liao L, Noble W. Combining pairwise sequence similarity and support vector machines for detecting remote protein evolutionary and structural relationships. J Comput Biol 2003;10:857-868.
    • (2003) J Comput Biol , vol.10 , pp. 857-868
    • Liao, L.1    Noble, W.2
  • 6
    • 26944462332 scopus 로고    scopus 로고
    • A novel approach to extracting features from motif content and protein composition for protein sequence classification
    • Zhao X, Cheung Y, Huang D. A novel approach to extracting features from motif content and protein composition for protein sequence classification. Neural Networks 2005;18:1019-1028.
    • (2005) Neural Networks , vol.18 , pp. 1019-1028
    • Zhao, X.1    Cheung, Y.2    Huang, D.3
  • 7
    • 33748437922 scopus 로고    scopus 로고
    • Classifying protein sequences using hydropathy blocks
    • Huang D, Zhao X, Huang G, Cheung Y. Classifying protein sequences using hydropathy blocks. Pattern Recog 2006;39:2293-2300.
    • (2006) Pattern Recog , vol.39 , pp. 2293-2300
    • Huang, D.1    Zhao, X.2    Huang, G.3    Cheung, Y.4
  • 8
    • 0035014847 scopus 로고    scopus 로고
    • Multi-class protein fold recognition using support vector machines and neural networks
    • Ding CH, Dubchak I. Multi-class protein fold recognition using support vector machines and neural networks. Bioinformatics 2001;17:349-358.
    • (2001) Bioinformatics , vol.17 , pp. 349-358
    • Ding, C.H.1    Dubchak, I.2
  • 9
    • 27144549260 scopus 로고    scopus 로고
    • Editorial: Special issue on learning from imbalanced data sets
    • Chawla NV, Japkowicz N, Kotcz A. Editorial: special issue on learning from imbalanced data sets. SIGKDD Explor Newslett 2004;6:1-6.
    • (2004) SIGKDD Explor Newslett , vol.6 , pp. 1-6
    • Chawla, N.V.1    Japkowicz, N.2    Kotcz, A.3
  • 11
    • 0037753593 scopus 로고    scopus 로고
    • PhD thesis. Delft University of Technology, Delft, The Netherlands;
    • Tax D. One-class classification. PhD thesis. Delft University of Technology, Delft, The Netherlands; 2001.
    • (2001) One-class classification
    • Tax, D.1
  • 12
    • 16644402628 scopus 로고    scopus 로고
    • Feature selection for text categorization on imbalanced data
    • Zheng Z, Wu X, Srihari R. Feature selection for text categorization on imbalanced data. ACM SIGKDD Explor Newslett 2004;6:80-89.
    • (2004) ACM SIGKDD Explor Newslett , vol.6 , pp. 80-89
    • Zheng, Z.1    Wu, X.2    Srihari, R.3
  • 15
    • 27144479454 scopus 로고    scopus 로고
    • Learning from imbalanced data sets with boosting and data generation: The databoost-im approach
    • Guo H, Viktor HL. Learning from imbalanced data sets with boosting and data generation: the databoost-im approach. SIGKDD Explor 2004;6:30-39.
    • (2004) SIGKDD Explor , vol.6 , pp. 30-39
    • Guo, H.1    Viktor, H.L.2
  • 17
    • 0343081513 scopus 로고    scopus 로고
    • Reduction techniques for instance-based learning algorithms
    • Wilson DR, Martinez TR. Reduction techniques for instance-based learning algorithms. Mach Learn 2000;38:257-286.
    • (2000) Mach Learn , vol.38 , pp. 257-286
    • Wilson, D.R.1    Martinez, T.R.2
  • 21
    • 0031361611 scopus 로고    scopus 로고
    • Machine learning research: Four current directions
    • Dietterich TG. Machine learning research: four current directions. AI Magazine 1998;18:97-136.
    • (1998) AI Magazine , vol.18 , pp. 97-136
    • Dietterich, T.G.1
  • 22
    • 33747880465 scopus 로고    scopus 로고
    • Ensemble classifier for protein fold pattern recognition
    • Shen H, Chou K. Ensemble classifier for protein fold pattern recognition. Bioinformatics 2006;22:1717-1722.
    • (2006) Bioinformatics , vol.22 , pp. 1717-1722
    • Shen, H.1    Chou, K.2
  • 23
    • 14944354760 scopus 로고    scopus 로고
    • Multi-class protein fold classification using a new ensemble machine learning approach
    • Tan A, Gilbert D, Deville Y. Multi-class protein fold classification using a new ensemble machine learning approach. Genome Informatics 2003;14:206-217.
    • (2003) Genome Informatics , vol.14 , pp. 206-217
    • Tan, A.1    Gilbert, D.2    Deville, Y.3
  • 25
    • 4444273377 scopus 로고    scopus 로고
    • Protein homology detection using string alignment kernels
    • Saigo H, Vert JP, Ueda N, Akutsu T. Protein homology detection using string alignment kernels. Bioinformatics 2004;20:1682-1689.
    • (2004) Bioinformatics , vol.20 , pp. 1682-1689
    • Saigo, H.1    Vert, J.P.2    Ueda, N.3    Akutsu, T.4
  • 26
    • 0019887799 scopus 로고
    • Identification of common molecular subsequences
    • Smith TF, Waterman MS. Identification of common molecular subsequences. J Mol Biol 1981;147:195-197.
    • (1981) J Mol Biol , vol.147 , pp. 195-197
    • Smith, T.F.1    Waterman, M.S.2
  • 27
    • 0014757386 scopus 로고
    • A general method applicable to the search for similarities in the amino acid sequence of two proteins
    • Needleman SB, Wunsch CD. A general method applicable to the search for similarities in the amino acid sequence of two proteins. J Mol Biol 1970;48:443-453.
    • (1970) J Mol Biol , vol.48 , pp. 443-453
    • Needleman, S.B.1    Wunsch, C.D.2
  • 32
    • 39749101564 scopus 로고    scopus 로고
    • Quinlan JR. C4.5: Programs for machine learning. San Francisco, CA: Morgan Kaufmann Publishers; 1993.
    • Quinlan JR. C4.5: Programs for machine learning. San Francisco, CA: Morgan Kaufmann Publishers; 1993.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.