메뉴 건너뛰기




Volumn 46, Issue 5, 2014, Pages 1343-1351

Selection of relevant features from amino acids enables development of robust classifiers

Author keywords

Feature extraction; Machine learning; Mitochondrial protein; Protein classifier design and evaluation; Protein sequence analysis

Indexed keywords

PROTEIN;

EID: 84898857296     PISSN: 09394451     EISSN: 14382199     Source Type: Journal    
DOI: 10.1007/s00726-014-1697-z     Document Type: Article
Times cited : (10)

References (42)
  • 1
    • 84865856641 scopus 로고    scopus 로고
    • Sequence and chromatin determinants of cell-type - Specific transcription factor binding
    • 3431489 22955984 10.1101/gr.127712.111
    • Arvey A, Agius P, Noble WS, Leslie C (2012) Sequence and chromatin determinants of cell-type - specific transcription factor binding. Genome Res 22(9):1723-1734
    • (2012) Genome Res , vol.22 , Issue.9 , pp. 1723-1734
    • Arvey, A.1    Agius, P.2    Noble, W.S.3    Leslie, C.4
  • 4
    • 0242361265 scopus 로고    scopus 로고
    • Properties and prediction of mitochondrial transit peptides from Plasmodium falciparum
    • 14599665 10.1016/j.molbiopara.2003.07.001
    • Bender A, van Dooren GG, Ralph SA, McFadden GI, Schneider G (2003) Properties and prediction of mitochondrial transit peptides from Plasmodium falciparum. Mol Biochem Parasitol 132(2):59-66
    • (2003) Mol Biochem Parasitol , vol.132 , Issue.2 , pp. 59-66
    • Bender, A.1    Van Dooren, G.G.2    Ralph, S.A.3    McFadden, G.I.4    Schneider, G.5
  • 7
    • 84875576158 scopus 로고    scopus 로고
    • Propy: A tool to generate various modes of Chou's PseAAC
    • 23426256 10.1093/bioinformatics/btt072
    • Cao DS, Xu QS, Liang YZ (2013) Propy: a tool to generate various modes of Chou's PseAAC. Bioinformatics 29(7):960-962
    • (2013) Bioinformatics , vol.29 , Issue.7 , pp. 960-962
    • Cao, D.S.1    Xu, Q.S.2    Liang, Y.Z.3
  • 8
    • 34047138318 scopus 로고    scopus 로고
    • Combining SVMs with various feature selection strategies
    • I. Guyon M. Nikravesh S. Gunn L. Zadeh (eds) Studies in fuzziness and soft computing 207 Springer Berlin 10.1007/978-3-540-35488-8-13
    • Chen YW, Lin CJ (2006) Combining SVMs with various feature selection strategies. In: Guyon I, Nikravesh M, Gunn S, Zadeh L (eds) Feature extraction, vol 207., Studies in fuzziness and soft computingSpringer, Berlin, pp 315-324
    • (2006) Feature Extraction , pp. 315-324
    • Chen, Y.W.1    Lin, C.J.2
  • 9
    • 84862701316 scopus 로고    scopus 로고
    • Using increment of diversity to predict mitochondrial proteins of malaria parasite: Integrating pseudo-Amino acid composition and structural alphabet
    • 21191803 10.1007/s00726-010-0825-7
    • Chen YL, Li QZ, Zhang LQ (2012) Using increment of diversity to predict mitochondrial proteins of malaria parasite: integrating pseudo-Amino acid composition and structural alphabet. Amino Acids 42(4):1309-1316
    • (2012) Amino Acids , vol.42 , Issue.4 , pp. 1309-1316
    • Chen, Y.L.1    Li, Q.Z.2    Zhang, L.Q.3
  • 10
    • 79955702502 scopus 로고    scopus 로고
    • LIBSVM: A library for support vector machines
    • 10.1145/1961189.1961199 10.1145/1961189.1961199
    • Chih-Chung C, Chih-Jen L (2011) LIBSVM: a library for support vector machines. ACM Trans Intell Syst Technol 2(3):1-27. doi: 10.1145/1961189.1961199
    • (2011) ACM Trans Intell Syst Technol , vol.2 , Issue.3 , pp. 1-27
    • Chih-Chung, C.1    Chih-Jen, L.2
  • 11
    • 12744279642 scopus 로고    scopus 로고
    • Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes
    • 15308540 10.1093/bioinformatics/bth466
    • Chou KC (2005) Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes. Bioinformatics 21(1):10-19
    • (2005) Bioinformatics , vol.21 , Issue.1 , pp. 10-19
    • Chou, K.C.1
  • 12
    • 18344391868 scopus 로고    scopus 로고
    • Prediction of membrane protein types by incorporating amphipathic effects
    • 10.1021/ci049686v10.1021/ci049686v 15807506 10.1021/ci049686v
    • Chou KC, Cai YD (2005) Prediction of membrane protein types by incorporating amphipathic effects. J Chem Inf Model 45(2):407-413. doi: 10.1021/ci049686v10.1021/ci049686v
    • (2005) J Chem Inf Model , vol.45 , Issue.2 , pp. 407-413
    • Chou, K.C.1    Cai, Y.D.2
  • 13
    • 57049095821 scopus 로고    scopus 로고
    • Function and structure of inherently disordered proteins
    • 18952168 10.1016/j.sbi.2008.10.002
    • Dunker AK, Silman I, Uversky VN, Sussman JL (2008) Function and structure of inherently disordered proteins. Curr Opin Struct Biol 18(6):756-764
    • (2008) Curr Opin Struct Biol , vol.18 , Issue.6 , pp. 756-764
    • Dunker, A.K.1    Silman, I.2    Uversky, V.N.3    Sussman, J.L.4
  • 14
    • 0034697980 scopus 로고    scopus 로고
    • Predicting subcellular localization of proteins based on their N-terminal amino acid sequence
    • 10891285 10.1006/jmbi.2000.3903
    • Emanuelsson O, Nielsen H, S Brunak, von Heijne G (2000) Predicting subcellular localization of proteins based on their N-terminal amino acid sequence. J Mol Biol 300(4):1005-1016
    • (2000) J Mol Biol , vol.300 , Issue.4 , pp. 1005-1016
    • Emanuelsson, O.1    Nielsen, H.2    Brunak, S.3    Von Heijne, G.4
  • 15
    • 0035229330 scopus 로고    scopus 로고
    • Analysis and prediction of mitochondrial targeting peptides
    • 11381593 10.1016/S0091-679X(01)65011-8
    • Emanuelsson O, von Heijne G, Schneider G (2001) Analysis and prediction of mitochondrial targeting peptides. Methods Cell Biol 65:175-187
    • (2001) Methods Cell Biol , vol.65 , pp. 175-187
    • Emanuelsson, O.1    Von Heijne, G.2    Schneider, G.3
  • 17
    • 3242875263 scopus 로고    scopus 로고
    • MITOPRED: A genome-scale method for prediction of nucleus-encoded mitochondrial proteins
    • 15037509 10.1093/bioinformatics/bth171
    • Guda C, Fahy E, Subramaniam S (2004) MITOPRED: a genome-scale method for prediction of nucleus-encoded mitochondrial proteins. Bioinformatics 20(11):1785-1794
    • (2004) Bioinformatics , vol.20 , Issue.11 , pp. 1785-1794
    • Guda, C.1    Fahy, E.2    Subramaniam, S.3
  • 19
    • 3543028004 scopus 로고    scopus 로고
    • Mitochondrial leader sequences: Structural similarities and sequence differences
    • 9723185 10.1002/(SICI)1097-010X(199809/10)282:1/2<280: AID-JEZ30>3.0.CO;2-V
    • Hammen PK, Weiner H (1998) Mitochondrial leader sequences: structural similarities and sequence differences. J Exp Zool 282(1-2):280-283
    • (1998) J Exp Zool , vol.282 , Issue.1-2 , pp. 280-283
    • Hammen, P.K.1    Weiner, H.2
  • 20
    • 1242274331 scopus 로고    scopus 로고
    • Prediction of RNA-binding proteins from primary sequence by a support vector machine approach
    • 1370931 14970381 10.1261/rna.5890304
    • Han LY, Cai CZ, Lo SL, Chung MCM, Chen YZ (2004) Prediction of RNA-binding proteins from primary sequence by a support vector machine approach. RNA 10(3):355-368
    • (2004) RNA , vol.10 , Issue.3 , pp. 355-368
    • Han, L.Y.1    Cai, C.Z.2    Lo, S.L.3    Chung, M.C.M.4    Chen, Y.Z.5
  • 22
    • 79952448515 scopus 로고    scopus 로고
    • Prediction of mitochondrial proteins of malaria parasite using bi-profile Bayes feature extraction
    • 21281691 10.1016/j.biochi.2011.01.013
    • Jia C, Liu T, Chang AK, Zhai Y (2011) Prediction of mitochondrial proteins of malaria parasite using bi-profile Bayes feature extraction. Biochimie 93(4):778-782
    • (2011) Biochimie , vol.93 , Issue.4 , pp. 778-782
    • Jia, C.1    Liu, T.2    Chang, A.K.3    Zhai, Y.4
  • 23
    • 33646852452 scopus 로고    scopus 로고
    • Prediction of mitochondrial proteins using support vector machine and hidden Markov model
    • 16339140 10.1074/jbc.M511061200
    • Kumar M, Verma R, Raghava GPS (2006) Prediction of mitochondrial proteins using support vector machine and hidden Markov model. J Biol Chem 281(9):5357-5363
    • (2006) J Biol Chem , vol.281 , Issue.9 , pp. 5357-5363
    • Kumar, M.1    Verma, R.2    Raghava, G.P.S.3
  • 24
    • 33745634395 scopus 로고    scopus 로고
    • Cd-hit: A fast program for clustering and comparing large sets of protein or nucleotide sequences
    • 16731699 10.1093/bioinformatics/btl158
    • Li W, Godzik A (2006) Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 22(13):1658-1659
    • (2006) Bioinformatics , vol.22 , Issue.13 , pp. 1658-1659
    • Li, W.1    Godzik, A.2
  • 25
    • 33747816816 scopus 로고    scopus 로고
    • PROFEAT: A web server for computing structural and physicochemical features of proteins and peptides from amino acid sequence
    • 1538821 16845018 10.1093/nar/gkl305
    • Li ZR, Lin HH, Han LY, Jiang L, Chen X, Chen YZ (2006) PROFEAT: a web server for computing structural and physicochemical features of proteins and peptides from amino acid sequence. Nucleic Acids Res 34(suppl 2):W32-W37
    • (2006) Nucleic Acids Res , vol.34 , Issue.SUPPL. 2
    • Li, Z.R.1    Lin, H.H.2    Han, L.Y.3    Jiang, L.4    Chen, X.5    Chen, Y.Z.6
  • 26
    • 50949094134 scopus 로고    scopus 로고
    • Prediction of protein structure class by coupling improved genetic algorithm and support vector machine
    • 18427714 10.1007/s00726-008-0084-z
    • Li ZC, Zhou XB, Lin YR, Zou XY (2008) Prediction of protein structure class by coupling improved genetic algorithm and support vector machine. Amino Acids 35(3):581-590
    • (2008) Amino Acids , vol.35 , Issue.3 , pp. 581-590
    • Li, Z.C.1    Zhou, X.B.2    Lin, Y.R.3    Zou, X.Y.4
  • 27
    • 33645308238 scopus 로고    scopus 로고
    • 2020 Computing: Exceeding human limits
    • 16554781 10.1038/440409a
    • Muggleton SH (2006) 2020 Computing: exceeding human limits. Nature 440(7083):409-410
    • (2006) Nature , vol.440 , Issue.7083 , pp. 409-410
    • Muggleton, S.H.1
  • 32
    • 79951963663 scopus 로고    scopus 로고
    • Faapred: A SVM-based prediction method for fungal adhesins and adhesin-like proteins
    • 2837750 20300572 10.1371/journal.pone.0009695
    • Ramana J, Gupta D (2010) Faapred: a SVM-based prediction method for fungal adhesins and adhesin-like proteins. PLoS ONE 5(3):e9695
    • (2010) PLoS ONE , vol.5 , Issue.3 , pp. 9695
    • Ramana, J.1    Gupta, D.2
  • 33
    • 35748932917 scopus 로고    scopus 로고
    • A review of feature selection techniques in bioinformatics
    • 17720704 10.1093/bioinformatics/btm344
    • Saeys Y, Inza I, Larrańaga P (2007) A review of feature selection techniques in bioinformatics. Bioinformatics 23(19):2507-2517
    • (2007) Bioinformatics , vol.23 , Issue.19 , pp. 2507-2517
    • Saeys, Y.1    Inza, I.2    Larrańaga, P.3
  • 34
    • 36949013631 scopus 로고    scopus 로고
    • Support vector machine-based classification of protein folds using the structural properties of amino acid residues and amino acid residue pairs
    • 17989092 10.1093/bioinformatics/btm527
    • Shamim MTA, Anwaruddin M, Nagarajaram HA (2007) Support vector machine-based classification of protein folds using the structural properties of amino acid residues and amino acid residue pairs. Bioinformatics 23(24):3320-3327
    • (2007) Bioinformatics , vol.23 , Issue.24 , pp. 3320-3327
    • Shamim, M.T.A.1    Anwaruddin, M.2    Nagarajaram, H.A.3
  • 35
    • 37549004451 scopus 로고    scopus 로고
    • PseAAC: A flexible web server for generating various kinds of protein pseudo amino acid composition
    • 17976365 10.1016/j.ab.2007.10.012
    • Shen H-B, Chou K-C (2008) PseAAC: a flexible web server for generating various kinds of protein pseudo amino acid composition. Anal Biochem 373(2):386-388
    • (2008) Anal Biochem , vol.373 , Issue.2 , pp. 386-388
    • Shen, H.-B.1    Chou, K.-C.2
  • 37
    • 77949539217 scopus 로고    scopus 로고
    • Pitfalls of supervised feature selection
    • 10.1093/bioinformatics/btp621 2815655 19880370 10.1093/bioinformatics/ btp621
    • Smialowski P, Frishman D, Kramer S (2010) Pitfalls of supervised feature selection. Bioinformatics 26(3):440-443. doi: 10.1093/bioinformatics/btp621
    • (2010) Bioinformatics , vol.26 , Issue.3 , pp. 440-443
    • Smialowski, P.1    Frishman, D.2    Kramer, S.3
  • 38
    • 0034669882 scopus 로고    scopus 로고
    • Why are "natively unfolded" proteins unstructured under physiologic conditions?
    • 11025552 10.1002/1097-0134(20001115)41:3<415: AID-PROT130>3.0.CO;2- 7
    • Uversky VN, Gillespie JR, Fink AL (2000) Why are "natively unfolded" proteins unstructured under physiologic conditions? Proteins 41(3):415-427
    • (2000) Proteins , vol.41 , Issue.3 , pp. 415-427
    • Uversky, V.N.1    Gillespie, J.R.2    Fink, A.L.3
  • 40
    • 77953622066 scopus 로고    scopus 로고
    • Prediction of mitochondrial proteins of malaria parasite using split amino acid composition and PSSM profile
    • 19908123 10.1007/s00726-009-0381-1
    • Verma R, Varshney G, Raghava GPS (2010) Prediction of mitochondrial proteins of malaria parasite using split amino acid composition and PSSM profile. Amino Acids 39(1):101-110
    • (2010) Amino Acids , vol.39 , Issue.1 , pp. 101-110
    • Verma, R.1    Varshney, G.2    Raghava, G.P.S.3
  • 41
    • 80051676719 scopus 로고    scopus 로고
    • NR-2L: A two-level predictor for identifying nuclear receptor subfamilies based on sequence-derived features
    • 3156231 21858146 10.1371/journal.pone.0023505
    • Wang P, Xiao X, Chou K-C (2011) NR-2L: a two-level predictor for identifying nuclear receptor subfamilies based on sequence-derived features. PLoS ONE 6(8):e23505
    • (2011) PLoS ONE , vol.6 , Issue.8 , pp. 23505
    • Wang, P.1    Xiao, X.2    Chou, K.-C.3
  • 42
    • 1942451938 scopus 로고    scopus 로고
    • Feature selection for high-dimensional data: A fast correlation-based filter solution
    • Yu L, Liu H (2003) Feature selection for high-dimensional data: a fast correlation-based filter solution. In: Machine learning-international workshop then conference, 2003, p 856
    • (2003) Machine Learning-international Workshop Then Conference , vol.2003 , pp. 856
    • Yu, L.1    Liu, H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.