메뉴 건너뛰기




Volumn 11, Issue 4, 2014, Pages 289-299

Exploratory predicting protein folding model with random forest and hybrid features

Author keywords

Classifier; n gram feature; Protein class; Protein structure prediction; Random forests; SCOP dataset

Indexed keywords

AMINO ACID; CELL SURFACE PROTEIN; PEPTIDE; PROTEIN;

EID: 84922326160     PISSN: 15701646     EISSN: None     Source Type: Journal    
DOI: 10.2174/157016461104150121115154     Document Type: Article
Times cited : (61)

References (46)
  • 1
    • 0029157083 scopus 로고
    • Prediction of protein structural classes
    • Chou, K.-C. and Zhang, C.-T. Prediction of protein structural classes. Crit. Rev. Biochem. Mol. Biol., 1995, 30(4), 275-349.
    • (1995) Crit. Rev. Biochem. Mol. Biol , vol.30 , Issue.4 , pp. 275-349
    • Chou, K.-C.1    Zhang, C.-T.2
  • 2
    • 0001040367 scopus 로고
    • An algorithm for protein secondary structure prediction based on class prediction
    • Deleage, G. and Roux, B. An algorithm for protein secondary structure prediction based on class prediction. Protein Eng., 1987, 1(4), 289-294.
    • (1987) Protein Eng , vol.1 , Issue.4 , pp. 289-294
    • Deleage, G.1    Roux, B.2
  • 3
    • 0026636312 scopus 로고
    • Hydrophobicity and structural classes in proteins
    • Cid, H.; Bunster, M.; Canales, M. and Gazitúa, F. Hydrophobicity and structural classes in proteins. Protein Eng., 1992, 5(5), 373-375.
    • (1992) Protein Eng , vol.5 , Issue.5 , pp. 373-375
    • Cid, H.1    Bunster, M.2    Canales, M.3    Gazitúa, F.4
  • 4
    • 0020817034 scopus 로고
    • Classification of proteins into groups based on amino acid composition and other characters. II. Grouping into four types
    • Nishikawa, K.; Kubota, Y. and Tatsuo, O. Classification of proteins into groups based on amino acid composition and other characters. II. Grouping into four types. J. Biochem., 1983, 94(3), 997-1007.
    • (1983) J. Biochem , vol.94 , Issue.3 , pp. 997-1007
    • Nishikawa, K.1    Kubota, Y.2    Tatsuo, O.3
  • 5
    • 0020820579 scopus 로고
    • Classification of proteins into groups based on amino acid composition and other characters. I. Angular distribution
    • Nishikawa, K.; Kubota Y. and Tatsuo, O. Classification of proteins into groups based on amino acid composition and other characters. I. Angular distribution. J. Biochem., 1983, 94(3), 981-995.
    • (1983) J. Biochem , vol.94 , Issue.3 , pp. 981-995
    • Nishikawa, K.1    Kubota, Y.2    Tatsuo, O.3
  • 6
    • 0017309766 scopus 로고
    • Structural patterns in globular proteins
    • Levitt, M. and Chothia, C. Structural patterns in globular proteins. Nature, 1976, 261(5561), 552-558.
    • (1976) Nature , vol.261 , Issue.5561 , pp. 552-558
    • Levitt, M.1    Chothia, C.2
  • 7
    • 0028953472 scopus 로고
    • Structural similarity between two-layer -/- and β-proteins
    • Efimov, A. V. Structural similarity between two-layer -/- and β-proteins. J. Mol. Biol., 1995, 245(4), 402-415.
    • (1995) J. Mol. Biol , vol.245 , Issue.4 , pp. 402-415
    • Efimov, A.V.1
  • 8
    • 0036007085 scopus 로고    scopus 로고
    • Prediction of protein structural classes by support vector machines
    • Cai, Y.-D.; Liu, X.-J.; Xu, X.-B. and Chou, K. C. Prediction of protein structural classes by support vector machines. Comput. Chem., 2002, 26(3), 293-296.
    • (2002) Comput. Chem , vol.26 , Issue.3 , pp. 293-296
    • Cai, Y.-D.1    Liu, X.-J.2    Xu, X.-B.3    Chou, K.C.4
  • 9
    • 37549004451 scopus 로고    scopus 로고
    • PseAAC: A flexible web server for generating various kinds of protein pseudo amino acid composition
    • Shen, H.-B. and Chou, K.-C. PseAAC: a flexible web server for generating various kinds of protein pseudo amino acid composition. Anal. Biochem., 2008, 373(2), 386-388.
    • (2008) Anal. Biochem , vol.373 , Issue.2 , pp. 386-388
    • Shen, H.-B.1    Chou, K.-C.2
  • 10
    • 84892972298 scopus 로고    scopus 로고
    • Protein remote homology detection by combining chou's pseudo amino acid composition and profile-based protein representation
    • Liu, B.; Wang, X.; Zou, Q.; Dong, Q. and Chen, Q. Protein remote homology detection by combining chou's pseudo amino acid composition and profile-based protein representation. Mol. Inform., 2013, 32(9), 775-782.
    • (2013) Mol. Inform , vol.32 , Issue.9 , pp. 775-782
    • Liu, B.1    Wang, X.2    Zou, Q.3    Dong, Q.4    Chen, Q.5
  • 11
    • 80053220447 scopus 로고    scopus 로고
    • Classification and analysis of regulatory pathways using graph property, biochemical and physicochemical property, and functional property
    • Huang, T.; Chen, L.; Cai, Y. D. and Chou, K. C. Classification and analysis of regulatory pathways using graph property, biochemical and physicochemical property, and functional property. PloS One, 2011, 6(9), e25297.
    • (2011) PloS One , vol.6 , Issue.9
    • Huang, T.1    Chen, L.2    Cai, Y.D.3    Chou, K.C.4
  • 12
    • 84874240374 scopus 로고    scopus 로고
    • Hierarchical classification of protein folds using a novel ensemble classifier
    • Lin, C.; Zou, Y.; Qin, J.; Liu, X.; Jiang, Y.; Ke, C. and Zou, Q. Hierarchical classification of protein folds using a novel ensemble classifier. PloS One, 2013, 8(2), e56944.
    • (2013) PloS One , vol.8 , Issue.2
    • Lin, C.1    Zou, Y.2    Qin, J.3    Liu, X.4    Jiang, Y.5    Ke, C.6    Zou, Q.7
  • 13
    • 84884255470 scopus 로고    scopus 로고
    • An Approach for Identifying Cytokines Based on a Novel Ensemble Classifier
    • Zou, Q.; Wang, Z.; Guan, X.; Liu, B.; Wu, Y. and Lin, Z. An Approach for Identifying Cytokines Based on a Novel Ensemble Classifier. BioMed Res. Int., 2013, 2013, 686090.
    • (2013) BioMed Res. Int , vol.2013
    • Zou, Q.1    Wang, Z.2    Guan, X.3    Liu, B.4    Wu, Y.5    Lin, Z.6
  • 14
    • 0033578684 scopus 로고    scopus 로고
    • Protein secondary structure prediction based on position-specific scoring matrices
    • Jones, D. T. Protein secondary structure prediction based on position-specific scoring matrices. J. Mol. Biol., 1999, 292(2), 195-202.
    • (1999) J. Mol. Biol , vol.292 , Issue.2 , pp. 195-202
    • Jones, D.T.1
  • 15
    • 70349985248 scopus 로고    scopus 로고
    • A new taxonomy-based protein fold recognition approach based on autocross-covariance transformation
    • Dong, Q.; Zhou S. and Guan, J. A new taxonomy-based protein fold recognition approach based on autocross-covariance transformation. Bioinformatics, 2009, 25(20), 2655-2662.
    • (2009) Bioinformatics , vol.25 , Issue.20 , pp. 2655-2662
    • Dong, Q.1    Zhou, S.2    Guan, J.3
  • 16
    • 84874229261 scopus 로고    scopus 로고
    • Identifying Multi-Functional Enzyme by Hierarchical Multi-Label Classifier
    • Quan, Z.; Weicheng, C.; Yong, H.; Xiangrong, L. and Yi, J. Identifying Multi-Functional Enzyme by Hierarchical Multi-Label Classifier. J. Comput. Theoret. Nanosci., 2013, 10(4), 1038-1043.
    • (2013) J. Comput. Theoret. Nanosci , vol.10 , Issue.4 , pp. 1038-1043
    • Quan, Z.1    Weicheng, C.2    Yong, H.3    Xiangrong, L.4    Yi, J.5
  • 17
    • 84881519843 scopus 로고    scopus 로고
    • BinMemPredict: A web server and software for predicting membrane protein types
    • Zou, Q.; Li, X.; Jiang, Y.; Zhao, Y. and Wang, G. BinMemPredict: a web server and software for predicting membrane protein types. Curr. Proteom., 2013, 10(1), 2-9.
    • (2013) Curr. Proteom , vol.10 , Issue.1 , pp. 2-9
    • Zou, Q.1    Li, X.2    Jiang, Y.3    Zhao, Y.4    Wang, G.5
  • 18
    • 71549172792 scopus 로고    scopus 로고
    • Prediction of protein binding sites in protein structures using hidden Markov support vector machine
    • Liu, B.; Wang, X.; Lin, L.; Tang, B.; Dong, Q. and Wang X. Prediction of protein binding sites in protein structures using hidden Markov support vector machine. BMC Bioinform., 2009, 10(1), 381.
    • (2009) BMC Bioinform , vol.10 , Issue.1
    • Liu, B.1    Wang, X.2    Lin, L.3    Tang, B.4    Dong, Q.5    Wang, X.6
  • 19
    • 68749110968 scopus 로고    scopus 로고
    • Exploiting three kinds of interface propensities to identify protein binding sites
    • Liu, B.; Wang, X.; Lin, L.; Dong, Q. and Wang, X. Exploiting three kinds of interface propensities to identify protein binding sites. Computat. Biol. Chem., 2009, 33, 303-311.
    • (2009) Computat. Biol. Chem , vol.33 , pp. 303-311
    • Liu, B.1    Wang, X.2    Lin, L.3    Dong, Q.4    Wang, X.5
  • 20
    • 84867004398 scopus 로고    scopus 로고
    • Using Amino Acid Physicochemical Distance Transformation for Fast Protein Remote Homology Detection
    • Liu, B.; Wang, X.; Chen, Q. Dong, Q. and Lan, X. Using Amino Acid Physicochemical Distance Transformation for Fast Protein Remote Homology Detection. PloS one, 2012, 7(9), e46633.
    • (2012) PloS one , vol.7 , Issue.9
    • Liu, B.1    Wang, X.2    Chen, Q.3    Dong, Q.4    Lan, X.5
  • 21
    • 84885838906 scopus 로고    scopus 로고
    • LibD3C: Ensemble classifiers with a clustering and dynamic selection strategy
    • Lin, C.; Chena, W.; Qiua, C.; Wua, Y.; Krishnanc, S. and Zou, Q. LibD3C: Ensemble classifiers with a clustering and dynamic selection strategy. Neurocomputing, 2014, 123, 424-435.
    • (2014) Neurocomputing , vol.123 , pp. 424-435
    • Lin, C.1    Chena, W.2    Qiua, C.3    Wua, Y.4    Krishnanc, S.5    Zou, Q.6
  • 22
    • 0028961335 scopus 로고
    • SCOP: A structural classification of proteins database for the investigation of sequences and structures
    • Murzin, A. G.; Brenner, S. E.; Hubbard, T.; and Chothia, C. SCOP: a structural classification of proteins database for the investigation of sequences and structures. J. Mol. Biol., 1995, 247(4), 536-540.
    • (1995) J. Mol. Biol , vol.247 , Issue.4 , pp. 536-540
    • Murzin, A.G.1    Brenner, S.E.2    Hubbard, T.3    Chothia, C.4
  • 24
    • 0022706389 scopus 로고
    • The relation between the divergence of sequence and structure in proteins
    • Chothia, C. and Lesk, A. M. The relation between the divergence of sequence and structure in proteins. EMBO J., 1986, 5(4), 823.
    • (1986) EMBO J , vol.5 , Issue.4
    • Chothia, C.1    Lesk, A.M.2
  • 25
    • 0000187138 scopus 로고
    • The response of protein structures to amino-acid sequence changes. Philosophical Transactions of the Royal Society of London. Series A
    • Lesk, A. and Chothia, C. The response of protein structures to amino-acid sequence changes. Philosophical Transactions of the Royal Society of London. Series A, Mathematical and Physical Sciences, vol. 317, no. 1540, pp. 345-356, 1986.
    • (1986) Mathematical and Physical Sciences , vol.317 , Issue.1540 , pp. 345-356
    • Lesk, A.1    Chothia, C.2
  • 27
    • 0017952955 scopus 로고
    • N-gram statistics for natural language understanding and text processing
    • Suen, C. Y. N-gram statistics for natural language understanding and text processing. IEEE Trans. Pattern Anal. Mach. Intell., 1979, 2, 164-172.
    • (1979) IEEE Trans. Pattern Anal. Mach. Intell , vol.2 , pp. 164-172
    • Suen, C.Y.1
  • 28
    • 84922301029 scopus 로고
    • N-gram-based text filtering for TREC-2
    • Cavnar, W. B. N-gram-based text filtering for TREC-2. Ann. Arbor., 1993, 1001, 48113-44001.
    • (1993) Ann. Arbor , vol.1001
    • Cavnar, W.B.1
  • 29
    • 0002636321 scopus 로고
    • N-gram-based text categorization
    • Cavnar, W. B. and Trenkle, J. M. N-gram-based text categorization. Ann. Arbor MI, 1994, 48113, 161-175.
    • (1994) Ann. Arbor MI , vol.48113 , pp. 161-175
    • Cavnar, W.B.1    Trenkle, J.M.2
  • 30
    • 58149345703 scopus 로고    scopus 로고
    • A Discriminative Method for Protein Remote Homology Detection and Fold Recognition Combining Top-n-grams and Latent Semantic Analysis
    • Liu, B.; Wang, X.; Lin, L.; Dong, Q. and Wang, X. A Discriminative Method for Protein Remote Homology Detection and Fold Recognition Combining Top-n-grams and Latent Semantic Analysis. BMC Bioinformatics, 2008, 9, 510.
    • (2008) BMC Bioinformatics , vol.9
    • Liu, B.1    Wang, X.2    Lin, L.3    Dong, Q.4    Wang, X.5
  • 32
    • 84876306464 scopus 로고    scopus 로고
    • Prediction of Golgi-resident protein types by using feature selection technique
    • Guo, F. B.; Huang, J.; Rao, N.; Chen, W. and Lin, H. Prediction of Golgi-resident protein types by using feature selection technique. Chemometrics Intelligent Laborat. Sys., 2013, 124, 9-13.
    • (2013) Chemometrics Intelligent Laborat. Sys , vol.124 , pp. 9-13
    • Guo, F.B.1    Huang, J.2    Rao, N.3    Chen, W.4    Lin, H.5
  • 33
    • 84878789801 scopus 로고    scopus 로고
    • Using overrepresented tetrapeptides to predict protein submitochondrial locations
    • Lin, H.; Chen, W.; Yuan, L. F.; Li, Z. Q. and Ding, H. Using overrepresented tetrapeptides to predict protein submitochondrial locations. Acta Biotheoretica, 2013, 61(2), 259-268.
    • (2013) Acta Biotheoretica , vol.61 , Issue.2 , pp. 259-268
    • Lin, H.1    Chen, W.2    Yuan, L.F.3    Li, Z.Q.4    Ding, H.5
  • 34
    • 0035478854 scopus 로고    scopus 로고
    • Random forests
    • Breiman, L. Random forests. Machine Learn., 2001, 45(1), 5-32.
    • (2001) Machine Learn , vol.45 , Issue.1 , pp. 5-32
    • Breiman, L.1
  • 35
    • 0032139235 scopus 로고    scopus 로고
    • The random subspace method for constructing decision forests
    • Ho, T. K. The random subspace method for constructing decision forests. IEEE Trans. Pattern Anal. Mach. Intell., 1998, 20, 832-844.
    • (1998) IEEE Trans. Pattern Anal. Mach. Intell , vol.20 , pp. 832-844
    • Ho, T.K.1
  • 36
    • 0001492549 scopus 로고    scopus 로고
    • Shape quantization and recognition with randomized trees
    • Amit, Y. and Geman, D. Shape quantization and recognition with randomized trees. Neural. Comput., 1997, 9, 1545-1588.
    • (1997) Neural. Comput , vol.9 , pp. 1545-1588
    • Amit, Y.1    Geman, D.2
  • 38
    • 84896463976 scopus 로고    scopus 로고
    • iNuc-PseKNC: A sequence-based predictor for predicting nucleosome positioning in genomes with pseudo k-tuple nucleotide composition
    • Guo, S. H.; Deng, E. Z.; Xu, L. Q.; Ding, H.; Lin, H.; Chen, W. and Chou, K. C. iNuc-PseKNC: a sequence-based predictor for predicting nucleosome positioning in genomes with pseudo k-tuple nucleotide composition. Bioinformatics, 2014, 30(11), 1522-1529.
    • (2014) Bioinformatics , vol.30 , Issue.11 , pp. 1522-1529
    • Guo, S.H.1    Deng, E.Z.2    Xu, L.Q.3    Ding, H.4    Lin, H.5    Chen, W.6    Chou, K.C.7
  • 39
    • 84903784979 scopus 로고    scopus 로고
    • Identification of bacteriophage virion proteins with the ANOVA feature selection and analysis
    • Ding, H.; Feng, P. M.; Chen, W. and Lin, H. Identification of bacteriophage virion proteins with the ANOVA feature selection and analysis. Mol. Biosys., 2014, 10, 2229-2235.
    • (2014) Mol. Biosys , vol.10 , pp. 2229-2235
    • Ding, H.1    Feng, P.M.2    Chen, W.3    Lin, H.4
  • 40
    • 84885112194 scopus 로고    scopus 로고
    • AcalPred: A sequence-based tool for discriminating between acidic and alkaline enzymes
    • Lin, H.; Chen, W. and Ding, H. AcalPred: A sequence-based tool for discriminating between acidic and alkaline enzymes. PloS One, 2013, 8(10), e75726.
    • (2013) PloS One , vol.8 , Issue.10
    • Lin, H.1    Chen, W.2    Ding, H.3
  • 41
  • 43
    • 84921023793 scopus 로고    scopus 로고
    • PseDNA-Pro: DNA-Binding Protein Identification by Combining Chou's PseAAC and Physicochemical Distance Transformation
    • Liu, B.; Jinghao, X.; Shixi, F.; Ruifeng, X.; Jiyun, Z. and Wang, X. PseDNA-Pro: DNA-Binding Protein Identification by Combining Chou's PseAAC and Physicochemical Distance Transformation. Mol. Inform., DOI: 10. 1002/minf. 201400025.
    • Mol. Inform
    • Liu, B.1    Jinghao, X.2    Shixi, F.3    Ruifeng, X.4    Jiyun, Z.5    Wang, X.6
  • 44
    • 84906975785 scopus 로고    scopus 로고
    • iDNA-Prot|dis: Identifying DNA-Binding Proteins by Incorporating Amino Acid Distance-Pairs and Reduced Alphabet Profile into the General Pseudo Amino Acid Composition
    • Liu, B.; Xu, J.; Lan, X.; Xu, R.; Zhou, J.; Wang, X. and Chou, K.-C. iDNA-Prot|dis: Identifying DNA-Binding Proteins by Incorporating Amino Acid Distance-Pairs and Reduced Alphabet Profile into the General Pseudo Amino Acid Composition. PLoS ONE, 2014, 9, e106691.
    • (2014) PLoS ONE , vol.9
    • Liu, B.1    Xu, J.2    Lan, X.3    Xu, R.4    Zhou, J.5    Wang, X.6    Chou, K.-C.7
  • 45
    • 84901288109 scopus 로고    scopus 로고
    • Using distances between Top-n-gram and residue pairs for protein remote homology detection
    • Liu, B.; Xu, J.; Zou, Q.; Xu, R.; Wang, X. and Chen, Q. Using distances between Top-n-gram and residue pairs for protein remote homology detection. BMC Bioinformatics, 2014, 15, S3.
    • (2014) BMC Bioinformatics , vol.15
    • Liu, B.1    Xu, J.2    Zou, Q.3    Xu, R.4    Wang, X.5    Chen, Q.6
  • 46
    • 84892954329 scopus 로고    scopus 로고
    • Combining evolutionary information extracted from frequency profiles with sequence-based kernels for protein remote homology detection
    • Liu, B.; Zhang, D.; Xu, R.; Xu, J.; Wang, X.; Chen, Q.; Dong, Q.; and Chou, K.-C. Combining evolutionary information extracted from frequency profiles with sequence-based kernels for protein remote homology detection. Bioinformatics, 2014, 30, 472-479.
    • (2014) Bioinformatics , vol.30 , pp. 472-479
    • Liu, B.1    Zhang, D.2    Xu, R.3    Xu, J.4    Wang, X.5    Chen, Q.6    Dong, Q.7    Chou, K.-C.8


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.