메뉴 건너뛰기




Volumn 24, Issue 7, 2013, Pages 597-609

An alignment-free method to find similarity among protein sequences via the general form of Chou's pseudo amino acid composition

Author keywords

alignment free method; cosine distance metric; phylogenetic tree; sequence similarity analysis

Indexed keywords

INDIUM COMPOUNDS; PHYLOGENETIC; PROTEINS; TREES (MATHEMATICS);

EID: 84888000439     PISSN: 1062936X     EISSN: 1029046X     Source Type: Journal    
DOI: 10.1080/1062936X.2013.773378     Document Type: Article
Times cited : (42)

References (37)
  • 1
    • 34548606295 scopus 로고    scopus 로고
    • Recent progress in protein subcellular location prediction
    • Chou, K.-C. and Shen, H.-B. 2007. Recent progress in protein subcellular location prediction. Anal. Biochem., 370: 1 - 16.
    • (2007) Anal. Biochem. , vol.370 , pp. 1-16
    • Chou, K.-C.1    Shen, H.-B.2
  • 2
    • 77950448057 scopus 로고    scopus 로고
    • Predicting drug-target interaction networks based on functional groups and biological features
    • Z. He, J. Zhang, X.-H. Shi, L.-L. Hu, X. Kong, Y.-D. Cai, and K.-C. Chou, Predicting drug-target interaction networks based on functional groups and biological features, PLoS ONE 5 (2010), pp. 1-8.
    • (2010) PLoS ONE , vol.5 , pp. 1-8
    • He, Z.1    Zhang, J.2    Shi, X.-H.3    Hu, L.-L.4    Kong, X.5    Cai, Y.-D.6    Chou, K.-C.7
  • 3
    • 79961193016 scopus 로고    scopus 로고
    • Predicting transcriptional activity of multiple site p53 mutants based on hybrid properties
    • Huang, T., Niu, S., Xu, Z., Huan, Y., Kong, X., Cai, Y.-D. and Chou, K.-C. 2011. Predicting transcriptional activity of multiple site p53 mutants based on hybrid properties. PLoS ONE, 6: 1 - 8.
    • (2011) PLoS ONE , vol.6 , pp. 1-8
    • Huang, T.1    Niu, S.2    Xu, Z.3    Huan, Y.4    Kong, X.5    Cai, Y.-D.6    Chou, K.-C.7
  • 4
    • 84868128310 scopus 로고    scopus 로고
    • INuc-PhysChem: A sequence-based predictor for identifying nucleosomes via physicochemical properties
    • Chen, W., Lin, H., Feng, P.-M., Ding, C., Zuo, Y.-C. and Chou, K.-C. 2012. INuc-PhysChem: A sequence-based predictor for identifying nucleosomes via physicochemical properties. PLoS ONE, 7: 1 - 9.
    • (2012) PLoS ONE , vol.7 , pp. 1-9
    • Chen, W.1    Lin, H.2    Feng, P.-M.3    Ding, C.4    Zuo, Y.-C.5    Chou, K.-C.6
  • 5
    • 0030049315 scopus 로고    scopus 로고
    • Prediction of human immunodeficiency virus protease cleavage sites in proteins
    • Chou, K.-C. 1996. Prediction of human immunodeficiency virus protease cleavage sites in proteins. Anal. Biochem., 233: 1 - 14.
    • (1996) Anal. Biochem. , vol.233 , pp. 1-14
    • Chou, K.-C.1
  • 6
    • 79960915175 scopus 로고    scopus 로고
    • Prediction of body fluids where proteins are secreted into based on protein interaction network
    • Hu, L.-L., Huang, T., Cai, Y.-D. and Chou, K.-C. 2011. Prediction of body fluids where proteins are secreted into based on protein interaction network. PLoS ONE, 6: 1 - 8.
    • (2011) PLoS ONE , vol.6 , pp. 1-8
    • Hu, L.-L.1    Huang, T.2    Cai, Y.-D.3    Chou, K.-C.4
  • 7
    • 77955199287 scopus 로고    scopus 로고
    • Analysis and prediction of the metabolic stability of proteins based on their sequential features, subcellular locations and interaction networks
    • Huang, T., Shi, X.-H., Wang, P., He, Z., Feng, K.-Y., Hu, L.L., Kong, X., Li, Y.-X., Cai, Y.-D. and Chou, K.-C. 2010. Analysis and prediction of the metabolic stability of proteins based on their sequential features, subcellular locations and interaction networks. PLoS ONE, 5: 1 - 9.
    • (2010) PLoS ONE , vol.5 , pp. 1-9
    • Huang, T.1    Shi, X.-H.2    Wang, P.3    He, Z.4    Feng, K.-Y.5    Hu, L.L.6    Kong, X.7    Li, Y.-X.8    Cai, Y.-D.9    Chou, K.-C.10
  • 8
    • 79953316878 scopus 로고    scopus 로고
    • iLoc-Euk: A multi-label classifier for predicting the subcellular localization of singleplex and multiplex eukaryotic proteins
    • Chou, K.-C., Wu, Z.-C. and Xiao, X. 2011. iLoc-Euk: A multi-label classifier for predicting the subcellular localization of singleplex and multiplex eukaryotic proteins. PLoS ONE, 6: 1 - 10.
    • (2011) PLoS ONE , vol.6 , pp. 1-10
    • Chou, K.-C.1    Wu, Z.-C.2    Xiao, X.3
  • 9
    • 84855641685 scopus 로고    scopus 로고
    • iLoc-Hum: Using the accumulation-label scale to predict subcellular locations of human proteins with both single and multiple sites
    • Chou, K.-C., Wu, Z.-C. and Xiao, X. 2012. iLoc-Hum: Using the accumulation-label scale to predict subcellular locations of human proteins with both single and multiple sites. Mol. BioSyst., 8: 629 - 641.
    • (2012) Mol. BioSyst. , vol.8 , pp. 629-641
    • Chou, K.-C.1    Wu, Z.-C.2    Xiao, X.3
  • 11
    • 80051676719 scopus 로고    scopus 로고
    • NR-2L: A two-level predictor for identifying nuclear receptor subfamilies based on sequence-derived features
    • Wang, P., Xiao, X. and Chou, K.-C. 2011. NR-2L: A two-level predictor for identifying nuclear receptor subfamilies based on sequence-derived features. PLoS ONE, 6: 1 - 9.
    • (2011) PLoS ONE , vol.6 , pp. 1-9
    • Wang, P.1    Xiao, X.2    Chou, K.-C.3
  • 12
    • 84859298511 scopus 로고    scopus 로고
    • Identification of colorectal cancer related genes with mRMR and shortest path in protein-protein interaction network
    • Li, B.-Q., Huang, T., Liu, L., Cai, Y.-D. and Chou, K.-C. 2012. Identification of colorectal cancer related genes with mRMR and shortest path in protein-protein interaction network. PLoS ONE, 7: 1 - 12.
    • (2012) PLoS ONE , vol.7 , pp. 1-12
    • Li, B.-Q.1    Huang, T.2    Liu, L.3    Cai, Y.-D.4    Chou, K.-C.5
  • 13
    • 84859499974 scopus 로고    scopus 로고
    • Hepatitis C virus network based classification of hepatocellular cirrhosis and carcinoma
    • Huang, T., Wang, J., Cai, Y.-D., Yu, H. and Chou, K.-C. 2012. Hepatitis C virus network based classification of hepatocellular cirrhosis and carcinoma. PLoS ONE, 7: 1 - 9.
    • (2012) PLoS ONE , vol.7 , pp. 1-9
    • Huang, T.1    Wang, J.2    Cai, Y.-D.3    Yu, H.4    Chou, K.-C.5
  • 14
    • 84856347308 scopus 로고    scopus 로고
    • Predict and analyze S-nitrosylation modification sites with the mRMR and IFS approaches
    • Li, B.-Q., Hu, L.-L., Niu, S., Cai, Y.-D. and Chou, K.-C. 2012. Predict and analyze S-nitrosylation modification sites with the mRMR and IFS approaches. J. Proteomics, 75: 1654 - 1665.
    • (2012) J. Proteomics , vol.75 , pp. 1654-1665
    • Li, B.-Q.1    Hu, L.-L.2    Niu, S.3    Cai, Y.-D.4    Chou, K.-C.5
  • 15
    • 3242792729 scopus 로고    scopus 로고
    • Structural bioinformatics and its impact to biomedical science
    • Chou, K.-C. 2004. Structural bioinformatics and its impact to biomedical science. Curr. Med. Chem., 11: 2105 - 2134.
    • (2004) Curr. Med. Chem. , vol.11 , pp. 2105-2134
    • Chou, K.-C.1
  • 16
    • 0020659327 scopus 로고
    • H curves, a novel method of representation of nucleotide series especially suited for long DNA sequences
    • Hamori, E. and Ruskin, J. 1983. H curves, a novel method of representation of nucleotide series especially suited for long DNA sequences. J. Biol. Chem., 258: 1318 - 1327.
    • (1983) J. Biol. Chem. , vol.258 , pp. 1318-1327
    • Hamori, E.1    Ruskin, J.2
  • 17
    • 0022592245 scopus 로고
    • A simple way to look at DNA
    • Gates, M.A. 1986. A simple way to look at DNA. J. Theor. Biol., 119: 319 - 328.
    • (1986) J. Theor. Biol. , vol.119 , pp. 319-328
    • Gates, M.A.1
  • 18
    • 0000347441 scopus 로고
    • A new graphical representation and analysis of DNA sequence structure: I. Methodology and application to globin genes
    • Nandy, A. 1994. A new graphical representation and analysis of DNA sequence structure: I. Methodology and application to globin genes. Curr. Sci., 66: 309 - 314.
    • (1994) Curr. Sci. , vol.66 , pp. 309-314
    • Nandy, A.1
  • 19
    • 0028889673 scopus 로고
    • Random walk and gap plots of DNA sequences
    • Leong, P.M. and Morgenthaler, S. 1995. Random walk and gap plots of DNA sequences. Comput. Appl. Biosci., 11: 503 - 507.
    • (1995) Comput. Appl. Biosci. , vol.11 , pp. 503-507
    • Leong, P.M.1    Morgenthaler, S.2
  • 20
    • 2442509138 scopus 로고    scopus 로고
    • 2-D Graphical representation of proteins based on virtual genetic code
    • Randić, M. 2004. 2-D Graphical representation of proteins based on virtual genetic code. SAR QSAR Environ. Res., 15: 147 - 157.
    • (2004) SAR QSAR Environ. Res. , vol.15 , pp. 147-157
    • Randić, M.1
  • 23
    • 52149106884 scopus 로고    scopus 로고
    • Characterization of protein primary sequences based on partial ordering
    • Feng, J. and Wang, T.M. 2008. Characterization of protein primary sequences based on partial ordering. J. Theor. Biol., 254: 752 - 755.
    • (2008) J. Theor. Biol. , vol.254 , pp. 752-755
    • Feng, J.1    Wang, T.M.2
  • 24
    • 47349126419 scopus 로고    scopus 로고
    • On novel representation of proteins based on amino acid adjacency matrix
    • Randić, M., Novič, M. and Vračko, M. 2008. On novel representation of proteins based on amino acid adjacency matrix. SAR QSAR Environ. Res., 19: 339 - 349.
    • (2008) SAR QSAR Environ. Res. , vol.19 , pp. 339-349
    • Randić, M.1    Novič, M.2    Vračko, M.3
  • 25
    • 79951518208 scopus 로고    scopus 로고
    • Some remarks on protein attribute prediction and pseudo amino acid composition
    • Chou, K.-C. 2011. Some remarks on protein attribute prediction and pseudo amino acid composition. J. Theor. Biol., 273: 236 - 247.
    • (2011) J. Theor. Biol. , vol.273 , pp. 236-247
    • Chou, K.-C.1
  • 26
    • 0035874091 scopus 로고    scopus 로고
    • Prediction of protein cellular attributes using pseudo-amino acid composition
    • Chou, K.-C. 2001. Prediction of protein cellular attributes using pseudo-amino acid composition. Proteins: Struct Funct. Bioinf., 43: 246 - 255.
    • (2001) Proteins: Struct Funct. Bioinf. , vol.43 , pp. 246-255
    • Chou, K.-C.1
  • 27
    • 12744279642 scopus 로고    scopus 로고
    • Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes
    • Chou, K.-C. 2005. Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes. Bioinformatics, 21: 10 - 19.
    • (2005) Bioinformatics , vol.21 , pp. 10-19
    • Chou, K.-C.1
  • 28
    • 33748168697 scopus 로고    scopus 로고
    • Clustering DNA sequences by feature vectors
    • Liu, L., Ho, Y.-K. and Yau, S. 2006. Clustering DNA sequences by feature vectors. Mol. Phylogenet. Evol., 41: 64 - 69.
    • (2006) Mol. Phylogenet. Evol. , vol.41 , pp. 64-69
    • Liu, L.1    Ho, Y.-K.2    Yau, S.3
  • 29
    • 0023375195 scopus 로고
    • The neighbor-joining method: A new method for reconstructing phylogenetic trees
    • Saitou, N. and Nei, M. 1987. The neighbor-joining method: A new method for reconstructing phylogenetic trees. Mol. Biol. Evol., 4: 406 - 425.
    • (1987) Mol. Biol. Evol. , vol.4 , pp. 406-425
    • Saitou, N.1    Nei, M.2
  • 30
    • 42049104155 scopus 로고    scopus 로고
    • 2-D graphical representation of protein sequences and its application to coronavirus phylogeny
    • Xing, L., Li, C. and Wang, X. 2008. 2-D graphical representation of protein sequences and its application to coronavirus phylogeny. BMB Rep., 41: 217 - 222.
    • (2008) BMB Rep. , vol.41 , pp. 217-222
    • Xing, L.1    Li, C.2    Wang, X.3
  • 31
    • 45949109669 scopus 로고    scopus 로고
    • MEGA: A biologist-centric software for evolutionary analysis of DNA and protein sequences
    • Kumar, S., Nei, M., Dudley, J. and Tamura, K. 2008. MEGA: A biologist-centric software for evolutionary analysis of DNA and protein sequences. Brief. Bioinf., 9: 299 - 306.
    • (2008) Brief. Bioinf. , vol.9 , pp. 299-306
    • Kumar, S.1    Nei, M.2    Dudley, J.3    Tamura, K.4
  • 32
    • 0035102453 scopus 로고    scopus 로고
    • An information-based sequence distance and its application to whole mitochondrial genome phylogeny
    • Li, M., Badger, J.H., Chen, X., Kwong, S., Kearney, P. and Zhang, H. 2001. An information-based sequence distance and its application to whole mitochondrial genome phylogeny. Bioinformatics, 17: 149 - 154.
    • (2001) Bioinformatics , vol.17 , pp. 149-154
    • Li, M.1    Badger, J.H.2    Chen, X.3    Kwong, S.4    Kearney, P.5    Zhang, H.6
  • 33
    • 71849120074 scopus 로고    scopus 로고
    • A 2D graphical representation of protein sequence and its numerical characterization
    • Wen, J. and Zhang, Y. 2009. A 2D graphical representation of protein sequence and its numerical characterization. Chem. Phys. Lett., 476: 281 - 286.
    • (2009) Chem. Phys. Lett. , vol.476 , pp. 281-286
    • Wen, J.1    Zhang, Y.2
  • 34
    • 77956169760 scopus 로고    scopus 로고
    • 3D graphical representation of protein sequences and their statistical characterization
    • Abo el Maaty, M.I., Abo-Elkhier, M.M. and Abd Elwahaab, M.A. 2010. 3D graphical representation of protein sequences and their statistical characterization. Physica A, 389: 4668 - 4676.
    • (2010) Physica A , vol.389 , pp. 4668-4676
    • Abo el Maaty, M.I.1    Abo-Elkhier, M.M.2    Abd Elwahaab, M.A.3
  • 35
    • 84886373947 scopus 로고    scopus 로고
    • Similarity/dissimilarity analysis of protein sequences using the spatial median as a descriptor
    • Abo-Elkhier, M.M. 2012. Similarity/dissimilarity analysis of protein sequences using the spatial median as a descriptor. J. Biophys. Chem., 3: 142 - 148.
    • (2012) J. Biophys. Chem. , vol.3 , pp. 142-148
    • Abo-Elkhier, M.M.1
  • 36
    • 77955673069 scopus 로고    scopus 로고
    • 2D-MH: A web-server for generating graphic representation of protein sequences based on the physicochemical properties of their constituent amino acids
    • Wu, Z.-C., Xiao, X. and Chou, K.-C. 2010. 2D-MH: A web-server for generating graphic representation of protein sequences based on the physicochemical properties of their constituent amino acids. J. Theor. Biol., 267: 29 - 34.
    • (2010) J. Theor. Biol. , vol.267 , pp. 29-34
    • Wu, Z.-C.1    Xiao, X.2    Chou, K.-C.3
  • 37
    • 77649339280 scopus 로고    scopus 로고
    • Recent advances in developing web-servers for predicting protein attributes
    • Chou, K.-C. and Shen, H.-B. 2009. Recent advances in developing web-servers for predicting protein attributes. Nat. Sci., 1: 63 - 92.
    • (2009) Nat. Sci. , vol.1 , pp. 63-92
    • Chou, K.-C.1    Shen, H.-B.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.