메뉴 건너뛰기




Volumn 337, Issue , 2013, Pages 61-70

Linear regression model of short k-word: A similarity distance suitable for biological sequences with various lengths

Author keywords

Alignment free; Bootstrap; Phylogenetic tree; Sequence comparison

Indexed keywords

HEMOGLOBIN BETA CHAIN;

EID: 84883333176     PISSN: 00225193     EISSN: 10958541     Source Type: Journal    
DOI: 10.1016/j.jtbi.2013.07.028     Document Type: Article
Times cited : (19)

References (56)
  • 2
    • 42149097173 scopus 로고    scopus 로고
    • Kinetic plasticity and the determination of product ratios for kinetic schemes leading to multiple products without rate laws. new methods based on directed graphs
    • Andraos J. Kinetic plasticity and the determination of product ratios for kinetic schemes leading to multiple products without rate laws. new methods based on directed graphs. Canadian Journal of Chemistry 2008, 86:342-357.
    • (2008) Canadian Journal of Chemistry , vol.86 , pp. 342-357
    • Andraos, J.1
  • 5
    • 84875576158 scopus 로고    scopus 로고
    • Propy. a tool to generate various modes of Chou's PseAAC
    • Cao D.S., Xu Q.S., Liang Y.Z. propy. a tool to generate various modes of Chou's PseAAC. Bioinformatics 2013, 29:960-962.
    • (2013) Bioinformatics , vol.29 , pp. 960-962
    • Cao, D.S.1    Xu, Q.S.2    Liang, Y.Z.3
  • 6
    • 84868128310 scopus 로고    scopus 로고
    • INuc-PhysChem. a sequence-based predictor for identifying nucleosomes via physicochemical properties
    • Chen W., Lin H., Feng P.M., Ding C., Zuo Y.C., Chou K.C. iNuc-PhysChem. a sequence-based predictor for identifying nucleosomes via physicochemical properties. PLoS ONE 2012, 7:e47843.
    • (2012) PLoS ONE , vol.7
    • Chen, W.1    Lin, H.2    Feng, P.M.3    Ding, C.4    Zuo, Y.C.5    Chou, K.C.6
  • 7
    • 84876053736 scopus 로고    scopus 로고
    • IRSpot-PseDNC. identify recombination spots with pseudo dinucleotide composition
    • Chen W., Feng P.M., Lin H., Chou K.C. iRSpot-PseDNC. identify recombination spots with pseudo dinucleotide composition. Nucleic Acids Research 2013, 41:e68.
    • (2013) Nucleic Acids Research , vol.41
    • Chen, W.1    Feng, P.M.2    Lin, H.3    Chou, K.C.4
  • 8
    • 84870215870 scopus 로고    scopus 로고
    • Predicting membrane protein types by incorporating protein topology, domains, signal peptides, and physicochemical properties into the general form of Chou's pseudo amino acid composition
    • Chen Y.K., Li K.B. Predicting membrane protein types by incorporating protein topology, domains, signal peptides, and physicochemical properties into the general form of Chou's pseudo amino acid composition. Journal of Theoretical Biology 2013, 318:1-12.
    • (2013) Journal of Theoretical Biology , vol.318 , pp. 1-12
    • Chen, Y.K.1    Li, K.B.2
  • 9
    • 0024971003 scopus 로고
    • Graphic rules in steady and non-steady enzyme kinetics
    • Chou K.C. Graphic rules in steady and non-steady enzyme kinetics. Journal of Biological Chemistry 1989, 264:12074-12079.
    • (1989) Journal of Biological Chemistry , vol.264 , pp. 12074-12079
    • Chou, K.C.1
  • 10
    • 0034687538 scopus 로고    scopus 로고
    • Prediction of protein subcellular locations by incorporating quasi-sequence-order effect
    • Chou K.C. Prediction of protein subcellular locations by incorporating quasi-sequence-order effect. Biochemical and Biophysical Research Communications 2000, 278:477-483.
    • (2000) Biochemical and Biophysical Research Communications , vol.278 , pp. 477-483
    • Chou, K.C.1
  • 11
    • 0035874091 scopus 로고    scopus 로고
    • Prediction of protein cellular attributes using pseudo amino acid composition
    • (Erratum: Chou, K.C., 2001. Prediction of protein cellular attributes using pseudo amino acid composition. Proteins: Structure, Function, and Genetics 44, 60.)
    • Chou K.C. Prediction of protein cellular attributes using pseudo amino acid composition. Proteins: Structure, Function, and Genetics 2001, 43:246-255. (Erratum: Chou, K.C., 2001. Prediction of protein cellular attributes using pseudo amino acid composition. Proteins: Structure, Function, and Genetics 44, 60.).
    • (2001) Proteins: Structure, Function, and Genetics , vol.43 , pp. 246-255
    • Chou, K.C.1
  • 12
    • 77649339280 scopus 로고    scopus 로고
    • Review. recent advances in developing web-servers for predicting protein attributes
    • Chou K.C., Shen H.B. Review. recent advances in developing web-servers for predicting protein attributes. Natural Science 2009, 2:63-92.
    • (2009) Natural Science , vol.2 , pp. 63-92
    • Chou, K.C.1    Shen, H.B.2
  • 13
    • 77952868004 scopus 로고    scopus 로고
    • Graphic rule for drug metabolism systems
    • Chou K.C. Graphic rule for drug metabolism systems. Current Drug Metabolism 2010, 11:369-378.
    • (2010) Current Drug Metabolism , vol.11 , pp. 369-378
    • Chou, K.C.1
  • 14
    • 84859342767 scopus 로고    scopus 로고
    • Wenxiang. a web-server for drawing Wenxiang diagrams
    • Chou K.C., Lin W.Z., Xiao X. Wenxiang. a web-server for drawing Wenxiang diagrams. Natural Science 2011, 3:862-865.
    • (2011) Natural Science , vol.3 , pp. 862-865
    • Chou, K.C.1    Lin, W.Z.2    Xiao, X.3
  • 15
    • 79951945251 scopus 로고    scopus 로고
    • Numerical characteristics of word frequencies and their application to dissimilarity measure for sequence comparison
    • Dai Q., Liu X., Yao Y., Zhao F. Numerical characteristics of word frequencies and their application to dissimilarity measure for sequence comparison. Journal of Theoretical Biology 2011, 276:174-180.
    • (2011) Journal of Theoretical Biology , vol.276 , pp. 174-180
    • Dai, Q.1    Liu, X.2    Yao, Y.3    Zhao, F.4
  • 16
    • 84868234231 scopus 로고    scopus 로고
    • A simple k-word interval method for phylogenetic analysis of DNA sequences
    • Ding S., Li Y., Yang X., Wang T. A simple k-word interval method for phylogenetic analysis of DNA sequences. Journal of Theoretical Biology 2013, 317:192-199.
    • (2013) Journal of Theoretical Biology , vol.317 , pp. 192-199
    • Ding, S.1    Li, Y.2    Yang, X.3    Wang, T.4
  • 17
    • 84859932176 scopus 로고    scopus 로고
    • PseAAC-Builder. a cross-platform stand-alone program for generating various special Chou's pseudo-amino acid compositions
    • Du P., Wang X., Xu C., Gao Y. PseAAC-Builder. a cross-platform stand-alone program for generating various special Chou's pseudo-amino acid compositions. Analytical Biochemistry 2012, 425:117-119.
    • (2012) Analytical Biochemistry , vol.425 , pp. 117-119
    • Du, P.1    Wang, X.2    Xu, C.3    Gao, Y.4
  • 18
    • 77649337793 scopus 로고    scopus 로고
    • Using the concept of Chou's pseudo amino acid composition for risk type prediction of human papillomaviruses
    • Esmaeili M., Mohabatkar H., Mohsenzadeh S. Using the concept of Chou's pseudo amino acid composition for risk type prediction of human papillomaviruses. Journal of Theoretical Biology 2010, 263:203-209.
    • (2010) Journal of Theoretical Biology , vol.263 , pp. 203-209
    • Esmaeili, M.1    Mohabatkar, H.2    Mohsenzadeh, S.3
  • 19
    • 0000461280 scopus 로고
    • Confidence limits on phylogenies. an approach using the bootstrap
    • Felsenstein J. Confidence limits on phylogenies. an approach using the bootstrap. Evolution 1985, 39:783-791.
    • (1985) Evolution , vol.39 , pp. 783-791
    • Felsenstein, J.1
  • 20
    • 0000122573 scopus 로고
    • PHYLIP-phylogeny inference package (version 3.2)
    • Felsenstein J. PHYLIP-phylogeny inference package (version 3.2). Cladistics 1989, 5:164-166.
    • (1989) Cladistics , vol.5 , pp. 164-166
    • Felsenstein, J.1
  • 21
    • 0000678497 scopus 로고    scopus 로고
    • A novel 2-D graphical representation of DNA sequences of low degeneracy
    • Guo X., Randić M., Basak S.C. A novel 2-D graphical representation of DNA sequences of low degeneracy. Chemical Physics Letters 2001, 350:106-112.
    • (2001) Chemical Physics Letters , vol.350 , pp. 106-112
    • Guo, X.1    Randić, M.2    Basak, S.C.3
  • 22
    • 40049100510 scopus 로고    scopus 로고
    • A new method to analyze the similarity of the DNA sequences
    • Guo Y., Wang T.M. A new method to analyze the similarity of the DNA sequences. Journal of Molecular Structure: THEOCHEM 2008, 853:62-67.
    • (2008) Journal of Molecular Structure: THEOCHEM , vol.853 , pp. 62-67
    • Guo, Y.1    Wang, T.M.2
  • 23
    • 0020659327 scopus 로고
    • H Curves, a novel method of representation of nucleotide series especially suited for long DNA sequences
    • Hamori E., Ruskin J. H Curves, a novel method of representation of nucleotide series especially suited for long DNA sequences. Journal of Biological Chemistry 1983, 258:1318-1327.
    • (1983) Journal of Biological Chemistry , vol.258 , pp. 1318-1327
    • Hamori, E.1    Ruskin, J.2
  • 24
    • 79960892272 scopus 로고    scopus 로고
    • Phylogenetic analysis of DNA sequences with a novel characteristic vector
    • Huang Y., Wang T. Phylogenetic analysis of DNA sequences with a novel characteristic vector. Journal of Mathematical Chemistry 2011, 49:1479-1492.
    • (2011) Journal of Mathematical Chemistry , vol.49 , pp. 1479-1492
    • Huang, Y.1    Wang, T.2
  • 25
    • 78149354443 scopus 로고    scopus 로고
    • Phylogenetic analysis of DNA sequences based on the generalized pseudo-amino acid composition
    • Huang Y., Yang L., Wang T. Phylogenetic analysis of DNA sequences based on the generalized pseudo-amino acid composition. Journal of Theoretical Biology 2011, 269:217-223.
    • (2011) Journal of Theoretical Biology , vol.269 , pp. 217-223
    • Huang, Y.1    Yang, L.2    Wang, T.3
  • 26
    • 84859499974 scopus 로고    scopus 로고
    • Hepatitis C virus network based classification of hepatocellular cirrhosis and carcinoma
    • Huang T., Wang J., Cai Y.D., Yu H., Chou K.C. Hepatitis C virus network based classification of hepatocellular cirrhosis and carcinoma. PLoS ONE 2012, 7:e34460.
    • (2012) PLoS ONE , vol.7
    • Huang, T.1    Wang, J.2    Cai, Y.D.3    Yu, H.4    Chou, K.C.5
  • 27
    • 84876592631 scopus 로고    scopus 로고
    • Signal propagation in protein interaction network during colorectal cancer progression
    • Jiang Y., Huang T., Lei C., Gao Y.F., Chou K.C. Signal propagation in protein interaction network during colorectal cancer progression. BioMed Research International 2013, 287019.
    • (2013) BioMed Research International , pp. 287019
    • Jiang, Y.1    Huang, T.2    Lei, C.3    Gao, Y.F.4    Chou, K.C.5
  • 28
    • 34547844142 scopus 로고    scopus 로고
    • A statistical method for alignment-free comparison of regulatory sequences
    • Kantorovitz M.R., Robinson G.E., Sinha S. A statistical method for alignment-free comparison of regulatory sequences. Bioinformatics 2007, 23:i249-i255.
    • (2007) Bioinformatics , vol.23
    • Kantorovitz, M.R.1    Robinson, G.E.2    Sinha, S.3
  • 29
    • 3242810318 scopus 로고    scopus 로고
    • MEGA3. integrated software for molecular evolutionary genetics analysis and sequence alignment
    • Kumar S., Tamra K., Nei M. MEGA3. integrated software for molecular evolutionary genetics analysis and sequence alignment. Briefings in Bioinformatics 2004, 5:150-163.
    • (2004) Briefings in Bioinformatics , vol.5 , pp. 150-163
    • Kumar, S.1    Tamra, K.2    Nei, M.3
  • 31
    • 79960004119 scopus 로고    scopus 로고
    • Interactive tree of life v2. online annotation and display of phylogenetic trees made easy
    • (Web Server issue)
    • Letunic I., Bork P. Interactive tree of life v2. online annotation and display of phylogenetic trees made easy. Nucleic Acid Research 2011, 39:w475-w478. (Web Server issue).
    • (2011) Nucleic Acid Research , vol.39
    • Letunic, I.1    Bork, P.2
  • 32
    • 84859298511 scopus 로고    scopus 로고
    • Identification of colorectal cancer related genes with mRMR and shortest path in protein-protein interaction network
    • Li B.Q., Huang T., Liu L., Cai Y.D., Chou K.C. Identification of colorectal cancer related genes with mRMR and shortest path in protein-protein interaction network. PLoS ONE 2012, 7:e33393.
    • (2012) PLoS ONE , vol.7
    • Li, B.Q.1    Huang, T.2    Liu, L.3    Cai, Y.D.4    Chou, K.C.5
  • 33
    • 12344296751 scopus 로고    scopus 로고
    • A 4D representation of DNA sequences and its application
    • Liao B., Tan M., Ding K. A 4D representation of DNA sequences and its application. Chemical Physics Letters 2005, 402:380-383.
    • (2005) Chemical Physics Letters , vol.402 , pp. 380-383
    • Liao, B.1    Tan, M.2    Ding, K.3
  • 34
    • 0037055264 scopus 로고    scopus 로고
    • Pika and vole mitochondrial genomes increase support for both rodent monophyly and glires
    • Lin Y., Waddell P.J., Penny D. Pika and vole mitochondrial genomes increase support for both rodent monophyly and glires. Gene 2002, 294:119-129.
    • (2002) Gene , vol.294 , pp. 119-129
    • Lin, Y.1    Waddell, P.J.2    Penny, D.3
  • 35
    • 39549123618 scopus 로고    scopus 로고
    • A novel feature-based method for whole genome phylogenetic analysis without alignment. application to HEV genotyping and subtyping
    • Liu Z., Meng J., Sun X. A novel feature-based method for whole genome phylogenetic analysis without alignment. application to HEV genotyping and subtyping. Biochemical and Biophysical Research Communications 2008, 368:223-230.
    • (2008) Biochemical and Biophysical Research Communications , vol.368 , pp. 223-230
    • Liu, Z.1    Meng, J.2    Sun, X.3
  • 36
    • 31144477556 scopus 로고    scopus 로고
    • Phylogenetic analysis of global hepatitis E virus sequences. genetic diversity, subtypes and zoonosis
    • Lu L., Li C., Hagedorn C.H. Phylogenetic analysis of global hepatitis E virus sequences. genetic diversity, subtypes and zoonosis. Reviews in Medical Virology 2006, 16:5-36.
    • (2006) Reviews in Medical Virology , vol.16 , pp. 5-36
    • Lu, L.1    Li, C.2    Hagedorn, C.H.3
  • 37
    • 84863695726 scopus 로고    scopus 로고
    • Predicting plant protein subcellular multi-localization by Chou's PseAAC formulation based multi-label homolog knowledge transfer learning
    • Mei S. Predicting plant protein subcellular multi-localization by Chou's PseAAC formulation based multi-label homolog knowledge transfer learning. Journal of Theoretical Biology 2012, 310:80-87.
    • (2012) Journal of Theoretical Biology , vol.310 , pp. 80-87
    • Mei, S.1
  • 38
    • 79955564229 scopus 로고    scopus 로고
    • Prediction of GABA(A) receptor proteins using the concept of Chou's pseudo-amino acid composition and support vector machine
    • Mohabatkar H., Mohammad Beigi M., Esmaeili A. Prediction of GABA(A) receptor proteins using the concept of Chou's pseudo-amino acid composition and support vector machine. Journal of Theoretical Biology 2011, 281:18-23.
    • (2011) Journal of Theoretical Biology , vol.281 , pp. 18-23
    • Mohabatkar, H.1    Mohammad Beigi, M.2    Esmaeili, A.3
  • 40
    • 12344295510 scopus 로고    scopus 로고
    • A probabilistic measure for alignment-free sequence comparison
    • Pham T.D., Zuegg J. A probabilistic measure for alignment-free sequence comparison. Bioinformatics 2004, 20:3455-3461.
    • (2004) Bioinformatics , vol.20 , pp. 3455-3461
    • Pham, T.D.1    Zuegg, J.2
  • 41
    • 0001802894 scopus 로고    scopus 로고
    • On characterization of DNA primary sequences by a condensed matrix
    • Randić M. On characterization of DNA primary sequences by a condensed matrix. Chemical Physics Letters 2000, 317:29-34.
    • (2000) Chemical Physics Letters , vol.317 , pp. 29-34
    • Randić, M.1
  • 42
    • 0035324931 scopus 로고    scopus 로고
    • Characterization of DNA primary sequences based on the average distances between bases
    • Randić M., Basak S.C. Characterization of DNA primary sequences based on the average distances between bases. Journal of Chemical Information and Computer Sciences 2001, 41:561-568.
    • (2001) Journal of Chemical Information and Computer Sciences , vol.41 , pp. 561-568
    • Randić, M.1    Basak, S.C.2
  • 43
    • 0037470662 scopus 로고    scopus 로고
    • Analysis of similarity/dissimilarity of DNA sequences based on novel 2-D graphical representation
    • Randić M., Vraîko M., Lerî N., Plavîić D. Analysis of similarity/dissimilarity of DNA sequences based on novel 2-D graphical representation. Chemical Physics Letters 2003, 371:202-207.
    • (2003) Chemical Physics Letters , vol.371 , pp. 202-207
    • Randić, M.1    Vraîko, M.2    Lerî, N.3    Plavîić, D.4
  • 44
    • 0036166508 scopus 로고    scopus 로고
    • Integrated gene and species phylogenies from unaligned whole genome protein sequences
    • Stuart G.W., Moffett K., Baker S. Integrated gene and species phylogenies from unaligned whole genome protein sequences. Bioinformatics 2002, 18:100-108.
    • (2002) Bioinformatics , vol.18 , pp. 100-108
    • Stuart, G.W.1    Moffett, K.2    Baker, S.3
  • 45
    • 0037342499 scopus 로고    scopus 로고
    • Alignment-free sequence comparison-a review
    • Vinga S., Almeida J. Alignment-free sequence comparison-a review. Bioinformatics 2003, 19:513-523.
    • (2003) Bioinformatics , vol.19 , pp. 513-523
    • Vinga, S.1    Almeida, J.2
  • 46
    • 0033085003 scopus 로고    scopus 로고
    • Assessing the cretaceous superordinal divergence times within birds and placental mammals using whole mitochondrial protein sequences and an extended statistical framework
    • Waddell P.J., Cao Y., Hasegawa M., Mindell D.P. Assessing the cretaceous superordinal divergence times within birds and placental mammals using whole mitochondrial protein sequences and an extended statistical framework. Systems Biology 1999, 48:119-137.
    • (1999) Systems Biology , vol.48 , pp. 119-137
    • Waddell, P.J.1    Cao, Y.2    Hasegawa, M.3    Mindell, D.P.4
  • 47
    • 0033085002 scopus 로고    scopus 로고
    • Towards resolving the interordinal relationships of placental mammals
    • Waddell P.J., Okada N., Hasegawa M. Towards resolving the interordinal relationships of placental mammals. Systems Biology 1999, 48:1-5.
    • (1999) Systems Biology , vol.48 , pp. 1-5
    • Waddell, P.J.1    Okada, N.2    Hasegawa, M.3
  • 48
    • 49849083665 scopus 로고    scopus 로고
    • WSE, a new sequence distance measure based on word frequencies
    • Wang J., Zheng X. WSE, a new sequence distance measure based on word frequencies. Mathematical Biosciences 2008, 215:78-83.
    • (2008) Mathematical Biosciences , vol.215 , pp. 78-83
    • Wang, J.1    Zheng, X.2
  • 49
    • 0031437248 scopus 로고    scopus 로고
    • A measure of DNA sequence dissimilarity based on Mahalanobis distance between frequencies of words
    • Wu T.J., Burke J.P., Davison D.B. A measure of DNA sequence dissimilarity based on Mahalanobis distance between frequencies of words. Biometrics 1997, 53:1431-1439.
    • (1997) Biometrics , vol.53 , pp. 1431-1439
    • Wu, T.J.1    Burke, J.P.2    Davison, D.B.3
  • 50
    • 0035013276 scopus 로고    scopus 로고
    • Statistical measures of DNA sequence dissimilarity under Markov chain models of base composition
    • Wu T.J., Hsieh Y.C., Li L.A. Statistical measures of DNA sequence dissimilarity under Markov chain models of base composition. Biometrics 2001, 57:441-448.
    • (2001) Biometrics , vol.57 , pp. 441-448
    • Wu, T.J.1    Hsieh, Y.C.2    Li, L.A.3
  • 51
    • 84872069665 scopus 로고    scopus 로고
    • Large local analysis of the unaligned genome and its application
    • Yang L., Zhang X., Wang T., Zhu H. Large local analysis of the unaligned genome and its application. Journal of Computational Biology 2013, 20:19-29.
    • (2013) Journal of Computational Biology , vol.20 , pp. 19-29
    • Yang, L.1    Zhang, X.2    Wang, T.3    Zhu, H.4
  • 52
    • 70350192279 scopus 로고    scopus 로고
    • A complexity-based measure and its application to phylogenetic analysis
    • Zheng X., Li C., Wang J. A complexity-based measure and its application to phylogenetic analysis. Journal of Mathematical Chemistry 2009, 46:1149-1157.
    • (2009) Journal of Mathematical Chemistry , vol.46 , pp. 1149-1157
    • Zheng, X.1    Li, C.2    Wang, J.3
  • 53
    • 34548439881 scopus 로고    scopus 로고
    • Using Chou's amphiphilic pseudo-amino acid composition and support vector machine for prediction of enzyme subfamily classes
    • Zhou X.B., Chen C., Li Z.C., Zou X.Y. Using Chou's amphiphilic pseudo-amino acid composition and support vector machine for prediction of enzyme subfamily classes. Journal of Theoretical Biology 2007, 248:546-551.
    • (2007) Journal of Theoretical Biology , vol.248 , pp. 546-551
    • Zhou, X.B.1    Chen, C.2    Li, Z.C.3    Zou, X.Y.4
  • 54
    • 0021764092 scopus 로고
    • An extension of Chou's graphic rules for deriving enzyme kinetic equations to systems involving parallel reaction pathways
    • Zhou G.P., Deng M.H. An extension of Chou's graphic rules for deriving enzyme kinetic equations to systems involving parallel reaction pathways. Biochemical Journal 1984, 222:169-176.
    • (1984) Biochemical Journal , vol.222 , pp. 169-176
    • Zhou, G.P.1    Deng, M.H.2
  • 55
    • 79960604768 scopus 로고    scopus 로고
    • The disposition of the LZCC protein residues in Wenxiang diagram provides new insights into the protein-protein interaction mechanism
    • Zhou G.P. The disposition of the LZCC protein residues in Wenxiang diagram provides new insights into the protein-protein interaction mechanism. Journal of Theoretical Biology 2011, 284:142-148.
    • (2011) Journal of Theoretical Biology , vol.284 , pp. 142-148
    • Zhou, G.P.1
  • 56
    • 79960617237 scopus 로고    scopus 로고
    • The structural determinations of the leucine zipper coiled-coil domains of the cGMP-dependent protein kinase I alpha and its interaction with the myosin binding subunit of the myosin light chains phosphase
    • Zhou G.P. The structural determinations of the leucine zipper coiled-coil domains of the cGMP-dependent protein kinase I alpha and its interaction with the myosin binding subunit of the myosin light chains phosphase. Protein and Peptide Letters 2011, 18:966-978.
    • (2011) Protein and Peptide Letters , vol.18 , pp. 966-978
    • Zhou, G.P.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.