메뉴 건너뛰기




Volumn 398, Issue , 2014, Pages 162-171

Phylogenetic analysis of DNA sequences based on k-word and rough set theory

Author keywords

DNA; Feature selection; k word; Phylogenetic analysis; Rough set theory

Indexed keywords

DNA; DNA SEQUENCES; FEATURE EXTRACTION; LINGUISTICS;

EID: 84891791887     PISSN: 03784371     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.physa.2013.12.025     Document Type: Article
Times cited : (20)

References (55)
  • 1
    • 0242643741 scopus 로고    scopus 로고
    • A new sequence distance measure for phylogenetic tree construction
    • DOI 10.1093/bioinformatics/btg295
    • H.H. Otu, and K. Sayood A new sequence distance measure for phylogenetic tree construction Bioinformatics 19 2003 2122 2130 (Pubitemid 37408164)
    • (2003) Bioinformatics , vol.19 , Issue.16 , pp. 2122-2130
    • Otu, H.H.1    Sayood, K.2
  • 2
    • 0025340372 scopus 로고
    • Chaos game representation of gene structure
    • H.J. Jeffrey Chaos game representation of gene structure Nucleic Acids Res. 18 1990 2163 2170
    • (1990) Nucleic Acids Res. , vol.18 , pp. 2163-2170
    • Jeffrey, H.J.1
  • 3
    • 0013068398 scopus 로고
    • Graphical representation of long DNA sequences
    • A. Nandy Graphical representation of long DNA sequences Curr. Sci. 66 1994 821
    • (1994) Curr. Sci. , vol.66 , pp. 821
    • Nandy, A.1
  • 4
    • 79952202492 scopus 로고    scopus 로고
    • New approaches to drug-DNA interactions based on graphical representation and numerical characterization of DNA sequences
    • A. Nandy, and S.C. Basak New approaches to drug-DNA interactions based on graphical representation and numerical characterization of DNA sequences Curr. Comput. Aided Drug Des. 6 2010 283 289
    • (2010) Curr. Comput. Aided Drug Des. , vol.6 , pp. 283-289
    • Nandy, A.1    Basak, S.C.2
  • 5
    • 79956122631 scopus 로고    scopus 로고
    • Graphical representation and mathematical characterization of protein sequences and applications to viral proteins
    • A. Ghosh, and A. Nandy Graphical representation and mathematical characterization of protein sequences and applications to viral proteins Adv. Protein Chem. Struct. Biol. 83 2011 1 42
    • (2011) Adv. Protein Chem. Struct. Biol. , vol.83 , pp. 1-42
    • Ghosh, A.1    Nandy, A.2
  • 6
    • 0027995001 scopus 로고
    • Z curves, an intutive tool for visualizing and analyzing the DNA sequences
    • R. Zhang, and C.T. Zhang Z curves, an intuitive tool for visualizing and analyzing DNA sequences J. Biomol. Struct. Dyn. 11 1994 767 782 (Pubitemid 24089977)
    • (1994) Journal of Biomolecular Structure and Dynamics , vol.11 , Issue.4 , pp. 767-782
    • Zhang, R.1    Zhang, C.-T.2
  • 7
    • 0034662286 scopus 로고    scopus 로고
    • Recognition of protein coding genes in the yeast genome at better than 95% accuracy based on the Z curve
    • C.T. Zhang, and J. Wang Recognition of protein coding genes in the yeast genome at better than 95% accuracy based on the Z curve Nucleic Acids Res. 28 2000 2804 2814 (Pubitemid 30488023)
    • (2000) Nucleic Acids Research , vol.28 , Issue.14 , pp. 2804-2814
    • Zhang, C.-T.1    Wang, J.2
  • 8
    • 0034266159 scopus 로고    scopus 로고
    • On 3-D graphical representation of DNA primary sequence and their numerical characterization
    • M. Randić, M. Vračko, A. Nandy, and S.C. Basak On 3-D graphical representation of DNA primary sequence and their numerical characterization J. Chem. Inf. Comput. Sci. 40 2000 1235 1244
    • (2000) J. Chem. Inf. Comput. Sci. , vol.40 , pp. 1235-1244
    • Randić, M.1    Vračko, M.2    Nandy, A.3    Basak, S.C.4
  • 9
    • 0037435485 scopus 로고    scopus 로고
    • Novel 2-D graphical representation of DNA sequences and their numerical characterization
    • DOI 10.1016/S0009-2614(02)01784-0, PII S0009261402017840
    • M. Randić, M. Vračko, N. Lerš, and D. Plavsić Novel 2-D graphical representation of DNA sequences and their numerical characterization Chem. Phys. Lett. 368 2003 1 6 (Pubitemid 36002823)
    • (2003) Chemical Physics Letters , vol.368 , Issue.1-2 , pp. 1-6
    • Randic, M.1    Vracko, M.2    Lers, N.3    Plavsic, D.4
  • 10
    • 0037363676 scopus 로고    scopus 로고
    • On a four-dimensional representation of DNA primary sequences
    • M. Randić, and A.T. Balaban On a four-dimensional representation of DNA primary sequences J. Chem. Inf. Comput. Sci 43 2003 532 539
    • (2003) J. Chem. Inf. Comput. Sci , vol.43 , pp. 532-539
    • Randić, M.1    Balaban, A.T.2
  • 14
    • 79952303267 scopus 로고    scopus 로고
    • A novel method of characterizing genetic sequences: Genome space with biological distance and applications
    • M. Deng, C.L. Yu, Q. Liang, R.L. He, and S.T. Yau A novel method of characterizing genetic sequences: genome space with biological distance and applications PLoS One 6 3 2011 e17293
    • (2011) PLoS One , vol.6 , Issue.3 , pp. 17293
    • Deng, M.1    Yu, C.L.2    Liang, Q.3    He, R.L.4    Yau, S.T.5
  • 15
    • 43249107908 scopus 로고    scopus 로고
    • Visualization of DNA sequences based on 3DD-curves
    • Y.S. Zhang, and M.S. Tan Visualization of DNA sequences based on 3DD-curves J. Math. Chem. 44 2008 206 216
    • (2008) J. Math. Chem. , vol.44 , pp. 206-216
    • Zhang, Y.S.1    Tan, M.S.2
  • 16
    • 65449167165 scopus 로고    scopus 로고
    • DV-curve: A novel intuitive tool for visualizing and analyzing DNA sequences
    • Z.J. Zhang DV-curve: a novel intuitive tool for visualizing and analyzing DNA sequences Bioinformatics 25 2009 1112 1117
    • (2009) Bioinformatics , vol.25 , pp. 1112-1117
    • Zhang, Z.J.1
  • 17
    • 79959718188 scopus 로고    scopus 로고
    • A novel graphical representation of protein sequences and its application
    • B. Liao, B.Y. Liao, X.G. Lu, and Z. Cao A novel graphical representation of protein sequences and its application J. Comput. Chem. 32 2011 2539 2544
    • (2011) J. Comput. Chem. , vol.32 , pp. 2539-2544
    • Liao, B.1    Liao, B.Y.2    Lu, X.G.3    Cao, Z.4
  • 18
    • 77958544124 scopus 로고    scopus 로고
    • Three 3D graphical representations of DNA primary sequences based on the classifications of DNA based and their applications
    • G.S. Xie, and Z.X. Mo Three 3D graphical representations of DNA primary sequences based on the classifications of DNA based and their applications J. Theoret. Biol. 269 2011 123 130
    • (2011) J. Theoret. Biol. , vol.269 , pp. 123-130
    • Xie, G.S.1    Mo, Z.X.2
  • 19
    • 33745615095 scopus 로고    scopus 로고
    • Directed graphs of DNA sequences and their numerical characterization
    • DOI 10.1016/j.jtbi.2005.11.023, PII S0022519305005060
    • C. Li, N. Tang, and J. Wang Directed graphs of DNA sequences and their numerical characterization J. Theoret. Biol. 241 2006 173 177 (Pubitemid 43993954)
    • (2006) Journal of Theoretical Biology , vol.241 , Issue.2 , pp. 173-177
    • Li, C.1    Tang, N.2    Wang, J.3
  • 20
    • 60349115403 scopus 로고    scopus 로고
    • 3-D maps and coupling numbers for protein sequences
    • C. Li, X.Q. Yu, L. Yang, X.Q. Zheng, and Z.F. Wang 3-D maps and coupling numbers for protein sequences Physica A 388 2009 1967 1972
    • (2009) Physica A , vol.388 , pp. 1967-1972
    • Li, C.1    Yu, X.Q.2    Yang, L.3    Zheng, X.Q.4    Wang, Z.F.5
  • 21
    • 80053179147 scopus 로고    scopus 로고
    • Using Huffman coding method to visualize and analyze DNA sequences
    • Z.H. Qi, L. Li, and X.Q. Qi Using Huffman coding method to visualize and analyze DNA sequences J. Comput. Chem. 32 2011 3233 3240
    • (2011) J. Comput. Chem. , vol.32 , pp. 3233-3240
    • Qi, Z.H.1    Li, L.2    Qi, X.Q.3
  • 23
    • 84870579444 scopus 로고    scopus 로고
    • Very efficient search for nucleotide alignments
    • M. Randic Very efficient search for nucleotide alignments J. Comput. Chem. 34 2013 77 82
    • (2013) J. Comput. Chem. , vol.34 , pp. 77-82
    • Randic, M.1
  • 24
    • 0036708519 scopus 로고    scopus 로고
    • Characteristic sequences for DNA primary sequence
    • P.A. He, and J. Wang Characteristic sequences for DNA primary sequence J. Chem. Inf. Comput. Sci. 42 2002 1008 1085
    • (2002) J. Chem. Inf. Comput. Sci. , vol.42 , pp. 1008-1085
    • He, P.A.1    Wang, J.2
  • 25
    • 0016880887 scopus 로고
    • On the complexity of finite sequences
    • A. Lempel, and J. Ziv On the complexity of finite sequences IEEE Trans. Inform. Theory 22 1976 75 81
    • (1976) IEEE Trans. Inform. Theory , vol.22 , pp. 75-81
    • Lempel, A.1    Ziv, J.2
  • 28
    • 84876814776 scopus 로고    scopus 로고
    • An extension of the Burrows-Wheeler Transform
    • DOI 10.1016/j.tcs.2007.07.014, PII S0304397507005282, The Burrows-Wheaker Transform
    • S. Mantaci, A. Restivo, G. Rosone, and M. Sciortino An extension of the Burrows-Wheeler transform Theoret. Comput. Sci. 387 2007 298 312 (Pubitemid 47633446)
    • (2007) Theoretical Computer Science , vol.387 , Issue.3 , pp. 298-312
    • Mantaci, S.1    Restivo, A.2    Rosone, G.3    Sciortino, M.4
  • 29
    • 36248953301 scopus 로고    scopus 로고
    • Distance measures for biological sequences: Some recent approaches
    • DOI 10.1016/j.ijar.2007.03.011, PII S0888613X07000382, Approximate Reasoning and Machine Learning for Bioinformatics
    • S. Mantaci, A. Restivo, and M. Sciortino Distance measures for biological sequences: some recent approaches Int. J. Approx. Reason. 47 2008 109 124 (Pubitemid 350138355)
    • (2008) International Journal of Approximate Reasoning , vol.47 , Issue.1 , pp. 109-124
    • Mantaci, S.1    Restivo, A.2    Sciortino, M.3
  • 31
    • 77954458424 scopus 로고    scopus 로고
    • A generalization of Lempel-Ziv complexity and its application to the comparison of protein sequences
    • C. Li, Z.X. Li, X.Q. Zheng, H. Ma, and X.Q. Yu A generalization of Lempel-Ziv complexity and its application to the comparison of protein sequences J. Math. Chem. 48 2010 330 338
    • (2010) J. Math. Chem. , vol.48 , pp. 330-338
    • Li, C.1    Li, Z.X.2    Zheng, X.Q.3    Ma, H.4    Yu, X.Q.5
  • 32
    • 70350192279 scopus 로고    scopus 로고
    • A complexity-based measure and its application to phylogenetic analysis
    • X. Zheng, C. Li, and J. Wang A complexity-based measure and its application to phylogenetic analysis J. Math. Chem. 46 2009 1149 1157
    • (2009) J. Math. Chem. , vol.46 , pp. 1149-1157
    • Zheng, X.1    Li, C.2    Wang, J.3
  • 33
    • 77950588824 scopus 로고    scopus 로고
    • An information-theoretic approach to the prediction of protein structural class
    • X. Zheng, C. Li, and J. Wang An information-theoretic approach to the prediction of protein structural class J. Comput. Chem. 31 2010 1201 1206
    • (2010) J. Comput. Chem. , vol.31 , pp. 1201-1206
    • Zheng, X.1    Li, C.2    Wang, J.3
  • 34
    • 70350165224 scopus 로고    scopus 로고
    • Normalized Lempel-Ziv complexity and its application in bio-sequence analysis
    • Y. Zhang, J.K. Hao, C.J. Zhou, and K. Chang Normalized Lempel-Ziv complexity and its application in bio-sequence analysis J. Math. Chem. 46 2009 1203 1212
    • (2009) J. Math. Chem. , vol.46 , pp. 1203-1212
    • Zhang, Y.1    Hao, J.K.2    Zhou, C.J.3    Chang, K.4
  • 35
    • 77956897988 scopus 로고    scopus 로고
    • Use of the Burrows-Wheeler similarity distribution to the comparison of the proteins
    • L.P. Yang, G.S. Chang, and X.D. Zhang Use of the Burrows-Wheeler similarity distribution to the comparison of the proteins Amino Acids 39 2010 887 898
    • (2010) Amino Acids , vol.39 , pp. 887-898
    • Yang, L.P.1    Chang, G.S.2    Zhang, X.D.3
  • 36
    • 0029060923 scopus 로고
    • Dinucleotide relative abundance extremes: A genomic signature
    • S. Karlin, and C. Burge Dinucleotide relative abundance extremes: a genomic signature Trends Genet. 11 1995 283 290
    • (1995) Trends Genet. , vol.11 , pp. 283-290
    • Karlin, S.1    Burge, C.2
  • 37
    • 0037428845 scopus 로고    scopus 로고
    • Prokaryotic phylogeny based on complete genomes without sequence aligment
    • B.L. Hao, J. Qi, and B. Wang Prokaryotic phylogeny based on complete genomes without sequence aligment Modern Phys. Lett. B 17 2003 1 4
    • (2003) Modern Phys. Lett. B , vol.17 , pp. 1-4
    • Hao, B.L.1    Qi, J.2    Wang, B.3
  • 38
    • 1242335920 scopus 로고    scopus 로고
    • Whole proteome prokaryote phylogeny without sequence alignment: A k-string composition approach
    • DOI 10.1007/s00239-003-2493-7
    • J. Qi, B. Wang, and B.L. Hao Whole proteome prokaryote phylogeny without sequence alignment: a k-string composition approach J. Mol. Biol. 58 2004 1 11 (Pubitemid 38120400)
    • (2004) Journal of Molecular Evolution , vol.58 , Issue.1 , pp. 1-11
    • Qi, J.1    Wang, B.2    Hao, B.-I.3
  • 39
    • 34648815169 scopus 로고    scopus 로고
    • Prokaryote phylogeny meets taxonomy: An exhaustive comparison of composition vector trees with systematic bacteriology
    • DOI 10.1007/s11427-007-0084-3
    • L. Gao, J. Qi, J.D. Sun, and B.L. Hao Prokaryote phylogeny meets taxonomy: an exhaustive comparison of composition vector trees with systematic bacteriology Sci. China C. Life Sci. 50 2007 587 599 (Pubitemid 47459905)
    • (2007) Science in China, Series C: Life Sciences , vol.50 , Issue.5 , pp. 587-599
    • Gao, L.1    Qi, J.2    Sun, J.3    Hao, B.4
  • 40
    • 85015680543 scopus 로고    scopus 로고
    • A fungal phylogeny based on 82 complete genomes using the composition vector method
    • H. Wang, Z. Xu, L. Gao, and B.L. Hao A fungal phylogeny based on 82 complete genomes using the composition vector method BMC Evol. Biol. 195 2009 1 13
    • (2009) BMC Evol. Biol. , vol.195 , pp. 1-13
    • Wang, H.1    Xu, Z.2    Gao, L.3    Hao, B.L.4
  • 41
    • 54149091987 scopus 로고    scopus 로고
    • Comparison study on k-word statistical measures for protein: From sequence to 'sequence space'
    • Q. Dai, and T. Wang Comparison study on k-word statistical measures for protein: from sequence to 'sequence space' BMC Bioinformatics 9 2008 394
    • (2008) BMC Bioinformatics , vol.9 , pp. 394
    • Dai, Q.1    Wang, T.2
  • 42
    • 79951945251 scopus 로고    scopus 로고
    • Numerical characteristics of word frequencies and their application to dissimilarity measure for sequence comparison
    • Q. Dai, X.Q. Liu, Y.H. Yao, and F. Zhao Numerical characteristics of word frequencies and their application to dissimilarity measure for sequence comparison J. Theoret. Biol. 276 2011 174 180
    • (2011) J. Theoret. Biol. , vol.276 , pp. 174-180
    • Dai, Q.1    Liu, X.Q.2    Yao, Y.H.3    Zhao, F.4
  • 43
    • 34547886848 scopus 로고    scopus 로고
    • Nucleotide composition string selection in HIV-1 subtyping using whole genomes
    • DOI 10.1093/bioinformatics/btm248
    • X.M. Wu, Z. Cai, X.F. Wan, T. Hoang, R. Goebel, and G.H. Lin Nucleotide composition string selection in HIV-1 subtyping using whole genomes Bioinformatics 23 2007 1744 1752 (Pubitemid 47250306)
    • (2007) Bioinformatics , vol.23 , Issue.14 , pp. 1744-1752
    • Wu, X.1    Cai, Z.2    Wan, X.-F.3    Hoang, T.4    Goebel, R.5    Lin, G.6
  • 44
    • 79952589852 scopus 로고    scopus 로고
    • A k-mer scheme to predict piRNAs and characterize locust piRNAs
    • Y. Zhang, X. Wang, and L. Kang A k-mer scheme to predict piRNAs and characterize locust piRNAs Bioinformatics 27 2011 771 776
    • (2011) Bioinformatics , vol.27 , pp. 771-776
    • Zhang, Y.1    Wang, X.2    Kang, L.3
  • 45
    • 84870524504 scopus 로고    scopus 로고
    • A novel statistical measure for sequence comparison on the basis of k-word counts
    • X.W. Yang, and T.M. Wang A novel statistical measure for sequence comparison on the basis of k-word counts J. Theoret. Biol. 2 2013 91 100
    • (2013) J. Theoret. Biol. , vol.2 , pp. 91-100
    • Yang, X.W.1    Wang, T.M.2
  • 46
    • 84861945261 scopus 로고    scopus 로고
    • A support vector machines based method to predict success for polymerase chain reactions
    • X.Q. Yu, X.Q. Zheng, L.Y. Meng, C. Li, and J. Wang A support vector machines based method to predict success for polymerase chain reactions Comb. Chem. High Throughput Screen 15 2012 486 491
    • (2012) Comb. Chem. High Throughput Screen , vol.15 , pp. 486-491
    • Yu, X.Q.1    Zheng, X.Q.2    Meng, L.Y.3    Li, C.4    Wang, J.5
  • 47
    • 39549123618 scopus 로고    scopus 로고
    • A novel feature-based method for whole genome phylogenetic analysis without alignment: Application to HEV genotyping and subtyping
    • Z. Liu, J. Meng, and X. Sun A novel feature-based method for whole genome phylogenetic analysis without alignment: application to HEV genotyping and subtyping Biochem. Biophys. Res. Commun. 368 2008 223 230
    • (2008) Biochem. Biophys. Res. Commun. , vol.368 , pp. 223-230
    • Liu, Z.1    Meng, J.2    Sun, X.3
  • 48
    • 77954524003 scopus 로고    scopus 로고
    • A simple feature representation vector for phylogenetic analysis of DNA sequences
    • S.Y. Ding, Q. Dai, H.M. Liu, and T.M. Wang A simple feature representation vector for phylogenetic analysis of DNA sequences J. Theoret. Biol. 265 2010 618 623
    • (2010) J. Theoret. Biol. , vol.265 , pp. 618-623
    • Ding, S.Y.1    Dai, Q.2    Liu, H.M.3    Wang, T.M.4
  • 49
    • 78149354443 scopus 로고    scopus 로고
    • Phylogenetic analysis of DNA sequences based on the generalized pseudo-amino acid composition
    • Y.J. Huang, L.P. Yang, and T.M. Wang Phylogenetic analysis of DNA sequences based on the generalized pseudo-amino acid composition J. Theoret. Biol. 269 2011 217 223
    • (2011) J. Theoret. Biol. , vol.269 , pp. 217-223
    • Huang, Y.J.1    Yang, L.P.2    Wang, T.M.3
  • 51
    • 38049119991 scopus 로고    scopus 로고
    • Consistency based attribute reduction
    • Lecture Notes in Computer Science
    • Q.H. Hu, H. Zhao, Z.X. Xie, and D.R. Yu Consistency based attribute reduction PAKDD Lecture Notes in Computer Science vol. 4426 2007 96 107
    • (2007) PAKDD , vol.4426 VOL. , pp. 96-107
    • Hu, Q.H.1    Zhao, H.2    Xie, Z.X.3    Yu, D.R.4
  • 53
    • 77956444134 scopus 로고    scopus 로고
    • Some invariant properties of ordered information systems under homomorphism
    • C.Z. Wang, D.G. Chen, and Q.H. Hu Some invariant properties of ordered information systems under homomorphism Sci. China Inf. Sci. 53 2010 1816 1825
    • (2010) Sci. China Inf. Sci. , vol.53 , pp. 1816-1825
    • Wang, C.Z.1    Chen, D.G.2    Hu, Q.H.3
  • 55
    • 31144477556 scopus 로고    scopus 로고
    • Phylogenetic analysis of global hepatitis E virus sequences: Genetic diversity, subtypes and zoonosis
    • DOI 10.1002/rmv.482
    • L. Lu, C. Li, and C.H. Hagedorn Phylogenetic analysis of global hepatitis E virus sequences: genetic diversity, subtypes and zoonosis Rev. Med. Virol. 16 2006 5 36 (Pubitemid 43133591)
    • (2006) Reviews in Medical Virology , vol.16 , Issue.1 , pp. 5-36
    • Lu, L.1    Li, C.2    Hagedorn, C.H.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.