메뉴 건너뛰기




Volumn 12, Issue , 2011, Pages

N-gram analysis of 970 microbial organisms reveals presence of biological language models

Author keywords

[No Author keywords available]

Indexed keywords

COMPARATIVE ANALYSIS; EVOLUTIONARY DISTANCE; EVOLUTIONARY TREE; GAMMAPROTEOBACTERIA; GENOME SEQUENCES; NATURAL LANGUAGE PROCESSING; PHYLOGENETIC TREES; STATISTICAL MEASURES;

EID: 78650997356     PISSN: None     EISSN: 14712105     Source Type: Journal    
DOI: 10.1186/1471-2105-12-12     Document Type: Article
Times cited : (21)

References (44)
  • 2
    • 0034305155 scopus 로고    scopus 로고
    • Small bugs, big business: the economic power of the microbe
    • 10.1016/S0734-9750(00)00049-5, 14538099
    • Demain AL. Small bugs, big business: the economic power of the microbe. Biotechnology advances 2000, 18(6):499-514. 10.1016/S0734-9750(00)00049-5, 14538099.
    • (2000) Biotechnology advances , vol.18 , Issue.6 , pp. 499-514
    • Demain, A.L.1
  • 4
  • 7
    • 4544229161 scopus 로고    scopus 로고
    • Phylogenetic trees based on gene content
    • Oxford, England, 10.1093/bioinformatics/bth198, 15044248
    • Huson DH, Steel M. Phylogenetic trees based on gene content. Bioinformatics 2004, 20(13):2044-2049. Oxford, England, 10.1093/bioinformatics/bth198, 15044248.
    • (2004) Bioinformatics , vol.20 , Issue.13 , pp. 2044-2049
    • Huson, D.H.1    Steel, M.2
  • 10
    • 19544379902 scopus 로고    scopus 로고
    • Whole-genome prokaryotic phylogeny
    • Oxford, England, 10.1093/bioinformatics/bth324, 15166018
    • Henz SR, Huson DH, Auch AF, Nieselt-Struwe K, Schuster SC. Whole-genome prokaryotic phylogeny. Bioinformatics 2005, 21(10):2329-2335. Oxford, England, 10.1093/bioinformatics/bth324, 15166018.
    • (2005) Bioinformatics , vol.21 , Issue.10 , pp. 2329-2335
    • Henz, S.R.1    Huson, D.H.2    Auch, A.F.3    Nieselt-Struwe, K.4    Schuster, S.C.5
  • 11
    • 0037315735 scopus 로고    scopus 로고
    • Evolutionary implications of microbial genome tetranucleotide frequency biases
    • 10.1101/gr.335003, 420360, 12566393
    • Pride DT, Meinersmann RJ, Wassenaar TM, Blaser MJ. Evolutionary implications of microbial genome tetranucleotide frequency biases. Genome research 2003, 13(2):145-158. 10.1101/gr.335003, 420360, 12566393.
    • (2003) Genome research , vol.13 , Issue.2 , pp. 145-158
    • Pride, D.T.1    Meinersmann, R.J.2    Wassenaar, T.M.3    Blaser, M.J.4
  • 13
    • 3843083229 scopus 로고
    • Experiments with syntactic traces in information retrieval
    • Heer TD. Experiments with syntactic traces in information retrieval. Inform Storage Retrieval 10 1974, 133-144.
    • (1974) Inform Storage Retrieval 10 , pp. 133-144
    • Heer, T.D.1
  • 14
  • 18
    • 0029060923 scopus 로고
    • Dinucleotide relative abundance extremes: a genomic signature
    • 10.1016/S0168-9525(00)89076-9, 7482779
    • Karlin S, Burge C. Dinucleotide relative abundance extremes: a genomic signature. Trends Genet 1995, 11(7):283-290. 10.1016/S0168-9525(00)89076-9, 7482779.
    • (1995) Trends Genet , vol.11 , Issue.7 , pp. 283-290
    • Karlin, S.1    Burge, C.2
  • 21
    • 75149150883 scopus 로고    scopus 로고
    • Using genomic signatures for HIV-1 sub-typing
    • 10.1186/1471-2105-11-S1-S26, 3009497, 20122198
    • Pandit A, Sinha S. Using genomic signatures for HIV-1 sub-typing. BMC bioinformatics 11(Suppl 1):S26. 10.1186/1471-2105-11-S1-S26, 3009497, 20122198.
    • BMC bioinformatics , vol.11 , Issue.SUPPL .1
    • Pandit, A.1    Sinha, S.2
  • 22
    • 0027512932 scopus 로고
    • A novel method of protein sequence classification based on oligopeptide frequency analysis and its application to search for functional sites and to domain localization
    • Solovyev VV, Makarova KS. A novel method of protein sequence classification based on oligopeptide frequency analysis and its application to search for functional sites and to domain localization. Comput Appl Biosci 1993, 9(1):17-24.
    • (1993) Comput Appl Biosci , vol.9 , Issue.1 , pp. 17-24
    • Solovyev, V.V.1    Makarova, K.S.2
  • 23
    • 13944255457 scopus 로고    scopus 로고
    • Protein classification based on text document classification techniques
    • 10.1002/prot.20373, 15645499
    • Cheng BY, Carbonell JG, Klein-Seetharaman J. Protein classification based on text document classification techniques. Proteins 2005, 58(4):955-970. 10.1002/prot.20373, 15645499.
    • (2005) Proteins , vol.58 , Issue.4 , pp. 955-970
    • Cheng, B.Y.1    Carbonell, J.G.2    Klein-Seetharaman, J.3
  • 24
    • 0032104588 scopus 로고    scopus 로고
    • Classification and identification of proteins by means of common and specific amino acid n-tuples in unaligned sequences
    • 10.1016/S0169-2607(98)00031-5, 9725648
    • Daeyaert F, Moereels H, Lewi PJ. Classification and identification of proteins by means of common and specific amino acid n-tuples in unaligned sequences. Computer methods and programs in biomedicine 1998, 56(3):221-233. 10.1016/S0169-2607(98)00031-5, 9725648.
    • (1998) Computer methods and programs in biomedicine , vol.56 , Issue.3 , pp. 221-233
    • Daeyaert, F.1    Moereels, H.2    Lewi, P.J.3
  • 25
    • 34548832558 scopus 로고    scopus 로고
    • NgLOC: an n-gram-based Bayesian method for estimating the subcellular proteomes of eukaryotes
    • 10.1186/gb-2007-8-5-r68, 1929137, 17472741
    • King BR, Guda C. ngLOC: an n-gram-based Bayesian method for estimating the subcellular proteomes of eukaryotes. Genome biology 2007, 8(5):R68. 10.1186/gb-2007-8-5-r68, 1929137, 17472741.
    • (2007) Genome biology , vol.8 , Issue.5
    • King, B.R.1    Guda, C.2
  • 26
    • 77951958250 scopus 로고    scopus 로고
    • A visual framework for sequence analysis using n-grams and spectral rearrangement
    • Oxford, England, 10.1093/bioinformatics/btq042, 20130028
    • Maetschke SR, Kassahn KS, Dunn JA, Han SP, Curley EZ, Stacey KJ, Ragan MA. A visual framework for sequence analysis using n-grams and spectral rearrangement. Bioinformatics 26(6):737-744. Oxford, England, 10.1093/bioinformatics/btq042, 20130028.
    • Bioinformatics , vol.26 , Issue.6 , pp. 737-744
    • Maetschke, S.R.1    Kassahn, K.S.2    Dunn, J.A.3    Han, S.P.4    Curley, E.Z.5    Stacey, K.J.6    Ragan, M.A.7
  • 27
    • 0036166508 scopus 로고    scopus 로고
    • Integrated gene and species phylogenies from unaligned whole genome protein sequences
    • Oxford, England, 10.1093/bioinformatics/18.1.100, 11836217
    • Stuart GW, Moffett K, Baker S. Integrated gene and species phylogenies from unaligned whole genome protein sequences. Bioinformatics 2002, 18(1):100-108. Oxford, England, 10.1093/bioinformatics/18.1.100, 11836217.
    • (2002) Bioinformatics , vol.18 , Issue.1 , pp. 100-108
    • Stuart, G.W.1    Moffett, K.2    Baker, S.3
  • 28
    • 1242335920 scopus 로고    scopus 로고
    • Whole proteome prokaryote phylogeny without sequence alignment: a K-string composition approach
    • 10.1007/s00239-003-2493-7, 14743310
    • Qi J, Wang B, Hao BI. Whole proteome prokaryote phylogeny without sequence alignment: a K-string composition approach. Journal of molecular evolution 2004, 58(1):1-11. 10.1007/s00239-003-2493-7, 14743310.
    • (2004) Journal of molecular evolution , vol.58 , Issue.1 , pp. 1-11
    • Qi, J.1    Wang, B.2    Hao, B.I.3
  • 29
    • 31344463462 scopus 로고    scopus 로고
    • N-gram-based classification and unsupervised hierarchical clustering of genome sequences
    • 10.1016/j.cmpb.2005.11.007, 16423423
    • Tomovic A, Janicic P, Keselj V. n-gram-based classification and unsupervised hierarchical clustering of genome sequences. Computer methods and programs in biomedicine 2006, 81(2):137-153. 10.1016/j.cmpb.2005.11.007, 16423423.
    • (2006) Computer methods and programs in biomedicine , vol.81 , Issue.2 , pp. 137-153
    • Tomovic, A.1    Janicic, P.2    Keselj, V.3
  • 30
    • 55649117683 scopus 로고    scopus 로고
    • Could n-gram analysis contribute to genomic island determination?
    • 10.1016/j.jbi.2008.03.007, 18448392
    • Mitic NS, Pavlovic-Lazetic GM, Beljanski MV. Could n-gram analysis contribute to genomic island determination?. Journal of biomedical informatics 2008, 41(6):936-943. 10.1016/j.jbi.2008.03.007, 18448392.
    • (2008) Journal of biomedical informatics , vol.41 , Issue.6 , pp. 936-943
    • Mitic, N.S.1    Pavlovic-Lazetic, G.M.2    Beljanski, M.V.3
  • 31
  • 32
    • 67649649626 scopus 로고    scopus 로고
    • Analysis of n-gram based promoter recognition methods and application to whole genome promoter prediction
    • Rani TS, Bapi RS. Analysis of n-gram based promoter recognition methods and application to whole genome promoter prediction. silico biology 2009, 9(1-2):S1-16.
    • (2009) silico biology , vol.9 , Issue.1-2
    • Rani, T.S.1    Bapi, R.S.2
  • 34
    • 0024135865 scopus 로고
    • On large-vocabulary speaker-independent continuous speech recognition
    • Lee K. On large-vocabulary speaker-independent continuous speech recognition. Speech Communication 1988, 7(4):375-379.
    • (1988) Speech Communication , vol.7 , Issue.4 , pp. 375-379
    • Lee, K.1
  • 37
    • 11144231530 scopus 로고    scopus 로고
    • Application of n-Grams
    • University of Missouri-Rolla
    • Tauritz D. Application of n-Grams. Department of Computer Science 2002, University of Missouri-Rolla.
    • (2002) Department of Computer Science
    • Tauritz, D.1
  • 40
    • 22744438090 scopus 로고    scopus 로고
    • BLMT: statistical sequence analysis using N-grams
    • 10.2165/00822942-200403020-00013, 15693744
    • Ganapathiraju M, Manoharan V, Klein-Seetharaman J. BLMT: statistical sequence analysis using N-grams. Applied bioinformatics 2004, 3(2-3):193-200. 10.2165/00822942-200403020-00013, 15693744.
    • (2004) Applied bioinformatics , vol.3 , Issue.2-3 , pp. 193-200
    • Ganapathiraju, M.1    Manoharan, V.2    Klein-Seetharaman, J.3
  • 42
    • 67949087176 scopus 로고    scopus 로고
    • Genomics of Host-Restricted Pathogens of the Genus Bartonella
    • full_text, 19696500
    • Engel P, Dehio C. Genomics of Host-Restricted Pathogens of the Genus Bartonella. Genome Dyn 2009, 6:158-169. full_text, 19696500.
    • (2009) Genome Dyn , vol.6 , pp. 158-169
    • Engel, P.1    Dehio, C.2
  • 44
    • 39749173921 scopus 로고    scopus 로고
    • Reduced selection leads to accelerated gene loss in Shigella
    • 10.1186/gb-2007-8-8-r164, 2374995, 17686180
    • Hershberg R, Tang H, Petrov DA. Reduced selection leads to accelerated gene loss in Shigella. Genome biology 2007, 8(8):R164. 10.1186/gb-2007-8-8-r164, 2374995, 17686180.
    • (2007) Genome biology , vol.8 , Issue.8
    • Hershberg, R.1    Tang, H.2    Petrov, D.A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.