메뉴 건너뛰기




Volumn 330, Issue 1, 2007, Pages 33-48

Primary sequences of proteins from complete genomes display a singular periodicity: Alignment-free N-gram analysis

Author keywords

Alignment free; N gram analysis; Singular periodicity

Indexed keywords

GENOME; PERIODICITY; PROTEIN;

EID: 33846300700     PISSN: 16310691     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.crvi.2006.11.001     Document Type: Article
Times cited : (8)

References (24)
  • 1
    • 0028911698 scopus 로고
    • Gauging similarity via n-grams: text sorting, categorizing and retrieval in any language
    • Damashek M. Gauging similarity via n-grams: text sorting, categorizing and retrieval in any language. Science 267 (1995) 843-848
    • (1995) Science , vol.267 , pp. 843-848
    • Damashek, M.1
  • 2
    • 0022743812 scopus 로고
    • A measure of the similarity of sets of sequences not requiring sequence alignment
    • Blaisdall B.E. A measure of the similarity of sets of sequences not requiring sequence alignment. Proc. Natl Acad. Sci. USA 83 (1986) 5155-5159
    • (1986) Proc. Natl Acad. Sci. USA , vol.83 , pp. 5155-5159
    • Blaisdall, B.E.1
  • 3
    • 0024805860 scopus 로고
    • Effectiveness of measures requiring and not requiring prior sequence alignment for estimating the dissimilarity of natural sequences
    • Blaisdall B.E. Effectiveness of measures requiring and not requiring prior sequence alignment for estimating the dissimilarity of natural sequences. J. Mol. Evol. 29 (1989) 526-537
    • (1989) J. Mol. Evol. , vol.29 , pp. 526-537
    • Blaisdall, B.E.1
  • 4
    • 0024848121 scopus 로고
    • Average values of a dissimilarity measure not requiring sequence alignment are twice the averages of conventional mismatch counts requiring sequence alignment for computer generated system model
    • Blaisdall B.E. Average values of a dissimilarity measure not requiring sequence alignment are twice the averages of conventional mismatch counts requiring sequence alignment for computer generated system model. J. Mol. Evol. 29 (1989) 538-547
    • (1989) J. Mol. Evol. , vol.29 , pp. 538-547
    • Blaisdall, B.E.1
  • 5
    • 0032104588 scopus 로고    scopus 로고
    • Classification and identification of proteins by means of common and specific amino acid n-tuples in unaligned sequences
    • Daeyaert F., Moereels H., and Lewi P.J. Classification and identification of proteins by means of common and specific amino acid n-tuples in unaligned sequences. Comp. Meth. Progr. Biomed. 56 (1998) 221-233
    • (1998) Comp. Meth. Progr. Biomed. , vol.56 , pp. 221-233
    • Daeyaert, F.1    Moereels, H.2    Lewi, P.J.3
  • 6
    • 0029117163 scopus 로고
    • Statistical significance of sequence patterns in proteins
    • Karlin S. Statistical significance of sequence patterns in proteins. Curr. Opin. Struct. Biol. 5 (1995) 360-371
    • (1995) Curr. Opin. Struct. Biol. , vol.5 , pp. 360-371
    • Karlin, S.1
  • 7
    • 0037195172 scopus 로고    scopus 로고
    • Distributional regimes for the number of k-word matches between two random sequences
    • Lippert R.A., Huang H.Y., and Waterman M.S. Distributional regimes for the number of k-word matches between two random sequences. Proc. Natl Acad. Sci. USA 99 (2002) 13980-13989
    • (2002) Proc. Natl Acad. Sci. USA , vol.99 , pp. 13980-13989
    • Lippert, R.A.1    Huang, H.Y.2    Waterman, M.S.3
  • 8
    • 0036166508 scopus 로고    scopus 로고
    • Integrated gene and species phylogenies from unaligned whole genome protein sequences
    • Stuart G.W., Moffett K., and Baker S. Integrated gene and species phylogenies from unaligned whole genome protein sequences. Bioinformatics 18 (2002) 100-108
    • (2002) Bioinformatics , vol.18 , pp. 100-108
    • Stuart, G.W.1    Moffett, K.2    Baker, S.3
  • 9
    • 19244371031 scopus 로고    scopus 로고
    • Vector space classification of DNA sequences
    • Muller H.M., and Koonin S.E. Vector space classification of DNA sequences. J. Theor. Biol. 223 (2003) 161-169
    • (2003) J. Theor. Biol. , vol.223 , pp. 161-169
    • Muller, H.M.1    Koonin, S.E.2
  • 10
    • 0037342499 scopus 로고    scopus 로고
    • Alignment-free sequence comparison
    • Vinga S., and Almeida J.S. Alignment-free sequence comparison. Bioinformatics 19 (2003) 513-523
    • (2003) Bioinformatics , vol.19 , pp. 513-523
    • Vinga, S.1    Almeida, J.S.2
  • 11
    • 1042269469 scopus 로고    scopus 로고
    • Comparative evaluation of word composition distances for the recognition of SCOP relationships
    • Vinga S., Gouveia-Oliveira R., and Almeida J.S. Comparative evaluation of word composition distances for the recognition of SCOP relationships. Bioinformatics 20 (2004) 206-215
    • (2004) Bioinformatics , vol.20 , pp. 206-215
    • Vinga, S.1    Gouveia-Oliveira, R.2    Almeida, J.S.3
  • 12
    • 0019887799 scopus 로고
    • Identification of common molecular subsequences
    • Smith T.F., and Waterman M.S. Identification of common molecular subsequences. J. Mol. Biol. 147 (1981) 195-197
    • (1981) J. Mol. Biol. , vol.147 , pp. 195-197
    • Smith, T.F.1    Waterman, M.S.2
  • 13
    • 1242320272 scopus 로고    scopus 로고
    • Local homology recognition and distance measures in linear time using compressed amino acid alphabets
    • Edgar R.C. Local homology recognition and distance measures in linear time using compressed amino acid alphabets. Nucl. Acids Res. 32 (2004) 380-384
    • (2004) Nucl. Acids Res. , vol.32 , pp. 380-384
    • Edgar, R.C.1
  • 14
    • 0035312933 scopus 로고    scopus 로고
    • Identification of thermophylic species by the amino acid compositions deduced from their genomes
    • Kreil D.P., and Ozounis C.A. Identification of thermophylic species by the amino acid compositions deduced from their genomes. Nucl. Acids Res. 29 (2001) 1608-1615
    • (2001) Nucl. Acids Res. , vol.29 , pp. 1608-1615
    • Kreil, D.P.1    Ozounis, C.A.2
  • 15
    • 0034885126 scopus 로고    scopus 로고
    • Genomic style of proteins: concepts, methods and analysis of ribosomal proteins from 16 microbial species
    • Radomski J.P., and Slonimski P.P. Genomic style of proteins: concepts, methods and analysis of ribosomal proteins from 16 microbial species. FEMS Microbiol. Rev. 25 (2001) 425-435
    • (2001) FEMS Microbiol. Rev. , vol.25 , pp. 425-435
    • Radomski, J.P.1    Slonimski, P.P.2
  • 16
    • 0036606927 scopus 로고    scopus 로고
    • Evidence for cysteine clustering in thermophylic proteomes
    • Rosato V., Pucello N., and Giuliano G. Evidence for cysteine clustering in thermophylic proteomes. Trends Genet. 18 (2002) 278-281
    • (2002) Trends Genet. , vol.18 , pp. 278-281
    • Rosato, V.1    Pucello, N.2    Giuliano, G.3
  • 17
    • 0014138443 scopus 로고
    • Formal analysis of protein sequences. I. Specific long range constraints in pair associations of amino acids
    • Krzywicki A., and Slonimski P.P. Formal analysis of protein sequences. I. Specific long range constraints in pair associations of amino acids. J. Theor. Biol. 17 (1967) 136-158
    • (1967) J. Theor. Biol. , vol.17 , pp. 136-158
    • Krzywicki, A.1    Slonimski, P.P.2
  • 18
    • 0025300402 scopus 로고
    • Toward a natural system of organisms: Proposal for the domains archaea, bacteria and eucaria
    • Woese C.R., Kandler O., and Wheelis M.L. Toward a natural system of organisms: Proposal for the domains archaea, bacteria and eucaria. Proc. Natl Acad. Sci. USA 87 (1990) 4576-4579
    • (1990) Proc. Natl Acad. Sci. USA , vol.87 , pp. 4576-4579
    • Woese, C.R.1    Kandler, O.2    Wheelis, M.L.3
  • 19
    • 0019065166 scopus 로고
    • Hydrophobicity and protein structure
    • Kanehisa M.I., and Tsong T.Y. Hydrophobicity and protein structure. Biopolymers 19 (1980) 1617-1628
    • (1980) Biopolymers , vol.19 , pp. 1617-1628
    • Kanehisa, M.I.1    Tsong, T.Y.2
  • 20
    • 0019887286 scopus 로고
    • Periodicity in DNA primary structure is defined by secondary structure of the coded protein
    • Zhurkin V.B. Periodicity in DNA primary structure is defined by secondary structure of the coded protein. Nucl. Acid Res. 9 (1981) 1963-1971
    • (1981) Nucl. Acid Res. , vol.9 , pp. 1963-1971
    • Zhurkin, V.B.1
  • 21
    • 0032891716 scopus 로고    scopus 로고
    • 10-11-bp periodicities in complete genomes reflect protein structure and DNA folding
    • Herzel H., Weiss O., and Trifonov E.N. 10-11-bp periodicities in complete genomes reflect protein structure and DNA folding. Bioinformatics 15 (1999) 187-193
    • (1999) Bioinformatics , vol.15 , pp. 187-193
    • Herzel, H.1    Weiss, O.2    Trifonov, E.N.3
  • 22
    • 0034141421 scopus 로고    scopus 로고
    • Structural analysis of DNA sequence: evidence for lateral gene transfer in Thermotoga maritima
    • Worning P., Jensen L.J., Nelson K.E., Brunak S., and Ussery D.W. Structural analysis of DNA sequence: evidence for lateral gene transfer in Thermotoga maritima. Nucl. Acid Res. 28 (2000) 706-709
    • (2000) Nucl. Acid Res. , vol.28 , pp. 706-709
    • Worning, P.1    Jensen, L.J.2    Nelson, K.E.3    Brunak, S.4    Ussery, D.W.5
  • 23
    • 26444614790 scopus 로고    scopus 로고
    • Sequence periodicity of Escherichia coli is concentrated in intergenic regions
    • Hosid S., Trifonov E.N., and Bolshoy A. Sequence periodicity of Escherichia coli is concentrated in intergenic regions. BMC Mol. Biol. 5 (2004) 14-20
    • (2004) BMC Mol. Biol. , vol.5 , pp. 14-20
    • Hosid, S.1    Trifonov, E.N.2    Bolshoy, A.3
  • 24
    • 33846279418 scopus 로고    scopus 로고
    • Periodic oscillations of the genomic nucleotide sequences disclose major differences in the way of constructing homologous proteins from different procaryotic species
    • Slonimski P.P. Periodic oscillations of the genomic nucleotide sequences disclose major differences in the way of constructing homologous proteins from different procaryotic species. C. R. Biologies 330 1 (2007)
    • (2007) C. R. Biologies , vol.330 , Issue.1
    • Slonimski, P.P.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.