메뉴 건너뛰기




Volumn 285, Issue 5428, 1999, Pages 751-753

Detecting protein function and protein-protein interactions from genome sequences

Author keywords

[No Author keywords available]

Indexed keywords

ANALYTIC METHOD; ARTICLE; DATA BASE; ESCHERICHIA COLI; GENE SEQUENCE; NONHUMAN; PHYLOGENY; PREDICTION; PRIORITY JOURNAL; PROTEIN PROTEIN INTERACTION; SEQUENCE HOMOLOGY; YEAST;

EID: 0033618555     PISSN: 00368075     EISSN: None     Source Type: Journal    
DOI: 10.1126/science.285.5428.751     Document Type: Article
Times cited : (1397)

References (29)
  • 1
    • 0003880161 scopus 로고
    • Garland, New York, ed. 3
    • B. Alberts et al., Molecular Biology of the Cell (Garland, New York, ed. 3, 1994); H. Lodish et al., Molecular Cell Biology (Scientific American Books, New York, ed. 3, 1995).
    • (1994) Molecular Biology of the Cell
    • Alberts, B.1
  • 2
    • 0003422388 scopus 로고
    • Scientific American Books, New York, ed. 3
    • B. Alberts et al., Molecular Biology of the Cell (Garland, New York, ed. 3, 1994); H. Lodish et al., Molecular Cell Biology (Scientific American Books, New York, ed. 3, 1995).
    • (1995) Molecular Cell Biology
    • Lodish, H.1
  • 5
    • 15444350252 scopus 로고    scopus 로고
    • F. R. Blattner et al., Science 277, 1453 (1997).
    • (1997) Science , vol.277 , pp. 1453
    • Blattner, F.R.1
  • 6
    • 0345583058 scopus 로고    scopus 로고
    • The triplets of proteins are found with the aid of protein domain databases such as the ProDom or Pfam databases (17). Here, a list of all ProDom domains in every one of the 64,568 SWISS-PROT proteins was prepared, as well as a list of all proteins that contain each of the 53,597 ProDom domains. Then every protein in ProDom was considered for its ability to be a linking (or Rosetta Stone) member in a triplet. All pairs of domains that are both members of a given protein P were defined as being linked by protein P, if we could find at least one protein with only one of the two domains. By this method, we found 14,899 links between the 7843 ProDom domains. Then in a single genome (such as E. coli), we found all nonhomologous pairs of proteins containing linked domains. These pairs are linked by the Rosetta Stone proteins. For E. coli, this method finds 3531 protein pairs. An alternate method for discovering protein triplets uses amino acid sequence alignment techniques to find two proteins that align to a Rosetta Stone protein such that the alignments do not overlap on the Rosetta Stone protein. For E. coli, this method finds 4487 protein pairs, 1209 of which were also found by the ProDom search method (even though different sequence databases were searched for each method). All predictions are available on the World Wide Web at www.doe-mbi.ucla.edu.
  • 7
    • 0345151209 scopus 로고    scopus 로고
    • note
    • Two amino acid sequences are said to be similar when the sequences align with a statistically significant alignment score. The significance is described by the probability of obtaining a higher alignment score when comparing shuffled sequences, with the acceptable probability threshold set by considering the total number of sequence comparisons performed. That is, if n proteins in E. coli are compared with m proteins in other genomes, n X m total comparisons are performed. We set a probability of 1/(n X m) as the threshold as this is the lowest value that could be obtained by comparing n X m random sequences. For the ProDom-based identification of homologs, definitions of sequence similarity are as in the ProDom database.
  • 8
    • 0345150848 scopus 로고    scopus 로고
    • The SWISS-PROT database is available at www. expasy.ch/sprot/.
  • 9
    • 0345151206 scopus 로고    scopus 로고
    • The Database of Interacting Proteins is available on the Web at http://www.doe-mbi.ucla.edu.
  • 11
    • 0024511257 scopus 로고
    • H. P. Erickson, J. Mol. Biol. 206, 465 (1989); A. D. Nagi and L. Regan, Folding Design 2, 67 (1997).
    • (1989) J. Mol. Biol. , vol.206 , pp. 465
    • Erickson, H.P.1
  • 12
    • 0030623398 scopus 로고    scopus 로고
    • H. P. Erickson, J. Mol. Biol. 206, 465 (1989); A. D. Nagi and L. Regan, Folding Design 2, 67 (1997).
    • (1997) Folding Design , vol.2 , pp. 67
    • Nagi, A.D.1    Regan, L.2
  • 16
    • 0028853544 scopus 로고
    • N. Horton and M. Lewis, Protein Sci. 1, 169 (1992); J. Janin, Biochimie 77, 497 (1995).
    • (1995) Biochimie , vol.77 , pp. 497
    • Janin, J.1
  • 18
    • 0031025991 scopus 로고    scopus 로고
    • W. Xu, S. C. Harrison, M. J. Eck, Nature 385, 595 (1997); F. Sicheri, I. Moarefi, J. Kuriyan, ibid., p. 602.
    • (1997) Nature , vol.385 , pp. 595
    • Xu, W.1    Harrison, S.C.2    Eck, M.J.3
  • 20
    • 0345151196 scopus 로고    scopus 로고
    • note
    • The error in predicting protein-protein interactions due to the inability to distinguish homologs was estimated as 1-T, where T is the mean percentage of potential true positives calculated for all domain pairs in E. coli. For each domain pair linked by a Rosetta Stone protein, there are n proteins with the first domain but not the second and m proteins with the second domain but not the first The percentage of true positives T is therefore estimated as the smaller of n or m divided by n times m.
  • 22
    • 0032952229 scopus 로고    scopus 로고
    • F. Corpet, J. Gouzy, D. Kahn, Nucleic Acids Res. 26, 323 (1998); A. Bateman et al., ibid. 27, 260 (1999).
    • (1999) Nucleic Acids Res. , vol.27 , pp. 260
    • Bateman, A.1
  • 24
    • 0019405352 scopus 로고
    • A. Sugino, N. P. Higgins, N. R. Cozzarelli, ibid. 8, 3865 (1980); W. K. Yeh and L. N. Omston, J. Biol. Chem. 256, 1565 (1981); C. S. McHenry and W. Crow, ibid. 254, 1748 (1979).
    • (1981) J. Biol. Chem. , vol.256 , pp. 1565
    • Yeh, W.K.1    Omston, L.N.2
  • 25
    • 0018786252 scopus 로고
    • A. Sugino, N. P. Higgins, N. R. Cozzarelli, ibid. 8, 3865 (1980); W. K. Yeh and L. N. Omston, J. Biol. Chem. 256, 1565 (1981); C. S. McHenry and W. Crow, ibid. 254, 1748 (1979).
    • (1979) J. Biol. Chem. , vol.254 , pp. 1748
    • McHenry, C.S.1    Crow, W.2
  • 26
    • 0019443447 scopus 로고
    • Table II
    • See Table II of J. S. Richardson, Adv. Protein Chem. 34, 167 (1981). Note also that eukaryotic genes, in contrast to prokaryotic genes, often code for multidomain proteins [W. J. Netzer and F. U. Hartl, Nature 388, 343 (1997)].
    • (1981) Adv. Protein Chem. , vol.34 , pp. 167
    • Richardson, J.S.1
  • 27
    • 0030844281 scopus 로고    scopus 로고
    • See Table II of J. S. Richardson, Adv. Protein Chem. 34, 167 (1981). Note also that eukaryotic genes, in contrast to prokaryotic genes, often code for multidomain proteins [W. J. Netzer and F. U. Hartl, Nature 388, 343 (1997)].
    • (1997) Nature , vol.388 , pp. 343
    • Netzer, W.J.1    Hartl, F.U.2
  • 29
    • 0344288906 scopus 로고    scopus 로고
    • note
    • Supported by the following grants: Department of Energy (DOE) DE-FC03-87ER-60615, NIH PO1 GM 31299, and NSF MCB 94 20769. E. M. was supported by a DOE Hollaender fellowship. We thank M. K. Baron for her work with the Database of Interacting Proteins.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.