-
1
-
-
0017693393
-
-
G. E. Fox et al., Proc. Natl. Acad. Sci. U.S.A. 74, 4537 (1977); C. R. Woese and G. E. Fox, ibid., p. 5088; C. R. Woese et al., ibid. 87, 4576 (1990).
-
(1977)
Proc. Natl. Acad. Sci. U.S.A.
, vol.74
, pp. 4537
-
-
Fox, G.E.1
-
3
-
-
0025300402
-
-
G. E. Fox et al., Proc. Natl. Acad. Sci. U.S.A. 74, 4537 (1977); C. R. Woese and G. E. Fox, ibid., p. 5088; C. R. Woese et al., ibid. 87, 4576 (1990).
-
(1990)
Proc. Natl. Acad. Sci. U.S.A.
, vol.87
, pp. 4576
-
-
Woese, C.R.1
-
4
-
-
0024358140
-
-
N. Iwabe et al, ibid. 86, 9355 (1989); J. P. Gogarten et al., ibid., p. 6651; W. Zillig et al., Endocytobiosis Cell Res. 6, 1 (1989); J. R. Brown and W. F. Doolittle, Proc. Natl. Acad. Sci. U.S.A. 92, 2441 (1995).
-
(1989)
Proc. Natl. Acad. Sci. U.S.A.
, vol.86
, pp. 9355
-
-
Iwabe, N.1
-
5
-
-
0024358140
-
-
N. Iwabe et al, ibid. 86, 9355 (1989); J. P. Gogarten et al., ibid., p. 6651; W. Zillig et al., Endocytobiosis Cell Res. 6, 1 (1989); J. R. Brown and W. F. Doolittle, Proc. Natl. Acad. Sci. U.S.A. 92, 2441 (1995).
-
Proc. Natl. Acad. Sci. U.S.A.
, pp. 6651
-
-
Gogarten, J.P.1
-
6
-
-
0024358140
-
-
N. Iwabe et al, ibid. 86, 9355 (1989); J. P. Gogarten et al., ibid., p. 6651; W. Zillig et al., Endocytobiosis Cell Res. 6, 1 (1989); J. R. Brown and W. F. Doolittle, Proc. Natl. Acad. Sci. U.S.A. 92, 2441 (1995).
-
(1989)
Endocytobiosis Cell Res.
, vol.6
, pp. 1
-
-
Zillig, W.1
-
7
-
-
0028941917
-
-
N. Iwabe et al, ibid. 86, 9355 (1989); J. P. Gogarten et al., ibid., p. 6651; W. Zillig et al., Endocytobiosis Cell Res. 6, 1 (1989); J. R. Brown and W. F. Doolittle, Proc. Natl. Acad. Sci. U.S.A. 92, 2441 (1995).
-
(1995)
Proc. Natl. Acad. Sci. U.S.A.
, vol.92
, pp. 2441
-
-
Brown, J.R.1
Doolittle, W.F.2
-
8
-
-
0029653518
-
-
R. D. Fleischmann et al., Science 269, 496 (1995); C. M. Fraser et al., ibid. 270, 397 (1995).
-
(1995)
Science
, vol.269
, pp. 496
-
-
Fleischmann, R.D.1
-
9
-
-
0028829125
-
-
R. D. Fleischmann et al., Science 269, 496 (1995); C. M. Fraser et al., ibid. 270, 397 (1995).
-
(1995)
Science
, vol.270
, pp. 397
-
-
Fraser, C.M.1
-
11
-
-
0029653613
-
-
M. D. Adams et al., Nature 377, 3 (1995); R. Wilson et al., ibid. 368, 32 (1994).
-
(1995)
Nature
, vol.377
, pp. 3
-
-
Adams, M.D.1
-
12
-
-
0028295681
-
-
M. D. Adams et al., Nature 377, 3 (1995); R. Wilson et al., ibid. 368, 32 (1994).
-
(1994)
Nature
, vol.368
, pp. 32
-
-
Wilson, R.1
-
15
-
-
0000241874
-
-
The statistical prediction of M. jannaschii genes was performed with GeneMark [M. Borodovsky and J. McIninch, Comput. Chem. 17, 123 (1993)]. Regular GeneMark uses nonhomogeneous Markov models derived from a training set of coding sequences and ordinary Markov models derived from a training set of noncoding sequences. Only a single 16S ribosomal RNA sequence of M. Jannaschii was available in the public sequence databases before the whole genome sequence described here. Thus, the initial training set to determine parameters of a coding sequence Markov model was chosen as a set of ORFs >1000 nucleotides (nt). As an initial model for noncoding sequences, a zero-order Markov model with genome-specific nucleotide frequencies was used. The initial models were used at the first prediction step. The results of the first prediction were then used to compile a set of putative genes used at the second training step. Alternate rounds of training aria predicting were continued until the set of predicted genes stabilized and the parameters of the final fourth-order model of coding sequences were derived. The regions predicted as noncoding were then used as a training set for a final model for noncoding regions. Cross-validation simulations demonstrated that the GeneMark program trained as described above was able to correctly identify coding regions of at least 96 nt in 94% of the cases and noncoding regions of the same length in 96% of the cases.: These values assume that the self-training method produced correct sequence annotation for complied control sets. Comparison with the results obtained by searches against a nonredundant protein database (3) demonstrated that almost all genes identified by sequence similarity were predicted by the GeneMark program as well. This observation provides additional confidence in genes predicted by GeneMark whose protein translations did not snow significant similarity to known protein sequences. The predicted protein-coding regions were searched against the Blocks database [S. Henikoff and J. G. Henikoff, Genomics 19, 97 (1994)) by means of BLIMPS [J. C. Wallace and S. Henikoff, Comput. Appl. Biosci. 8, 249 (1992)] to verify putative identifications and to identify potential functional motifs in predicted protein-coding regions that had no database match. Genes were assigned to known metabolic pathways. When a gene appeared to be missing from a pathway, the unassigned ORFs and the complete M. jannaschii genome sequence were searched with specific query sequences or motifs from the Blocks database. Hydrophobicity plots were performed on all predicted protein-coding regions by means of the Kyte-Doolittle algorithm [J. Kyte and R. F. Doolittte, J. Mol. Biol. 157, 105 (1982)] to identify potentially functionally relevant signatures in these sequences. The results of the Blocks and Kyte-Doolittle analyses are available on the World Wide Web (http:/ /www.tigr.org/tdo/mdb/mjdb/mjdb.html).
-
(1993)
Comput. Chem.
, vol.17
, pp. 123
-
-
Borodovsky, M.1
McIninch, J.2
-
16
-
-
0028118434
-
-
The statistical prediction of M. jannaschii genes was performed with GeneMark [M. Borodovsky and J. McIninch, Comput. Chem. 17, 123 (1993)]. Regular GeneMark uses nonhomogeneous Markov models derived from a training set of coding sequences and ordinary Markov models derived from a training set of noncoding sequences. Only a single 16S ribosomal RNA sequence of M. Jannaschii was available in the public sequence databases before the whole genome sequence described here. Thus, the initial training set to determine parameters of a coding sequence Markov model was chosen as a set of ORFs >1000 nucleotides (nt). As an initial model for noncoding sequences, a zero-order Markov model with genome-specific nucleotide frequencies was used. The initial models were used at the first prediction step. The results of the first prediction were then used to compile a set of putative genes used at the second training step. Alternate rounds of training aria predicting were continued until the set of predicted genes stabilized and the parameters of the final fourth-order model of coding sequences were derived. The regions predicted as noncoding were then used as a training set for a final model for noncoding regions. Cross-validation simulations demonstrated that the GeneMark program trained as described above was able to correctly identify coding regions of at least 96 nt in 94% of the cases and noncoding regions of the same length in 96% of the cases.: These values assume that the self-training method produced correct sequence annotation for complied control sets. Comparison with the results obtained by searches against a nonredundant protein database (3) demonstrated that almost all genes identified by sequence similarity were predicted by the GeneMark program as well. This observation provides additional confidence in genes predicted by GeneMark whose protein translations did not snow significant similarity to known protein sequences. The predicted protein-coding regions were searched against the Blocks database [S. Henikoff and J. G. Henikoff, Genomics 19, 97 (1994)) by means of BLIMPS [J. C. Wallace and S. Henikoff, Comput. Appl. Biosci. 8, 249 (1992)] to verify putative identifications and to identify potential functional motifs in predicted protein-coding regions that had no database match. Genes were assigned to known metabolic pathways. When a gene appeared to be missing from a pathway, the unassigned ORFs and the complete M. jannaschii genome sequence were searched with specific query sequences or motifs from the Blocks database. Hydrophobicity plots were performed on all predicted protein-coding regions by means of the Kyte-Doolittle algorithm [J. Kyte and R. F. Doolittte, J. Mol. Biol. 157, 105 (1982)] to identify potentially functionally relevant signatures in these sequences. The results of the Blocks and Kyte-Doolittle analyses are available on the World Wide Web (http:/ /www.tigr.org/tdo/mdb/mjdb/mjdb.html).
-
(1994)
Genomics
, vol.19
, pp. 97
-
-
Henikoff, S.1
Henikoff, J.G.2
-
17
-
-
0026747032
-
-
The statistical prediction of M. jannaschii genes was performed with GeneMark [M. Borodovsky and J. McIninch, Comput. Chem. 17, 123 (1993)]. Regular GeneMark uses nonhomogeneous Markov models derived from a training set of coding sequences and ordinary Markov models derived from a training set of noncoding sequences. Only a single 16S ribosomal RNA sequence of M. Jannaschii was available in the public sequence databases before the whole genome sequence described here. Thus, the initial training set to determine parameters of a coding sequence Markov model was chosen as a set of ORFs >1000 nucleotides (nt). As an initial model for noncoding sequences, a zero-order Markov model with genome-specific nucleotide frequencies was used. The initial models were used at the first prediction step. The results of the first prediction were then used to compile a set of putative genes used at the second training step. Alternate rounds of training aria predicting were continued until the set of predicted genes stabilized and the parameters of the final fourth-order model of coding sequences were derived. The regions predicted as noncoding were then used as a training set for a final model for noncoding regions. Cross-validation simulations demonstrated that the GeneMark program trained as described above was able to correctly identify coding regions of at least 96 nt in 94% of the cases and noncoding regions of the same length in 96% of the cases.: These values assume that the self-training method produced correct sequence annotation for complied control sets. Comparison with the results obtained by searches against a nonredundant protein database (3) demonstrated that almost all genes identified by sequence similarity were predicted by the GeneMark program as well. This observation provides additional confidence in genes predicted by GeneMark whose protein translations did not snow significant similarity to known protein sequences. The predicted protein-coding regions were searched against the Blocks database [S. Henikoff and J. G. Henikoff, Genomics 19, 97 (1994)) by means of BLIMPS [J. C. Wallace and S. Henikoff, Comput. Appl. Biosci. 8, 249 (1992)] to verify putative identifications and to identify potential functional motifs in predicted protein-coding regions that had no database match. Genes were assigned to known metabolic pathways. When a gene appeared to be missing from a pathway, the unassigned ORFs and the complete M. jannaschii genome sequence were searched with specific query sequences or motifs from the Blocks database. Hydrophobicity plots were performed on all predicted protein-coding regions by means of the Kyte-Doolittle algorithm [J. Kyte and R. F. Doolittte, J. Mol. Biol. 157, 105 (1982)] to identify potentially functionally relevant signatures in these sequences. The results of the Blocks and Kyte-Doolittle analyses are available on the World Wide Web (http:/ /www.tigr.org/tdo/mdb/mjdb/mjdb.html).
-
(1992)
Comput. Appl. Biosci.
, vol.8
, pp. 249
-
-
Wallace, J.C.1
Henikoff, S.2
-
18
-
-
0020475449
-
-
The statistical prediction of M. jannaschii genes was performed with GeneMark [M. Borodovsky and J. McIninch, Comput. Chem. 17, 123 (1993)]. Regular GeneMark uses nonhomogeneous Markov models derived from a training set of coding sequences and ordinary Markov models derived from a training set of noncoding sequences. Only a single 16S ribosomal RNA sequence of M. Jannaschii was available in the public sequence databases before the whole genome sequence described here. Thus, the initial training set to determine parameters of a coding sequence Markov model was chosen as a set of ORFs >1000 nucleotides (nt). As an initial model for noncoding sequences, a zero-order Markov model with genome-specific nucleotide frequencies was used. The initial models were used at the first prediction step. The results of the first prediction were then used to compile a set of putative genes used at the second training step. Alternate rounds of training aria predicting were continued until the set of predicted genes stabilized and the parameters of the final fourth-order model of coding sequences were derived. The regions predicted as noncoding were then used as a training set for a final model for noncoding regions. Cross-validation simulations demonstrated that the GeneMark program trained as described above was able to correctly identify coding regions of at least 96 nt in 94% of the cases and noncoding regions of the same length in 96% of the cases.: These values assume that the self-training method produced correct sequence annotation for complied control sets. Comparison with the results obtained by searches against a nonredundant protein database (3) demonstrated that almost all genes identified by sequence similarity were predicted by the GeneMark program as well. This observation provides additional confidence in genes predicted by GeneMark whose protein translations did not snow significant similarity to known protein sequences. The predicted protein-coding regions were searched against the Blocks database [S. Henikoff and J. G. Henikoff, Genomics 19, 97 (1994)) by means of BLIMPS [J. C. Wallace and S. Henikoff, Comput. Appl. Biosci. 8, 249 (1992)] to verify putative identifications and to identify potential functional motifs in predicted protein-coding regions that had no database match. Genes were assigned to known metabolic pathways. When a gene appeared to be missing from a pathway, the unassigned ORFs and the complete M. jannaschii genome sequence were searched with specific query sequences or motifs from the Blocks database. Hydrophobicity plots were performed on all predicted protein-coding regions by means of the Kyte-Doolittle algorithm [J. Kyte and R. F. Doolittte, J. Mol. Biol. 157, 105 (1982)] to identify potentially functionally relevant signatures in these sequences. The results of the Blocks and Kyte-Doolittle analyses are available on the World Wide Web (http:/ /www.tigr.org/tdo/mdb/mjdb/mjdb.html).
-
(1982)
J. Mol. Biol.
, vol.157
, pp. 105
-
-
Kyte, J.1
Doolittte, R.F.2
-
21
-
-
0021714067
-
-
N. Belay et al., Nature 312, 286 (1984).
-
(1984)
Nature
, vol.312
, pp. 286
-
-
Belay, N.1
-
26
-
-
0024321368
-
-
K. Lechner et al., J. Mol. Evol. 29, 20 (1989); A. K. E. Köpke and B. Wittmann-Liebold. Can. J. Microbiol. 35,11 (1989).
-
(1989)
J. Mol. Evol.
, vol.29
, pp. 20
-
-
Lechner, K.1
-
29
-
-
0014633039
-
-
M. Wilcox, Eur. J. Biochem. 11, 405 (1969); N. C. Martin et al., J. Mol. Biol. 101, 285 (1976); N. C. Martin et al., Biochemistry 16, 4672 (1977); A. Schon et al., Biochimie 70, 391 (1988); D. Soll and U. Raj Bhandary, Eds. tRNA: Structure, Biosynthesis, and Function (American Society for Microbiology, Washington, DC, 1995).
-
(1969)
Eur. J. Biochem.
, vol.11
, pp. 405
-
-
Wilcox, M.1
-
30
-
-
0017232902
-
-
M. Wilcox, Eur. J. Biochem. 11, 405 (1969); N. C. Martin et al., J. Mol. Biol. 101, 285 (1976); N. C. Martin et al., Biochemistry 16, 4672 (1977); A. Schon et al., Biochimie 70, 391 (1988); D. Soll and U. Raj Bhandary, Eds. tRNA: Structure, Biosynthesis, and Function (American Society for Microbiology, Washington, DC, 1995).
-
(1976)
J. Mol. Biol.
, vol.101
, pp. 285
-
-
Martin, N.C.1
-
31
-
-
0017661126
-
-
M. Wilcox, Eur. J. Biochem. 11, 405 (1969); N. C. Martin et al., J. Mol. Biol. 101, 285 (1976); N. C. Martin et al., Biochemistry 16, 4672 (1977); A. Schon et al., Biochimie 70, 391 (1988); D. Soll and U. Raj Bhandary, Eds. tRNA: Structure, Biosynthesis, and Function (American Society for Microbiology, Washington, DC, 1995).
-
(1977)
Biochemistry
, vol.16
, pp. 4672
-
-
Martin, N.C.1
-
32
-
-
0023948279
-
-
M. Wilcox, Eur. J. Biochem. 11, 405 (1969); N. C. Martin et al., J. Mol. Biol. 101, 285 (1976); N. C. Martin et al., Biochemistry 16, 4672 (1977); A. Schon et al., Biochimie 70, 391 (1988); D. Soll and U. Raj Bhandary, Eds. tRNA: Structure, Biosynthesis, and Function (American Society for Microbiology, Washington, DC, 1995).
-
(1988)
Biochimie
, vol.70
, pp. 391
-
-
Schon, A.1
-
33
-
-
0014633039
-
-
American Society for Microbiology, Washington, DC
-
M. Wilcox, Eur. J. Biochem. 11, 405 (1969); N. C. Martin et al., J. Mol. Biol. 101, 285 (1976); N. C. Martin et al., Biochemistry 16, 4672 (1977); A. Schon et al., Biochimie 70, 391 (1988); D. Soll and U. Raj Bhandary, Eds. tRNA: Structure, Biosynthesis, and Function (American Society for Microbiology, Washington, DC, 1995).
-
(1995)
tRNA: Structure, Biosynthesis, and Function
-
-
Soll, D.1
Raj Bhandary, U.2
-
35
-
-
16044366148
-
-
E.A. Wagner et al., J. Bacteriol 177, 5179 (1995); D. T. Logan et al., EMBO J. 14, 4156 (1995).
-
(1995)
J. Bacteriol
, vol.177
, pp. 5179
-
-
Wagner, E.A.1
-
36
-
-
0029091055
-
-
E.A. Wagner et al., J. Bacteriol 177, 5179 (1995); D. T. Logan et al., EMBO J. 14, 4156 (1995).
-
(1995)
EMBO J.
, vol.14
, pp. 4156
-
-
Logan, D.T.1
-
37
-
-
0011919685
-
-
Academic Press, New York
-
C. R. Woese and R. S. Wolfe, Eds. The Bacteria (Academic Press, New York, 1985), vol. 8; D. Langer et al., Proc. Natl. Acad Sci. U.S.A. 92, 5768 (1995); M. Lanzendoerfer et al., Syst. Appl. Microbiol. 16, 656 (1994).
-
(1985)
The Bacteria
, vol.8
-
-
Woese, C.R.1
Wolfe, R.S.2
-
38
-
-
0028983409
-
-
C. R. Woese and R. S. Wolfe, Eds. The Bacteria (Academic Press, New York, 1985), vol. 8; D. Langer et al., Proc. Natl. Acad Sci. U.S.A. 92, 5768 (1995); M. Lanzendoerfer et al., Syst. Appl. Microbiol. 16, 656 (1994).
-
(1995)
Proc. Natl. Acad Sci. U.S.A.
, vol.92
, pp. 5768
-
-
Langer, D.1
-
39
-
-
0028295933
-
-
C. R. Woese and R. S. Wolfe, Eds. The Bacteria (Academic Press, New York, 1985), vol. 8; D. Langer et al., Proc. Natl. Acad Sci. U.S.A. 92, 5768 (1995); M. Lanzendoerfer et al., Syst. Appl. Microbiol. 16, 656 (1994).
-
(1994)
Syst. Appl. Microbiol.
, vol.16
, pp. 656
-
-
Lanzendoerfer, M.1
-
41
-
-
0023661346
-
-
A. Bernard et al., EMBO J. 6, 4219 (1987); G. Cullman et al., Mol. Cell, Biol. 15, 4661 (1995); T. Uemori et al, J. Bacteriol. 177, 2164 (1995); M. Delarue et al., Protein Eng. 3, 461 (1990); K. A. Gavin, M. Hidaka, B. Stillman, Science 270, 1667 (1995).
-
(1987)
EMBO J.
, vol.6
, pp. 4219
-
-
Bernard, A.1
-
42
-
-
0029150469
-
-
A. Bernard et al., EMBO J. 6, 4219 (1987); G. Cullman et al., Mol. Cell, Biol. 15, 4661 (1995); T. Uemori et al, J. Bacteriol. 177, 2164 (1995); M. Delarue et al., Protein Eng. 3, 461 (1990); K. A. Gavin, M. Hidaka, B. Stillman, Science 270, 1667 (1995).
-
(1995)
Mol. Cell. Biol.
, vol.15
, pp. 4661
-
-
Cullman, G.1
-
43
-
-
0028912012
-
-
A. Bernard et al., EMBO J. 6, 4219 (1987); G. Cullman et al., Mol. Cell, Biol. 15, 4661 (1995); T. Uemori et al, J. Bacteriol. 177, 2164 (1995); M. Delarue et al., Protein Eng. 3, 461 (1990); K. A. Gavin, M. Hidaka, B. Stillman, Science 270, 1667 (1995).
-
(1995)
J. Bacteriol.
, vol.177
, pp. 2164
-
-
Uemori, T.1
-
44
-
-
0025040632
-
-
A. Bernard et al., EMBO J. 6, 4219 (1987); G. Cullman et al., Mol. Cell, Biol. 15, 4661 (1995); T. Uemori et al, J. Bacteriol. 177, 2164 (1995); M. Delarue et al., Protein Eng. 3, 461 (1990); K. A. Gavin, M. Hidaka, B. Stillman, Science 270, 1667 (1995).
-
(1990)
Protein Eng.
, vol.3
, pp. 461
-
-
Delarue, M.1
-
45
-
-
0029586090
-
-
A. Bernard et al., EMBO J. 6, 4219 (1987); G. Cullman et al., Mol. Cell, Biol. 15, 4661 (1995); T. Uemori et al, J. Bacteriol. 177, 2164 (1995); M. Delarue et al., Protein Eng. 3, 461 (1990); K. A. Gavin, M. Hidaka, B. Stillman, Science 270, 1667 (1995).
-
(1995)
Science
, vol.270
, pp. 1667
-
-
Gavin, K.A.1
Hidaka, M.2
Stillman, B.3
-
48
-
-
0030065617
-
-
L. Rothfield and C.-R Zhao, Cell 84, 183 (1996); J. Lutkenhaus, Curr. Opin. Genet. Dev. 3, 783 (1993).
-
(1996)
Cell
, vol.84
, pp. 183
-
-
Rothfield, L.1
Zhao, C.-R.2
-
51
-
-
0024720303
-
-
B.P. Kaine and V.L. Merkel, J. Bacteriol. 171, 4261 (1989); M. A. Porltz et al, Cell 55, 4 (1988).
-
(1988)
Cell
, vol.55
, pp. 4
-
-
Porltz, M.A.1
-
52
-
-
0028182415
-
-
D. M. Faguy et al., Can. J. Microbiol. 40, 67 (1994): M. L. Kalmokoff et al., Arch. Microbiol. 157, 481 (1992).
-
(1994)
Can. J. Microbiol.
, vol.40
, pp. 67
-
-
Faguy, D.M.1
-
53
-
-
0026507201
-
-
D. M. Faguy et al., Can. J. Microbiol. 40, 67 (1994): M. L. Kalmokoff et al., Arch. Microbiol. 157, 481 (1992).
-
(1992)
Arch. Microbiol.
, vol.157
, pp. 481
-
-
Kalmokoff, M.L.1
-
55
-
-
0025226104
-
-
2 terminal extein-intein junction, so it was necessary to rely on consensus sequences to select the putative site. The inteins in MJ1042 and MJ0542 have previously uncharacterized COOH-terminal splice junctions, GNC and FNC, respectively.
-
(1990)
Science
, vol.250
, pp. 651
-
-
Kane, P.M.1
-
56
-
-
0025240361
-
-
2 terminal extein-intein junction, so it was necessary to rely on consensus sequences to select the putative site. The inteins in MJ1042 and MJ0542 have previously uncharacterized COOH-terminal splice junctions, GNC and FNC, respectively.
-
(1990)
J. Biol. Chem.
, vol.265
, pp. 6726
-
-
Hirata, R.1
-
57
-
-
0029116947
-
-
2 terminal extein-intein junction, so it was necessary to rely on consensus sequences to select the putative site. The inteins in MJ1042 and MJ0542 have previously uncharacterized COOH-terminal splice junctions, GNC and FNC, respectively.
-
(1995)
Trends Biochem. Sci.
, vol.20
, pp. 351
-
-
Cooper, A.A.1
Stevens, T.2
-
58
-
-
0027753790
-
-
2 terminal extein-intein junction, so it was necessary to rely on consensus sequences to select the putative site. The inteins in MJ1042 and MJ0542 have previously uncharacterized COOH-terminal splice junctions, GNC and FNC, respectively.
-
(1993)
Cell
, vol.75
, pp. 1371
-
-
Xu, M.-Q.1
-
59
-
-
0026718843
-
-
2 terminal extein-intein junction, so it was necessary to rely on consensus sequences to select the putative site. The inteins in MJ1042 and MJ0542 have previously uncharacterized COOH-terminal splice junctions, GNC and FNC, respectively.
-
(1992)
Proc. Natl. Acad. Sci. U.S.A.
, vol.89
, pp. 5577
-
-
Perler, F.1
-
60
-
-
0027315426
-
-
2 terminal extein-intein junction, so it was necessary to rely on consensus sequences to select the putative site. The inteins in MJ1042 and MJ0542 have previously uncharacterized COOH-terminal splice junctions, GNC and FNC, respectively.
-
(1993)
EMBO J
, vol.12
, pp. 2575
-
-
Cooper1
-
61
-
-
0020198881
-
-
2 terminal extein-intein junction, so it was necessary to rely on consensus sequences to select the putative site. The inteins in MJ1042 and MJ0542 have previously uncharacterized COOH-terminal splice junctions, GNC and FNC, respectively.
-
(1982)
Biochimie
, vol.64
, pp. 867
-
-
Michel, F.1
-
62
-
-
0028607524
-
-
2 terminal extein-intein junction, so it was necessary to rely on consensus sequences to select the putative site. The inteins in MJ1042 and MJ0542 have previously uncharacterized COOH-terminal splice junctions, GNC and FNC, respectively.
-
(1994)
Protein Sci.
, vol.3
, pp. 2340
-
-
Pietrokovski, S.1
-
65
-
-
33947470407
-
-
G. Felsenfeld et al., J. Am. Chem. Soc. 79, 2023 (1957); A. G. Letai et al., Biochemistry 27, 9108 (1988).
-
(1957)
J. Am. Chem. Soc.
, vol.79
, pp. 2023
-
-
Felsenfeld, G.1
-
66
-
-
0024298583
-
-
G. Felsenfeld et al., J. Am. Chem. Soc. 79, 2023 (1957); A. G. Letai et al., Biochemistry 27, 9108 (1988).
-
(1988)
Biochemistry
, vol.27
, pp. 9108
-
-
Letai, A.G.1
-
68
-
-
16044364281
-
-
note
-
Supported in part by Department of Energy Cooperative Agreements DE-FC02-95ER61962 (J.C.V.) and DEFC02-95ER61963 (C.R.W. and G.J.O), NASA grant NAGW 2554 (C.R.W,), and a core grant to TIGR from Human Genome Sciences. G.J.O. is the recipient of the National Science Foundation Presidential Young Investigator Award (DIR 89-57026). M.B. is supported by National Institutes of Health grant GM00783. We thank M. Heaney, C. Gnehm, R. Shirley, J. Slagel, and W. Hayes for software and database support; T. Dixon and V. Sapiro for computer system support; K. Hong and B. Stader for laboratory assistance; and B. Mukhopadhyay for helpful discussions. The M. jannaschii source accession number is DSM 2661. and the cells were a gift from P. Haney (Department of Microbiology, University of Illinois).
-
-
-
|