We choose our synthetases based on a good total score and a good ligand score. We checked the corresponding PDB-files, and rated the ligand and the binding pocket
as satisfying, so that the ligand assumedly does not collide with residues in the near environment.
The total scores for CBT are not as good as the scores for NPA. However, the ligand scores are acceptable in both cases. A visual evaluation confirms that the ligand
fits into the binding pocket.
Our results for this step
We used this algorithm to simulate the evolution of the tyrosyl-tRNA with the amino acids Nitrophenylalanine(NPA) and N
γ‑2‑cyanobenzothiazol‑6‑yl‑L‑asparagine (CBT-asparagine).
NPA simulation:
The .cst-file contained two blocks for the nitrogroup of NPA. Since there are two oxygen-atoms in the nitrogroup,
we defined two atom nametags. As several possibilities are useful, we defined two possible constraint partners
for the hydrogen bonds. The first is asparagine (N) or glutamine (Q) and the second is glycine (G). We set the
possible distance to 2.8 Å, as it is the optimal distance for hydrogenbonds, and a tolerance level of 0.5 Å.
We set the angles to 120° with a tolerance of 40°, as recommended by Florian Richter during our discussion in cologne.
The torsion angles were set to 180° with a tolerance of 180° and a penalty of 0, such that the torsion angles can rotate
completely freely.(Richter, unpublished data)
CBT-Asparagine simulation:
CBT-Asparagine can build hydrogen bonds in two ways. The first is a weak hydrogen bond on the
sulfur atom and the other possibility is a normal hydrogen bond on the nitrogen (N
2)
after the Cγ. We wrote three cst-files, one for a possible bond with sulfur, one for a
possible bond with nitrogen, and one for both bonds. As possible corresponding amino acids, we chose serine,
threonine, tyrosine, asparagine, glutamine, and glycine.
It is recommended to write a “.flags”-file, because there are several input parameters to be defined,
but it is also possible to define them via console user interface.
For the categorization of the scaffold, we chose the automatic determination and set the following cuts: cut1: 6 Å, cut2: 8 Å, cut3: 10 Å and cut4: 12 Å, like the Baker-lab commonly used.
We used this algrithm to simulate the evolution of the tyrosyl-tRNA with the amino acids Nitrophenylalanine and CBT-ASP
We obtained 13 synthetase sequences for CBT-ASP, and 43 sequences for NPA, which fit well into the binding site according to the ROSETTA score.
The sequences for the best synthetases for NPA is available
here
Sequence Number |
Total Score |
Ligand Score |
ncAA |
15 |
124.88 |
-3.77 |
NPA |
19 |
23.55 |
-3.93 |
NPA |
31 |
-3.40 |
-2.47 |
NPA |
32 |
-1.57 |
-3.82 |
NPA |
40 |
11.67 |
-4.33 |
NPA |
41 |
11.55 |
-2.98 |
NPA |
43 |
66.36 |
-5.05 |
NPA |
2 |
38.01 |
-6.56 |
CBT-Asparagine |
4 |
58.45 |
-4.37 |
CBT-Asparagine |
5 |
109.13 |
-4.25 |
CBT-Asparagine |
Table 2: ROSETTA Enzyme Design Protocol Results
ROSETTA scores of the best modeled synthetases for CBT-Asparagine and NPA.
Position |
Synthetase Number |
Original Amino Acid |
Mutation Amino Acid |
30 |
5 |
Serine |
Asparagine |
32 |
5 |
Tyrosine |
Threonine |
34 |
2, 4 |
Glycine |
Alanine |
36 |
2 |
Glutamine acid |
Isoleucine |
61 |
5 |
Asparagine acid |
Arginine |
63 |
5 |
Isoleucine |
Alanine |
65 |
4 |
Leucine |
Glycine |
65 |
5 |
Leucine |
Threonine |
68 |
4 |
Asparagine acid |
Alanine |
69 |
4 |
Leucine |
Alanine |
70 |
2 |
Histidine |
Asparagine acid |
70 |
4 |
Histidine |
Glycine |
72 |
4 |
Tyrosine |
Glutamine acid |
73 |
2 |
Leucine |
Alanine |
73 |
4 |
Leucine |
Methionine |
74 |
2 |
Asparagine |
Asparagine acid |
76 |
2 |
Lysine |
Serine |
79 |
4 |
Leucine |
Arginine |
101 |
5 |
Lysine |
Glutamine acid |
103 |
5 |
Valine |
Triptophane |
104 |
5 |
Tyrosine |
Valine |
105 |
4, 5 |
Glycine |
Serine |
107 |
5 |
Glutamine acid |
Lysine |
108 |
4 |
Phenylalanine |
Lysine |
108 |
5 |
Phenylalanine |
Arginine |
109 |
4, 5 |
Glutamine |
Alanine |
114 |
4 |
Tyrosine |
Alanine |
115 |
4 |
Threonine |
Triptophane |
118 |
4 |
Valine |
Serine |
134 |
2 |
Methionine |
Asparagine |
137 |
2 |
Isoleucine |
Alanine |
139 |
2 |
Arginine |
Serine |
147 |
2, 4 |
Alanine |
Serine |
148 |
2 |
Glutamine |
Lysine |
149 |
2 |
Valine |
Threonine |
150 |
2, 4 |
Isoleucine |
Leucine |
151 |
2 |
Tyrosine |
Serine |
152 |
2 |
Proline |
Threonine |
153 |
2 |
Isoleucine |
Leucine |
153 |
4 |
Isoleucine |
Threonine |
154 |
2 |
Methionine |
Asparagine |
154 |
4, 5 |
Methionine |
Serine |
155 |
2 |
Glutamine |
Glycine |
155 |
4, 5 |
Glutamine |
Alanine |
156 |
4 |
Valine |
Alanine |
157 |
4 |
Asparagine |
Alanine |
158 |
4, 5 |
Asparagine |
Tyrosine |
159 |
4 |
Isoleucine |
Glycine |
161 |
4 |
Tyrosine |
Glutamine acid |
161 |
5 |
Tyrosine |
Asparagine acid |
162 |
4 |
Leucine |
Methionine |
162 |
5 |
Leucine |
Alanine |
164 |
5 |
Valine |
Alanine |
172 |
2 |
Glutamine acid |
Lysine |
173 |
2 |
Glutamine |
Serine |
176 |
2 |
Isoleucine |
Serine |
204-210 |
2, 4, 5 |
varying |
- |
307 |
2 |
- |
Alanine |
307 |
4, 5 |
- |
Tyrosine |
In order to test the functionality and specificity of our modeled aaRS, we selected the most promising 11 amino acid sequences and ordered seven synthetase sequences of NPA via IDT, and four
synthetase sequences of CBT-Asparagine by courtesy of Genscript, where we had previously won a grant of 500€.
Since the DNA synthesis by GenScript and IDT were delayed by several weeks, we could not perform the best practice characterization of these parts.
Finally, we received only three of the ordered syntheses in sufficient quality for further experiments. All of them encode predicted CBT-tRNA-synthetases.
We subjected the sequences to a positive selection as initial characterization. Plasmids encoding the predicted best candidates were cotransformed with our
positive selection plasmid into
E. coli(BL21 DE3). Due to the IPTG induced promoter, we tested different IPTG concentrations, including 0 mM, 5 mM,
10 mM, and 15 mM. In addition, we tried different concentration of the antibiotics: kan15, cm15/kan15, and cm15/kan15/tet5. The number of resulting colonies for
each sample is summarized in figure X. Our
in vivo results show that our
in silico designed enzymes kept their native function and are
able to integrate amino acids through an amber codon matching tRNA. Since these results only indicate the acceptance and transfer of the non-canonical
amino acid, additional experiment are required to demonstrate a high specificity of these enzymes.
We offer the predicted sequences to the community for further characterization via the parts-reg (LINKS).