|
|
Line 131: |
Line 131: |
| <div class="third double"> | | <div class="third double"> |
| <div class ="article"> | | <div class ="article"> |
− | As part of our iGEM project, we are faced with the challenge of adapting the tRNA synthetase to non-canonical amino acids. For this purpose, modelled possible candidates for synthetases as a preparation for carrying out a positive-negative selection according to (Liu <i>et al.</i>, 2007) in the laboratory. | + | As part of our iGEM project, we are faced with the challenge of adapting the tRNA synthetase |
| + | to non-canonical amino acids. For this purpose, modelled possible candidates for synthetases as a |
| + | preparation for carrying out a positive-negative selection according to (Liu <i>et al.</i>, 2007) in the laboratory. |
| | | |
− | Due to the rapid development in the field of protein and molecular structure analysis, there has been an increase in the availability of molecular 3D structure data. These data are organized in publicly available databases which provide a foundation for the modeling and simulation of chemical-biological processes in bioinformatics. As our non-canonical amino acid has been synthetized by ourselves, no such comprehensive information is available, yet. However, information of similarly structured amino acids can potentially serve as a basis for our modeling. | + | Due to the rapid development in the field of protein and molecular structure analysis, there has been an |
| + | increase in the availability of molecular 3D structure data. These data are organized in publicly available |
| + | databases which provide a foundation for the modeling and simulation of chemical-biological processes in bioinformatics. |
| + | As our non-canonical amino acid has been synthetized by ourselves, no such comprehensive information is available, yet. |
| + | However, information of similarly structured amino acids can potentially serve as a basis for our modeling. |
| | | |
− | As evaluating an expanded genetic code is a complex task, the practical laboratory work of our project is supplemented by a theoretical approach, involving modeling, simulation, and evaluation on the computerin silico. Specifically, we focused on simulation to designaimed at designing an aaRS tRNA synthetase for the new non-canonical amino acid CBT-ASP. Additionally to CBT, we also simulated the evolution process for the non-canonical amino acid NPA as a validation of our modeling procedure, altough as synthases for this ncAA are known and thus comparable to our in silico result, we can evaluate our modeling procedure. (Vielleicht hier ein wenig schöner) For this purposeOur core challenge was to evolve, the binding pocket must be evolved in a manner which effectively charges the tRNA with the amino acid, thus also recognizing this amino acid specifically. | + | As evaluating an expanded genetic code is a complex task, the practical laboratory work of our project is supplemented by a |
| + | theoretical approach, involving modeling, simulation, and evaluation on the computerin silico. Specifically, we focused on simulation to |
| + | designaimed at designing an aaRS tRNA synthetase for the new non-canonical amino acid CBT-ASP. Additionally to CBT, we also simulated the |
| + | evolution process for the non-canonical amino acid NPA as a validation of our modeling procedure, altough as synthases for this ncAA are known and |
| + | thus comparable to our in silico result, we can evaluate our modeling procedure. (Vielleicht hier ein wenig schöner) For this purposeOur core challenge was to evolve, |
| + | the binding pocket must be evolved in a manner which effectively charges the tRNA with the amino acid, thus also recognizing this amino acid specifically. |
| </div> | | </div> |
| </div> | | </div> |
Line 338: |
Line 349: |
| </div> | | </div> |
| </div> | | </div> |
− | <h4>Our results for this step</h4> | + | |
− | We choose our synthetases because of a good total score and a good ligand score. We checked the corresponding PDB-files, and rated the ligand and the binding pocket
| + | |
| + | </div> |
| + | </div> |
| + | <div class="bevel bl"></div> |
| + | </div> |
| + | |
| + | |
| + | <div class="contentbox"> |
| + | <div class="bevel tr"></div> |
| + | <div class="content"> |
| + | <h3> Results </h3> |
| + | <h4> Results in silico </h4> |
| + | <div class="article"> |
| + | We choose our synthetases because of a good total score and a good ligand score. We checked the corresponding PDB-files, and rated the ligand and the binding pocket |
| as satisfying, so that the ligand assumedly does not collide with residues in the near environment. | | as satisfying, so that the ligand assumedly does not collide with residues in the near environment. |
| The total scores for CBT are not as good as the scores for NPA. However, the ligand scores are acceptable in both cases. A visual evaluation confirms that the ligand | | The total scores for CBT are not as good as the scores for NPA. However, the ligand scores are acceptable in both cases. A visual evaluation confirms that the ligand |
Line 360: |
Line 384: |
| It is recommended to write a “.flags”-file, because there are several input- parameters to be defined, but it is also possible to define them via console user interface. </br> | | It is recommended to write a “.flags”-file, because there are several input- parameters to be defined, but it is also possible to define them via console user interface. </br> |
| For the categorization of the scaffold, we chose the automatic determination and set the following cuts: cut1: 6 A, cut2: 8 A, cut3: 10 A and cut4: 12 A, like the baker-lab commonly used. | | For the categorization of the scaffold, we chose the automatic determination and set the following cuts: cut1: 6 A, cut2: 8 A, cut3: 10 A and cut4: 12 A, like the baker-lab commonly used. |
− | </div>
| |
− | </div>
| |
− | <div class="bevel bl"></div>
| |
− | </div>
| |
− |
| |
− |
| |
− | <div class="contentbox">
| |
− | <div class="bevel tr"></div>
| |
− | <div class="content">
| |
− | <h3> Results </h3>
| |
− | <h4> Results in silico </h4>
| |
− | <div class="article">
| |
| We used this algrithm to simulate the evolution of the tyrosyl-tRNA with the amino acids Nitrophenylalanine and CBT-ASP | | We used this algrithm to simulate the evolution of the tyrosyl-tRNA with the amino acids Nitrophenylalanine and CBT-ASP |
| We obtained 13 synthetase sequences for CBT-ASP, and 43 sequences for NPA, which fit well into the binding site according to the ROSETTA score. | | We obtained 13 synthetase sequences for CBT-ASP, and 43 sequences for NPA, which fit well into the binding site according to the ROSETTA score. |
| + | |
| + | table { |
| + | font-family: arial, sans-serif; |
| + | border-collapse: collapse; |
| + | width: 100%; |
| + | } |
| + | |
| + | td, th { |
| + | border: 1px solid #dddddd; |
| + | text-align: left; |
| + | padding: 8px; |
| + | } |
| + | |
| + | tr:nth-child(even) { |
| + | background-color: #dddddd; |
| + | } |
| + | </style> |
| + | </head> |
| + | <body> |
| + | |
| + | <table> |
| + | <tr> |
| + | <th>Sequence Number</th> |
| + | <th>Total Score</th> |
| + | <th>Ligand Score</th> |
| + | <th>Sequence</th> |
| + | </tr> |
| + | <tr> |
| + | <td>15</td> |
| + | <td>124.88</td> |
| + | <td>-3.77</td> |
| + | <td>MDEFEMIKRNTSEIISEEELREVLKKDEKWAQAGFEPSGKIHLGHYLQIKKMIDLQNAGFGTAITLADLHAYLNQKGELDEIRKIGDYNKKVFEAMGLKAIYVYGSEFQLDKDYTLNVYRLALKTTLKRARRSMELIAREDENPKVAEVIYPIMQVNSLHYAGADVAVGGMEQRKIHMLARELLPKKVVCIHNPVLTGLDGEGNFIAVDDSPEEIRAKIKKAYCPAGVVEGNPIMEIAKYFLEYPLTIKRPEKFGGDLTVNSYEELESLFKNKELHPMDLKNAVAEELIKILEPIRKRLY |
| + | </td> |
| + | </tr> |
| + | <tr> |
| + | <td>19</td> |
| + | <td>23.55</td> |
| + | <td>-3.93</td> |
| + | <td>MDEFEMIKRNTSEIISEEELREVLKKDEKAGQNGFEPSGKIHLGHYLQIKKMIDLQNAGFDMAVVLDDLGAYLNQKGELDEIRKIGDYNKKVFEAMGLKAKYVYGSEFQLDKDYTLNVYRLALKTTLKRARRSMELIAREDENPKVAEVIYPIESVNALHYSGVDVAVGGMEQRKITMLARELLPKKVVAIHNPVLTGLDGEGNFIAVDDSPEEIRAKIKKAYCPAGVVEGNPIMEIAKYFLEYPLTIKRPEKFGGDLTVNSYEELESLFKNKELHPMDLKNAVAEELIKILEPIRKRLA |
| + | </td> |
| + | </tr> |
| + | <tr> |
| + | <td>31</td> |
| + | <td>-3.40</td> |
| + | <td>-2.47</td> |
| + | <td>MDEFEMIKRNTSEIISEEELREVLKKDEKSAYIVFAPSGKIHLGHYLQIKKMIDLQNAGFDIIILLADLNAYMAQKGELDEIRKIGDYNKKVFEAMGLKAKYVYGSEFQLDKDYTLNVYRLALKTTLKRARRSNELVAPEDENPKVVHVIAATNNANNAHYLGVDVAVGGMENRKSHMLARELLPKKVVCIHNPVLTGLDGEGNFIAVDDSPEEIRAKIKKAYCPAGVVEGNPIMEIAKYFLEYPLTIKRPEKFGGDLTVNSYEELESLFKNKELHPMDLKNAVAEELIKILEPIRKRLA |
| + | </td> |
| + | </tr> |
| + | <tr> |
| + | <td>32</td> |
| + | <td>-1.57</td> |
| + | <td>-3.82 </td> |
| + | <td>MDEFEMIKRNTSEIISEEELREVLKKDEKSAYIAMAPSGKIHLGHYLQIKKMIDLQNAGFDIIIILDDLGAYLNQKGELDEIRKIGDYNKKVFEAMGLKAKYVYGSEFQLDKDYTLNVYRLALKTTLKRARRANELLAREDENPKVAQVIAPTTNVNAAHYLGVDVAVGGMENRKAHMLARELLPKKVVCIHNPVLTGLDGEGNFIAVDDSPEEIRAKIKKAYCPAGVVEGNPIMEIAKYFLEYPLTIKRPEKFGGDLTVNSYEELESLFKNKELHPMDLKNAVAEELIKILEPIRKRLS |
| + | </td> |
| + | </tr> |
| + | <tr> |
| + | <td>40</td> |
| + | <td>11.67</td> |
| + | <td>-4.33</td> |
| + | <td>MDEFEMIKRNTSEIISEEELREVLKKDEKSAYIAHAPSGKIHLGHYLQIKKMIDLQNAGFDIIIILGDSVAYLNQKGELDEIRKIGDYNKKVFEAMGLKAKYVYSSELNLDKDYTLNVYRLALKTTLKRARRSAELNAREDENPKVAEVISSILSVAQTHGLGVDVAVGGMEARKAHMLARELLPKKVVCIHNPVLTGLDGEGNFIAVDDSPEEIRAKIKKAYCPAGVVEGNPIMEIAKYFLEYPLTIKRPEKFGGDLTVNSYEELESLFKNKELHPMDLKNAVAEELIKILEPIRKRLH |
| + | </td> |
| + | </tr> |
| + | <tr> |
| + | <td>41</td> |
| + | <td>11.55</td> |
| + | <td>-2.98</td> |
| + | <td>MDEFEMIKRNTSEIISEEELREVLKKDEKSAYIAFKPSGKIHLGHYLQIKKMIDLQNAGFDIIIFLGDSAAYDNQKGELDEIRKIGDYNKKVFEAMGLKAKYVYSSEINLDKDYTLNVYRLALKTTLKRARRANELNAREDENPKVAEVIAAIKGVGQGHKAGVDVAVGGMEARKMHMLARELLPKKVVCIHNPVLTGLDGEGNFIAVDDSPEEIRAKIKKAYCPAGVVEGNPIMEIAKYFLEYPLTIKRPEKFGGDLTVNSYEELESLFKNKELHPMDLKNAVAEELIKILEPIRKRLQ |
| + | </td> |
| + | </tr> |
| + | <tr> |
| + | <td>43</td> |
| + | <td>66.36</td> |
| + | <td>-5.05</td> |
| + | <td>MDEFEMIKRNTSEIISEEELREVLKKDEKSASIGFEPSGKIHLGHYLQIKKMIDLQNAGFDIYIQLADLHAYLNQKGELDEIRKIGDYNKKVFEAMGLKAKYHYSSQYQLSKKSTLDTYRLALKTTLKRARRSMELIAREDENPKVAEVIYPINAVDQAGNSGSDVAVGGMEQRKIHMLARELLPKKVVCIHNPVLTGLDGEGNFIAVDDSPEEIRAKIKKAYCPAGVVEGNPIMEIAKYFLEYPLTIKRPEKFGGDLTVNSYEELESLFKNKELHPMDLKNAVAEELIKILEPIRKRLY</td> |
| + | </tr> |
| + | </table> |
| + | |
| + | |
| + | |
| + | |
| + | |
| + | <style> |
| + | table { |
| + | font-family: arial, sans-serif; |
| + | border-collapse: collapse; |
| + | width: 100%; |
| + | } |
| + | |
| + | td, th { |
| + | border: 1px solid #dddddd; |
| + | text-align: left; |
| + | padding: 8px; |
| + | } |
| + | |
| + | tr:nth-child(even) { |
| + | background-color: #dddddd; |
| + | } |
| + | </style> |
| + | </head> |
| + | <body> |
| + | |
| + | <table> |
| + | <tr> |
| + | <th>Position</th> |
| + | <th>Synthetase Number</th> |
| + | <th>Original Amino Acid</th> |
| + | <th>Mutation Amino Acid</th> |
| + | </tr> |
| + | <tr> |
| + | <td>30</td> |
| + | <td>5</td> |
| + | <td>Serine</td> |
| + | <td>Asparagine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>32</td> |
| + | <td>5</td> |
| + | <td>Tyrosine</td> |
| + | <td>Threonine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>34</td> |
| + | <td>2, 4</td> |
| + | <td>Glycine</td> |
| + | <td>Alanine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>36</td> |
| + | <td>2</td> |
| + | <td>Glutamine Acid</td> |
| + | <td>Isoleucine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>61</td> |
| + | <td>5</td> |
| + | <td>Asparagine Acid</td> |
| + | <td>Arginine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>63</td> |
| + | <td>5</td> |
| + | <td>Isoleucine</td> |
| + | <td>Alanine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>65</td> |
| + | <td>4</td> |
| + | <td>Leucine</td> |
| + | <td>Glycine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>65</td> |
| + | <td>5</td> |
| + | <td>Leucine</td> |
| + | <td>Threonine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>68</td> |
| + | <td>4</td> |
| + | <td>Asparagine Acid</td> |
| + | <td>Alanine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>69</td> |
| + | <td>4</td> |
| + | <td>Leucine</td> |
| + | <td>Alanine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>70</td> |
| + | <td>2</td> |
| + | <td>Histidine</td> |
| + | <td>Asparagine Acid</td> |
| + | </tr> |
| + | <tr> |
| + | <td>70</td> |
| + | <td>4</td> |
| + | <td>Histidine</td> |
| + | <td>Glycine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>72</td> |
| + | <td>4</td> |
| + | <td>Tyrosine</td> |
| + | <td>Glutamine Acid</td> |
| + | </tr> |
| + | <tr> |
| + | <td>73</td> |
| + | <td>2</td> |
| + | <td>Leucine</td> |
| + | <td>Alanine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>73</td> |
| + | <td>4</td> |
| + | <td>Leucine</td> |
| + | <td>Methionine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>74</td> |
| + | <td>2</td> |
| + | <td>Asparagine</td> |
| + | <td>Asparagine Acid</td> |
| + | </tr> |
| + | <tr> |
| + | <td>76</td> |
| + | <td>2</td> |
| + | <td>Lysine</td> |
| + | <td>Serine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>79</td> |
| + | <td>4</td> |
| + | <td>Leucine</td> |
| + | <td>Arginine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>101</td> |
| + | <td>5</td> |
| + | <td>Lysine</td> |
| + | <td>Glutamine Acid</td> |
| + | </tr> |
| + | <tr> |
| + | <td>103</td> |
| + | <td>5</td> |
| + | <td>Valine</td> |
| + | <td>Triptophane</td> |
| + | </tr> |
| + | <tr> |
| + | <td>104</td> |
| + | <td>5</td> |
| + | <td>Tyrosine</td> |
| + | <td>Valine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>105</td> |
| + | <td>4, 5</td> |
| + | <td>Glycine</td> |
| + | <td>Serine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>107</td> |
| + | <td>5</td> |
| + | <td>Glutamine Acid</td> |
| + | <td>Lysine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>108</td> |
| + | <td>4</td> |
| + | <td>Phenylalanine</td> |
| + | <td>Lysine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>108</td> |
| + | <td>5</td> |
| + | <td>Phenylalanine</td> |
| + | <td>Arginine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>109</td> |
| + | <td>4, 5</td> |
| + | <td>Glutamine</td> |
| + | <td>Alanine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>114</td> |
| + | <td>4</td> |
| + | <td>Tyrosine</td> |
| + | <td>Alanine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>115</td> |
| + | <td>4</td> |
| + | <td>Threonine</td> |
| + | <td>Triptophane</td> |
| + | </tr> |
| + | <tr> |
| + | <td>118</td> |
| + | <td>4</td> |
| + | <td>Valine</td> |
| + | <td>Serine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>134</td> |
| + | <td>2</td> |
| + | <td>Methionine</td> |
| + | <td>Asparagine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>137</td> |
| + | <td>2</td> |
| + | <td>Isoleucine</td> |
| + | <td>Alanine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>139</td> |
| + | <td>2</td> |
| + | <td>Arginine</td> |
| + | <td>Serine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>147</td> |
| + | <td>2, 4</td> |
| + | <td>Alanine</td> |
| + | <td>Serine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>148</td> |
| + | <td>2</td> |
| + | <td>Glutamine</td> |
| + | <td>Lysine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>149</td> |
| + | <td>2</td> |
| + | <td>Valine</td> |
| + | <td>Threonine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>150</td> |
| + | <td>2, 4</td> |
| + | <td>Isoleucine</td> |
| + | <td>Leucine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>151</td> |
| + | <td>2</td> |
| + | <td>Tyrosine</td> |
| + | <td>Serine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>152</td> |
| + | <td>2</td> |
| + | <td>Proline</td> |
| + | <td>Threonine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>153</td> |
| + | <td>2</td> |
| + | <td>Isoleucine</td> |
| + | <td>Leucine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>153</td> |
| + | <td>4</td> |
| + | <td>Isoleucine</td> |
| + | <td>Threonine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>154</td> |
| + | <td>2</td> |
| + | <td>Methionine</td> |
| + | <td>Asparagine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>154</td> |
| + | <td>4, 5</td> |
| + | <td>Methionine</td> |
| + | <td>Serine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>155</td> |
| + | <td>2</td> |
| + | <td>Glutamine</td> |
| + | <td>Glycine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>155</td> |
| + | <td>4, 5</td> |
| + | <td>Glutamine</td> |
| + | <td>Alanine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>156</td> |
| + | <td>4</td> |
| + | <td>Valine</td> |
| + | <td>Alanine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>157</td> |
| + | <td>4</td> |
| + | <td>Asparagine</td> |
| + | <td>Alanine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>158</td> |
| + | <td>4, 5</td> |
| + | <td>Asparagine</td> |
| + | <td>Tyrosine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>159</td> |
| + | <td>4</td> |
| + | <td>Isoleucine</td> |
| + | <td>Glycine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>161</td> |
| + | <td>4</td> |
| + | <td>Tyrosine</td> |
| + | <td>Glutamine Acid</td> |
| + | </tr> |
| + | <tr> |
| + | <td>161</td> |
| + | <td>5</td> |
| + | <td>Tyrosine</td> |
| + | <td>Asparagine Acid</td> |
| + | </tr> |
| + | <tr> |
| + | <td>162</td> |
| + | <td>4</td> |
| + | <td>Leucine</td> |
| + | <td>Methionine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>162</td> |
| + | <td>5</td> |
| + | <td>Leucine</td> |
| + | <td>Alanine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>164</td> |
| + | <td>5</td> |
| + | <td>Valine</td> |
| + | <td>Alanine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>172</td> |
| + | <td>2</td> |
| + | <td>Glutamine Acid</td> |
| + | <td>Lysine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>173</td> |
| + | <td>2</td> |
| + | <td>Glutamine</td> |
| + | <td>Serine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>176</td> |
| + | <td>2</td> |
| + | <td>Isoleucine</td> |
| + | <td>Serine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>204-210</td> |
| + | <td>2, 4, 5</td> |
| + | <td>varying</td> |
| + | <td>-</td> |
| + | </tr> |
| + | <tr> |
| + | <td>307</td> |
| + | <td>2</td> |
| + | <td>-</td> |
| + | <td>Alanine</td> |
| + | </tr> |
| + | <tr> |
| + | <td>307</td> |
| + | <td>4, 5</td> |
| + | <td>-</td> |
| + | <td>Tyrosine</td> |
| + | </tr> |
| + | </table> |
| </div> | | </div> |
| <h4> Results in vivo </h4> | | <h4> Results in vivo </h4> |
Results
Results in silico
We choose our synthetases because of a good total score and a good ligand score. We checked the corresponding PDB-files, and rated the ligand and the binding pocket
as satisfying, so that the ligand assumedly does not collide with residues in the near environment.
The total scores for CBT are not as good as the scores for NPA. However, the ligand scores are acceptable in both cases. A visual evaluation confirms that the ligand
fits into the binding pocket.
Our results for this step
We used this algorithm to simulate the evolution of the tyrosyl-tRNA with the amino acids Nitrophenylalanine(NPA) and N
γ‑2‑cyanobenzothiazol‑6‑yl‑L‑asparagine (CBT-asparagine).
NPA simulation:
We created one .cst-file-block for the nitrogroup of NPA. Since there are two oxygen-atoms in the nitrogroup,
we defined two atom nametags. As several possibilities are useful, we defined two possible constraint partners
for the hydrogen bonds. The first is asparagine (N) or glutamine (Q) and the second is glycine (G). We set the
possible distance to 2.8 A, as it is the optimal distance for hydrogenbonds, and a tolerance level of 0.5 A.
We set the angles to 120° with a tolerance of 40°, as recommended by Florian Richter during our talk in cologne.
The torsion angles were set to 180° with a tolerance of 180° and a penalty of 0, such that the torsion angles can rotate
completely freely.(Richter, unpublished data)
CBT-ASP simulation:
CBT-ASP can build hydrogen bonds in two ways. The first is a weak hydrogen bond on the
sulphur atom and the other possibility is a normal hydrogen bond on the nitrogen (N2)
after the Cγ. We wrote three cst-files, one for a possible bond with sulpur, one for a
possible bond with nitrogen, and one for both bonds. As possible corresponding amino acids, we chose serine, threonine, tyrosine, asparagine, glutamine, and glycine.
It is recommended to write a “.flags”-file, because there are several input- parameters to be defined, but it is also possible to define them via console user interface.
For the categorization of the scaffold, we chose the automatic determination and set the following cuts: cut1: 6 A, cut2: 8 A, cut3: 10 A and cut4: 12 A, like the baker-lab commonly used.
We used this algrithm to simulate the evolution of the tyrosyl-tRNA with the amino acids Nitrophenylalanine and CBT-ASP
We obtained 13 synthetase sequences for CBT-ASP, and 43 sequences for NPA, which fit well into the binding site according to the ROSETTA score.
table {
font-family: arial, sans-serif;
border-collapse: collapse;
width: 100%;
}
td, th {
border: 1px solid #dddddd;
text-align: left;
padding: 8px;
}
tr:nth-child(even) {
background-color: #dddddd;
}
Sequence Number |
Total Score |
Ligand Score |
Sequence |
15 |
124.88 |
-3.77 |
MDEFEMIKRNTSEIISEEELREVLKKDEKWAQAGFEPSGKIHLGHYLQIKKMIDLQNAGFGTAITLADLHAYLNQKGELDEIRKIGDYNKKVFEAMGLKAIYVYGSEFQLDKDYTLNVYRLALKTTLKRARRSMELIAREDENPKVAEVIYPIMQVNSLHYAGADVAVGGMEQRKIHMLARELLPKKVVCIHNPVLTGLDGEGNFIAVDDSPEEIRAKIKKAYCPAGVVEGNPIMEIAKYFLEYPLTIKRPEKFGGDLTVNSYEELESLFKNKELHPMDLKNAVAEELIKILEPIRKRLY
|
19 |
23.55 |
-3.93 |
MDEFEMIKRNTSEIISEEELREVLKKDEKAGQNGFEPSGKIHLGHYLQIKKMIDLQNAGFDMAVVLDDLGAYLNQKGELDEIRKIGDYNKKVFEAMGLKAKYVYGSEFQLDKDYTLNVYRLALKTTLKRARRSMELIAREDENPKVAEVIYPIESVNALHYSGVDVAVGGMEQRKITMLARELLPKKVVAIHNPVLTGLDGEGNFIAVDDSPEEIRAKIKKAYCPAGVVEGNPIMEIAKYFLEYPLTIKRPEKFGGDLTVNSYEELESLFKNKELHPMDLKNAVAEELIKILEPIRKRLA
|
31 |
-3.40 |
-2.47 |
MDEFEMIKRNTSEIISEEELREVLKKDEKSAYIVFAPSGKIHLGHYLQIKKMIDLQNAGFDIIILLADLNAYMAQKGELDEIRKIGDYNKKVFEAMGLKAKYVYGSEFQLDKDYTLNVYRLALKTTLKRARRSNELVAPEDENPKVVHVIAATNNANNAHYLGVDVAVGGMENRKSHMLARELLPKKVVCIHNPVLTGLDGEGNFIAVDDSPEEIRAKIKKAYCPAGVVEGNPIMEIAKYFLEYPLTIKRPEKFGGDLTVNSYEELESLFKNKELHPMDLKNAVAEELIKILEPIRKRLA
|
32 |
-1.57 |
-3.82 |
MDEFEMIKRNTSEIISEEELREVLKKDEKSAYIAMAPSGKIHLGHYLQIKKMIDLQNAGFDIIIILDDLGAYLNQKGELDEIRKIGDYNKKVFEAMGLKAKYVYGSEFQLDKDYTLNVYRLALKTTLKRARRANELLAREDENPKVAQVIAPTTNVNAAHYLGVDVAVGGMENRKAHMLARELLPKKVVCIHNPVLTGLDGEGNFIAVDDSPEEIRAKIKKAYCPAGVVEGNPIMEIAKYFLEYPLTIKRPEKFGGDLTVNSYEELESLFKNKELHPMDLKNAVAEELIKILEPIRKRLS
|
40 |
11.67 |
-4.33 |
MDEFEMIKRNTSEIISEEELREVLKKDEKSAYIAHAPSGKIHLGHYLQIKKMIDLQNAGFDIIIILGDSVAYLNQKGELDEIRKIGDYNKKVFEAMGLKAKYVYSSELNLDKDYTLNVYRLALKTTLKRARRSAELNAREDENPKVAEVISSILSVAQTHGLGVDVAVGGMEARKAHMLARELLPKKVVCIHNPVLTGLDGEGNFIAVDDSPEEIRAKIKKAYCPAGVVEGNPIMEIAKYFLEYPLTIKRPEKFGGDLTVNSYEELESLFKNKELHPMDLKNAVAEELIKILEPIRKRLH
|
41 |
11.55 |
-2.98 |
MDEFEMIKRNTSEIISEEELREVLKKDEKSAYIAFKPSGKIHLGHYLQIKKMIDLQNAGFDIIIFLGDSAAYDNQKGELDEIRKIGDYNKKVFEAMGLKAKYVYSSEINLDKDYTLNVYRLALKTTLKRARRANELNAREDENPKVAEVIAAIKGVGQGHKAGVDVAVGGMEARKMHMLARELLPKKVVCIHNPVLTGLDGEGNFIAVDDSPEEIRAKIKKAYCPAGVVEGNPIMEIAKYFLEYPLTIKRPEKFGGDLTVNSYEELESLFKNKELHPMDLKNAVAEELIKILEPIRKRLQ
|
43 |
66.36 |
-5.05 |
MDEFEMIKRNTSEIISEEELREVLKKDEKSASIGFEPSGKIHLGHYLQIKKMIDLQNAGFDIYIQLADLHAYLNQKGELDEIRKIGDYNKKVFEAMGLKAKYHYSSQYQLSKKSTLDTYRLALKTTLKRARRSMELIAREDENPKVAEVIYPINAVDQAGNSGSDVAVGGMEQRKIHMLARELLPKKVVCIHNPVLTGLDGEGNFIAVDDSPEEIRAKIKKAYCPAGVVEGNPIMEIAKYFLEYPLTIKRPEKFGGDLTVNSYEELESLFKNKELHPMDLKNAVAEELIKILEPIRKRLY |
Position |
Synthetase Number |
Original Amino Acid |
Mutation Amino Acid |
30 |
5 |
Serine |
Asparagine |
32 |
5 |
Tyrosine |
Threonine |
34 |
2, 4 |
Glycine |
Alanine |
36 |
2 |
Glutamine Acid |
Isoleucine |
61 |
5 |
Asparagine Acid |
Arginine |
63 |
5 |
Isoleucine |
Alanine |
65 |
4 |
Leucine |
Glycine |
65 |
5 |
Leucine |
Threonine |
68 |
4 |
Asparagine Acid |
Alanine |
69 |
4 |
Leucine |
Alanine |
70 |
2 |
Histidine |
Asparagine Acid |
70 |
4 |
Histidine |
Glycine |
72 |
4 |
Tyrosine |
Glutamine Acid |
73 |
2 |
Leucine |
Alanine |
73 |
4 |
Leucine |
Methionine |
74 |
2 |
Asparagine |
Asparagine Acid |
76 |
2 |
Lysine |
Serine |
79 |
4 |
Leucine |
Arginine |
101 |
5 |
Lysine |
Glutamine Acid |
103 |
5 |
Valine |
Triptophane |
104 |
5 |
Tyrosine |
Valine |
105 |
4, 5 |
Glycine |
Serine |
107 |
5 |
Glutamine Acid |
Lysine |
108 |
4 |
Phenylalanine |
Lysine |
108 |
5 |
Phenylalanine |
Arginine |
109 |
4, 5 |
Glutamine |
Alanine |
114 |
4 |
Tyrosine |
Alanine |
115 |
4 |
Threonine |
Triptophane |
118 |
4 |
Valine |
Serine |
134 |
2 |
Methionine |
Asparagine |
137 |
2 |
Isoleucine |
Alanine |
139 |
2 |
Arginine |
Serine |
147 |
2, 4 |
Alanine |
Serine |
148 |
2 |
Glutamine |
Lysine |
149 |
2 |
Valine |
Threonine |
150 |
2, 4 |
Isoleucine |
Leucine |
151 |
2 |
Tyrosine |
Serine |
152 |
2 |
Proline |
Threonine |
153 |
2 |
Isoleucine |
Leucine |
153 |
4 |
Isoleucine |
Threonine |
154 |
2 |
Methionine |
Asparagine |
154 |
4, 5 |
Methionine |
Serine |
155 |
2 |
Glutamine |
Glycine |
155 |
4, 5 |
Glutamine |
Alanine |
156 |
4 |
Valine |
Alanine |
157 |
4 |
Asparagine |
Alanine |
158 |
4, 5 |
Asparagine |
Tyrosine |
159 |
4 |
Isoleucine |
Glycine |
161 |
4 |
Tyrosine |
Glutamine Acid |
161 |
5 |
Tyrosine |
Asparagine Acid |
162 |
4 |
Leucine |
Methionine |
162 |
5 |
Leucine |
Alanine |
164 |
5 |
Valine |
Alanine |
172 |
2 |
Glutamine Acid |
Lysine |
173 |
2 |
Glutamine |
Serine |
176 |
2 |
Isoleucine |
Serine |
204-210 |
2, 4, 5 |
varying |
- |
307 |
2 |
- |
Alanine |
307 |
4, 5 |
- |
Tyrosine |
Results in vivo
In order to test the functionality and specificity of our modeled aaRS, we translated a selection of the most promising amino acid sequences into DNA sequences optimized for E.coli and ordered them via gene synthesis. We then used a positive-negative selection system for characterization. The experiment proceeds as follows:
Due to problems with regards to the protein- and salt-concentration, we retransformed the gensyntheses which had been cloned into pSB1C3. In a next step, these syntheses were cotransformed in E.coli(BL21) with our positive selection plasmid.
With regards to CBT2, only the original colony could be transformed. From CBT4 and CBT5, were used each originally isolated clone and its retransformed counterpart.
Due to the IPTC-induced promoter, we used variants without IPTG, and with 5 mM, 10 mM, and 15 mM added IPTS for all plasmids for the kanamycine resistance.
We chose additional variants with regards to the antibiotics; one variant each of kanamycine, kanamycine and chloramphenicole, and kanamycine, chloramphenicole and tetracycline. The number of resulting colonies for each variant are summarized in figure X. Our in vivo results show that our in silico designed enzymes did not lead to a loss of functioning.
We ordered seven synthetase sequences of NPA via IDT, and four synthetase sequences of CBT-Asparagine by courtesy of Genscript, where we had previously
won a grant of 500€. Due to problems on the part of IDT, the sequences for NPA could not be synthetized. We still provide the sequences for further use below.
All further descriptions therefore refer to the synthetase sequences for CBT-Asparagine.
Unfortunately, we were not able to amplify the four sequences or clone them into the detection system pSB3T5. A test digestion revealed that the length of the
sequence in the plasmid puk57, which was the plasmid delivered by Genscript, did not correspond to the sequence length ordered. Therefore, we disregarded synthetase candidate number 1
for further tests.
For the remaining three sequences, we instead utilized the positive-negative selection plasmids for validation of our syntheses.
After the first, positive selection cycle, colonies formed only if the non-canonical amino acid was present.
Thus, through the positive selection, we could show that the synthetase did not lead to the loss of functioning of the enzyme.
To show specificity, we conducted a negative selection as well. We managed to clone the synthetases into the negative selection plasmid,
but were not able to verify this selection cycle. Therefore, further tests are needed to validate the specificity of the synthetases.
References
Liu, W., Brock, A., Chen, S., Chen, S., Schultz, P. G. ,(2007). Genetic incorporation of unnatural amino acids into proteins in mammalian cells. Nature methods, 4(3), 239-244.
Richter, F., Leaver-Fay, A., Khare, S. D., Bjelic, S., Baker, D. (2011). De novo enzyme design using Rosetta3. PloS one, 6(5): e19230.
Simons, K. T., Kooperberg, C., Huang, E., Baker, D. (1997). Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. Journal of molecular biology, 268(1), 209-225.