|
|
Line 25: |
Line 25: |
| However, besides this complex modeling, we also conducted and applied several straight-forward stochastic and statistical models | | However, besides this complex modeling, we also conducted and applied several straight-forward stochastic and statistical models |
| | | |
− | to support and guide our laboratory work. These modeling projects are briefly described here; for further information, please check | + | to support and guide numerous steps of our laboratory work. Some of these modeling projects are briefly described in this box; however, |
− | | + | we recommend reading the linked pages for further information. </br> |
− | the corresponding link leading to the part of our project the model has been an element of. </br> | + | |
| | | |
| <h4>Discriminant function model for the ICG prediction: </h4> | | <h4>Discriminant function model for the ICG prediction: </h4> |
| | | |
− | We conducted a discriminant function analysis for the recognition of which base – natural or unnatural – is present at a specified position of a base sequence. This model is part of our ICG model software, found | + | We conducted a discriminant function analysis for the classification of nucleotides in Oxford Nanopore sequencing reads |
| + | at a specific position. This model is part of our |
| + | <a target="_blank"href="https://2017.igem.org/Team:Bielefeld-CeBiTec/Software#iCG">iCG</a> software module and enabled the successful detection of unnatural bases. |
| | | |
− | <a target="_blank"href="https://2017.igem.org/Team:Bielefeld-CeBiTec/Software#iCG">here. </a> | + | |
| + | <h4>Calculation of the required library size for the selection system: </h4> |
| | | |
− | <h4>Calculation of an effective library size for the selection system: </h4> | + | We applied combinatorics and statistics to calculate the optimal library size for the <a target="_blank"href="https://2017.igem.org/Team:Bielefeld-CeBiTec/Results/translational_system/library_and_selection">tRNA synthetase selection process.</a> |
| | | |
− | We used a combination of combinatorics and statistics to calculate the optimal library size for the selection process, | + | This was a trade off between putting lots of efforts into constructing a very huge library and missing diversity in a too small library. |
| + | Therefore, we predicted the optimal library size. Experimental validation of this prediction was done via MiSeq analysis of the diversity |
| + | of a subset of this library. |
| | | |
− | such that it is expected to contain all possible sequence mutations, and therefore easily all possible resulting amino acids, at least once. | + | <h4>Strength prediction for a transcription signal amplification system <a target="_blank"href="http://parts.igem.org/Part:BBa_K2201373">(BBa_K2201373)</a>: <!--den part gibts nicht --> </h4> |
| | | |
− | This calculation is part of the translational system, found <a target="_blank"href="https://2017.igem.org/Team:Bielefeld-CeBiTec/Results/translational_system/library_and_selection">here</a>. | + | We modeled and visually compared the mRFP production over time for a normal mRFP reporter system and compared it to our <a target="_blank"href="https://2017.igem.org/Team:Bielefeld-CeBiTec/Composite_Part">enhanced signaling.</a>. |
| + | Validity was done as part of the positive selection process for an adapted tRNA synthetase. |
| | | |
− | <h4>Comparison of mRFP production for the positive selection system <a target="_blank"href="http://parts.igem.org/Part:BBa_K2201373">(BBa_K2201373)</a>: <!--den part gibts nicht --> </h4> | + | |
− | | + | |
− | We modeled and visually compared the mRFP production over time for the normal signaling and the enhanced signaling circuit of the positive selection system.
| + | |
− | | + | |
− | The system and plot can be found <a target="_blank"href="https://2017.igem.org/Team:Bielefeld-CeBiTec/Composite_Part">here</a>.
| + | |
| | | |
| </div class="article"> | | </div class="article"> |
Line 61: |
Line 62: |
| <div class="article"> | | <div class="article"> |
| As our project explores possibilities of an expanded genetic code via unnatural bases and non-canonical amino acids, | | As our project explores possibilities of an expanded genetic code via unnatural bases and non-canonical amino acids, |
− | we set out to complement our lab work via modeling of novel amino acyl tRNA synthetases (aaRS) for a non-canonical amino | + | we set out to complement and improve our lab work via modeling of novel amino acyl tRNA synthetases (aaRS) for a non-canonical amino |
− | acids we synthetized in the lab. In order to incorporate non-canonical amino acids into proteins via the translational | + | acids, which were synthetized in our lab. In order to incorporate non-canonical amino acids into proteins via the translational |
− | process, the aaRS has to attach the amino acid to the respective tRNA. Thus, we designed aaRS sequences which were meant | + | process, the aaRS has to attach the amino acid to the respective tRNA. Thus, we designed aaRS sequences which were adjusted |
− | to link our own non-canonical amino acid to a fitting tRNA. As a result, we obtained a couple of sequences of possible aaRS | + | to link our own non-canonical amino acid to a fitting tRNA. Candidates were evaluated and selected, based on a ROSETTA score. |
− | candidates, which we evaluated, based on a ROSETTA score, and ordered via gene synthesis.
| + | Most promising sequences were ordered via gene synthesis for the experimental validation. |
− | The following figure provides a rough overview of our modeling project. The table below summarizes the realization in practice. | + | Figure 1 provides a rough overview of our modeling project. Table 1 below summarizes the realization in practice. |
| </div> | | </div> |
| | | |
Line 102: |
Line 103: |
| </tr> | | </tr> |
| <tr> | | <tr> |
− | <td>1. Ligand Preparation</td> | + | <td>1. Ligand preparation</td> |
| <td>Manually via <a href="https://avogadro.cc/">Avogadro</a></td> | | <td>Manually via <a href="https://avogadro.cc/">Avogadro</a></td> |
| <td>Due to the novelty of our amino acid, no information on the ligand is available in databases. | | <td>Due to the novelty of our amino acid, no information on the ligand is available in databases. |
− | Therefore, all information has to be provided manually and then generate a conformer ensemble, containing | + | Therefore, manual generation of a conformer ensemble, containing |
− | for example all energetically useful arrangements of atoms within the molecule.</td> | + | for example all energetically useful arrangements of atoms within the molecule, was required.</td> |
| </tr> | | </tr> |
| <tr> | | <tr> |
Line 122: |
Line 123: |
| </tr> | | </tr> |
| <tr> | | <tr> |
− | <td>4. <a href="https://www.rosettacommons.org/manuals/archive/rosetta3.5_user_guide/d7/dfc/match.html">Enzyme Matching</a></td> | + | <td>4. <a href="https://www.rosettacommons.org/manuals/archive/rosetta3.5_user_guide/d7/dfc/match.html">Enzyme matching</a></td> |
| <td>ROSETTA protocol</td> | | <td>ROSETTA protocol</td> |
| <td>ROSETTA combines information about the ligand and constrains to find possible hydrogen bonding partners and propose the shape | | <td>ROSETTA combines information about the ligand and constrains to find possible hydrogen bonding partners and propose the shape |
Line 128: |
Line 129: |
| </tr> | | </tr> |
| <tr> | | <tr> |
− | <td>5. <a href="https://www.rosettacommons.org/manuals/archive/rosetta3.5_user_guide/d6/dbc/enzyme_design.html">Enzyme Design</a></td> | + | <td>5. <a href="https://www.rosettacommons.org/manuals/archive/rosetta3.5_user_guide/d6/dbc/enzyme_design.html">Enzyme design</a></td> |
| <td>ROSETTA protocol</td> | | <td>ROSETTA protocol</td> |
| <td>An algorithm uses the information from the previous step and information on the ligand to simulate the mutation process and | | <td>An algorithm uses the information from the previous step and information on the ligand to simulate the mutation process and |
Line 134: |
Line 135: |
| </tr> | | </tr> |
| <tr> | | <tr> |
− | <td>6. Evaluate results in silico</td> | + | <td>6. Evaluate results <i>in silico</i></td> |
| <td>Manually</td> | | <td>Manually</td> |
− | <td>We evaluate the visual output and the score values and order the sequences with the most promising results via gene synthesis. </td> | + | <td>Based on the score values, we ordered the synthesis of the most promising sequences. </td> |
| </tr> | | </tr> |
| <tr> | | <tr> |
− | <td>7. Evaluate results in vivo</td> | + | <td>7. Evaluate results <i>in vivo</i></td> |
| <td>Manually</td> | | <td>Manually</td> |
| <td>The synthetases are validated in the lab with the corresponding ncAA via a | | <td>The synthetases are validated in the lab with the corresponding ncAA via a |
Line 166: |
Line 167: |
| <div class="third double"> | | <div class="third double"> |
| <div class ="article"> | | <div class ="article"> |
− | As part of our iGEM project, we are faced with the challenge of adapting the tRNA synthetase | + | As part of our iGEM project, we faced the challenge of adapting the tRNA synthetase |
− | to non-canonical amino acids. For this purpose, we modeled possible candidates for synthetases as a | + | to non-canonical amino acids. For this purpose, we modeled possible candidates for synthetases as an |
− | preparation for carrying out a positive-negative selection according to (Liu <i>et al.</i>, 2007) in the laboratory. | + | alternative to carrying out a positive-negative selection according to (Liu <i>et al.</i>, 2007) in the laboratory. |
| | | |
| Due to the rapid development in the field of protein and molecular structure analysis, there has been an | | Due to the rapid development in the field of protein and molecular structure analysis, there has been an |
| increase in the availability of molecular 3D structure data. These data are organized in publicly available | | increase in the availability of molecular 3D structure data. These data are organized in publicly available |
| databases which provide a foundation for the modeling and simulation of chemical-biological processes in bioinformatics. | | databases which provide a foundation for the modeling and simulation of chemical-biological processes in bioinformatics. |
− | As our ncAA has been synthetized by ourselves, no such comprehensive information is available, yet. | + | As our ncAA has been synthetized in our lab, no such comprehensive information is available, yet. |
| However, information of similarly structured amino acids can potentially serve as a basis for our modeling. | | However, information of similarly structured amino acids can potentially serve as a basis for our modeling. |
| | | |
− | As evaluating an expanded genetic code is a complex task, the practical laboratory work of our project is supplemented by a | + | Specifically, we focused on simulation to |
− | theoretical approach, involving modeling, simulation, and evaluation on the computer in silico. Specifically, we focused on simulation to
| + | design an aaRS for the new ncAA CBT-Asparagine. |
− | design an aaRS for the new ncAA CBT-Asparagine. Additionally to CBT-Asparagine, we also generated aaRS sequences for the ncAA Nitrophenylalanine (NPA) as | + | |
− | a validation of our modeling procedure; as synthases for this ncAA are known and
| + | |
− | thus comparable to our in silico result, we can evaluate our modeling procedure.
| + | |
− | Our core challenge was to evolve the binding pocket in a manner which effectively charges the tRNA with the amino acid, thus also recognizing this amino acid specifically.
| + | |
| </div> | | </div> |
| </div> | | </div> |
Line 190: |
Line 187: |
| <h4>Method</h4> | | <h4>Method</h4> |
| <div class ="article"> | | <div class ="article"> |
− | We used the open-source software <a href="https://www.rosettacommons.org/docs/latest/getting_started/Getting-Started">"Rosetta"</a> for the main part of our modeling project, which was introduced at the University of Washington by David Baker in 1997 (Simon <i>et al.</i>,1997), initially in the context of protein structure prediction. Since then, Rosetta has grown to include numerous modules and is currently widely used in research. In our application, we focus on the Rosetta module called the "Rosetta Enzyme Design Protocol" | + | We used the open-source software <a href="https://www.rosettacommons.org/docs/latest/getting_started/Getting-Started">"ROSETTA"</a> |
| + | for the main part of our modeling project, which was introduced at the University of Washington by David Baker in 1997 |
| + | (Simon <i>et al.</i>,1997), initially in the context of protein structure prediction. ROSETTA has grown through the addition of numerous modules and is currently widely used in research. In our application, we focus on the Rosetta module called the "Rosetta Enzyme Design Protocol" |
| | | |
| </div> | | </div> |
Line 204: |
Line 203: |
| <div class ="article"> | | <div class ="article"> |
| Since the non-canonical amino acid synthesized in the laboratory is completely novel, there is no corresponding tRNA synthetase which can load the tRNA, yet. For this reason, we use the enzyme design protocol to design the binding pocket in a way that allows it to form an effective and specific enzyme. The protocol consists of two main steps: matching and designing (Richter <i>et al.</i>, 2011) | | Since the non-canonical amino acid synthesized in the laboratory is completely novel, there is no corresponding tRNA synthetase which can load the tRNA, yet. For this reason, we use the enzyme design protocol to design the binding pocket in a way that allows it to form an effective and specific enzyme. The protocol consists of two main steps: matching and designing (Richter <i>et al.</i>, 2011) |
− | The enzyme design algorithm basically is summarized in Fig. B | + | The enzyme design algorithm is briefly summarized in Fig. 2 |
| </div> | | </div> |
| | | |
Line 238: |
Line 237: |
| <li>a “.pdb”-file providing a rough scaffold layout</li> | | <li>a “.pdb”-file providing a rough scaffold layout</li> |
| <li>a “.cst”-file to define the bindings between ligand and scaffold</li> | | <li>a “.cst”-file to define the bindings between ligand and scaffold</li> |
− | <li>a “.pos”-file to define the positions of the amino acids of the scaffold </li> | + | <li>a “.pos”-file to define the positions of the amino acids in the scaffold </li> |
| <li>a “.flags” file to control all inputs. These files are necessary, as they describe the ligand and backbone and specify the parameters of the algorithm </li> | | <li>a “.flags” file to control all inputs. These files are necessary, as they describe the ligand and backbone and specify the parameters of the algorithm </li> |
| </ul> | | </ul> |
Line 297: |
Line 296: |
| The output generated in the matching step is the layout of the scaffold as well as one or more states of the amino acid which enable interaction with the ligand. This information is stored as a “.pdb” file and becomes part of the input for the design step. </br> | | The output generated in the matching step is the layout of the scaffold as well as one or more states of the amino acid which enable interaction with the ligand. This information is stored as a “.pdb” file and becomes part of the input for the design step. </br> |
| <h4>Our results for this step </h4> | | <h4>Our results for this step </h4> |
− | We used the <a href="http://www.rcsb.org/pdb/explore.do?structureId=1j1u">“1J1U”</a>-scaffold from PDB for our matching step. The “1J1U.pdb”-file contains the Tyrosyl-tRNA-synthetase, which is labeld under “Chain A”, the orthogonol tRNA under “Chain B” and the natural ligand Tyrosyl. For our project, we deleted the natural ligand and “Chain B”, because it was not neccerary to change their structure or sequence and it was a way to save computer time and power. | + | We used the <a href="http://www.rcsb.org/pdb/explore.do?structureId=1j1u">“1J1U”</a>-scaffold from PDB for our matching step. |
| + | The “1J1U.pdb”-file contains the tyrosyl-tRNA-synthetase, which is labeld under “Chain A”, the orthogonol tRNA under “Chain B” |
| + | and the natural ligand tyrosyl. For our project, we deleted the natural ligand and “Chain B”, because it was not necessary to |
| + | change their structure or sequence and it was a way to save compute time. |
| | | |
− | We designed the ligands manually by using <a href="https://avogadro.cc/">Avogadro</a>, and for the .cst-file, we choose the default matching algorithm for simulations of both amino acids. | + | We designed the ligands manually via <a href="https://avogadro.cc/">Avogadro</a> based on a default matching algorithm for both |
| + | amino acids, thus creating useful .cst-files. |
| </article> | | </article> |
| </div> | | </div> |
Line 315: |
Line 318: |
| <p>The design step applies an algorithm such that the binding pocket and the near environment are mutated and the remaining scaffold is | | <p>The design step applies an algorithm such that the binding pocket and the near environment are mutated and the remaining scaffold is |
| repacked. Additionally, a badness-of-fit score is generated which indicates how well the mutation fits the amino acid. For every file from the | | repacked. Additionally, a badness-of-fit score is generated which indicates how well the mutation fits the amino acid. For every file from the |
− | matching step, a model with a score and a “.pdb-file” will be generated, specifying where the sequence can be located, and the 3D-structure can be | + | matching step, a model with a score and a “.pdb-file” was generated, specifying where the sequence can be located. Additionally, the ".pdb-file" |
− | analyzed. Notably, the amino acid structure can be extracted separately. | + | makes visual analysis of the 3D-structure possible. Notably, the amino acid structure can be extracted separately. |
− | The following section describes the structure of the design step. For further details on each step, click the technical details button. </br> | + | The following section describes the structure of the design step. Further details on each step can be obtaind by showing the Technical Details Section. </br> |
| 1. Optimizing the catalytic interactions </br> | | 1. Optimizing the catalytic interactions </br> |
| For the first alternative, the file can be generated either by the Rosetta standard or a manually created .”res”- file. For more details, we refer to the | | For the first alternative, the file can be generated either by the Rosetta standard or a manually created .”res”- file. For more details, we refer to the |
Line 334: |
Line 337: |
| </article> | | </article> |
| <article> <p>2. Cycles of sequence design and minimazation within constrains </br> | | <article> <p>2. Cycles of sequence design and minimazation within constrains </br> |
− | To optimize the structure we used applied an iterative optimization algorithm. This algorithm mutates all residues from the backbone, which are not part | + | To optimize the structure we applied an iterative optimization algorithm. This algorithm mutates all residues from the backbone, which are not part |
− | #of the catalytic center, to alanine, and a small energy function refraction will place the ligand in an optimal position to the backbone. </br> | + | of the catalytic center, to alanine, and a small energy function refraction will place the ligand in an optimal position to the backbone. </br> |
| </p> <a class="hidden-expand">TECHNICAL DETAILS</a></article> | | </p> <a class="hidden-expand">TECHNICAL DETAILS</a></article> |
| <article class="hidden-block"> | | <article class="hidden-block"> |
Line 362: |
Line 365: |
| <p><b>Design step outputs </b></p> | | <p><b>Design step outputs </b></p> |
| <article>The output for the design step is a “.pdb”-file containing the mutated scaffold and a “.score”-file. | | <article>The output for the design step is a “.pdb”-file containing the mutated scaffold and a “.score”-file. |
− | For every PDB-file, a line in the score-file is generated, so it is easy to evaluate the given structure. | + | For every .pdb-file, a line in the score-file is generated, so it is easy to evaluate the given structure. |
| The first score in the file is the total score of the model. After that, the number of hydrogen bonds | | The first score in the file is the total score of the model. After that, the number of hydrogen bonds |
− | in the protein as a whole and in the constraints is listed, followed by the number of dismissed polars in the catalytic residues as well in the whole protein and in the constraints. | + | in the protein as a whole and in the constraints is listed, followed by the number of dismissed solutions in the |
| + | catalytic residues as well in the whole protein and in the constraints. |
| See the technical details below for a full overview of the output information </br> | | See the technical details below for a full overview of the output information </br> |
| <a class="hidden-expand">TECHNICAL DETAILS</a></article> | | <a class="hidden-expand">TECHNICAL DETAILS</a></article> |
Line 396: |
Line 400: |
| <div class="content"> | | <div class="content"> |
| <h3> Results </h3> | | <h3> Results </h3> |
− | <h4> Results in silico </h4> | + | <h4> Results <i>in silico</i> </h4> |
| <div class="article"> | | <div class="article"> |
− | We choose our synthetases because of a good total score and a good ligand score. We checked the corresponding PDB-files, and rated the ligand and the binding pocket | + | We choose our synthetases based on a good total score and a good ligand score. We checked the corresponding PDB-files, and rated the ligand and the binding pocket |
| as satisfying, so that the ligand assumedly does not collide with residues in the near environment. | | as satisfying, so that the ligand assumedly does not collide with residues in the near environment. |
| The total scores for CBT are not as good as the scores for NPA. However, the ligand scores are acceptable in both cases. A visual evaluation confirms that the ligand | | The total scores for CBT are not as good as the scores for NPA. However, the ligand scores are acceptable in both cases. A visual evaluation confirms that the ligand |
Line 405: |
Line 409: |
| We used this algorithm to simulate the evolution of the tyrosyl-tRNA with the amino acids Nitrophenylalanine(NPA) and N<sup>γ</sup>‑2‑cyanobenzothiazol‑6‑yl‑L‑asparagine (CBT-asparagine). </br> | | We used this algorithm to simulate the evolution of the tyrosyl-tRNA with the amino acids Nitrophenylalanine(NPA) and N<sup>γ</sup>‑2‑cyanobenzothiazol‑6‑yl‑L‑asparagine (CBT-asparagine). </br> |
| NPA simulation: </br> | | NPA simulation: </br> |
− | We created one .cst-file-block for the nitrogroup of NPA. Since there are two oxygen-atoms in the nitrogroup, | + | The .cst-file contained two blocks for the nitrogroup of NPA. Since there are two oxygen-atoms in the nitrogroup, |
| we defined two atom nametags. As several possibilities are useful, we defined two possible constraint partners | | we defined two atom nametags. As several possibilities are useful, we defined two possible constraint partners |
| for the hydrogen bonds. The first is asparagine (N) or glutamine (Q) and the second is glycine (G). We set the | | for the hydrogen bonds. The first is asparagine (N) or glutamine (Q) and the second is glycine (G). We set the |
| possible distance to 2.8 Å, as it is the optimal distance for hydrogenbonds, and a tolerance level of 0.5 Å. | | possible distance to 2.8 Å, as it is the optimal distance for hydrogenbonds, and a tolerance level of 0.5 Å. |
− | We set the angles to 120° with a tolerance of 40°, as recommended by Florian Richter during our talk in cologne. | + | We set the angles to 120° with a tolerance of 40°, as recommended by Florian Richter during our discussion in cologne. |
| The torsion angles were set to 180° with a tolerance of 180° and a penalty of 0, such that the torsion angles can rotate | | The torsion angles were set to 180° with a tolerance of 180° and a penalty of 0, such that the torsion angles can rotate |
| completely freely.(Richter, unpublished data) </br> | | completely freely.(Richter, unpublished data) </br> |
− | CBT-ASP simulation: </br> | + | CBT-Asparagine simulation: </br> |
− | CBT-ASP can build hydrogen bonds in two ways. The first is a weak hydrogen bond on the | + | CBT-Asparagine can build hydrogen bonds in two ways. The first is a weak hydrogen bond on the |
− | sulphur atom and the other possibility is a normal hydrogen bond on the nitrogen (N<sub>2</sub>) | + | sulfur atom and the other possibility is a normal hydrogen bond on the nitrogen (N<sub>2</sub>) |
− | after the Cγ. We wrote three cst-files, one for a possible bond with sulpur, one for a | + | after the Cγ. We wrote three cst-files, one for a possible bond with sulfur, one for a |
− | possible bond with nitrogen, and one for both bonds. As possible corresponding amino acids, we chose serine, threonine, tyrosine, asparagine, glutamine, and glycine. </br> | + | possible bond with nitrogen, and one for both bonds. As possible corresponding amino acids, we chose serine, |
− | It is recommended to write a “.flags”-file, because there are several input- parameters to be defined, but it is also possible to define them via console user interface. </br> | + | threonine, tyrosine, asparagine, glutamine, and glycine. </br> |
− | For the categorization of the scaffold, we chose the automatic determination and set the following cuts: cut1: 6 Å, cut2: 8 Å, cut3: 10 Å and cut4: 12 Å, like the Baker-lab commonly used. | + | It is recommended to write a “.flags”-file, because there are several input parameters to be defined, |
| + | but it is also possible to define them via console user interface. </br> |
| + | For the categorization of the scaffold, we chose the automatic determination and set the following cuts: cut1: 6 Å,</br> cut2: 8 Å,</br> cut3: 10 Å</br> and cut4: 12 Å,</br> like the Baker-lab commonly used. |
| We used this algrithm to simulate the evolution of the tyrosyl-tRNA with the amino acids Nitrophenylalanine and CBT-ASP | | We used this algrithm to simulate the evolution of the tyrosyl-tRNA with the amino acids Nitrophenylalanine and CBT-ASP |
| We obtained 13 synthetase sequences for CBT-ASP, and 43 sequences for NPA, which fit well into the binding site according to the ROSETTA score. | | We obtained 13 synthetase sequences for CBT-ASP, and 43 sequences for NPA, which fit well into the binding site according to the ROSETTA score. |
− | The sequences for the best synthetases for NPA is avaible | + | The sequences for the best synthetases for NPA is available |
| <a target="_blank"href="https://static.igem.org/mediawiki/2017/1/12/T--Bielefeld-CeBiTec--DKE_NPAseq.pdf">here</a> | | <a target="_blank"href="https://static.igem.org/mediawiki/2017/1/12/T--Bielefeld-CeBiTec--DKE_NPAseq.pdf">here</a> |
| </style> | | </style> |
Line 621: |
Line 627: |
| <td>36</td> | | <td>36</td> |
| <td>2</td> | | <td>2</td> |
− | <td>Glutamine Acid</td> | + | <td>Glutamine acid</td> |
| <td>Isoleucine</td> | | <td>Isoleucine</td> |
| </tr> | | </tr> |
Line 627: |
Line 633: |
| <td>61</td> | | <td>61</td> |
| <td>5</td> | | <td>5</td> |
− | <td>Asparagine Acid</td> | + | <td>Asparagine acid</td> |
| <td>Arginine</td> | | <td>Arginine</td> |
| </tr> | | </tr> |
Line 651: |
Line 657: |
| <td>68</td> | | <td>68</td> |
| <td>4</td> | | <td>4</td> |
− | <td>Asparagine Acid</td> | + | <td>Asparagine acid</td> |
| <td>Alanine</td> | | <td>Alanine</td> |
| </tr> | | </tr> |
Line 664: |
Line 670: |
| <td>2</td> | | <td>2</td> |
| <td>Histidine</td> | | <td>Histidine</td> |
− | <td>Asparagine Acid</td> | + | <td>Asparagine acid</td> |
| </tr> | | </tr> |
| <tr> | | <tr> |
Line 676: |
Line 682: |
| <td>4</td> | | <td>4</td> |
| <td>Tyrosine</td> | | <td>Tyrosine</td> |
− | <td>Glutamine Acid</td> | + | <td>Glutamine acid</td> |
| </tr> | | </tr> |
| <tr> | | <tr> |
Line 694: |
Line 700: |
| <td>2</td> | | <td>2</td> |
| <td>Asparagine</td> | | <td>Asparagine</td> |
− | <td>Asparagine Acid</td> | + | <td>Asparagine acid</td> |
| </tr> | | </tr> |
| <tr> | | <tr> |
Line 712: |
Line 718: |
| <td>5</td> | | <td>5</td> |
| <td>Lysine</td> | | <td>Lysine</td> |
− | <td>Glutamine Acid</td> | + | <td>Glutamine acid</td> |
| </tr> | | </tr> |
| <tr> | | <tr> |
Line 735: |
Line 741: |
| <td>107</td> | | <td>107</td> |
| <td>5</td> | | <td>5</td> |
− | <td>Glutamine Acid</td> | + | <td>Glutamine acid</td> |
| <td>Lysine</td> | | <td>Lysine</td> |
| </tr> | | </tr> |
Line 892: |
Line 898: |
| <td>4</td> | | <td>4</td> |
| <td>Tyrosine</td> | | <td>Tyrosine</td> |
− | <td>Glutamine Acid</td> | + | <td>Glutamine acid</td> |
| </tr> | | </tr> |
| <tr> | | <tr> |
Line 898: |
Line 904: |
| <td>5</td> | | <td>5</td> |
| <td>Tyrosine</td> | | <td>Tyrosine</td> |
− | <td>Asparagine Acid</td> | + | <td>Asparagine acid</td> |
| </tr> | | </tr> |
| <tr> | | <tr> |
Line 921: |
Line 927: |
| <td>172</td> | | <td>172</td> |
| <td>2</td> | | <td>2</td> |
− | <td>Glutamine Acid</td> | + | <td>Glutamine acid</td> |
| <td>Lysine</td> | | <td>Lysine</td> |
| </tr> | | </tr> |
Line 956: |
Line 962: |
| </table> | | </table> |
| </div> | | </div> |
− | <h4> Results in vivo </h4> | + | <h4> <i>in vivo</i> validation of predicted tRNA synthetase structures </h4> |
| <div class="article"> | | <div class="article"> |
− | In order to test the functionality and specificity of our modeled aaRS, we translated a selection of the most promising amino acid sequences into DNA sequences optimized for E.coli and ordered them via gene synthesis. We then used a positive-negative selection system for characterization. The experiment proceeds as follows: | + | In order to test the functionality and specificity of our modeled aaRS, we selected the most promising 11 amino acid sequences and ordered seven synthetase sequences of NPA via IDT, and four |
− | Due to problems with regards to the protein- and salt-concentration, we retransformed the gensyntheses which had been cloned into pSB1C3. In a next step, these syntheses were cotransformed in E.coli(BL21) with our positive selection plasmid. | + | synthetase sequences of CBT-Asparagine by courtesy of Genscript, where we had previously won a grant of 500€. |
− | With regards to CBT2, only the original colony could be transformed. From CBT4 and CBT5, were used each originally isolated clone and its retransformed counterpart. | + | Since the DNA synthesis by GenScript and IDT were delayed by several weeks, we could not perform the best practice characterization of these parts. |
− | Due to the IPTC-induced promoter, we used variants without IPTG, and with 5 mM, 10 mM, and 15 mM added IPTS for all plasmids for the kanamycine resistance. | + | Finally, we received only three of the ordered syntheses in sufficient quality for further experiments. All of them encode predicted CBT-tRNA-synthetases. |
− | We chose additional variants with regards to the antibiotics; one variant each of kanamycine, kanamycine and chloramphenicole, and kanamycine, chloramphenicole and tetracycline. The number of resulting colonies for each variant are summarized in figure X. Our in vivo results show that our in silico designed enzymes did not lead to a loss of functioning.
| + | We subjected the sequences to a positive selection as initial characterization. Plasmids encoding the predicted best candidates were cotransformed with our |
− | We ordered seven synthetase sequences of NPA via IDT, and four synthetase sequences of CBT-Asparagine by courtesy of Genscript, where we had previously
| + | positive selection plasmid into <i>E. coli(BL21 DE3)</i>. Due to the IPTG induced promoter, we tested different IPTG concentrations, including 0 mM, 5 mM, |
− | won a grant of 500€. Due to problems on the part of IDT, the sequences for NPA could not be synthetized. We still provide the sequences for further use below.
| + | 10 mM, and 15 mM. In addition, we tried different concentration of the antibiotics: kan15, cm15/kan15, and cm15/kan15/tet5. The number of resulting colonies for |
− | All further descriptions therefore refer to the synthetase sequences for CBT-Asparagine. </br>
| + | each sample is summarized in figure X. Our <i>in vivo</i> results show that our <i>in silico</i> designed enzymes kept their native function and are |
− | Unfortunately, we were not able to amplify the four sequences or clone them into the detection system pSB3T5. A test digestion revealed that the length of the
| + | able to integrate amino acids through an amber codon matching tRNA. Since these results only indicate the acceptance and transfer of the non-canonical |
− | sequence in the plasmid pUC57, which was the plasmid delivered by Genscript, did not correspond to the sequence length ordered. Therefore, we disregarded synthetase candidate number 1
| + | amino acid, additional experiment are required to demonstrate a high specificity of these enzymes. </br> |
− | for further tests. </br>
| + | We offer the predicted sequences to the community for further characterization via the parts-reg (LINKS). |
− | For the remaining three sequences, we instead utilized the positive-negative selection plasmids for validation of our syntheses.
| + | |
− | After the first, positive selection cycle, colonies formed only if the non-canonical amino acid was present.
| + | |
− | Thus, through the positive selection, we could show that the synthetase did not lead to the loss of functioning of the enzyme.
| + | |
− | To show specificity, we conducted a negative selection as well. We managed to clone the synthetases into the negative selection plasmid,
| + | |
− | but were not able to verify this selection cycle. Therefore, further tests are needed to validate the specificity of the synthetases.
| + | |
− |
| + | |
− |
| + | |
| <div class="contentline"> | | <div class="contentline"> |
| <div class="third"> | | <div class="third"> |