Difference between revisions of "Team:BOKU-Vienna/Description"

Line 99: Line 99:
 
         <div class="row">
 
         <div class="row">
 
             <div class="col-lg-8 col-lg-offset-2">
 
             <div class="col-lg-8 col-lg-offset-2">
                <h2>Description.</h2>
 
 
                
 
                
<p>
 
<br>
 
<p><h4>Operation: PETase - only the tip of the iceberg</h4></p>
 
  
<p class="justify">Over the past few decades the use of plastic has become exceedingly crucial in our daily lives. Plastic can be found almost everywhere, from common household objects to the undeniably beneficial medical applications. The reason why plastic has become that widely utilized is due to its many different properties. However, while all petro-based polymers show diverse attributes, there is one fundamental rule that binds them together - they need a considerably long time to decompose which can lead to devastating environmental consequences. PETase is a newly discovered enzyme able to break down highly resistant PET (polyethylene terephthalate) by hydrolyzation. It presents a promising alternative towards solving the ongoing plastic pollution problem. Still, the enzyme itself is not efficient enough to be applied in the field yet.
+
<p style="text-align: justify;"><strong>Directed Evolution &ndash; a (very) short overview</strong></p>
<br>
+
<p style="text-align: justify;">After decades of continuous methodological advancement and increasing understanding of biomolecule structure and functionality directed evolution nevertheless is still the most powerful method in protein or aptamer engineering. Classic <em>in vitro</em> strategies, however, require substantial effort in terms of lab work and time investment to perform several consecutive rounds of evolution<sup>1,2</sup>. In order to automate the laborious process scientists have tried to devise systems that traverse the four steps of Darwinian evolution (mutation, expression, selection, replication) in a continuous cycle <em>in vivo</em> (reviewed in <sup>3</sup>). In the earliest approaches this was achieved by simply cloning the gene of interest into mutator <em>E. coli</em> strains<sup>4</sup> showing reduced DNA replication fidelity or into <em>E. coli </em>strains carrying inducible mutator plasmids<sup>5</sup>. While these settings proved to be useful for the generation of complex multifactorial phenotypes like organic solvent tolerance<sup>5</sup> they cannot provide the regional selectivity that is desired in single-gene protein evolution as globally enhanced mutagenesis leads to slow growth and reduced transformation efficiency<sup>6</sup> in addition to obscured phenotypic expression due to unwanted off-target mutations<sup>7,8</sup>.&nbsp;</p>
With our newly devised method D.I.V.E.R.T. we intend to enhance the catalytic activity of PETase in pursuit of unlocking its full potential. Since D.I.V.E.R.T. is a generalized concept, it can also be applied to other enzymes as well as binding proteins in any host organism. Presumably, it has the potential to revolutionize the face of directed evolution. </p>    
+
<p style="text-align: justify;">Therefore, the ideal system for <em>in vivo</em> directed evolution would avoid those side effects by subjecting the host organism to locally confined hypermutation. Such an arrangement would allow the researcher to rapidly mutate and evolve a defined single sequence or gene while leaving the rest of the genome unchanged. Several strategies to locally constrain enhanced mutation rates have been devised so far; including plasmids harboring regions with low replication fidelity<sup>9</sup>, elaborate phage-assisted systems confining the accumulation of mutations to the phage genome while keeping the overall mutational load in the cell population in a steady state<sup>10</sup> or retrotransposon-based methods<sup>11</sup>. Although definitely representing a large step towards the right direction, still none of the designs mentioned allow the continuous mutation of a single copy of a single gene <em>in vivo</em>. With D.I.V.E.R.T. we want to build a system that does.</p>
 +
<p><span style="font-size: 11.0pt; line-height: 107%; font-family: Calibri;"><br style="page-break-before: always;" /> </span></p>
 +
<p>&nbsp;</p>
 +
<p style="text-align: justify;"><strong>The D.I.V.E.R.T. concept</strong></p>
 +
<p style="text-align: justify;">In their work published in 2016 Crook et al. probably generated the very first retrotransposon-based system for <em>in vivo </em>directed evolution by inserting a gene of interest into a truncated version of the native Ty1 retrotransposon in <em>S. cerevisiae </em>(<em>Figure 1</em>)<sup>11</sup><em>.</em> Thus, the GOI is subjected to the retrotransposon life cycle and continuously mutated due to the error-prone nature of Ty1 reverse transcriptase (low fidelity is inherent to most reverse transcriptases<sup>12</sup>).&nbsp;</p>
 +
<p style="text-align: justify;">Figure 1: Scheme of the design used by Crook et al. in <sup>11</sup>.</p>
 +
<p style="text-align: justify;">Although having delivered impressive results such as a mutation rate of 0.15&nbsp;kb<sup>-1&shy;</sup> the concept still holds room for improvement as argued by Zheng et al. in their review of targeted mutagenesis<sup>13</sup>.&nbsp; For example, Ty1 as a mobile genetic element can reintegrate anywhere in the genome, preferred upstream of genes transcribed by RNA polymerase III<sup>14</sup> possibly reducing selection efficiency due to the presence of multiple copies of the heterologous gene in a single cell. Additionally, in this setting the reverse transcriptase likewise is continuously mutated increasing the risk of inactivation as time carries on.</p>
 +
<p style="text-align: justify;">Drawing inspiration from this retroelement-based approach and having its potential weaknesses in mind we started thinking about the benefits of a more universal hypermutation strategy relying on reverse transcription of the gene of interest and reintegration of the generated cDNA such as:</p>
 +
<ol>
 +
<li style="text-align: justify; text-indent: -18.0pt;">Site-specific reintegration would make sure to replace the original gene variant maintaining single-copy status for optimal selection behavior.</li>
 +
<li style="text-align: justify; text-indent: -18.0pt;">Expression of the required reverse transcriptase <em>in trans</em> rather than within the synthetic retroelement eliminates the risk of inactivating the enzyme by mutation.</li>
 +
</ol>
 +
<p style="text-align: justify;">In such a system only the GOI would be mutated and remain being present in just a single copy. The overall scheme of our D.I.V.E.R.T. (<u>d</u>irected <em><u>i</u>n <u>v</u>ivo</em> <u>e</u>volution via <u>r</u>everse <u>t</u>ranscription) concept is depicted in <em>Figure 2</em>.</p>
 +
<p style="text-align: justify;">&nbsp;</p>
 +
<p style="text-align: justify;">Figure 2: General scheme of the D.I.V.E.R.T. cycle</p>
 +
<p>&nbsp;</p>
 +
<p style="text-align: justify;"><strong>Theoretical considerations</strong></p>
 +
<p style="text-align: justify;">To sum up: we wanted to build a fully synthetic retrotransposon-like genetic element that would allow our gene of interest to continuously undergo the retrotransposon life cycle accumulating mutations over time. The main events that need to be functionally implemented for such a system to work would be the <em>in vivo </em>reverse transcription carried out by a heterologous reverse transcriptase as well as site-specific recombination. For both processes several options are available. The reasoning that goes into our choices as well as some other thoughts are briefly explained below.</p>
 +
<p style="text-align: justify;"><em>&nbsp;</em></p>
 +
<p style="text-align: justify;"><strong>Host range</strong></p>
 +
<p style="text-align: justify;">Not being reliant on host-specific factors (like the Ty1 retrotransposon in yeast) but only on heterologous components (RT and recombinase) D.I.V.E.R.T. &ndash; at least in general &ndash; should be applicable in a broad range of organisms as we wanted to show by performing our proof of concept experiment in yeast as well as in <em>E. coli</em>. Some minor adjustments in regard to the host, however, need to be made. For example, the relevant proteins have to be tagged with a NLS in eukaryotic systems and ribosome vs. reverse transcriptase interactions might play a role in prokaryotes. More details can be found in the D.I.V.E.R.T. experiment section. (<strong><span style="color: red;">LINK)</span></strong></p>
 +
<p style="text-align: justify;">&nbsp;</p>
 +
<p style="text-align: justify;"><strong>RT choice and priming conditions:</strong></p>
 +
<p style="text-align: justify;">In the early phase of project planning we spent quite some time on gathering relevant information about a variety of reverse transcriptases including processivity, mutation rate, mutational spectra and RNase H activity. Finally, we chose to use Moloney murine leukemia virus (MMLV) RT mainly due to it being the best characterized monomeric<sup>15</sup> reverse transcriptase while the active forms of many other RTs (e.g. HIV<sup>16</sup> and ASLV<sup>17</sup> RTs) are heterodimers. Picking a monomeric enzyme allowed us to save some IDT DNA synthesis credits; a wise decision, since we managed to use up all the credits for gBlocks, primers and DNA oligos during lab work in the summer months. As an additional requirement MMLV RT also shows RNase H activity which might help to synthesize double-stranded cDNA by degrading the RNA template after the first cDNA strand has been generated and we knew that active enzyme could be expressed in <em><span style="font-size: 10.0pt; line-height: 107%;">E. coli</span></em><sup><span style="font-size: 10.0pt; line-height: 107%;">18</span></sup><span style="font-size: 10.0pt; line-height: 107%;">. </span></p>
 +
<p style="text-align: justify;">In its natural context MMLV RT uses a mouse tRNA<sup>Pro</sup> for priming<sup>19,20</sup>. It has been shown, though, that MMLV RT shows not very stringent preferences and that MMLV can replicate using different tRNAs as long as the PBS is complementary to the 3&rsquo; end of the tRNA<sup>21</sup>. From <em>in vitro</em> studies (and cDNA synthesis kits for RT-PCRs) we furthermore know that MMLV RT can also initiate replication using DNA or RNA oligonucleotides with the general sequence of efficiency being DNA-oligo, tRNA, RNA-oligo<sup>22</sup>. Unfortunately, finding data on <em>in vivo </em>priming conditions for any heterologous reverse transcriptase in <em>E. coli</em> is difficult as apparently only one study has performed reverse transcription in <em>E. coli</em> so far<sup>23</sup>. In this work that focused on the generation of ssDNA for the creation of DNA nanostructures <em>in vivo</em> Elbaz et al. designed the 3&rsquo; end of their mRNA in a way so that it formed a distinct structure that on one hand acted as an transcription terminator while on the other hand it promoted efficient priming of reverse transcription by dimeric HIV RT.</p>
 +
<p style="text-align: justify;">So, after an extensive literature research, those were the priming conditions we found to be worthwhile to consider:</p>
 +
<ul>
 +
<li style="text-align: justify; text-indent: -18.0pt;">RNA oligos transcribed <em>in trans</em> featuring a defined 3&rsquo; end</li>
 +
<li style="text-align: justify; text-indent: -18.0pt;">DNA oligos generated using a native retroelement (e.g. retrons in <em> coli</em>; used in a similar fashion in <sup>24</sup>)</li>
 +
<li style="text-align: justify; text-indent: -18.0pt;">The tRNA corresponding to the heterologous RT (in case of MMLV tRNA<sup>Pro</sup> from mouse; transcribed <em>in trans</em> with a defined 3&rsquo; end)</li>
 +
<li style="text-align: justify; text-indent: -18.0pt;">A tRNA of the host organism (is already present in the cell, no need for worrying about primer production)</li>
 +
<li style="text-align: justify; text-indent: -18.0pt;">Self-priming using a simple hairpin or a more sophisticated structure at the mRNA 3&rsquo; end like in <sup>23</sup>.</li>
 +
</ul>
 +
<p style="text-align: justify;">For ease of implementation and broad host generality we opted for using RNA oligos in our &ldquo;lucky shot&rdquo; D.I.V.E.R.T. experiment (<strong><span style="color: red;">LINK</span></strong>). Nevertheless, being able to prime with a native tRNA would be most convenient. Hence, in our priming condition assay (<strong><span style="color: red;">LINK)</span></strong> we tested 5 of the most abundant tRNAs in <em>E. coli</em> for initiating reverse transcription with MMLV RT.</p>
 +
<p style="text-align: justify;"><strong>&nbsp;</strong></p>
 +
<p style="text-align: justify;"><strong>Reintegration of the generated cDNA</strong></p>
 +
<p style="text-align: justify;">Once the mRNA has been reverse transcribed and potentially mutated it has to replace the original copy of the gene of interest. Again, different methods are available:</p>
 +
<ul>
 +
<li style="text-align: justify; text-indent: -18.0pt;">Site-specific recombinase-mediated recombination (analogous to recombinase-mediated cassette exchange)<sup>25,26</sup></li>
 +
<li style="text-align: justify; text-indent: -18.0pt;">Homologous recombination</li>
 +
</ul>
 +
<p style="text-align: justify;">For our D.I.V.E.R.T. experiment (<strong><span style="color: red;">LINK</span></strong>) we decided to employ the Flp/FRT site-specific recombination system as described in <sup>25</sup>. However, extended FRT sites needed for efficient recombination are palindromic and form stable hairpins possibly acting as transcription (or reverse transcription) terminators. To evaluate this potential problem we determined the bidirectional termination efficiency of extended FRT sites using the classic terminator strength assay. (<strong><span style="color: red;">LINK)</span></strong></p>
 +
<p style="text-align: justify;">&nbsp;</p>
 +
<p style="text-align: justify;"><strong>Does it work? Find a rigorous proof</strong></p>
 +
<p style="text-align: justify;">In the early phase of development the mutation rates achieved might be very low as the full D.I.V.E.R.T. cycle will rarely be completed. Hence, using mutation frequency as an indicator or test on whether the system works might be problematic. Crook et al.<sup>11</sup> solved this problem by using a very elegant and rigorous proof of concept assay (<em>Figure 1</em><em>, b</em>) that was inspired by a test on recombination frequency of the Ty1 element<sup>27</sup>. Basically, they chose a selectable marker as cargo (i.e. GOI) that was inserted into the retrotransposon in reverse orientation. The GOI was disrupted by an intron which again was reversely oriented with respect to the cargo (i.e. oriented in the retrotransposon&rsquo;s sense direction). Hence, upon transcription of the cargo the mRNA would contain the reverse complement of the intron which would not be spliced leading to nonfunctional protein. Upon transcription of the retroelement, however, the intron would end up in the right orientation and going to be spliced. After reverse transcription and reintegration the intron would not be present in the new version of the retroelement leading to functional protein and selectable cells.</p>
 +
<p style="text-align: justify;">For our D.I.V.E.R.T. experiments we used a similar concept, details can be found in the experiments section. (<strong><span style="color: red;">LINK</span></strong>)</p>
 +
<p>&nbsp;</p>
  
 
 
 
        </div>
 
        </div>
 
    </section>
 
    <section id="design" class="container content-section text-center">
 
        <div class="row">
 
            <div class="col-lg-8 col-lg-offset-2">
 
                <h2>Design.</h2>
 
 
                
 
                
<br>
+
</div></div>
<p><h4> D.I.V.E.R.T. – <u>d</u>irected <i> <u>i</u>n <u>v</u>ivo </i> <u>e</u>volution via <u>r</u>everse <u>t</u>ranscription </h4></p>
+
 
+
<p><h4> Abstract </h4></p>
+
 
+
<p><i>In vivo</i> continuous directed evolution offers significant advantages over classic <i>in vitro</i> methods as it drastically reduces the amount of time and actual lab work that needs to be invested. Most current approaches, however, are based on globally enhanced mutagenesis rates eventually leading to unwanted off-target mutations that interfere with the experiment. Here, we present the concept of a new continuous <i>in vivo</i> evolution strategy that allows complete spatial control of mutagenesis by cyclically using an RNA intermediate which finally replaces the original DNA cargo at the respective locus in the genome after it has been reverse-transcribed in an error prone way.</p>
+
 
+
 
+
 
+
<p><h4>Introduction:</h4></p>
+
 
+
<p>Finding and enhancing desired properties via directed evolution remains to be a key strategy in the engineering of biomolecules as rational design approaches are still limited due to deficient understanding of sequence to structure and function relations. The drawback of traditional <i>in vitro</i> laboratory evolution techniques, though, is that most steps in the procedure involve substantial human intervention and a single round of mutation, transformation, protein expression and selection usually requires several days to be completed<sup>1</sup> while for optimal results multiple rounds need to be performed<sup>2</sup>.</p>
+
 
+
<p>Most of the <i>in vivo</i> methods that have been devised by scientists to overcome these difficulties so far either rely on global mutagenesis providing no spatial control, employ fusion proteins of DNA binding motifs and mutating enzymes like AID that only utilize a small section of the available sequence space or involve the assistance of phages imposing limits for the selection of phenotypes not well linkable to phage propagation<sup>3</sup>. Other approaches utilize engineered error-prone DNA polymerase I for replication of a cargo-carrying plasmid<sup>4</sup>. This way the mutational load on the genome is minimized, but regulatory elements on the plasmid are mutated just as likely as the coding sequence leading to differences in expression levels rather than protein activity. Also, since cells contain multiple copies of the gene of interest another pre-selection transformation step is necessary to finally generate the library that can be screened.</p>
+
 
+
<p>The iGEM project presented here allows for spatially fully controlled hypermutation in a means that is theoretically generalizable to prokaryotic as well as eukaryotic cells by creating an “artificial retroelement”. The idea takes the work of Crook et al.<sup>5</sup> who inserted genetic cargo into a yeast retrotransposon to achieve continuous mutagenesis a step further by generating a system that undergoes the retrotransposon life cycle using an error prone reverse transcriptase (RT) that is encoded somewhere in trans rather than attaching the cargo to an already existing retroelement. This way, the concept can be extended to host species that lack suitable own retroelements like bacteria. Furthermore, the gene encoding the necessary reverse transcriptase is not mutated and regulatory sequences can be protected by placing them outside of the mutation cassette.</p>
+
 
+
 
+
<p><h4>Detailed project description:</h4></p>
+
 
+
<p><h5>General:</h5></p>
+
 
+
<p>One D.I.V.E.R.T. cycle comprises transcription, reverse transcription and reintegration of the gene of interest into the genome replacing the original copy (<i>Figure 1</i>). Since reverse transcriptases generally show poor fidelity mutations accumulate within the cargo while cycling. This overall scheme serves as an underlying principle for more elaborate arrangements described below. The individual approaches mainly vary in primer usage as well as the recombination mechanism applied. To find the optimal setup we plan on implementing all of them in <i>E. coli</i> and <i>S. cerevisiae</i> during the lab work phase starting in July.</p>
+
 
+
<img class="invert" src="https://static.igem.org/mediawiki/2017/2/21/T--BOKU-Vienna--Describtion-Figure1.jpeg" width="750" height="347">
+
<br><i><div style="font-size:80%;"> Figure 1: General scheme of D.I.V.E.R.T. RNA is transparent. </div></i></br>
+
 
+
+
<p><h5>Host difference considerations:</h5></p>
+
 
+
<p>The following measures are taken in respect to the differences in mRNA processing between yeast and bacteria:
+
 
+
 
+
<br>• In <i>S. cerevisiae</i> all relevant proteins are fused to a NLS to facilitate transport into the nucleus.</br>
+
 
+
<br>• To avoid interference between RT and ribosomes in <i>E. coli</i> the gene cassettes used are designed in a way that the transcript that is subject to reverse transcription is generated from the non-template DNA strand. This leads to a somewhat more complicated bidirectional cassette design.</br>
+
 
+
<br>• The <i>in vivo</i> generation of short RNAs in eukaryotes imposes some difficulties that also arise when transcribing gRNAs required by Cas9 in the CRISPR/Cas9 technology (lack of inducible promotors for RNA Pol III, unfavorable processing of transcripts from RNA Pol II). These can be overcome by using self-cleaving ribozymes enclosing the actual functional small RNA<sup>6</sup>. To have temporary control over primer generation in yeast we also intend to use this technique.</br>
+
<br></br>
+
 
+
Also, in <i>E. coli</i> all functions necessary for D.I.V.E.R.T. (RT, Flp, primers) are encoded on a plasmid in contrast to being integrated into the genome via homologous recombination in yeast.</p>
+
 
+
 
+
<p><h5>Approach I – Flp-FRT mediated recombination:</h5></p>
+
 
+
<p>Just like any other DNA polymerase RT relies on a double stranded priming segment to kick off DNA synthesis. In Approach I an exogenous RNA primer is transcribed in trans and binds to the cargo mRNA enabling reverse transcription of the first strand (<i>Figure 2</i>).</p>
+
 
+
<img class="invert" src="https://static.igem.org/mediawiki/2017/5/5f/T--BOKU-Vienna--Describtion-Figure2.jpg" width="750" height="347">
+
<br><i><div style="font-size:80%;"> Figure 2: D.I.V.E.R.T. Approach I; RNA is represented transparently. </div></i></br>
+
+
<p>The mRNA template then is degraded by the RNase H activity of the reverse transcriptase before the next primer can bind to finish reverse transcription which leads to a double-stranded pre-reintegration intermediate containing eventual point mutations that have been acquired in the process. The newly generated dsDNA as well as the original copy on the chromosome contain FRT sequences resulting in recombination through Flp-recombinase.</p>
+
 
+
<p>Drawbacks of this approach are that it relies on many components to work (primer, RT, Flp) and that mutations can occur within the FRT sequence, potentially impeding recombination.</p>
+
 
+
<p><h5>Approach II – homologous recombination:</h5></p>
+
 
+
<p>Employing homologous recombination instead of the Flp-FRT system allows to cut down to two components that need to be expressed in the host aside from the cargo. This is only true for yeast, though, since proteins of the lambda Red system are required to facilitate homologous recombination in <i>E. coli</i><sup>7</sup>. The dsDNA produced by reverse transcription, however, should display at least one 3’ overhang (depending on when in the process the RNA primers are degraded by RNase H, <i>Figure 3</i>) so that only the beta protein from phage lambda is absolutely necessary.</p>
+
 
+
<p>To facilitate recombination it might even be sufficient to work in a <i>RecA</i><sup>+</sup> strain hence relying solely on the cell’s endogenous HR system equivalent to what is done in yeast. Both cases require the genes of the exogenous factors (primers and RT) to be integrated into the genome since plasmids cannot be stably sustained in recombination positive <i>E. coli</i> strains. For the project we will just go with beta, though.</p>
+
 
+
<img class="invert" src="https://static.igem.org/mediawiki/2017/3/3b/T--BOKU-Vienna--Describtion-Figure3.jpg" width="750" height="347">
+
<br><i><div style="font-size:80%;"> Figure 3: D.I.V.E.R.T. Approach II; the possible forms of the dsDNA prior recombination are depicted on the right. RNA is represented transparently. </div></i></br>
+
+
<p>An additional advantage of this strategy is that it is more robust against unwanted mutations compared to Approach I where propagation of one single SNP in the FRT sequence might disable efficient recombination while it takes a higher number of point mutations in the primer binding site to actually prevent binding.</p>
+
 
+
 
+
 
+
<p><h5>Approach III – ssDNA mediated recombination and self-priming:</h5></p>
+
 
+
<p>Recombination in yeast<sup>8,9</sup> as well as in <i>E. coli</i><sup>10</sup> can be promoted by ssDNA oligonucleotides and there is even research suggesting that Red mediated dsDNA homologous recombination in <i>E. coli</i> involves a fully single-stranded intermediate<sup>11</sup>. This is exploited in Approach III where (like Approach II) recombination only depends on the lambda beta protein (in <i>E. coli</i>) or does not need any exogenous factor at all (in yeast). To further reduce the number of components needed and since RT only requires one priming event to generate ssDNA, the cargo features complementary sequences leading to a dumbbell structure that can act as a double stranded priming site at the mRNA’s 3’ end (<i>Figure 4</i>). Due to RNase H activity only the ssDNA remains and is ready for recombination.</p>
+
 
+
<img class="invert" src="https://static.igem.org/mediawiki/2017/5/52/T--BOKU-Vienna--Describtion-Figure4_2.jpg" width="750" height="347">
+
<br><i><div style="font-size:80%;"> Figure 4: D.I.V.E.R.T. Approach III; RNA is represented transparently. </div></i></br>
+
 
+
<p>This concept allows D.I.V.E.R.T. to only require one component (the reverse transcriptase) in addition to the cargo cassette but also holds some disadvantages: To facilitate effective recombination, for example, one must make sure that the resulting ssDNA is complementary to the lagging strand of DNA replication. Moreover, ssDNA mediated recombineering is usually done using small oligonucleotides up to 100 nt and there is hardly any literature on the efficiency of incorporating longer DNA strands. Lastly, the secondary structure at the 3’ end might interfere with the secondary structure involved in transcription termination in <i>E. coli</i> and in yeast mRNA processing of the 3’ end most probably acts as a competing process. Thus, the feasibility of Approach III in yeast is highly speculative. We plan on trying to form the dumbbell structure by adding a sufficient number of Ts at the 3’ end of the cargo that should interact with the poly(A)tail but are skeptical.</p>
+
 
+
 
+
 
+
<p><h4>Applications and Implications:</h4></p>
+
 
+
<p>Rapidly mutating a single gene, pathway, regulatory sequence or any other cargo within a stable genome by just setting up the experiment and walking away sounds too good to be true. By allowing just this, a system like D.I.V.E.R.T. could drastically reduce the time and labor needed for carrying out traditional directed evolution experiments and also come in handy in many other fields of synthetic biology. Moreover, in spite of reverse transcriptases not offering error rates as high as those provided by some <i>in vitro</i> mutagenesis methods (e.g. error-prone PCR), <i>in vivo</i>-based techniques nonetheless enable the generation of larger libraries due to limits in transformation efficiency inherent to <i>in vitro</i> procedures. After optimizing their experimental design Crook et al. reported a potential library size of 3.7 × 10<sup>−2</sup> per round and cell. Since yeast and especially <i>E. coli</i> can reach cell densities of above 10<sup>12</sup> L<sup>-1</sup> (yeast only in controlled fermentation) theoretically infinite library sizes are only a matter of time and medium volume.</p>
+
 
+
<p>However, there is still considerable potential for optimization. For instance, mutation rates could be increased by engineering RTs or the RNA polymerase used for cargo-transcription towards lower fidelity. Also, the possible occurrence of unwanted mutations in sequences necessary for cycling (e.g. the FRT sequence in Approach I or primer binding sites in Approach II), although not posing an immediate problem due to the relatively short length of those sequences (and the robustness in primer binding in respect to single point mutations), could still be eradicated in future applications.</p>
+
 
+
<p>To summarize, we strongly believe that current methods, although finding broad application (e.g. in enzyme or antibody engineering), by far do not exploit the full potential of directed evolution. The ideal technique would enable researchers to unrestrictedly mutate distinct sequences without having to leave the living system. On the path towards this ultimate goal something like D.I.V.E.R.T. can be the next step.</p>
+
+
<p><h5>References:</h5></p>
+
 
+
<a href="https://www.ncbi.nlm.nih.gov/pubmed/16148303" target="_blank"><p style="font-size:70%;">1.Yuan L, Kurek I, English J, Keenan R. Laboratory-directed protein evolution. Microbiol Mol Biol Rev. 2005;69(3):373-392. doi:10.1128/MMBR.69.3.373-392.2005.</a>
+
 
+
<a href="https://www.ncbi.nlm.nih.gov/pubmed/11050933" target="_blank"><p style="font-size:70%;">2. Voigt C a, Kauffman S, Wang ZG. Rational evolutionary design: the theory of in vitro protein evolution. Adv Protein Chem. 2000;55:79-160. doi:10.1016/S0065-3233(01)55003-2.</a>
+
 
+
<a href="https://www.ncbi.nlm.nih.gov/pubmed/25461718" target="_blank"><p style="font-size:70%;">3. Badran AH, Liu DR. In vivo continuous directed evolution. Curr Opin Chem Biol. 2015;24:1-10. doi:10.1016/j.cbpa.2014.09.040.</a>
+
 
+
<a href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4112372/" target="_blank"><p style="font-size:70%;">4. Alexander DL, Lilly J, Hernandez J, Romsdahl J, Troll CJ, Camps M. Random mutagenesis by error-prone Pol I plasmid replication in Escherichia coli. Methods Mol Biol. 2014;1179:31-44. doi:10.1007/978-1-4939-1053-3_3.</a>
+
 
+
<a href="http://www.nature.com/articles/ncomms13051" target="_blank"><p style="font-size:70%;">5. Crook N, Abatemarco J, Sun J, Wagner JM, Schmitz A, Alper HS. In vivo continuous evolution of genes and pathways in yeast. Nat Commun. 2016;7:13051. doi:10.1038/ncomms13051.</a>
+
 
+
<a href="http://onlinelibrary.wiley.com/doi/10.1111/jipb.12152/full" target="_blank"><p style="font-size:70%;">6. Gao Y, Zhao Y. Self-processing of ribozyme-flanked RNAs into guide RNAs in vitro and in vivo for CRISPR-mediated genome editing. J Integr Plant Biol. 2014;56(4):343-349. doi:10.1111/jipb.12152.</a>
+
 
+
<a href="https://www.ncbi.nlm.nih.gov/pubmed/19180090" target="_blank"><p style="font-size:70%;">7. Sharan SK, Thomason LC, Kuznetsov SG, Court DL. Recombineering: a homologous recombination-based method of genetic engineering. Nat Protoc. 2009;4(2):206-223. doi:10.1038/nprot.2008.227.</a>
+
 
+
<a href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC365363" target="_blank"><p style="font-size:70%;">8. Simon JR, Moore PD. Homologous recombination between single-stranded DNA and chromosomal genes in Saccharomyces cerevisiae. Mol Cell Biol. 1987;7(7):2329-2334. doi:10.1128/MCB.7.7.2329.Updated.</a>
+
 
+
<a href="https://www.ncbi.nlm.nih.gov/pubmed/24160921" target="_blank"><p style="font-size:70%;">9. Dicarlo JE, Conley AJ, PenttiläM, JäJ, Wang HH, Church GM. Yeast Oligo-Mediated Genome Engineering (YOGE). 2013. doi:10.1021/sb400117c.</a>
+
 
+
<a href="https://www.ncbi.nlm.nih.gov/pubmed/11381128" target="_blank"><p style="font-size:70%;">10. Ellis HM, Yu D, DiTizio T, Court DL. High efficiency mutagenesis, repair, and engineering of chromosomal DNA using single-stranded oligonucleotides. Proc Natl Acad Sci. 2001;98(12):6742-6746. doi:10.1073/pnas.121164898.</a>
+
 
+
<a href="https://www.ncbi.nlm.nih.gov/pubmed/20813883" target="_blank"><p style="font-size:70%;">11. Mosberg JA, Lajoie MJ, Church GM. Lambda red recombineering in Escherichia coli occurs through a fully single-stranded intermediate. Genetics. 2010;186(3):791-799. doi:10.1534/genetics.110.120782.</a>
+
           
+
            </div>
+
        </div>
+
    </section>
+
 
+
    <section id="experiments" class="container content-section text-center">
+
        <div class="row">
+
            <div class="col-lg-8 col-lg-offset-2">
+
                <h2>Experiments.</h2>
+
                <p>Under construction... </p>
+
            </div>
+
        </div>
+
    </section>
+
 
+
 
+
    <section id="results" class="container content-section text-center">
+
        <div class="row">
+
            <div class="col-lg-8 col-lg-offset-2">
+
                <h2>Results.</h2>
+
                <p>Under construction... </p>
+
            </div>
+
        </div>
+
 
     </section>
 
     </section>
  

Revision as of 10:36, 30 October 2017

Theory

V

Directed Evolution – a (very) short overview

After decades of continuous methodological advancement and increasing understanding of biomolecule structure and functionality directed evolution nevertheless is still the most powerful method in protein or aptamer engineering. Classic in vitro strategies, however, require substantial effort in terms of lab work and time investment to perform several consecutive rounds of evolution1,2. In order to automate the laborious process scientists have tried to devise systems that traverse the four steps of Darwinian evolution (mutation, expression, selection, replication) in a continuous cycle in vivo (reviewed in 3). In the earliest approaches this was achieved by simply cloning the gene of interest into mutator E. coli strains4 showing reduced DNA replication fidelity or into E. coli strains carrying inducible mutator plasmids5. While these settings proved to be useful for the generation of complex multifactorial phenotypes like organic solvent tolerance5 they cannot provide the regional selectivity that is desired in single-gene protein evolution as globally enhanced mutagenesis leads to slow growth and reduced transformation efficiency6 in addition to obscured phenotypic expression due to unwanted off-target mutations7,8

Therefore, the ideal system for in vivo directed evolution would avoid those side effects by subjecting the host organism to locally confined hypermutation. Such an arrangement would allow the researcher to rapidly mutate and evolve a defined single sequence or gene while leaving the rest of the genome unchanged. Several strategies to locally constrain enhanced mutation rates have been devised so far; including plasmids harboring regions with low replication fidelity9, elaborate phage-assisted systems confining the accumulation of mutations to the phage genome while keeping the overall mutational load in the cell population in a steady state10 or retrotransposon-based methods11. Although definitely representing a large step towards the right direction, still none of the designs mentioned allow the continuous mutation of a single copy of a single gene in vivo. With D.I.V.E.R.T. we want to build a system that does.


 

The D.I.V.E.R.T. concept

In their work published in 2016 Crook et al. probably generated the very first retrotransposon-based system for in vivo directed evolution by inserting a gene of interest into a truncated version of the native Ty1 retrotransposon in S. cerevisiae (Figure 1)11. Thus, the GOI is subjected to the retrotransposon life cycle and continuously mutated due to the error-prone nature of Ty1 reverse transcriptase (low fidelity is inherent to most reverse transcriptases12). 

Figure 1: Scheme of the design used by Crook et al. in 11.

Although having delivered impressive results such as a mutation rate of 0.15 kb-1­ the concept still holds room for improvement as argued by Zheng et al. in their review of targeted mutagenesis13.  For example, Ty1 as a mobile genetic element can reintegrate anywhere in the genome, preferred upstream of genes transcribed by RNA polymerase III14 possibly reducing selection efficiency due to the presence of multiple copies of the heterologous gene in a single cell. Additionally, in this setting the reverse transcriptase likewise is continuously mutated increasing the risk of inactivation as time carries on.

Drawing inspiration from this retroelement-based approach and having its potential weaknesses in mind we started thinking about the benefits of a more universal hypermutation strategy relying on reverse transcription of the gene of interest and reintegration of the generated cDNA such as:

  1. Site-specific reintegration would make sure to replace the original gene variant maintaining single-copy status for optimal selection behavior.
  2. Expression of the required reverse transcriptase in trans rather than within the synthetic retroelement eliminates the risk of inactivating the enzyme by mutation.

In such a system only the GOI would be mutated and remain being present in just a single copy. The overall scheme of our D.I.V.E.R.T. (directed in vivo evolution via reverse transcription) concept is depicted in Figure 2.

 

Figure 2: General scheme of the D.I.V.E.R.T. cycle

 

Theoretical considerations

To sum up: we wanted to build a fully synthetic retrotransposon-like genetic element that would allow our gene of interest to continuously undergo the retrotransposon life cycle accumulating mutations over time. The main events that need to be functionally implemented for such a system to work would be the in vivo reverse transcription carried out by a heterologous reverse transcriptase as well as site-specific recombination. For both processes several options are available. The reasoning that goes into our choices as well as some other thoughts are briefly explained below.

 

Host range

Not being reliant on host-specific factors (like the Ty1 retrotransposon in yeast) but only on heterologous components (RT and recombinase) D.I.V.E.R.T. – at least in general – should be applicable in a broad range of organisms as we wanted to show by performing our proof of concept experiment in yeast as well as in E. coli. Some minor adjustments in regard to the host, however, need to be made. For example, the relevant proteins have to be tagged with a NLS in eukaryotic systems and ribosome vs. reverse transcriptase interactions might play a role in prokaryotes. More details can be found in the D.I.V.E.R.T. experiment section. (LINK)

 

RT choice and priming conditions:

In the early phase of project planning we spent quite some time on gathering relevant information about a variety of reverse transcriptases including processivity, mutation rate, mutational spectra and RNase H activity. Finally, we chose to use Moloney murine leukemia virus (MMLV) RT mainly due to it being the best characterized monomeric15 reverse transcriptase while the active forms of many other RTs (e.g. HIV16 and ASLV17 RTs) are heterodimers. Picking a monomeric enzyme allowed us to save some IDT DNA synthesis credits; a wise decision, since we managed to use up all the credits for gBlocks, primers and DNA oligos during lab work in the summer months. As an additional requirement MMLV RT also shows RNase H activity which might help to synthesize double-stranded cDNA by degrading the RNA template after the first cDNA strand has been generated and we knew that active enzyme could be expressed in E. coli18.

In its natural context MMLV RT uses a mouse tRNAPro for priming19,20. It has been shown, though, that MMLV RT shows not very stringent preferences and that MMLV can replicate using different tRNAs as long as the PBS is complementary to the 3’ end of the tRNA21. From in vitro studies (and cDNA synthesis kits for RT-PCRs) we furthermore know that MMLV RT can also initiate replication using DNA or RNA oligonucleotides with the general sequence of efficiency being DNA-oligo, tRNA, RNA-oligo22. Unfortunately, finding data on in vivo priming conditions for any heterologous reverse transcriptase in E. coli is difficult as apparently only one study has performed reverse transcription in E. coli so far23. In this work that focused on the generation of ssDNA for the creation of DNA nanostructures in vivo Elbaz et al. designed the 3’ end of their mRNA in a way so that it formed a distinct structure that on one hand acted as an transcription terminator while on the other hand it promoted efficient priming of reverse transcription by dimeric HIV RT.

So, after an extensive literature research, those were the priming conditions we found to be worthwhile to consider:

  • RNA oligos transcribed in trans featuring a defined 3’ end
  • DNA oligos generated using a native retroelement (e.g. retrons in coli; used in a similar fashion in 24)
  • The tRNA corresponding to the heterologous RT (in case of MMLV tRNAPro from mouse; transcribed in trans with a defined 3’ end)
  • A tRNA of the host organism (is already present in the cell, no need for worrying about primer production)
  • Self-priming using a simple hairpin or a more sophisticated structure at the mRNA 3’ end like in 23.

For ease of implementation and broad host generality we opted for using RNA oligos in our “lucky shot” D.I.V.E.R.T. experiment (LINK). Nevertheless, being able to prime with a native tRNA would be most convenient. Hence, in our priming condition assay (LINK) we tested 5 of the most abundant tRNAs in E. coli for initiating reverse transcription with MMLV RT.

 

Reintegration of the generated cDNA

Once the mRNA has been reverse transcribed and potentially mutated it has to replace the original copy of the gene of interest. Again, different methods are available:

  • Site-specific recombinase-mediated recombination (analogous to recombinase-mediated cassette exchange)25,26
  • Homologous recombination

For our D.I.V.E.R.T. experiment (LINK) we decided to employ the Flp/FRT site-specific recombination system as described in 25. However, extended FRT sites needed for efficient recombination are palindromic and form stable hairpins possibly acting as transcription (or reverse transcription) terminators. To evaluate this potential problem we determined the bidirectional termination efficiency of extended FRT sites using the classic terminator strength assay. (LINK)

 

Does it work? Find a rigorous proof

In the early phase of development the mutation rates achieved might be very low as the full D.I.V.E.R.T. cycle will rarely be completed. Hence, using mutation frequency as an indicator or test on whether the system works might be problematic. Crook et al.11 solved this problem by using a very elegant and rigorous proof of concept assay (Figure 1, b) that was inspired by a test on recombination frequency of the Ty1 element27. Basically, they chose a selectable marker as cargo (i.e. GOI) that was inserted into the retrotransposon in reverse orientation. The GOI was disrupted by an intron which again was reversely oriented with respect to the cargo (i.e. oriented in the retrotransposon’s sense direction). Hence, upon transcription of the cargo the mRNA would contain the reverse complement of the intron which would not be spliced leading to nonfunctional protein. Upon transcription of the retroelement, however, the intron would end up in the right orientation and going to be spliced. After reverse transcription and reintegration the intron would not be present in the new version of the retroelement leading to functional protein and selectable cells.

For our D.I.V.E.R.T. experiments we used a similar concept, details can be found in the experiments section. (LINK)