Difference between revisions of "Team:IISc-Bangalore/Design"

 
(63 intermediate revisions by 3 users not shown)
Line 2: Line 2:
 
<html>
 
<html>
 
     <ol id="inPageNav">
 
     <ol id="inPageNav">
<li><a href="#t7-expression-system">T7 expression system</a></li>
+
<li><a href="#t7-expression-system">T7 expression system<img src="https://static.igem.org/mediawiki/2017/6/68/T--IISc-Bangalore--navbar_bullet.png" /></a></li>
<li><a href="#sfgfp-spycatcher">sfGFP-SpyCatcher</a></li>
+
<li><a href="#sfgfp-spycatcher">sfGFP-SpyCatcher<img src="https://static.igem.org/mediawiki/2017/6/68/T--IISc-Bangalore--navbar_bullet.png" /></a></li>
<li><a href="#mcherry-spytag">mCherry-SpyTag</a></li>
+
<li><a href="#mcherry-spytag">mCherry-SpyTag<img src="https://static.igem.org/mediawiki/2017/6/68/T--IISc-Bangalore--navbar_bullet.png" /></a></li>
<li><a href="#gvpc">GvpC fusion</a></li>
+
<li><a href="#gvpc">GvpC<img src="https://static.igem.org/mediawiki/2017/6/68/T--IISc-Bangalore--navbar_bullet.png" /></a></li>
 +
        <li><a href="#other">Miscellaneous<img src="https://static.igem.org/mediawiki/2017/6/68/T--IISc-Bangalore--navbar_bullet.png" /></a></li>
 
     </ol>
 
     </ol>
  
 
<div id="contentMain">
 
<div id="contentMain">
  
<h1 id="t7-expression-system">Designing a T7 expression backbone</h1>
+
<img src="https://static.igem.org/mediawiki/2017/9/9f/T--IISc-Bangalore--Header--Des.svg" id="headerImg" />
  
<p>Our third method to induce gas vesicle aggregation (SpyCatcher-SpyTag binding) involves protein overexpression, and no system is better than <i>E. coli</i> strain BL21 (DE3)'s T7 expression system for this purpose: BL21 (DE3) is deficient in <i>lon</i> and <i>ompT</i> proteases. BL21 (DE3) has the T7 RNA polymerase gene integrated into its genome under the lac operon; adding IPTG induces expression of T7 RNA polymerase, which recognizes the T7 promoter sequence. Any gene inserted downstream of the T7 promoter can thus be expressed.</p>
+
 
 +
<h1 id="t7-expression-system">T7 expression backbone</h1>
 +
 
 +
<p>Aggregating gas vesicles using SpyCatcher-SpyTag binding involves protein overexpression, and no system is better for protein expression than <i>E. coli</i> BL21 (DE3) as its <i>lon</i> and <i>ompT</i> protease deficiency yields a huge amount of protein.</p>
 +
 
 +
<p>BL21 (DE3) is a lysogenic strain that has the T7 RNA polymerase gene integrated into its genome under the lac operon; adding IPTG induces expression of T7 RNA polymerase, which recognizes the T7 promoter sequence. Any gene inserted downstream of the T7 promoter can thus be expressed.</p>
  
 
<p>Using <a href="http://parts.igem.org/Part:BBa_K525998">BBa_K525998</a> (T7 promoter+RBS) and <a href="http://parts.igem.org/Part:BBa_K731721">BBa_K731721</a> (T7 terminator), we have designed a T7 expression backbone that can be used to assemble and express fusion proteins easily.</p>
 
<p>Using <a href="http://parts.igem.org/Part:BBa_K525998">BBa_K525998</a> (T7 promoter+RBS) and <a href="http://parts.igem.org/Part:BBa_K731721">BBa_K731721</a> (T7 terminator), we have designed a T7 expression backbone that can be used to assemble and express fusion proteins easily.</p>
  
 
<figure>
 
<figure>
<img src="https://static.igem.org/mediawiki/2017/d/dc/T--IISc-Bangalore--assembly-BBa_K525998.png" width="30%" height="30%">
+
<img src="https://static.igem.org/mediawiki/2017/8/82/T--IISc-Bangalore--P1-P2.png" width="50%">
</figure>
+
 
+
<figure>
+
<img src="https://static.igem.org/mediawiki/2017/3/35/T--IISc-Bangalore--assembly-BBa_K731721.png" width="30%" height="30%">
+
 
</figure>
 
</figure>
  
 
<h2>Choice of BioBricks</h2>
 
<h2>Choice of BioBricks</h2>
  
<p><a href="http://parts.igem.org/Part:BBa_K525998">BBa_K525998</a> (T7 promoter+RBS) was chosen since a strong RBS <a href="http://parts.igem.org/Part:BBa_B0034">B0034</a> is used, and this allows for maximal expression of our proteins of interest. <a href="http://parts.igem.org/Part:BBa_K731721">BBa_K731721</a> (T7 terminator) was chosen instead of the standard <a href="">B0015</a> double terminator as its <i>in vivo</i> termination efficiency is greater, as characterized by <a href="http://parts.igem.org/Part:BBa_K731700">BBa_K731700</a>.</p>
+
<p><a href="http://parts.igem.org/Part:BBa_K525998">BBa_K525998</a> (T7 promoter+RBS) was chosen as the strong RBS <a href="http://parts.igem.org/Part:BBa_B0034">B0034</a> used allows for maximal protein expression. <a href="http://parts.igem.org/Part:BBa_K731721">BBa_K731721</a> (T7 terminator) was chosen instead of the standard <a href="">B0015</a> double terminator as its <i>in vivo</i> termination efficiency is greater, as characterized by <a href="http://parts.igem.org/Part:BBa_K731700">BBa_K731700</a>.</p>
  
 
<h2>Our First Modification — <a href="http://parts.igem.org/wiki/index.php?title=Part:BBa_K2319001">BBa_K2319001</a> (HindIII+ATG+AgeI scar)</h2>
 
<h2>Our First Modification — <a href="http://parts.igem.org/wiki/index.php?title=Part:BBa_K2319001">BBa_K2319001</a> (HindIII+ATG+AgeI scar)</h2>
  
<p>A HindIII restriction site, a start codon and an AgeI restriction site (<a href="http://parts.igem.org/wiki/index.php?title=Part:BBa_K2319001">BBa_K2319001</a>) are added immediately downstream of the T7 promoter+RBS. A number of design considerations went into the choice of these restriction sites. The HindIII site (A\AGCTT) — sandwiched between the RBS and the start codon — has a sequence very similar to the optimal sequence predicted by the <a href="https://static.igem.org/mediawiki/parts/d/d1/Ribologo-small.gif">sequence logo</a> of <i>E. coli</i> ribosome binding sites. In fact, the HindIII sequence is closer to the optimal sequence than the typical 5'-TACTAG-3' mixed SpeI-XbaI restriction site formed by BioBrick assembly!</p>
+
<p>A HindIII restriction site, a start codon and an AgeI restriction site (<a href="http://parts.igem.org/wiki/index.php?title=Part:BBa_K2319001">BBa_K2319001</a>) are added immediately downstream of the T7 promoter+RBS. A number of design considerations motivated the choice of these restriction sites. The HindIII site (A\AGCTT) — sandwiched between the RBS and the start codon — has a sequence very similar to the optimal sequence predicted by the <a href="https://static.igem.org/mediawiki/parts/d/d1/Ribologo-small.gif">sequence logo</a> of <i>E. coli</i> ribosome binding sites. In fact, the HindIII sequence is closer to the optimal sequence than the typical 5'-TACTAG-3' mixed SpeI-XbaI restriction site formed by BioBrick assembly, improving the initial ribosome-mRNA binding and thereby increasing the rate of translation.</p>
  
 
<figure>
 
<figure>
<img src="https://static.igem.org/mediawiki/2017/7/78/T--IISc-Bangalore--threonine.png" width="40%">
+
<img src="https://static.igem.org/mediawiki/2017/f/f6/T--IISc-Bangalore--amino-acids.png" width="80%">
 +
<br>
 +
<figurecaption>
 +
<b>Figure 1</b>: the amino acids glycine (left), serine (center) and threonine (right)
 +
</figurecaption>
 
</figure>
 
</figure>
  
<figure>
+
<p>The AgeI site was chosen to simplify assembly of fusion proteins in this backbone: by inserting a protein coding sequence at the N-terminus of the existing protein using the HindIII and AgeI sites, a fusion protein can be formed with a benign scar. The AgeI site (A\CCGGT) is translated in-frame to Thr-Gly, amino acids commonly used in linker sequences for fusion proteins. Threonine's hydroxyl group makes it hydrophilic — allowing stabilizing interactions with the aqueous cellular environment — while glycine's small size makes the linker more flexible, allowing both protein domains to fold independently. In addition, the AgeI site is useful if the user wishes to transfer an RFC25-compatible fusion protein (Freiburg format) into our expression system.</p>
<img src="https://static.igem.org/mediawiki/2017/0/00/T--IISc-Bangalore--glycine.png" width="40%">
+
</figure>
+
 
+
<p>The AgeI site was chosen to simplify assembly of fusion proteins in this backbone: by inserting a protein coding sequence at the N-terminus of the existing protein using the HindIII and AgeI sites, a fusion protein can be formed with a benign scar — the AgeI sequence (A\CCGGT) is translated in-frame to Thr-Gly, amino acids commonly used in linker sequences for fusion proteins. Threonine's hydroxyl group makes it hydrophilic — allowing stabilizing interactions with the aqueous cellular environment — while glycine's small size makes the linker more flexible, allowing independent folding of both protein domains. In addition, the AgeI site is useful if the user wishes to transfer a fusion protein in RFC25 (Freiburg format) into our expression system.</p>
+
  
 
<h2>Our Second Modification — <a href="http://parts.igem.org/wiki/index.php?title=Part:BBa_K2319004">BBa_K2319004</a> (TAAG)</h2>
 
<h2>Our Second Modification — <a href="http://parts.igem.org/wiki/index.php?title=Part:BBa_K2319004">BBa_K2319004</a> (TAAG)</h2>
  
<p>A stop codon (5'-TAA-3') and the nucleotide G are added immediately upstream of <a href="http://parts.igem.org/Part:BBa_K731721">BBa_K731721</a> (T7 terminator). This extra stop codon (assuming the fusion protein sequence has its own) ensures that translation is halted and prevents any translational read-through. The extra nucleotide G is added for a more subtle purpose: when added just before the T7 terminator sequence (5'-CTAGC...TTTTG-3'), it forms the NheI restriction site (G\CTAGC). This allows any fusion protein to be inserted into our expression backbone using the HindIII and NheI sites.</p>
+
<p>A stop codon (TAA) and the nucleotide G are added immediately upstream of <a href="http://parts.igem.org/Part:BBa_K731721">BBa_K731721</a> (T7 terminator). This extra stop codon (assuming the fusion protein sequence has its own) ensures that translation is halted and prevents any translational read-through. The extra nucleotide G is added for a more subtle purpose: when placed just before the T7 terminator sequence (5'-CTAGC...TTTTG-3'), it forms the NheI restriction site (G\CTAGC). This allows any fusion protein to be inserted into our expression backbone using the HindIII and NheI sites.</p>
  
<p>Note that all the restriction sites we have chosen to insert are supplied by NEB as High Fidelity (HF) versions for optimal double digestions and downstream processes in NEB's CutSmart buffer.</p>
+
<h2>Using the T7 expression backbone</h2>
  
<table>
+
<figure>
<tr>
+
<img src="https://static.igem.org/mediawiki/2017/6/66/T--IISc-Bangalore--L12.png" width="100%">
<th colspan="8">Restriction enzymes used in our assembly</th>
+
<br>
</tr>
+
<figurecaption>
<tr>
+
<b>Figure 2</b>: T7 expression backbone showing HindIII, AgeI and NheI sites
<th rowspan="2">Restriction Enzyme</th>
+
</figurecaption>
<th rowspan="2">Sequence</th>
+
</figure>
<th colspan="4">Activity in NEBuffers (%)</th>
+
<th rowspan="2">Incubation temperature</th>
+
<th rowspan="2">Heat inactivation</th>
+
</tr>
+
<tr>
+
<th>1.1</th>
+
<th>2.1</th>
+
<th>3.1</th>
+
<th>CutSmart</th>
+
</tr>
+
+
<tr>
+
<td>AgeI</td>
+
<td>A\CCGGT</td>
+
<td>100</td>
+
<td>75</td>
+
<td>25</td>
+
<td>75</td>
+
<td>37°C</td>
+
<td>65°C</td>
+
</tr>
+
<tr>
+
<td>AgeI-HF</td>
+
<td>A\CCGGT</td>
+
<td>100</td>
+
<td>50</td>
+
<td>10</td>
+
<td>100</td>
+
<td>37°C</td>
+
<td>65°C</td>
+
</tr>
+
<tr>
+
<td>BamHI</td>
+
<td>G\GATCC</td>
+
<td>75*</td>
+
<td>100*</td>
+
<td>100</td>
+
<td>100*</td>
+
<td>37°C</td>
+
<td>—</td>
+
</tr>
+
<tr>
+
<td>BamHI-HF</td>
+
<td>G\GATCC</td>
+
<td>100</td>
+
<td>50</td>
+
<td>10</td>
+
<td>100</td>
+
<td>37°C</td>
+
<td>—</td>
+
</tr>
+
<tr>
+
<td>HindIII</td>
+
<td>A\AGCTT</td>
+
<td>25</td>
+
<td>100</td>
+
<td>50</td>
+
<td>50</td>
+
<td>37°C</td>
+
<td>80°C</td>
+
</tr>
+
<tr>
+
<td>HindIII-HF</td>
+
<td>A\AGCTT</td>
+
<td>10</td>
+
<td>100</td>
+
<td>10</td>
+
<td>100</td>
+
<td>37°C</td>
+
<td>80°C</td>
+
</tr>
+
<tr>
+
<td>HindIII</td>
+
<td>A\AGCTT</td>
+
<td>25</td>
+
<td>100</td>
+
<td>50</td>
+
<td>50</td>
+
<td>37°C</td>
+
<td>80°C</td>
+
</tr>
+
<tr>
+
<td>HindIII-HF</td>
+
<td>A\AGCTT</td>
+
<td>10</td>
+
<td>100</td>
+
<td>10</td>
+
<td>100</td>
+
<td>37°C</td>
+
<td>80°C</td>
+
</tr><tr>
+
<td>NcoI</td>
+
<td>C\CATGG</td>
+
<td>100</td>
+
<td>100</td>
+
<td>100</td>
+
<td>100</td>
+
<td>37°C</td>
+
<td>80°C</td>
+
</tr>
+
<tr>
+
<td>NcoI-HF</td>
+
<td>C\CATGG</td>
+
<td>50</td>
+
<td>100</td>
+
<td>10</td>
+
<td>100</td>
+
<td>37°C</td>
+
<td>80°C</td>
+
</tr>
+
<tr>
+
<td>NheI</td>
+
<td>G\CTAGC</td>
+
<td>100</td>
+
<td>100</td>
+
<td>10</td>
+
<td>100</td>
+
<td>37°C</td>
+
<td>65°C</td>
+
</tr>
+
<tr>
+
<td>NheI-HF</td>
+
<td>G\CTAGC</td>
+
<td>100</td>
+
<td>25</td>
+
<td>10</td>
+
<td>100</td>
+
<td>37°C</td>
+
<td>80°C</td>
+
</tr>
+
<tr>
+
<td colspan="8">* denotes star activity</td>
+
</tr>
+
+
</table>
+
  
 +
<h3>Expressing any protein of interest</h3>
 +
<p>This T7 expression backbone can be used to express any protein if its coding sequence (with a start codon) is inserted using the HindIII and NheI sites. These sites can be added to the coding sequence using PCR with primers having 5'-overhangs.</p>
  
<h1 id="sfgfp-spycatcher">Designing a fusion protein: sfGFP-SpyCatcher</h1>
+
<h3>Fusing a protein domain at the N-terminus of an existing protein</h3>
 +
<p>By inserting the coding sequence of a protein domain (including the start codon) using the HindIII and AgeI sites into the T7 expression backbone (which already contains a protein coding sequence), an N-terminal fusion can be performed.</p>
  
<p>Using <a href="http://parts.igem.org/Part:BBa_K1321337">BBa_K1321337</a> (sfGFP in Freiburg format) and <a href="http://parts.igem.org/Part:BBa_K731721">BBa_K1650037</a> (SpyCatcher), we make the fusion protein sfGFP-SpyCatcher.</p>
+
<h1 id="sfgfp-spycatcher">sfGFP-SpyCatcher</h1>
 +
 
 +
<p>Using <a href="http://parts.igem.org/Part:BBa_K1321337">BBa_K1321337</a> (sfGFP in Freiburg format) and <a href="http://parts.igem.org/Part:BBa_K731721">BBa_K1650037</a> (SpyCatcher), we make the fusion protein sfGFP-SpyCatcher to be expressed on the gas vesicle surface after fusion to GvpC.</p>
 +
 
 +
<figure>
 +
<img src="https://static.igem.org/mediawiki/2017/c/c3/T--IISc-Bangalore--P3-P4.png" width="50%">
 +
</figure>
  
 
<h2>Choice of BioBricks</h2>
 
<h2>Choice of BioBricks</h2>
Line 200: Line 76:
 
<p><a href="http://parts.igem.org/Part:BBa_K1321337">BBa_K1321337</a> (sfGFP in Freiburg format) was chosen because superfolder GFP exhibits intense bright green fluorescence (making it easy to assay) and folds easily into an extremely stable structure — its half-life for unfolding at room temperature is estimated at 28 years! <a href="http://parts.igem.org/Part:BBa_K1650037">BBa_K1650037</a> (SpyCatcher) was chosen as it possesses a 6xHis tag near the N-terminus, which allows the fusion protein to be easily purified using a Ni-NTA column.</p>
 
<p><a href="http://parts.igem.org/Part:BBa_K1321337">BBa_K1321337</a> (sfGFP in Freiburg format) was chosen because superfolder GFP exhibits intense bright green fluorescence (making it easy to assay) and folds easily into an extremely stable structure — its half-life for unfolding at room temperature is estimated at 28 years! <a href="http://parts.igem.org/Part:BBa_K1650037">BBa_K1650037</a> (SpyCatcher) was chosen as it possesses a 6xHis tag near the N-terminus, which allows the fusion protein to be easily purified using a Ni-NTA column.</p>
  
<figure>
+
<h2>Our Modification — <a href="http://parts.igem.org/wiki/index.php?title=Part:BBa_K23190032">BBa_K2319002</a> (GGSGSGSS linker)</h2>
<img src="https://static.igem.org/mediawiki/2017/1/1d/T--IISc-Bangalore--assembly-BBa_K1321337.png" width="49%" height="49%">
+
 
</figure>
+
<p>Between the sfGFP and SpyCatcher protein domains, we have inserted a short (8 aa), flexible, hydrophilic linker comprising Gly and Ser residues that allows sfGFP and SpyCatcher to fold independently of each other. Serine, like threonine, has a hydroxyl group that contributes to stability by interacting with the aqueous cellular environment. Glycine, again due to its small size, maintains flexibility of the linker.</p>
 +
 
 +
<p>This linker region also contains a BamHI restriction site (G\GATCC) whose in-frame translation is Gly-Ser. This BamHI site is used to link these two protein domains during assembly of the fusion construct.</p>
 +
 
 +
<h1 id="mcherry-spytag">mCherry-SpyTag</h1>
 +
 
 +
<p>Using <a href="http://parts.igem.org/Part:BBa_J18932">BBa_J18932</a> (mCherry RFP) and two well-designed oligos, we plan to produce mCherry-SpyTag.<p>
  
 
<figure>
 
<figure>
<img src="https://static.igem.org/mediawiki/2017/0/0e/T--IISc-Bangalore--assembly-BBa_K1650037.png" width="49%" height="49%">
+
<img src="https://static.igem.org/mediawiki/2017/0/0f/T--IISc-Bangalore--P5.png" width="50%">
 
</figure>
 
</figure>
  
<h2>Our Modification — <a href="http://parts.igem.org/wiki/index.php?title=Part:BBa_K23190032">BBa_K2319002</a> (GGSGSGSS linker)</h2>
+
<h2>Choice of BioBricks</h2>
 +
 
 +
<p><a href="http://parts.igem.org/Part:BBa_J18932">BBa_J18932</a> (mCherry RFP) has an interesting flaw: an internal ATG near the N-terminus has a RBS-like sequence preceding it; this hidden translation start site leads to ~50% truncation of the produced mCherry protein!</p>
 +
 
 +
<h2>Our First Modification — Improved mCherry </h2>
 +
<p>Using an <i>in silico</i> analysis of RBS strengths using an online RBS Calculator, we modified the nucleotide sequence preceding the translation start site to become a far weaker RBS while maintaining the same amino acid sequence. This inhibits translation initiation at that position by almost 75% (predicted) and so reduces the truncation of the protein.</p>
  
<p>Between the sfGFP and SpyCatcher protein domains, we have inserted a short (8 aa), flexible, hydrophilic linker comprising glycine and serine residues that allows sfGFP and SpyCatcher to fold independently of each other. Serine, like threonine, has a hydroxyl group that contributes to stability by interacting with the aqueous cellular environment. Glycine, again due to its small size, maintains flexibility of the linker.</p>
+
<pre align="middle">BBa_J18932_mCherry      1 GTGAGCAAAGGCGAGGAAGATAACATG    27
 +
                  |||...|||||.||.||||||||.|||
 +
Improved_mCherry        1 GTGTCTAAAGGTGAAGAAGATAATATG    27</pre>
  
<p>This linker region also contains a BamHI restriction site (5'-G\GATCC-3') whose in-frame translation is Gly-Ser. This BamHI site is used to link these two protein domains during assembly of the fusion construct.</p>
+
<p>By exploiting a natural NdeI site (CA\TATG) occurring right after the modified sequence, we insert <a href="http://parts.igem.org/Part:BBa_K2319006">BBa_K2319006</a> comprising a HindIII site (for insertion into the T7 expression backbone), a start codon (ATG), and a 6xHis-tag (for easy Ni-NTA column-based protein purification), and the modified sequence preceding the hidden translation start site.</p>
  
<h1 id="mcherry-spytag">Designing a fusion protein: mCherry-SpyTag</h1>
+
<h2>Our Second Modification — SpyTag Linker</h2>
<img src="https://static.igem.org/mediawiki/2017/4/40/T--IISc-Bangalore--assembly-BBa_J18932.png" width="49%" height="49%">
+
<p>Using another oligo, we insert <a href="http://parts.igem.org/Part:BBa_K2319008">BBa_K2319008</a> comprising a BamHI site (for insertion after the mCherry sequence), a GSGGGGS linker (for independent folding) and the SpyTag peptide sequence AHIVMVDAYKPTK. By doing this, we add SpyTag functionality to the mCherry protein, allowing it to bind to our sfGFP-SpyCatcher.</p>
  
 
<h1 id="gvpc">GvpC fusion</h1>
 
<h1 id="gvpc">GvpC fusion</h1>
 +
 +
<p>After assembling our sfGFP-SpyCatcher and mCherry-SpyTag fusion proteins, we have to express them on the gas vesicle surface: this is where our AgeI sites come in handy — by using HindIII and AgeI, we can insert the <i>gvpC</i> sequence into this expression backbone to make the GvpC fusion proteins. Our project involves gas vesicles extracted from <i>Anabaena flos-aquae</i> and <i>Halobacterium salinarum</i> NRC-1 — we need the <i>gvpC</i> genes from both these organisms, but as always, there are design considerations to keep in mind.</p>
 +
 +
<h2>Our Modification — GvpC of <i>Halobacterium salinarum</i> NRC-1</h2>
 +
 +
<p><i>H. salinarum</i> is a halophile that tolerates hypersaline environments with ease — using a "salt-in" strategy, its cellular environment has evolved to be hypersaline itself! The vast majority of proteins synthesized by H. salinarum have negatively-charged acidic side chains and require 4-5 M salt concentrations to function. GvpC is one such protein. GvpC contains seven internal repeats of the domain that interlocks between GvpA "ribs" and strengthens the gas vesicle, followed by an acidic tail at the C-terminus which stabilizes the protein through effective solvation.</p>
 +
 +
<pre>
 +
<b>GvpC protein sequence from <i>Halobacterium salinarum</i> NRC-1, showing seven internal repeats and acidic tail</b>
 +
 +
                        MSVTDKRDEMSTARDKFAESQQEFESYADEFAADITAKQDDVSDLVDAITDFQAEMTNTT
 +
                                                |  | |||||      |    | |  |     
 +
                                              DAFHTYGDEFAAEVDHLRADIDAQRDVIREMQ     
 +
                                              |||  | | ||        ||      |         
 +
                                              DAFEAYADIFATDIADKQ-DIGNLLAAIEALRTEMNSTH
 +
                                              ||||||| || | |    ||  | |||    |   
 +
                                              GAFEAYADDFAADVAALR-DISDLVAAIDDFQEEFIAVQ
 +
                                              ||  || || |        |  | ||| |    | | 
 +
                                              DAFDNYAGDFDAE-------IDQLHAAIADQHDSFDATA
 +
                                              |||  |  |                             
 +
                                              DAFAEYRDEFYRIEVEALLEAINDFQQDIGDFRAEFETTE
 +
                                              |||      ||  |  |  |                  |
 +
                                              DAFVAFARDFYGHEITAEEGAAEAEAEPVEADADVEAEAE*
 +
                                            #VSPDEAGGESAGTEEEETEPAEVETAAPEVEGSPADTADE
 +
                                              AEDTEAEEETEEEAPEDMVQCRVCGEYYQAITEPHLQTHD
 +
                                              MTIQEYRDEYGEDVPLRPDDKT
 +
 +
* denotes the truncation site (denoted C3 truncation site)
 +
# denotes the start of the acidic tail (note the abundance of aspartate and glutamate residues)
 +
Adapted from DasSarma et al (2013)
 +
</pre>
 +
 +
<p>For our proposed fusions, this is a problem: haloarchaeal gas vesicles "shrug off" their surface GvpC if high salt concentrations are not maintained. As a result, we removed this acidic tail that destabilizes the GvpC-gas vesicle binding at low salt concentrations, and used this truncated GvpC for our fusions.</p>
 +
 +
<h2>Our Modification — (GGGGS)2 linker + AgeI</h2>
 +
 +
<p>Our second modification to both <i>gvpC</i> sequences is an addition of a C-terminal (GGGGS)2 linker (for proper folding of the GvpC domain) and an AgeI site (for insertion into our sfGFP-SpyCatcher and mCherry-SpyTag backbones).</p>
 +
 +
<h1 id="other">Miscellaneous</h1>
 +
 +
<h2>Codon optimization</h2>
 +
 +
<p>As far as possible, we have manually codon-optimized the fusion proteins for the amino acids we are adding using our primers — His, Gly, Ser, Ala — using the codon table of <i>E. coli</i> and an online codon usage tool.</p>
 +
 +
<figure>
 +
<img src="https://static.igem.org/mediawiki/2017/f/f4/T--IISc-Bangalore--assembly-codon-usage.png" width="100%">
 +
<br>
 +
<figurecaption>
 +
<b>Figure 3</b>: Codon usage in E. coli genes
 +
</figurecaption>
 +
</figure>
 +
 +
<h2>Choice of restriction enzymes</h2>
 +
 +
<p>Note that all the restriction sites we have chosen to insert are supplied by NEB as High Fidelity (HF) versions for optimal double digestions and downstream processes in NEB's CutSmart buffer.</p>
  
 
</div>
 
</div>
Line 225: Line 169:
 
   changeHash: true     
 
   changeHash: true     
 
});
 
});
 +
 +
var height = $('#headerImg').height();
 +
    window.onscroll = function() {myFunction()};
 +
 +
    function myFunction() {
 +
        if (document.body.scrollTop > height || document.documentElement.scrollTop > height) {
 +
            $("#inPageNav").fadeIn(200);
 +
        } else {
 +
            $("#inPageNav").fadeOut(200);
 +
        }
 +
    }
 
</script>
 
</script>
  
 
</html>
 
</html>

Latest revision as of 02:06, 2 November 2017

  1. T7 expression system
  2. sfGFP-SpyCatcher
  3. mCherry-SpyTag
  4. GvpC
  5. Miscellaneous

T7 expression backbone

Aggregating gas vesicles using SpyCatcher-SpyTag binding involves protein overexpression, and no system is better for protein expression than E. coli BL21 (DE3) as its lon and ompT protease deficiency yields a huge amount of protein.

BL21 (DE3) is a lysogenic strain that has the T7 RNA polymerase gene integrated into its genome under the lac operon; adding IPTG induces expression of T7 RNA polymerase, which recognizes the T7 promoter sequence. Any gene inserted downstream of the T7 promoter can thus be expressed.

Using BBa_K525998 (T7 promoter+RBS) and BBa_K731721 (T7 terminator), we have designed a T7 expression backbone that can be used to assemble and express fusion proteins easily.

Choice of BioBricks

BBa_K525998 (T7 promoter+RBS) was chosen as the strong RBS B0034 used allows for maximal protein expression. BBa_K731721 (T7 terminator) was chosen instead of the standard B0015 double terminator as its in vivo termination efficiency is greater, as characterized by BBa_K731700.

Our First Modification — BBa_K2319001 (HindIII+ATG+AgeI scar)

A HindIII restriction site, a start codon and an AgeI restriction site (BBa_K2319001) are added immediately downstream of the T7 promoter+RBS. A number of design considerations motivated the choice of these restriction sites. The HindIII site (A\AGCTT) — sandwiched between the RBS and the start codon — has a sequence very similar to the optimal sequence predicted by the sequence logo of E. coli ribosome binding sites. In fact, the HindIII sequence is closer to the optimal sequence than the typical 5'-TACTAG-3' mixed SpeI-XbaI restriction site formed by BioBrick assembly, improving the initial ribosome-mRNA binding and thereby increasing the rate of translation.


Figure 1: the amino acids glycine (left), serine (center) and threonine (right)

The AgeI site was chosen to simplify assembly of fusion proteins in this backbone: by inserting a protein coding sequence at the N-terminus of the existing protein using the HindIII and AgeI sites, a fusion protein can be formed with a benign scar. The AgeI site (A\CCGGT) is translated in-frame to Thr-Gly, amino acids commonly used in linker sequences for fusion proteins. Threonine's hydroxyl group makes it hydrophilic — allowing stabilizing interactions with the aqueous cellular environment — while glycine's small size makes the linker more flexible, allowing both protein domains to fold independently. In addition, the AgeI site is useful if the user wishes to transfer an RFC25-compatible fusion protein (Freiburg format) into our expression system.

Our Second Modification — BBa_K2319004 (TAAG)

A stop codon (TAA) and the nucleotide G are added immediately upstream of BBa_K731721 (T7 terminator). This extra stop codon (assuming the fusion protein sequence has its own) ensures that translation is halted and prevents any translational read-through. The extra nucleotide G is added for a more subtle purpose: when placed just before the T7 terminator sequence (5'-CTAGC...TTTTG-3'), it forms the NheI restriction site (G\CTAGC). This allows any fusion protein to be inserted into our expression backbone using the HindIII and NheI sites.

Using the T7 expression backbone


Figure 2: T7 expression backbone showing HindIII, AgeI and NheI sites

Expressing any protein of interest

This T7 expression backbone can be used to express any protein if its coding sequence (with a start codon) is inserted using the HindIII and NheI sites. These sites can be added to the coding sequence using PCR with primers having 5'-overhangs.

Fusing a protein domain at the N-terminus of an existing protein

By inserting the coding sequence of a protein domain (including the start codon) using the HindIII and AgeI sites into the T7 expression backbone (which already contains a protein coding sequence), an N-terminal fusion can be performed.

sfGFP-SpyCatcher

Using BBa_K1321337 (sfGFP in Freiburg format) and BBa_K1650037 (SpyCatcher), we make the fusion protein sfGFP-SpyCatcher to be expressed on the gas vesicle surface after fusion to GvpC.

Choice of BioBricks

BBa_K1321337 (sfGFP in Freiburg format) was chosen because superfolder GFP exhibits intense bright green fluorescence (making it easy to assay) and folds easily into an extremely stable structure — its half-life for unfolding at room temperature is estimated at 28 years! BBa_K1650037 (SpyCatcher) was chosen as it possesses a 6xHis tag near the N-terminus, which allows the fusion protein to be easily purified using a Ni-NTA column.

Our Modification — BBa_K2319002 (GGSGSGSS linker)

Between the sfGFP and SpyCatcher protein domains, we have inserted a short (8 aa), flexible, hydrophilic linker comprising Gly and Ser residues that allows sfGFP and SpyCatcher to fold independently of each other. Serine, like threonine, has a hydroxyl group that contributes to stability by interacting with the aqueous cellular environment. Glycine, again due to its small size, maintains flexibility of the linker.

This linker region also contains a BamHI restriction site (G\GATCC) whose in-frame translation is Gly-Ser. This BamHI site is used to link these two protein domains during assembly of the fusion construct.

mCherry-SpyTag

Using BBa_J18932 (mCherry RFP) and two well-designed oligos, we plan to produce mCherry-SpyTag.

Choice of BioBricks

BBa_J18932 (mCherry RFP) has an interesting flaw: an internal ATG near the N-terminus has a RBS-like sequence preceding it; this hidden translation start site leads to ~50% truncation of the produced mCherry protein!

Our First Modification — Improved mCherry

Using an in silico analysis of RBS strengths using an online RBS Calculator, we modified the nucleotide sequence preceding the translation start site to become a far weaker RBS while maintaining the same amino acid sequence. This inhibits translation initiation at that position by almost 75% (predicted) and so reduces the truncation of the protein.

BBa_J18932_mCherry      1 GTGAGCAAAGGCGAGGAAGATAACATG     27
                   |||...|||||.||.||||||||.|||
Improved_mCherry        1 GTGTCTAAAGGTGAAGAAGATAATATG     27

By exploiting a natural NdeI site (CA\TATG) occurring right after the modified sequence, we insert BBa_K2319006 comprising a HindIII site (for insertion into the T7 expression backbone), a start codon (ATG), and a 6xHis-tag (for easy Ni-NTA column-based protein purification), and the modified sequence preceding the hidden translation start site.

Our Second Modification — SpyTag Linker

Using another oligo, we insert BBa_K2319008 comprising a BamHI site (for insertion after the mCherry sequence), a GSGGGGS linker (for independent folding) and the SpyTag peptide sequence AHIVMVDAYKPTK. By doing this, we add SpyTag functionality to the mCherry protein, allowing it to bind to our sfGFP-SpyCatcher.

GvpC fusion

After assembling our sfGFP-SpyCatcher and mCherry-SpyTag fusion proteins, we have to express them on the gas vesicle surface: this is where our AgeI sites come in handy — by using HindIII and AgeI, we can insert the gvpC sequence into this expression backbone to make the GvpC fusion proteins. Our project involves gas vesicles extracted from Anabaena flos-aquae and Halobacterium salinarum NRC-1 — we need the gvpC genes from both these organisms, but as always, there are design considerations to keep in mind.

Our Modification — GvpC of Halobacterium salinarum NRC-1

H. salinarum is a halophile that tolerates hypersaline environments with ease — using a "salt-in" strategy, its cellular environment has evolved to be hypersaline itself! The vast majority of proteins synthesized by H. salinarum have negatively-charged acidic side chains and require 4-5 M salt concentrations to function. GvpC is one such protein. GvpC contains seven internal repeats of the domain that interlocks between GvpA "ribs" and strengthens the gas vesicle, followed by an acidic tail at the C-terminus which stabilizes the protein through effective solvation.

GvpC protein sequence from Halobacterium salinarum NRC-1, showing seven internal repeats and acidic tail

                         MSVTDKRDEMSTARDKFAESQQEFESYADEFAADITAKQDDVSDLVDAITDFQAEMTNTT
                                                |  | |||||       |     | |   |       
                                              DAFHTYGDEFAAEVDHLRADIDAQRDVIREMQ       
                                              |||  | | ||        ||      |           
                                              DAFEAYADIFATDIADKQ-DIGNLLAAIEALRTEMNSTH
                                               ||||||| || | |    ||  | |||     |     
                                              GAFEAYADDFAADVAALR-DISDLVAAIDDFQEEFIAVQ
                                               ||  || || |        |  | ||| |    | |  
                                              DAFDNYAGDFDAE-------IDQLHAAIADQHDSFDATA
                                              |||  |   |                              
                                              DAFAEYRDEFYRIEVEALLEAINDFQQDIGDFRAEFETTE
                                              |||      ||  |  |   |                  |
                                              DAFVAFARDFYGHEITAEEGAAEAEAEPVEADADVEAEAE*
                                             #VSPDEAGGESAGTEEEETEPAEVETAAPEVEGSPADTADE
                                              AEDTEAEEETEEEAPEDMVQCRVCGEYYQAITEPHLQTHD
                                              MTIQEYRDEYGEDVPLRPDDKT

* denotes the truncation site (denoted C3 truncation site)
# denotes the start of the acidic tail (note the abundance of aspartate and glutamate residues)
Adapted from DasSarma et al (2013)

For our proposed fusions, this is a problem: haloarchaeal gas vesicles "shrug off" their surface GvpC if high salt concentrations are not maintained. As a result, we removed this acidic tail that destabilizes the GvpC-gas vesicle binding at low salt concentrations, and used this truncated GvpC for our fusions.

Our Modification — (GGGGS)2 linker + AgeI

Our second modification to both gvpC sequences is an addition of a C-terminal (GGGGS)2 linker (for proper folding of the GvpC domain) and an AgeI site (for insertion into our sfGFP-SpyCatcher and mCherry-SpyTag backbones).

Miscellaneous

Codon optimization

As far as possible, we have manually codon-optimized the fusion proteins for the amino acids we are adding using our primers — His, Gly, Ser, Ala — using the codon table of E. coli and an online codon usage tool.


Figure 3: Codon usage in E. coli genes

Choice of restriction enzymes

Note that all the restriction sites we have chosen to insert are supplied by NEB as High Fidelity (HF) versions for optimal double digestions and downstream processes in NEB's CutSmart buffer.