Difference between revisions of "Team:UNOTT/Modelling"

 
(5 intermediate revisions by the same user not shown)
Line 175: Line 175:
 
a{
 
a{
 
     color: #ffffff;
 
     color: #ffffff;
  font-size: 18px;
+
    font-size: 18px;
 
}
 
}
 
html {
 
html {
Line 251: Line 251:
 
<br> </br>
 
<br> </br>
  
<p style="text-align: center;" >  Figure 1 </p>
 
 
$$ \color{white}{ p \underset{t_{1} }{\rightarrow} m \underset{t_{2}}{\rightarrow} p  } $$
 
$$ \color{white}{ p \underset{t_{1} }{\rightarrow} m \underset{t_{2}}{\rightarrow} p  } $$
 
<p> The equation above describes the process of which the gene undergoes transcription to produce mRNA. The mRNA carries the genetic information copied from the DNA which codes for protein. The expression of protein lead to fluorescence which is the desired output of the system. </p>
 
<p> The equation above describes the process of which the gene undergoes transcription to produce mRNA. The mRNA carries the genetic information copied from the DNA which codes for protein. The expression of protein lead to fluorescence which is the desired output of the system. </p>
  
<p style="text-align: center;" >  Figure 2 </p>
 
 
$$ \color{white}{ m \underset{Degradation}{\rightarrow} \oslash  } $$
 
$$ \color{white}{ m \underset{Degradation}{\rightarrow} \oslash  } $$
  
Line 264: Line 262:
 
<p>  The team applied Law of Mass Action, combining both equations for the concentration of protein and mRNA over time. This model can be described as: </p>
 
<p>  The team applied Law of Mass Action, combining both equations for the concentration of protein and mRNA over time. This model can be described as: </p>
  
  <p style="text-align: center;" >  Figure 3 </p>
+
 
 
$$ \color{white}{  \frac{dm}{dt} = k_{1} -d _{1 } m } $$
 
$$ \color{white}{  \frac{dm}{dt} = k_{1} -d _{1 } m } $$
 
$$ \color{white}{ \frac{dp}{dt} = k_{2} \cdot  m - d_{2} \cdot p } $$
 
$$ \color{white}{ \frac{dp}{dt} = k_{2} \cdot  m - d_{2} \cdot p } $$
Line 297: Line 295:
 
     <div class="expandable-box">
 
     <div class="expandable-box">
  
       <h4 style="color: #ffffff; font-weight: bold; font-size: 30px;">Gene Transcription Regulation by Repressors (CRISPRi) - Concentration over Time</h4><center></center>
+
       <h4 style="color: #ffffff; font-weight: bold; font-size: 30px;">Gene Transcription Regulation by Repressors (CRISPRi)</h4><center></center>
 
<h5 style="color: #C0C0C0; font-weight: bold; font-size: 20px;"> Calculating how much protein is produced over time when a gene is inhibited </h5>
 
<h5 style="color: #C0C0C0; font-weight: bold; font-size: 20px;"> Calculating how much protein is produced over time when a gene is inhibited </h5>
  
Line 305: Line 303:
 
   <p style="text-align: center;" >  <img src="https://static.igem.org/mediawiki/2017/0/00/T--UNOTT--Formualasdas.png"; </p>
 
   <p style="text-align: center;" >  <img src="https://static.igem.org/mediawiki/2017/0/00/T--UNOTT--Formualasdas.png"; </p>
 
<br></br>
 
<br></br>
<p> This system can be described as above. Where gRNA(i), Cas9, and mRNA are produced constitutively with their associated rates of production kc, kg, and k0i respectively. The Cas9 and gRNA(i) will undergo an irreversible association to form Cas9:gRNA(i) at rate kf, which in turn inhibits the production of mRNA and reduce the production of Fluorescent protein (k1). All molecules spontaneously degrade and diffuse away at their own associated rate. (i) will account for us having multiple gRNAs and just as many fluorescent proteins i.e. i=3 with three fluorescent proteins and subsequent set of three gRNAs. It is asumed that all gRNAs have the same binding affinity and their productions are the same. The varying strengths of promoters for mRNA (koi) will be assigned to each corresponding gRNA in the set of (i). </p>
+
<p> This system can be described as above. Where gRNA(i), dCas9, and mRNA are produced constitutively with their associated rates of production kc, kg, and k0i respectively. The dCas9 and gRNA(i) will undergo an irreversible association to form dCas9:gRNA(i) at rate kf, which in turn inhibits the production of mRNA and reduce the production of Fluorescent protein (k1). All molecules spontaneously degrade and diffuse away at their own associated rate. (i) will account for us having multiple gRNAs and just as many fluorescent proteins i.e. i=3 with three fluorescent proteins and subsequent set of three gRNAs. It is asumed that all gRNAs have the same binding affinity and their productions are the same. The varying strengths of promoters for mRNA (koi) will be assigned to each corresponding gRNA in the set of (i). </p>
 
<br>
 
<br>
<p>The system can be described by the following 5 ordinary differential equations, defining how the concentration of each variable will change at any given change in time using mass action kinetics. Equations 1, 2 and 3 are derived from Farasat <i>et al.</i>(2016), which comprehensively investigated the rates at which CRISPR-Cas9 can cleave DNA targets.</p><br><br>
+
<p>The system can be described by the following 5 ordinary differential equations, defining how the concentration of each variable will change at any given change in time using mass action kinetics. Equations 1, 2 and 3 are derived from Farasat <i>et al.</i>(2016), which comprehensively investigated the rates at which CRISPR-dCas9 can cleave DNA targets.</p><br><br>
 
    
 
    
  
     $$  \color{white}{(1) \frac{dgRNA,i}{dt} = k_{g,i} – δ_{dg} \cdot gRNA,i – k_{f} \cdot Cas9 \cdot  gRNA,i}  $$
+
     $$  \color{white}{(1) \frac{dgRNA,i}{dt} = k_{g,i} – δ_{dg} \cdot gRNA,i – k_{f} \cdot dCas9 \cdot  gRNA,i}  $$
 
     <p style="text-align: center;" >   
 
     <p style="text-align: center;" >   
The above equation details the change in gRNA concentration per unit time, also extending along index i. At any given time, the concentration of gRNA(i) will be increased by its production (kgi), and decreased by its association with cas9 at rate kf, relative to it's concentration, and it will also degrade and diffuse away at rate δdg.</p><br><br>
+
The above equation details the change in gRNA concentration per unit time, also extending along index i. At any given time, the concentration of gRNA(i) will be increased by its production (kgi), and decreased by its association with dcas9 at rate kf, relative to it's concentration, and it will also degrade and diffuse away at rate δdg.</p><br><br>
  
     $$  \color{white}{(2) \frac{dCas9}{dt} = k_{c} – δ_{dc} \cdot Cas9 – k_{f} \cdot Cas9 \cdot \underset{i}{∑}gRNA,i} $$
+
     $$  \color{white}{(2) \frac{dCas9}{dt} = k_{c} – δ_{dc} \cdot dCas9 – k_{f} \cdot dCas9 \cdot \underset{i}{∑}gRNA,i} $$
     <p style="text-align: center;" >  This equation details the change in Cas9 protein per unit time. It will be increased by its production (kc) and reduced by its degradation (δdc), and again it's association to gRNA(s). This will be proportioal the sum of all the gRNA's along i, accounting for the competition for Cas9.</p><br><br>
+
     <p style="text-align: center;" >  This equation details the change in dCas9 protein per unit time. It will be increased by its production (kc) and reduced by its degradation (δdc), and again it's association to gRNA(s). This will be proportioal the sum of all the gRNA's along i, accounting for the competition for dCas9.</p><br><br>
  
     $$  \color{white}{(3) \frac{dCas9:gRNA,i}{dt} = k_{f} \cdot Cas9:gRNA,i – δ_{dcg} } $$  
+
     $$  \color{white}{(3) \frac{dCas9:gRNA,i}{dt} = k_{f} \cdot dCas9:gRNA,i – δ_{dcg} } $$  
     <p style="text-align: center;" >  This equation details the change in concentration of the Cas9 associated with gRNA(i). This is simply the rate of formation from Equation 2, minus its degredation. </p><br><br>
+
     <p style="text-align: center;" >  This equation details the change in concentration of the dCas9 associated with gRNA(i). This is simply the rate of formation from Equation 2, minus its degredation. </p><br><br>
  
     $$  \color{white}{(4) \frac{dmRNA,i}{dt} = k_{0i} \cdot \frac{1}{1+k{m} \cdot Cas9:gRNA,i} −δ_{dm} \cdot mRNA,i} $$  
+
     $$  \color{white}{(4) \frac{dmRNA,i}{dt} = k_{0i} \cdot \frac{1}{1+k{m} \cdot ds9:gRNA,i} −δ_{dm} \cdot mRNA,i} $$  
     <p style="text-align: center;" > This equation details the change in mRNA(i), which is very similar to the equation seen earlier when describing transciption. This is produced at a rate k0i, but it is also inhibited by Cas9:grna(i), so there is a standard inhibition function which will reduce rate k0 as Cas9:gRNA(i) increases. It is also simply reduced by it's degradation and diffusion rate δdm. </p><br><br>
+
     <p style="text-align: center;" > This equation details the change in mRNA(i), which is very similar to the equation seen earlier when describing transciption. This is produced at a rate k0i, but it is also inhibited by dCas9:grna(i), so there is a standard inhibition function which will reduce rate k0 as dCas9:gRNA(i) increases. It is also simply reduced by it's degradation and diffusion rate δdm. </p><br><br>
  
 
     $$  \color{white}{(5) \frac{dFP,i}{dt} = k_{1} \cdot mRNA,i – δ_{dp} \cdot FP,i} $$
 
     $$  \color{white}{(5) \frac{dFP,i}{dt} = k_{1} \cdot mRNA,i – δ_{dp} \cdot FP,i} $$
Line 364: Line 362:
 
<p> These graphs show the relationship between protein concentration of a certain type of protein and the intensity that can be expected of it. By integrating real life data into the models, we can have accurate representation of how the bacteria would behave in real life. This suggests that when comparing the modelled data to real life data, there is a strong fit. However, this is not necessarily true for all cases: we simply only had data for the conditions we were using, which suggests that more data would be required for the models to be truly representative of real world data.</p>
 
<p> These graphs show the relationship between protein concentration of a certain type of protein and the intensity that can be expected of it. By integrating real life data into the models, we can have accurate representation of how the bacteria would behave in real life. This suggests that when comparing the modelled data to real life data, there is a strong fit. However, this is not necessarily true for all cases: we simply only had data for the conditions we were using, which suggests that more data would be required for the models to be truly representative of real world data.</p>
 
<br> </br>  
 
<br> </br>  
<p> On evaluation, the fit for the CFP appears quite strange! Insight from the wet lab suggested there were mistakes made with reading from the fluorescence reader, which can be attributed to this behaviour. One way to fix this is set the spectro-photometer at a more restrictive wavelength that would minimise the cross-interference from GFP, like 375nm, as suggested by the Absorption and Emission Wavelength models developed earlier. Furthermore, due to time constraints, rather than implementing the relationship directly from lab data, the data was fitted using a Polynomial Fit of Order 3 using Excel and an equation was calculated from these. These equations were directly plugged into the simulation. However, this is inaccurate as the R squared value was ... , suggesting that it doesn't fully capture the data trend. In order to improve this situation, if there was more data available for different scenarios such as with using different wavelengths and concentration of proteins, the model could be validated against more data and refined. Once done, this could substitute the polynomial fit. Lastly, to improve the data, rather than having to use another model to estimate the protein concentration, the team could read for protein concentration during fluorescence readings. This means there is a separate data set to validate the model from, to check whether our protein calculations were correct. </p>
+
<p> On evaluation, the fit for the CFP appears quite strange! Insight from the wet lab suggested there were mistakes made with reading from the fluorescence reader, which can be attributed to this behaviour. One way to fix this is set the spectro-photometer at a more restrictive wavelength that would minimise the cross-interference from GFP, like 375nm, as suggested by the Absorption and Emission Wavelength models developed earlier. Furthermore, due to time constraints, rather than implementing the relationship directly from lab data, the data was fitted using a Polynomial Fit of Order 3 using Excel and an equation was calculated from these. These equations were directly plugged into the simulation. However, this is inaccurate as the R squared value was 0.9148 for RFP, 0.9922 for CFP and 0.9478 for GFP, suggesting that it doesn't fully capture the data trend. Furthermore, on the plot themselves, they don't match the trend at all, suggesting using this method follows the trend poorly. In order to improve this situation, if there was more data available for different scenarios such as with using different wavelengths and concentration of proteins, the model could be validated against more data and refined. Once done, this could substitute the polynomial fit. Lastly, to improve the data, rather than having to use another model to estimate the protein concentration, the team could read for protein concentration during fluorescence readings. This means there is a separate data set to validate the model from, to check whether our protein calculations were correct. </p>
 
<br></br>
 
<br></br>
 
<p> These relationships were implemented into the simulation to give the expected spectra produced by each protein. This highlights another use: by adding or subtracting values from our fit, we can create a threshold for our Keys. This was essential when developing the Raw Data Simulator. <sup> 2 </sup></p>
 
<p> These relationships were implemented into the simulation to give the expected spectra produced by each protein. This highlights another use: by adding or subtracting values from our fit, we can create a threshold for our Keys. This was essential when developing the Raw Data Simulator. <sup> 2 </sup></p>
Line 387: Line 385:
 
<p> In order to save time and program a model, the team used Semrock's Online Fluorescence graph maker <sup> 1 </sup> which operated by taking in the expected Absorption wavelengths and emitting the Emission wavelengths expected by sfGFP (green), mRFP (red) and ECFP (blue) proteins. This was done through the Web App on the website. Furthermore, they provided the raw data in a text file format which was useful as it allows the team to read the data into a stand alone program. </p>
 
<p> In order to save time and program a model, the team used Semrock's Online Fluorescence graph maker <sup> 1 </sup> which operated by taking in the expected Absorption wavelengths and emitting the Emission wavelengths expected by sfGFP (green), mRFP (red) and ECFP (blue) proteins. This was done through the Web App on the website. Furthermore, they provided the raw data in a text file format which was useful as it allows the team to read the data into a stand alone program. </p>
 
<br> </br>  
 
<br> </br>  
<br> </br>
+
<p style="text-align: center;" >  <img src="https://static.igem.org/mediawiki/2017/8/8f/T--UNOTT--SpectrumAbsoprtionEM.png" class="border" width="1000" height="500" style= position: fixed; align=center;> </p>  
<p style="text-align: center;" >  <img src="https://static.igem.org/mediawiki/2017/8/8f/T--UNOTT--SpectrumAbsoprtionEM.png" class="border" width="500" height="600" style= position: fixed; align=center;> </p>  
+
 
<p style="text-align: center;" > The absorption and emission spectra from RFP, GFP and ECHP. The dotted lines show absorption wavelengths, and the solid lines show emission wavelengths. </p>  
 
<p style="text-align: center;" > The absorption and emission spectra from RFP, GFP and ECHP. The dotted lines show absorption wavelengths, and the solid lines show emission wavelengths. </p>  
 
<br> </br>
 
<br> </br>
<p>This graph tells us the emitted light is expected to be at a higher wavelength than the absorbed wavelength. This must be considered in the model as there is overlap between emitted and absorbed wavelengths implying emitted light may be absorbed and re-emitted at a higher wavelength.</p>
+
<p>This graph tells us the emitted light is expected to be at a higher wavelength than the absorbed wavelength. This must be considered in the model as there is overlap between emitted and absorbed wavelengths implying emitted light may be absorbed and re-emitted at a higher wavelength by different fluorescent proteins, which might dramatically alter the reading.</p>
  
<p> Fortunately, the data points used to graph the spectra is available on the website as a raw data text file which was very useful as it meant we could read the data directly into our simulator when it was being implemented. </p>
+
<p> This model was important as it guided us for the spectro-photometer setup to determine what wavelength range to produce different fluorescence spectra. This was especially crucial selecting the wavelengths so only one type of protein would be expressed, which was useful when working with the random constructions.</p>
<p> This model is important as it guides us when using wavelengths as parameters so we know which wavelengths to use, especially when trying to create a specific color as well as what wavelengths to look out for as they might cause overlap. This was very useful to the wet-lab as it informed them of what wavelengths to use as well as what wavelength range they should use to produce different fluorescence spectra.</p>
+
 
<br> </br>
 
<br> </br>
 
<br> </br>
 
<br> </br>
<p style="text-align: center;" >  <sup> 1 </sup> <a href=" https://www.semrock.com/searchlight-welcome.aspx ">Semrock Fluorescence Spectra Chart</a></p>
+
<p style="text-align: center;" >  <sup> 1 </sup> <a href=" https://www.semrock.com/searchlight-welcome.aspx ">Semrock Fluorescence Spectra Chart</a> </p>
 
</div>
 
</div>
 
     </div>
 
     </div>
Line 419: Line 415:
 
         <p> Members of the public are encouraged to try it out and can use it to help with identifying how their spectra would look if they used the same proteins the project used </p>
 
         <p> Members of the public are encouraged to try it out and can use it to help with identifying how their spectra would look if they used the same proteins the project used </p>
  
        <a href="https://github.com/BurgundyIsAPublicEnemy/iGEMNotts2017/tree/master/Models">Excel Spreadsheet</a>
+
  <p style="text-align: center;" >        <a href="https://github.com/BurgundyIsAPublicEnemy/iGEMNotts2017/tree/master/Models">Excel Spreadsheet</a> </p>
  
 
         <br> </br>
 
         <br> </br>
Line 432: Line 428:
 
     <h5 style="color: #C0C0C0; font-weight: bold; font-size: 20px;"> What iGEM Nottingham 2017 learnt from modelling and how modelling impacted the project. </h5>
 
     <h5 style="color: #C0C0C0; font-weight: bold; font-size: 20px;"> What iGEM Nottingham 2017 learnt from modelling and how modelling impacted the project. </h5>
 
           <div id="clear7" style="display: none;">
 
           <div id="clear7" style="display: none;">
     <p> The main objectives of modelling were met: the simulation for calculating the fluorescence spectra was completed and was not only extensively used in the lab to generate spectra when the parameters consisted of different protein concentrations, but was used to produce dummy data for the comparison software to produce a demo for when industry contacts came to visit the labs. Furthermore, the models allowed for parameters we couldn't test for in the lab for example, what the spectra would look like if one protein was inhibited but the others weren't.</p>
+
     <p> The main objectives of modelling were met: the <b> simulation for calculating the fluorescence spectra was completed </b> and was not only extensively used in the lab to generate spectra when the parameters consisted of different protein concentrations, but was used to produce dummy data for the <b> comparison software </b> to produce a demo for when industry contacts came to visit the labs. Furthermore, the models allowed for parameters we couldn't test for in the lab for example, what the spectra would look like if one protein was inhibited but the others weren't.</p>
 
  <br> </br>   
 
  <br> </br>   
     <p> The main reason the team undertook a rigorous approach to modelling was because it wouldn't have been accurate to construct a single model to show how the fluorescence spectra would vary with protein concentration without taking into account elements such protein degradation, the impact of CRISPRi and whether wavelengths would impact how the strong the intensity is. The simulation simply allowed the team to combine all the models produced to give a desired output in a programming fashion so the model could be used by anyone without a maths and programming background. </p>  
+
     <p> The main reason the team undertook a rigorous approach to modelling was because it wouldn't have been accurate to construct a single model to show how the fluorescence spectra would vary with protein concentration without taking into account elements such protein degradation, the impact of CRISPRi and whether wavelengths would impact how the strong the intensity is. The simulation simply allowed the team to combine all the models produced to give a desired output in a programming fashion so the model could be used by anyone <b>  without a maths and programming background.</b>  </p>  
 
   <br> </br>   
 
   <br> </br>   
<p> Overall, the models showed that given a specific wavelength and a certain concentration of protein (ug/mol), a spectra will be produced. Furthermore, beyond helping to validate real world data, it helped to solve practical issues with the wet lab. The biggest issue modelling helped to solve was that the wet lab weren't able to produce any CFP fluorescence. The models showed that after 500nm, the CFP proteins wouldn't fluoresce, which suggested the solution to this problem would be to use a lower wavelength, such as 490nm (whereas the team had used 584nm.) Unfortunately, due to time constraints, this fix couldn't be implemented but nevertheless, modelling helped to reveal the complication. Issues like these were real world problems modelling helped to solve. </p>  
+
<p> Overall, the models showed that given a specific wavelength and a certain concentration of protein (ug/mol), a <b> spectra </b>  will be produced. Furthermore, beyond helping to validate real world data, it helped to solve practical issues with the wet lab. The biggest issue modelling helped to solve was that the wet lab weren't able to produce any CFP fluorescence. The models showed that after 375nm, the GFP proteins would fluoresce alongside the CFP proteins, which suggested the solution to this problem would be to use a lower wavelength, such as 375nm so only the CFP proteins would fluoresce without interference from the GFP. Unfortunately, due to time constraints, this fix couldn't be implemented but nevertheless, modelling helped to reveal the complication. Issues like these were real world problems modelling helped to solve. </p>  
 
     </div>
 
     </div>
 
         </div>
 
         </div>

Latest revision as of 03:53, 2 November 2017





MODELLING

Constitutive Gene Expression

The general gene expression equation showing the process of protein synthesis

Gene Transcription Regulation by Repressors (CRISPRi)

Calculating how much protein is produced over time when a gene is inhibited

Relationship between Fluorescence and Protein Concentration

Using our models to estimate the amount of fluorescence expected from a certain concentration of protein synthesized

Absorption and Emission Wavelengths of sfGFP, mRFP & ECFP

Working out which wavelengths are required to produce a fluorescence spectra.

Are Our Constructions Random?

Showing that our constructions are random and why they are random

Conclusion

What iGEM Nottingham 2017 learnt from modelling and how modelling impacted the project.