Difference between revisions of "Team:UNOTT/Modelling"

 
(One intermediate revision by the same user not shown)
Line 175: Line 175:
 
a{
 
a{
 
     color: #ffffff;
 
     color: #ffffff;
  font-size: 18px;
+
    font-size: 18px;
 
}
 
}
 
html {
 
html {
Line 251: Line 251:
 
<br> </br>
 
<br> </br>
  
<p style="text-align: center;" >  Figure 1 </p>
 
 
$$ \color{white}{ p \underset{t_{1} }{\rightarrow} m \underset{t_{2}}{\rightarrow} p  } $$
 
$$ \color{white}{ p \underset{t_{1} }{\rightarrow} m \underset{t_{2}}{\rightarrow} p  } $$
 
<p> The equation above describes the process of which the gene undergoes transcription to produce mRNA. The mRNA carries the genetic information copied from the DNA which codes for protein. The expression of protein lead to fluorescence which is the desired output of the system. </p>
 
<p> The equation above describes the process of which the gene undergoes transcription to produce mRNA. The mRNA carries the genetic information copied from the DNA which codes for protein. The expression of protein lead to fluorescence which is the desired output of the system. </p>
  
<p style="text-align: center;" >  Figure 2 </p>
 
 
$$ \color{white}{ m \underset{Degradation}{\rightarrow} \oslash  } $$
 
$$ \color{white}{ m \underset{Degradation}{\rightarrow} \oslash  } $$
  
Line 264: Line 262:
 
<p>  The team applied Law of Mass Action, combining both equations for the concentration of protein and mRNA over time. This model can be described as: </p>
 
<p>  The team applied Law of Mass Action, combining both equations for the concentration of protein and mRNA over time. This model can be described as: </p>
  
  <p style="text-align: center;" >  Figure 3 </p>
+
 
 
$$ \color{white}{  \frac{dm}{dt} = k_{1} -d _{1 } m } $$
 
$$ \color{white}{  \frac{dm}{dt} = k_{1} -d _{1 } m } $$
 
$$ \color{white}{ \frac{dp}{dt} = k_{2} \cdot  m - d_{2} \cdot p } $$
 
$$ \color{white}{ \frac{dp}{dt} = k_{2} \cdot  m - d_{2} \cdot p } $$
Line 297: Line 295:
 
     <div class="expandable-box">
 
     <div class="expandable-box">
  
       <h4 style="color: #ffffff; font-weight: bold; font-size: 30px;">Gene Transcription Regulation by Repressors (CRISPRi) - Concentration over Time</h4><center></center>
+
       <h4 style="color: #ffffff; font-weight: bold; font-size: 30px;">Gene Transcription Regulation by Repressors (CRISPRi)</h4><center></center>
 
<h5 style="color: #C0C0C0; font-weight: bold; font-size: 20px;"> Calculating how much protein is produced over time when a gene is inhibited </h5>
 
<h5 style="color: #C0C0C0; font-weight: bold; font-size: 20px;"> Calculating how much protein is produced over time when a gene is inhibited </h5>
  
Line 305: Line 303:
 
   <p style="text-align: center;" >  <img src="https://static.igem.org/mediawiki/2017/0/00/T--UNOTT--Formualasdas.png"; </p>
 
   <p style="text-align: center;" >  <img src="https://static.igem.org/mediawiki/2017/0/00/T--UNOTT--Formualasdas.png"; </p>
 
<br></br>
 
<br></br>
<p> This system can be described as above. Where gRNA(i), Cas9, and mRNA are produced constitutively with their associated rates of production kc, kg, and k0i respectively. The Cas9 and gRNA(i) will undergo an irreversible association to form Cas9:gRNA(i) at rate kf, which in turn inhibits the production of mRNA and reduce the production of Fluorescent protein (k1). All molecules spontaneously degrade and diffuse away at their own associated rate. (i) will account for us having multiple gRNAs and just as many fluorescent proteins i.e. i=3 with three fluorescent proteins and subsequent set of three gRNAs. It is asumed that all gRNAs have the same binding affinity and their productions are the same. The varying strengths of promoters for mRNA (koi) will be assigned to each corresponding gRNA in the set of (i). </p>
+
<p> This system can be described as above. Where gRNA(i), dCas9, and mRNA are produced constitutively with their associated rates of production kc, kg, and k0i respectively. The dCas9 and gRNA(i) will undergo an irreversible association to form dCas9:gRNA(i) at rate kf, which in turn inhibits the production of mRNA and reduce the production of Fluorescent protein (k1). All molecules spontaneously degrade and diffuse away at their own associated rate. (i) will account for us having multiple gRNAs and just as many fluorescent proteins i.e. i=3 with three fluorescent proteins and subsequent set of three gRNAs. It is asumed that all gRNAs have the same binding affinity and their productions are the same. The varying strengths of promoters for mRNA (koi) will be assigned to each corresponding gRNA in the set of (i). </p>
 
<br>
 
<br>
<p>The system can be described by the following 5 ordinary differential equations, defining how the concentration of each variable will change at any given change in time using mass action kinetics. Equations 1, 2 and 3 are derived from Farasat <i>et al.</i>(2016), which comprehensively investigated the rates at which CRISPR-Cas9 can cleave DNA targets.</p><br><br>
+
<p>The system can be described by the following 5 ordinary differential equations, defining how the concentration of each variable will change at any given change in time using mass action kinetics. Equations 1, 2 and 3 are derived from Farasat <i>et al.</i>(2016), which comprehensively investigated the rates at which CRISPR-dCas9 can cleave DNA targets.</p><br><br>
 
    
 
    
  
     $$  \color{white}{(1) \frac{dgRNA,i}{dt} = k_{g,i} – δ_{dg} \cdot gRNA,i – k_{f} \cdot Cas9 \cdot  gRNA,i}  $$
+
     $$  \color{white}{(1) \frac{dgRNA,i}{dt} = k_{g,i} – δ_{dg} \cdot gRNA,i – k_{f} \cdot dCas9 \cdot  gRNA,i}  $$
 
     <p style="text-align: center;" >   
 
     <p style="text-align: center;" >   
The above equation details the change in gRNA concentration per unit time, also extending along index i. At any given time, the concentration of gRNA(i) will be increased by its production (kgi), and decreased by its association with cas9 at rate kf, relative to it's concentration, and it will also degrade and diffuse away at rate δdg.</p><br><br>
+
The above equation details the change in gRNA concentration per unit time, also extending along index i. At any given time, the concentration of gRNA(i) will be increased by its production (kgi), and decreased by its association with dcas9 at rate kf, relative to it's concentration, and it will also degrade and diffuse away at rate δdg.</p><br><br>
  
     $$  \color{white}{(2) \frac{dCas9}{dt} = k_{c} – δ_{dc} \cdot Cas9 – k_{f} \cdot Cas9 \cdot \underset{i}{∑}gRNA,i} $$
+
     $$  \color{white}{(2) \frac{dCas9}{dt} = k_{c} – δ_{dc} \cdot dCas9 – k_{f} \cdot dCas9 \cdot \underset{i}{∑}gRNA,i} $$
     <p style="text-align: center;" >  This equation details the change in Cas9 protein per unit time. It will be increased by its production (kc) and reduced by its degradation (δdc), and again it's association to gRNA(s). This will be proportioal the sum of all the gRNA's along i, accounting for the competition for Cas9.</p><br><br>
+
     <p style="text-align: center;" >  This equation details the change in dCas9 protein per unit time. It will be increased by its production (kc) and reduced by its degradation (δdc), and again it's association to gRNA(s). This will be proportioal the sum of all the gRNA's along i, accounting for the competition for dCas9.</p><br><br>
  
     $$  \color{white}{(3) \frac{dCas9:gRNA,i}{dt} = k_{f} \cdot Cas9:gRNA,i – δ_{dcg} } $$  
+
     $$  \color{white}{(3) \frac{dCas9:gRNA,i}{dt} = k_{f} \cdot dCas9:gRNA,i – δ_{dcg} } $$  
     <p style="text-align: center;" >  This equation details the change in concentration of the Cas9 associated with gRNA(i). This is simply the rate of formation from Equation 2, minus its degredation. </p><br><br>
+
     <p style="text-align: center;" >  This equation details the change in concentration of the dCas9 associated with gRNA(i). This is simply the rate of formation from Equation 2, minus its degredation. </p><br><br>
  
     $$  \color{white}{(4) \frac{dmRNA,i}{dt} = k_{0i} \cdot \frac{1}{1+k{m} \cdot Cas9:gRNA,i} −δ_{dm} \cdot mRNA,i} $$  
+
     $$  \color{white}{(4) \frac{dmRNA,i}{dt} = k_{0i} \cdot \frac{1}{1+k{m} \cdot ds9:gRNA,i} −δ_{dm} \cdot mRNA,i} $$  
     <p style="text-align: center;" > This equation details the change in mRNA(i), which is very similar to the equation seen earlier when describing transciption. This is produced at a rate k0i, but it is also inhibited by Cas9:grna(i), so there is a standard inhibition function which will reduce rate k0 as Cas9:gRNA(i) increases. It is also simply reduced by it's degradation and diffusion rate δdm. </p><br><br>
+
     <p style="text-align: center;" > This equation details the change in mRNA(i), which is very similar to the equation seen earlier when describing transciption. This is produced at a rate k0i, but it is also inhibited by dCas9:grna(i), so there is a standard inhibition function which will reduce rate k0 as dCas9:gRNA(i) increases. It is also simply reduced by it's degradation and diffusion rate δdm. </p><br><br>
  
 
     $$  \color{white}{(5) \frac{dFP,i}{dt} = k_{1} \cdot mRNA,i – δ_{dp} \cdot FP,i} $$
 
     $$  \color{white}{(5) \frac{dFP,i}{dt} = k_{1} \cdot mRNA,i – δ_{dp} \cdot FP,i} $$
Line 364: Line 362:
 
<p> These graphs show the relationship between protein concentration of a certain type of protein and the intensity that can be expected of it. By integrating real life data into the models, we can have accurate representation of how the bacteria would behave in real life. This suggests that when comparing the modelled data to real life data, there is a strong fit. However, this is not necessarily true for all cases: we simply only had data for the conditions we were using, which suggests that more data would be required for the models to be truly representative of real world data.</p>
 
<p> These graphs show the relationship between protein concentration of a certain type of protein and the intensity that can be expected of it. By integrating real life data into the models, we can have accurate representation of how the bacteria would behave in real life. This suggests that when comparing the modelled data to real life data, there is a strong fit. However, this is not necessarily true for all cases: we simply only had data for the conditions we were using, which suggests that more data would be required for the models to be truly representative of real world data.</p>
 
<br> </br>  
 
<br> </br>  
<p> On evaluation, the fit for the CFP appears quite strange! Insight from the wet lab suggested there were mistakes made with reading from the fluorescence reader, which can be attributed to this behaviour. One way to fix this is set the spectro-photometer at a more restrictive wavelength that would minimise the cross-interference from GFP, like 375nm, as suggested by the Absorption and Emission Wavelength models developed earlier. Furthermore, due to time constraints, rather than implementing the relationship directly from lab data, the data was fitted using a Polynomial Fit of Order 3 using Excel and an equation was calculated from these. These equations were directly plugged into the simulation. However, this is inaccurate as the R squared value was ... , suggesting that it doesn't fully capture the data trend. In order to improve this situation, if there was more data available for different scenarios such as with using different wavelengths and concentration of proteins, the model could be validated against more data and refined. Once done, this could substitute the polynomial fit. Lastly, to improve the data, rather than having to use another model to estimate the protein concentration, the team could read for protein concentration during fluorescence readings. This means there is a separate data set to validate the model from, to check whether our protein calculations were correct. </p>
+
<p> On evaluation, the fit for the CFP appears quite strange! Insight from the wet lab suggested there were mistakes made with reading from the fluorescence reader, which can be attributed to this behaviour. One way to fix this is set the spectro-photometer at a more restrictive wavelength that would minimise the cross-interference from GFP, like 375nm, as suggested by the Absorption and Emission Wavelength models developed earlier. Furthermore, due to time constraints, rather than implementing the relationship directly from lab data, the data was fitted using a Polynomial Fit of Order 3 using Excel and an equation was calculated from these. These equations were directly plugged into the simulation. However, this is inaccurate as the R squared value was 0.9148 for RFP, 0.9922 for CFP and 0.9478 for GFP, suggesting that it doesn't fully capture the data trend. Furthermore, on the plot themselves, they don't match the trend at all, suggesting using this method follows the trend poorly. In order to improve this situation, if there was more data available for different scenarios such as with using different wavelengths and concentration of proteins, the model could be validated against more data and refined. Once done, this could substitute the polynomial fit. Lastly, to improve the data, rather than having to use another model to estimate the protein concentration, the team could read for protein concentration during fluorescence readings. This means there is a separate data set to validate the model from, to check whether our protein calculations were correct. </p>
 
<br></br>
 
<br></br>
 
<p> These relationships were implemented into the simulation to give the expected spectra produced by each protein. This highlights another use: by adding or subtracting values from our fit, we can create a threshold for our Keys. This was essential when developing the Raw Data Simulator. <sup> 2 </sup></p>
 
<p> These relationships were implemented into the simulation to give the expected spectra produced by each protein. This highlights another use: by adding or subtracting values from our fit, we can create a threshold for our Keys. This was essential when developing the Raw Data Simulator. <sup> 2 </sup></p>
Line 395: Line 393:
 
<br> </br>
 
<br> </br>
 
<br> </br>
 
<br> </br>
<p style="text-align: center;" >  <sup> 1 </sup> <a href=" https://www.semrock.com/searchlight-welcome.aspx ">Semrock Fluorescence Spectra Chart</a></p>
+
<p style="text-align: center;" >  <sup> 1 </sup> <a href=" https://www.semrock.com/searchlight-welcome.aspx ">Semrock Fluorescence Spectra Chart</a> </p>
 
</div>
 
</div>
 
     </div>
 
     </div>
Line 417: Line 415:
 
         <p> Members of the public are encouraged to try it out and can use it to help with identifying how their spectra would look if they used the same proteins the project used </p>
 
         <p> Members of the public are encouraged to try it out and can use it to help with identifying how their spectra would look if they used the same proteins the project used </p>
  
        <a href="https://github.com/BurgundyIsAPublicEnemy/iGEMNotts2017/tree/master/Models">Excel Spreadsheet</a>
+
  <p style="text-align: center;" >        <a href="https://github.com/BurgundyIsAPublicEnemy/iGEMNotts2017/tree/master/Models">Excel Spreadsheet</a> </p>
  
 
         <br> </br>
 
         <br> </br>

Latest revision as of 03:53, 2 November 2017





MODELLING

Constitutive Gene Expression

The general gene expression equation showing the process of protein synthesis

Gene Transcription Regulation by Repressors (CRISPRi)

Calculating how much protein is produced over time when a gene is inhibited

Relationship between Fluorescence and Protein Concentration

Using our models to estimate the amount of fluorescence expected from a certain concentration of protein synthesized

Absorption and Emission Wavelengths of sfGFP, mRFP & ECFP

Working out which wavelengths are required to produce a fluorescence spectra.

Are Our Constructions Random?

Showing that our constructions are random and why they are random

Conclusion

What iGEM Nottingham 2017 learnt from modelling and how modelling impacted the project.