Team:UNOTT/Modelling

Constitutive Gene Expression For Protein and mRNA Expression over Time

The general gene expression equation showing the process of protein synthesis

Biological insight had told us we need a model with constant gene expression. Investigating models from literature ¹ so see which model would satisfy these conditions, and it was found the constitutive gene expression model was suitable to guide the model.

The first step was to take the general model from literature and apply it in our scenario using the proteins (GFP, ECHP, RFP.)

^{Figure 1} $$ \color{white}{ sfGFP \underset{Transcriptin}{\rightarrow} mRNA \underset{Translation}{\rightarrow} sfGFP } $$

The equation above describes the process of which the gene undergoes transcription to produce mRNA. The mRNA carries the genetic information copied from the DNA which codes for protein. The expression of protein, can therefore, be measured by the fluorescence which is the desired output of the system.

^{Figure 2} $$ \color{white}{ mRNA \underset{Degradation}{\rightarrow} \oslash } $$ $$ \color{white}{ sfGFP \underset{Degradation}{\rightarrow} \oslash } $$

The two equations above state the same time, the concentration of protein and mRNA would undergo degradation which means the concentration would drop. However, since there is always protein and mRNA being created, over time, the creation and degradation keep the concentration constant. ²

We can apply Law of Mass Action combine both equations for the concentration of protein and mRNA over time. This model can be described as:

^{Figure 3} $$ \color{white}{ mRNA = k_{1} -d _{1 } mRNA } $$ $$ \color{white}{ Protein = k_{2} \cdot mRNA - d_{2} \cdot Protein } $$

Where...

mRNA is the concentration of mRNA
Protein is the concentration of Protein
k₁ is the constitutive transcription rate. This represents the number of mRNA molecules produced per gene, per unit of time.
d ₁ is the mRNA degradation rate
k₂ is the translation rate. This represents the number of protein molecules produced per mRNA molecule, per unit of time.
d ₂ is the protein degradation rate.

This is important because we can use this model to calculate the concentration of proteins we can expect over time. This is useful as we can use this information to calculate the total emitted light spectra during the time period which is what we are looking for in our system. However, the constants and variables are individual for each protein and which means parameters for each protein would need to be found. These constants were found using literature ³ (for GFP) and lab results (the rest.)

¹ GB Stan, 20137. Modeling in Biology. London, the United Kingdom: Imperial College London. p, pp.59-65.

² See Non-Inhibited conditions from Figure 5 Gene Transcription Regulation by Repressors (CRISPRi) - Concentration over Time

³ See Relationship between Max Fluorescence and Protein Concentration for more details

Gene Transcription Regulation by Repressors (CRISPRi) - Concentration over Time

Calculating how much protein is produced over time when a gene is inhibited

The next step in developing our simulation was to calculate our protein concentration at any given time when using CRISPRi. Discussion with wet-lab revealed our method would be using CRISPRi as a repressor, which works by inhibiting the expression of one or more genes by binding to the promoter region ¹. The expanded mRNA and Protein concentration models from the Constitutive Gene Expression Model ² were modified to include the element of repression from the CRISPRi inhibition.

$$ \color{white}{ \frac{dgRNA,i}{dt} = k_{g,i} – δ_{dg} \cdot gRNA,i – k_{f} \cdot Cas9 \cdot gRNA,i} $$

The above equation details the change in gRNA concentration extending along index i, i will account for us perhaps having multiple gRNAs which will compete with one another. At any given time, the concentration of gRNA,i will be increased by its production (kgi), and decreased by its association with cas9 at rate kf, relative to it's concentration, and it will also degrade and diffuse away at rate δdg, ³ :

$$ \color{white}{ \frac{dCas9}{dt} = k_{c} – δ_{dc} \cdot Cas9 – k_{f} \cdot Cas9 \cdot \underset{i}{∑}gRNA,i} $$

This equation details the change in Cas9 protein. It will ³ :

$$ \color{white}{ \frac{dCas9}{dt} = k_{c} – δ_{dc} \cdot Cas9 – k_{f} \cdot Cas9 \cdot \underset{i}{∑}gRNA,i} $$

This change can be applied to the Law of Mass Action ³ :

$$ \color{white}{ \frac{dmRNA,i}{dt} = k_{0} \cdot \frac{1}{1+k{m} \cdot Cas9:gRNA,i} −δ_{dm} \cdot mRNA,i} $$

This change can be applied to the Law of Mass Action ³ :

$$ \color{white}{ \frac{dmRNA,i}{dt} = k_{0} \cdot \frac{1}{1+k{m} \cdot Cas9:gRNA,i} −δ_{dm} \cdot mRNA,i} $$

This change can be applied to the Law of Mass Action ³ :

Where...

m is mRNA concentration, p is Protein concentration, R is Repressor, k1 is Max Transcription Rate, k is the Repression Coefficient, n is number of repressors that need to cooperatively bind the promoter to trigger the inhibition of gene expression (Hill Coefficient), R is Repressor, d1 is mRNA degradation rate, d2 is Protein degradation rate

The value for these constants and variables were taken from literature and calculating them ⁴ but later, adjusted to the lab results.

Figure 6

Figure 6 shows the structure which underwent CRISPRi inhibition are expected to produce lower concentration of the protein whose expression were are inhibiting. This is important as it means the team can calculate concentration of proteins which are inhibited and compare them to the control conditions as well as giving the correct concentration for the simulation.

Furthermore, by having a model which can calculate the protein concentration at any given time, we can deduce how much fluorescence is being emitted at that time period by the bacteria

⁴ See Relationship between Max Fluorescence and Protein Concentration

Relationship between Max Fluorescence and Protein Concentration

Using our models to estimate the amount of fluorescence expected from a certain concentration of protein synthesized

A problem the team faced was identify the level of fluorescence at any given time as it is expected that the proteins would be expressed. This can be confirmed by looking at the bacteria after being constructed and observing that they are giving off light.

To solve this issue, the team required an equation which could estimate the intensity of fluorescence at any certain time. This consisted of calculating the protein concentration in a time period mapping that intensity to the protein concentration at that time provided by real world data.

When the fluorescence data was received from the wet lab, a model was constructed from the data gained. Originally, the data from the lab was the Fluorescence against Time but by using the Gene Transcription Regulation by Repressors model developed earlier ¹, the team was able to estimate the protein concentration at that time.

^{Figure 7}

These graphs show the relationship between protein concentration and fluorescence intensity; as the concentration increases, the intensity increases greatly. The only exception to this is CFP however, it was revealed that there was an error in reading CFP identifeid by the wet lab. Due to time constraints, rather than implementing the relationship directly from lab data, the data was fitted using a Polynomial Fit of Order 3 using Excel and an equation was calculated from these. These equations were directly plugged into the simulation. However, this is inaccurate as the R squared value was ... , suggesting that it doesn't fully capture the data trend.

These relationships were implemented into the simulation to give the expected spectra produced by each protein. This highlights another use: by adding or subtracting values from our fit, we can create a threshold for our Keys. This was essential when developing the Raw Data Simulator. ²

¹ See Gene Transcription Regulation by Repressors (CRISPRi) - Concentration over Time

² See Software

MODELING

Overview

About modeling and why iGEM Nottingham chose to do it

Constitutive Gene Expression For Protein and mRNA Expression over Time

The general gene expression equation showing the process of protein synthesis

Gene Transcription Regulation by Repressors (CRISPRi) - Concentration over Time

Calculating how much protein is produced over time when a gene is inhibited

Relationship between Max Fluorescence and Protein Concentration

Using our models to estimate the amount of fluorescence expected from a certain concentration of protein synthesized

Are Our Constructions Random?

Showing that our constructions are random and why they are random