Difference between revisions of "Team:ETH Zurich/Model/Environment Sensing/parameter fitting"

 
(11 intermediate revisions by 4 users not shown)
Line 16: Line 16:
 
     <h1>Goal</h1>
 
     <h1>Goal</h1>
 
     <p>Once the initial guidelines for the genetic design given <a href="https://2017.igem.org/Team:ETH_Zurich/Model/Environment_Sensing/parameter_space">in a first step</a>, we need to measure on our obtained biological system whether it behaves as intended. Therefore, we will submit it to carefully designed experiments, in order to fit our model on the obtained data and retrieve crucial parameters. The tricky part here is that in a test tube, our system will behave differently than in a tumor, as the repartition of bacteria is not the same and diffusion of AHL does not take place in the lab experiments. We therefore have to extend our model to take this discrepancy into account.</p>
 
     <p>Once the initial guidelines for the genetic design given <a href="https://2017.igem.org/Team:ETH_Zurich/Model/Environment_Sensing/parameter_space">in a first step</a>, we need to measure on our obtained biological system whether it behaves as intended. Therefore, we will submit it to carefully designed experiments, in order to fit our model on the obtained data and retrieve crucial parameters. The tricky part here is that in a test tube, our system will behave differently than in a tumor, as the repartition of bacteria is not the same and diffusion of AHL does not take place in the lab experiments. We therefore have to extend our model to take this discrepancy into account.</p>
 +
 
<p><strong>We then fit three of the most important parameters of our system: the expression levels of luxI and luxR, and the P<sub>lux</sub> promoter leakiness.</strong></p>
 
<p><strong>We then fit three of the most important parameters of our system: the expression levels of luxI and luxR, and the P<sub>lux</sub> promoter leakiness.</strong></p>
 
</section>
 
</section>
Line 27: Line 28:
 
<span class="math">\[\begin{aligned}
 
<span class="math">\[\begin{aligned}
 
         \frac{\mathrm{d} [\text{AHL}]}{\mathrm{d} t} &amp;= P_{\text{AHL}} - k_{\text{deg}} [\text{AHL}]\end{aligned}\]</span>
 
         \frac{\mathrm{d} [\text{AHL}]}{\mathrm{d} t} &amp;= P_{\text{AHL}} - k_{\text{deg}} [\text{AHL}]\end{aligned}\]</span>
<p>Therefore, once reached the steady state, and thanks to the still valid equation describing AHL production<span class="math">\begin{aligned}\[P_{\text{AHL}} &amp;= d_{\text{cell}} a_{\text{AHL}} [\text{luxI}]\end{aligned}</span> the level of AHL is now:</p>
+
<p>Therefore, once reached the steady state, and thanks to the still valid equation describing AHL production<span class="math">\begin{aligned}P_{\text{AHL}} &amp;= d_{\text{cell}} a_{\text{AHL}} [\text{luxI}]\end{aligned}</span> the level of AHL is now:</p>
 
<span class="math">\[\begin{aligned}
 
<span class="math">\[\begin{aligned}
 
         \text{[AHL]} &amp;= \frac{d_{\text{cell}} a_{\text{AHL}} [\text{luxI}]} {k_{\text{deg}}}\end{aligned}\]</span>
 
         \text{[AHL]} &amp;= \frac{d_{\text{cell}} a_{\text{AHL}} [\text{luxI}]} {k_{\text{deg}}}\end{aligned}\]</span>
Line 35: Line 36:
 
</div>
 
</div>
  
<h1>LuxR expression fitting</h1>
+
<h2 id="od005">Switching OD target for our <im>in vitro</im> experiments</h2>
 +
<p>As an example and guide for our experiments, here is are the two different responses that our system should exhibit according to the <a href="https://2017.igem.org/Team:ETH_Zurich/Model/Environment_Sensing/parameter_space">design guidelines</a> that we produced:</p>
 +
 
 +
<figure class="fig-nonfloat" style="maxwidth: 600px;">
 +
        <img alt="Responses"
 +
        src="https://static.igem.org/mediawiki/2017/e/e3/Responses_vitro_vivo.png"/>
 +
<figcaption>Expected activation behavior of the quorum sensing module, in the tumor environment (in vivo) or in a tube (in vitro). The observed difference comes from the fact that no diffusion of AHL happens in the tubes during our experiments, making the activation of the quorum sensing taking place at lower OD.</figcaption>
 +
    </figure>
 +
 
 +
<p><strong>We can see that the correction of the model to stick better to the reality of our experiments is essential to correctly lead our experimentations and subsequent data fits. In the in vitro situation, we should aim for an activation of the quorum sensing at OD 0.05.</strong></p>
 +
 
 +
<h1 id="luxr_fit">LuxR expression fitting</h1>
 
<h2>What experiment do we need to fit the expression of LuxR?</h2>
 
<h2>What experiment do we need to fit the expression of LuxR?</h2>
 
<p>LuxR is the regulatory protein binding AHL and subsequently activating a promoter where LuxR-AHL tetramers can bind:</p>
 
<p>LuxR is the regulatory protein binding AHL and subsequently activating a promoter where LuxR-AHL tetramers can bind:</p>
Line 59: Line 71:
 
<details>
 
<details>
 
     <summary>Cost function based on the chi2 test</summary>
 
     <summary>Cost function based on the chi2 test</summary>
<p>The most simple way to measure the distance between simulated model data and experimental data is the sum of square of the difference between each couple of point. However, the statistical meaning of this value is not obvious, and it is hard to say from it whether the fit is statistically significant or not. This is why we have implemented a more sophisticated cost function based on the <a href="https://en.wikipedia.org/wiki/Chi-squared_test">chi2 test</a> of goodness of fit, using the the <a href="https://ch.mathworks.com/help/stats/chi2inv.html">chi2inv function</a> provided by Matlab. This cost function takes into account the experimental uncertainty to evaluate whether a fit is significant or not. The cost function is then, with <span class="math">theo</span> being the vector of simulated data, <span class="math">exp</span> the vector of experimental data, and <span class="math">\sigma exp</span> the vector of the standard deviation for each data point (computed from triplicates):
+
<p>The most simple way to measure the distance between simulated model data and experimental data is the sum of square of the difference between each couple of point. However, the statistical meaning of this value is not obvious, and it is hard to say from it whether the fit is statistically significant or not. This is why we have implemented a more sophisticated cost function based on the <a href="https://en.wikipedia.org/wiki/Chi-squared_test">chi2 test</a> of goodness of fit, using the the <a href="https://ch.mathworks.com/help/stats/chi2inv.html">chi2inv function</a> provided by Matlab. This cost function takes into account the experimental uncertainty to evaluate whether a fit is significant or not. The cost function is then, with <em>theo</em> being the vector of simulated data, <em>exp</em> the vector of experimental data, and <em>&sigma;exp</em> the vector of the standard deviation for each data point (computed from triplicates):
 
</p>
 
</p>
 
<p><span class="math">\[\begin{aligned}
 
<p><span class="math">\[\begin{aligned}
       cost(theo, exp, \sigma exp) = \frac{\sum_i^N\frac{(exp_i-theo_i)^2}{\sigma exp_i^2}}{chi2inv(0.95,N)}
+
       \text{cost}(theo, exp, \sigma exp) = \frac{\sum_i^N\frac{(exp_i-theo_i)^2}{\sigma exp_i^2}}{\text{chi2inv}(0.95,N)}
 
       \end{aligned}\]</span>
 
       \end{aligned}\]</span>
 
     </p>
 
     </p>
Line 110: Line 122:
 
     </figure>
 
     </figure>
  
<h1>NEEDS A BIG HIGHLIGHT THERE</h1>
+
<section class = "emphasize">
 +
 
 
<p>From our statistically significant fit and under the assumptions made, we can gather with a 95% confidence that:</p>
 
<p>From our statistically significant fit and under the assumptions made, we can gather with a 95% confidence that:</p>
 
<ol>
 
<ol>
       <li>The expression level of luxR in our system is equal to 41 &#177; 7 nM.min<sup>-1</sup></li>
+
       <li><p>The expression level of luxR in our system is equal to 41 &#177; 7 nM.min<sup>-1</sup></p><p>Moreover, this expression level is very close to the expression level we aimed at (100 nM.min<sup>-1</sup>) during our preliminary parameter space analysis.</p>
 +
    </li>
 
       <li>The leakiness of the P<sub>Lux</sub> promoter in our system is equal to 3.9 &#177; 1.5 %</li>
 
       <li>The leakiness of the P<sub>Lux</sub> promoter in our system is equal to 3.9 &#177; 1.5 %</li>
 
</ol>
 
</ol>
 +
</section>
 
</section>
 
</section>
 
</section>
 
</section>
Line 183: Line 198:
  
 
</details>
 
</details>
<details>
+
<details id="QS_fitting">
   <summary>Disclaimer: Strong assumptions about the interpretation of the experimental data</summary>
+
   <summary>Interpretation of the experimental data</summary>
<p>We were not able to find any significant fit on <a href="https://2017.igem.org/Team:ETH_Zurich/Experiments/Tumor_Sensor#QS_end_point">the raw experimental data</a>. To be able to still get information out of them, we had to consider that the maximal activation of our system was reached by the most dense and most fluorescent bacteria (which grew in high-glucose-content cultures), even though a clear plateau cannot be directly observed on our data (see Figure 3 below). This rather liberal interpretation of the data was motivated by the hypothesis that fully activated bacteria were experiencing a high metabolic burden preventing them from reaching the OD they should have reached without this burden.</p>
+
<p>We were not able to find any significant fit on <a href="https://2017.igem.org/Team:ETH_Zurich/Experiments/Tumor_Sensor#QS_end_point">the raw experimental data</a>. To be able to still get information out of them, we had to consider that the maximal activation of our system was reached by the most dense and most fluorescent bacteria (which grew in high-glucose-content cultures), even though a clear plateau cannot be directly observed on our data (see Figure 3 below). This interpretation of the data was motivated by the hypothesis that fully activated bacteria were experiencing a high metabolic burden preventing them from reaching the OD they should have reached without this burden.</p>
  
 
<figure class="fig-nonfloat" style="maxwidth:700px;">
 
<figure class="fig-nonfloat" style="maxwidth:700px;">
Line 192: Line 207:
 
</figure>
 
</figure>
  
<p>In practice, what we did to get exploitable data for our fit is the following: we fitted first a linear model on the first data points (unaffected by any burden) of the A600 vs. glucose to get the OD that each culture should have reached without burden. We used this adjusted OD value in the (therefore adjusted) data to fit our model. Here is how the adjusted look like:
+
<p>In practice, what we did to get exploitable data for our fit is the following: we fitted first a linear model on the first data points (unaffected by any burden) of the A600 vs. glucose to get the OD that each culture should have reached without burden. We used this adjusted OD value in the (therefore adjusted) data to fit our model. Here is how the adjusted data look like:
 
</p>
 
</p>
  
<h1>FIGURE HERE</h1>
+
<figure class="fig-nonfloat" style="maxwidth:400px;">
 +
        <img src="https://static.igem.org/mediawiki/2017/b/ba/T--ETH_Zurich--adjusted_exp_data.png">
 +
        <figcaption>Figure 4. Adjusted data used for the fit of the expression of luxI.</figcaption>
 +
</figure>
  
 
<p>Finally, because of the absence of replicates in the data, we had to estimate a meaningful standard deviation to be able to use our cost function for the fitting: we assumed a 25% standard deviation for each measurement point, which is the standard deviation that was observed in the data used to fit the expression of luxR.</p>
 
<p>Finally, because of the absence of replicates in the data, we had to estimate a meaningful standard deviation to be able to use our cost function for the fitting: we assumed a 25% standard deviation for each measurement point, which is the standard deviation that was observed in the data used to fit the expression of luxR.</p>
Line 209: Line 227:
 
     </figure>
 
     </figure>
  
<h1>NEEDS A BIG HIGHLIGHT THERE</h1>
+
<section class = "emphasize">
 
<p>From our statistically significant fit and under the assumptions made, we can gather with a 95% confidence that:</p>
 
<p>From our statistically significant fit and under the assumptions made, we can gather with a 95% confidence that:</p>
 
<ol>
 
<ol>
Line 216: Line 234:
 
</ol>
 
</ol>
 
<p><strong>In addition, we can verify that the leakiness of the P<sub>Lux</sub> promoter was successfully fitted two times with excellent consistency on two completely different experiments, supporting the validity of our approach and the assumptions we made.</strong></p>
 
<p><strong>In addition, we can verify that the leakiness of the P<sub>Lux</sub> promoter was successfully fitted two times with excellent consistency on two completely different experiments, supporting the validity of our approach and the assumptions we made.</strong></p>
 +
</section>
 
</section>
 
</section>
  

Latest revision as of 14:52, 23 November 2017

Fit the quorum sensing parameters of our system from experiments

Goal

Once the initial guidelines for the genetic design given in a first step, we need to measure on our obtained biological system whether it behaves as intended. Therefore, we will submit it to carefully designed experiments, in order to fit our model on the obtained data and retrieve crucial parameters. The tricky part here is that in a test tube, our system will behave differently than in a tumor, as the repartition of bacteria is not the same and diffusion of AHL does not take place in the lab experiments. We therefore have to extend our model to take this discrepancy into account.

We then fit three of the most important parameters of our system: the expression levels of luxI and luxR, and the Plux promoter leakiness.

In vitro model extension

Compared to the real-life situation happening in the tumor, during our experiments bacteria are in a very different state: they are in bulk in the liquid culture and not in an open solid medium like they would be around the tumor. Therefore, the diffusion of AHL in the test tube does not take place, and some of our equations used before should be adapted to describe the behavior of our circuit when bacteria are in the situation of our lab experiments: only AHL production and degradation happen in the test tubes.

\[\begin{aligned} \frac{\mathrm{d} [\text{AHL}]}{\mathrm{d} t} &= P_{\text{AHL}} - k_{\text{deg}} [\text{AHL}]\end{aligned}\]

Therefore, once reached the steady state, and thanks to the still valid equation describing AHL production\begin{aligned}P_{\text{AHL}} &= d_{\text{cell}} a_{\text{AHL}} [\text{luxI}]\end{aligned} the level of AHL is now:

\[\begin{aligned} \text{[AHL]} &= \frac{d_{\text{cell}} a_{\text{AHL}} [\text{luxI}]} {k_{\text{deg}}}\end{aligned}\]

The rest of the model remain unchanged, as it concerns intracellular phenomena that are assumed to happen the same way in vitro than in vivo.

Switching OD target for our in vitro experiments

As an example and guide for our experiments, here is are the two different responses that our system should exhibit according to the design guidelines that we produced:

Responses
Expected activation behavior of the quorum sensing module, in the tumor environment (in vivo) or in a tube (in vitro). The observed difference comes from the fact that no diffusion of AHL happens in the tubes during our experiments, making the activation of the quorum sensing taking place at lower OD.

We can see that the correction of the model to stick better to the reality of our experiments is essential to correctly lead our experimentations and subsequent data fits. In the in vitro situation, we should aim for an activation of the quorum sensing at OD 0.05.

LuxR expression fitting

What experiment do we need to fit the expression of LuxR?

LuxR is the regulatory protein binding AHL and subsequently activating a promoter where LuxR-AHL tetramers can bind:

LuxR-AHL binding

The amount of LuxR present in the cell is crucial to know how much AHL is necessary to fully activate the promoter. And inversely, knowing the AHL dose response curve of the very simple system constituted of the constitutively luxR gene and a fluorescent reporter under the Lux promoter is enough to deduce the amount of LuxR in the bacteria, and therefore their expression level. This is why we have designed and performed an experiment to precisely measure a dose response curve of such a system to be able to fit our luxR expression parameter.

Fitting method

We used the the following process to fit our model to experimental data:

  1. From an initial set of parameters for our model, we simulate the response of the system that we should obtain
  2. We quantify the difference (via a cost function) between the model-based response and the actual experimental data
  3. We change the set of parameters in a way that is expected to reduce the cost (with gradient-based method) the and go back to step 1. and iterate

Thanks to the MEIGO toolbox already used for our parameter search, we do not have to implement this whole process de novo but we can focus on the cost function that will be used for the fitting. We built it taking the chi2 test of goodness of fit as a basis, to be able to tell with precision whether our fit is significant or not, obtaining on top of that the uncertainty of the fit of our parameters which arise from experimental variability.

Cost function based on the chi2 test

The most simple way to measure the distance between simulated model data and experimental data is the sum of square of the difference between each couple of point. However, the statistical meaning of this value is not obvious, and it is hard to say from it whether the fit is statistically significant or not. This is why we have implemented a more sophisticated cost function based on the chi2 test of goodness of fit, using the the chi2inv function provided by Matlab. This cost function takes into account the experimental uncertainty to evaluate whether a fit is significant or not. The cost function is then, with theo being the vector of simulated data, exp the vector of experimental data, and σexp the vector of the standard deviation for each data point (computed from triplicates):

\[\begin{aligned} \text{cost}(theo, exp, \sigma exp) = \frac{\sum_i^N\frac{(exp_i-theo_i)^2}{\sigma exp_i^2}}{\text{chi2inv}(0.95,N)} \end{aligned}\]

When this cost is below 1, the fit is considered to be significant, taking into account the experimental uncertainty of the measures. Therefore, the set of parameters corresponding to the lowest obtained cost value will be the center value of the parameter, while the maximum and minimum possible values of the parameter will be the maximal and minimal ones satisfying the condition that the cost is below 1.

Assumptions

We assumed fixed the following parameters of our model, which are considered known well enough from previous characterizations:

Constant Description Value Reference
K_LuxRAHL LuxR-AHL quadrimer binding constant 5.10-10 nM-3 [2]
d_luxR LuxR degradation rate 0.023 min-1 [3]
K_luxR Half-activation LuxR-AHL concentration of the hybrid promoter 5 nM iGEM ETH 2013

From the experimental data of the AHL dose response curve, we could fit two parameters of our model: the expression level of luxR, and the leakiness of our promoter (both in log scale):

LuxR fit
Figure 1. Fit of the expression level of luxR. Parameter space fitting the experimental data. Each point represent a parameter vector that significantly fit the experimental data. The blue points fit the data the best (least sum of square) while the yellow ones represent parameters combinations that barely fit the data (but still significant according to the chi2 test of goodness of fit). Fitted parameters are annotated in red. Unfortunately, a very large values of the Hill coefficient of the promoter in regards to LuxR-AHL were possible, thus it could not be fitted.

From our statistically significant fit and under the assumptions made, we can gather with a 95% confidence that:

  1. The expression level of luxR in our system is equal to 41 ± 7 nM.min-1

    Moreover, this expression level is very close to the expression level we aimed at (100 nM.min-1) during our preliminary parameter space analysis.

  2. The leakiness of the PLux promoter in our system is equal to 3.9 ± 1.5 %

LuxI expression fitting

What experiment do we need to fit the expression of LuxI?

LuxI is the enzyme participating to the positive feedback loop by producing AHL that will bind to LuxR and lead to more activation of the promoter controlling the very own expression of luxI, closing the loop of positive feedback.

AHL synthesis

The strength of the expression of luxI (especially when at the basal activation, without AHL) is a parameter which strongly influences the bacterial cell density threshold at which our system activates. The same way we could deduce the expression level of luxR from the AHL dose response, we will here infer the expression level of luxI by testing over which bacterial cell density our system gets activated. This would not have been possible without knowing luxR expression first, as both parameters have an influence on the response of our system toward bacterial cell density. To see the response of our system to different cells concentration inputs, we have designed and performed a quorum sensing end-point experiment to be able to fit our luxI expression parameter.

Assumptions on parameters

Parameters

We assumed fixed the following parameters of our model, which are considered known well enough from previous characterizations, including our own characterization of luxR expression:

Constant Description Value Reference
K_LuxRAHL LuxR-AHL quadrimer binding constant 5.10-10 nM-3 [2]
d_luxR LuxR degradation rate 0.023 min-1 [3]
K_luxR Half-activation LuxR-AHL concentration of the hybrid promoter 5 nM iGEM ETH 2013
a_AHL AHL synthesis rate by LuxI 0.01 min-1 [2]
k_deg AHL degradation rate 5.10-4 min-1 [4]
d_luxI LuxI degradation rate 0.017 min-1 [4]

We will let the value of a_luxR (expression level of luxR) vary between the two bounds that our first characterization has issued. We don't give any bound for k_luxI, the leakiness, as we should be able to fit it again independently.

Interpretation of the experimental data

We were not able to find any significant fit on the raw experimental data. To be able to still get information out of them, we had to consider that the maximal activation of our system was reached by the most dense and most fluorescent bacteria (which grew in high-glucose-content cultures), even though a clear plateau cannot be directly observed on our data (see Figure 3 below). This interpretation of the data was motivated by the hypothesis that fully activated bacteria were experiencing a high metabolic burden preventing them from reaching the OD they should have reached without this burden.

Figure 3. A) Final absorbance obtained depending on initial glucose concentrations of each culture. B) GFP fluorescence per A600 in response to population density. Colonies were grown over night in media with varying glucose concentrations that lead to different final population densities.

In practice, what we did to get exploitable data for our fit is the following: we fitted first a linear model on the first data points (unaffected by any burden) of the A600 vs. glucose to get the OD that each culture should have reached without burden. We used this adjusted OD value in the (therefore adjusted) data to fit our model. Here is how the adjusted data look like:

Figure 4. Adjusted data used for the fit of the expression of luxI.

Finally, because of the absence of replicates in the data, we had to estimate a meaningful standard deviation to be able to use our cost function for the fitting: we assumed a 25% standard deviation for each measurement point, which is the standard deviation that was observed in the data used to fit the expression of luxR.

From the experimental data of the quorum sensing end-point experiment, we could fit two parameters of our model: the expression level of luxI, and once again the leakiness of our promoter (both in log scale):

LuxR fit
Figure 2. Fit of the expression level of luxI. Parameter space fitting the experimental data. Each point represent a parameter vector that significantly fit the experimental data. The blue points fit the data the best (least sum of square) while the yellow ones represent parameters combinations that barely fit the data (but still significant according to the chi2 test of goodness of fit). Fitted parameters are annotated in red.

From our statistically significant fit and under the assumptions made, we can gather with a 95% confidence that:

  1. The expression level of luxI in our system is comprised between 1.0 and 6.8 µM.min-1, with its most probable value being 3.2 µM.min-1
  2. The leakiness of the PLux promoter in our system is comprised between 2.0 and 6.2 %, with its most probable value being 3.7 %

In addition, we can verify that the leakiness of the PLux promoter was successfully fitted two times with excellent consistency on two completely different experiments, supporting the validity of our approach and the assumptions we made.

References

  1. ^ Vander Heiden, Matthew G., Lewis C. Cantley, and Craig B. Thompson. “Understanding the Warburg Effect: The Metabolic Requirements of Cell Proliferation.” Science (New York, N.Y.) 324.5930 (2009): 1029–1033. PMC. Web. 18 Oct. 2017.
  2. A synthetic multicellular system for programmed pattern formation Subhayu Basu, Yoram Gerchman, Cynthia H. Collins, Frances H. Arnold & Ron WeissNature 434, 1130-1134 (28 April 2005) | doi:10.1038/nature03461
  3. A.B. Goryachev, D.J. Toh T.Lee, Systems analysis of a quorum sensing network: Design constraints imposed by the functional requirements, network topology and kinetic constants Biosystems, Volume 83, Issues 2–3, February–March 2006, Pages 178-187