Model-based design guidelines used to choose our parts rationally
Why do we need to search for functional parameters?
A biological circuit often has different functioning regimes and can only achieve a particular and interesting behavior for a few given combinations of parameters (among which protein expression levels, or promoter sensitivity and leakiness for example). This is why we had to get some insights into the sets of parameters of our circuit and determine a target combination that would make our circuit work, to give out design guidelines for the choice of the parts we would use in the lab.
Please check the full detailed model if you are interested in knowing how our whole model works.
Parameter search
Even before we got our first parts cloned and characterized, we attempted to predict the requirements that they should meet to achieve the criteria previously established from literature data. For this, we extensively explored the parameter space controlling our model, and simulated the response of potential systems to find subsets of parameter combinations satisfying the performance specifications needed to get a sensitive as well as specific tumor sensing circuit.
The parameters satisfying the specifications are called functional space (green ellipse on Figure 1). We selected the biological parts that were the most likely to fall in the functional parameter space.
Different categories of parameters
Our model is relying on a dozen of parameters, some of which we can have a leverage on (typically maximal expression of the proteins, via RBS tuning), and others not (binding constants, promoter leakiness...). Some of these latter parameters have been precisely characterized and others are not very well known. This is why we have chosen to set some parameters to a certain value when we could find a reasonably reliable source in the literature, or when their influence would be redundant with other parameters (such as protein degradation rates and maximal expression rates that can alleviate each other's influence when co-varied), and leave other parameters free to vary to check their influence on our system.
Fixed parameters, because well known
Symbol | Description | Value | Reference |
---|---|---|---|
aAHL | AHL synthesis rate by LuxI | 0.01 min-1 | [2] |
dAHL | AHL degradation rate | 5x10-4 min-1 | [4] |
D | AHL diffusion coefficient in water | 3x10-8 m2min-1 | [4] |
KLuxR-AHL | LuxR-AHL quadrimer binding constant | 5x10-10 nM-3 | [5] |
w | Width of the colonized shell area | 5x10-10 nM-3 | [5] |
Fixed parameters, not very well known but redundant with other parameters
Symbol | Description | Value | Reference |
---|---|---|---|
dLuxI | LuxI degradation rate | 0.017 min-1 | [4] |
dLuxR | LuxR degradation rate | 0.023 min-1 | [4] |
dAzu | Azurin degradation rate | 0.1 min-1 | estimated |
Parameters allowed to vary because not very well known and which may have a significant effect on our circuit
Symbol | Description | Typical value (initial value in the parameter search) | Reference | Lower bound | Higher bound |
---|---|---|---|---|---|
aLuxR | Maximum expression of luxR | 5 nM min-1 | iGEM ETH 2014 | 1x10-2 nM min-1 | 1x104 nM min-1 |
aLuxI | Maximum expression of luxI | 1x103 nM min-1 | [5] | 1x10-2 nM min-1 | 1x104 nM min-1 |
KLac | Half-activation lactate concentration of the hybrid promoter | 2x106 nM | Characterized lactate sensing part on which our AND-gate is based | 1x104 nM | 1x108 nM |
kLuxI | Leakiness of the hybrid promoter | 0.01 | Characterized lactate sensing part on which our AND-gate is based | 0.0001 | 0.1 |
KLuxR | Half-activation LuxR-AHL concentration of the hybrid promoter | 5 nM | iGEM ETH 2013 | 1 nM | 100 nM |
nLuxR | Hill coefficient of the hybrid promoter regarding LuxR-AHL concentration | 1.7 | iGEM ETH 2015 | 1.1 | 1.9 |
nLac | Hill coefficient of the hybrid promoter regarding lactate concentration | 1.7 | iGEM ETH 2015 | 1.1 | 1.9 |
kAzu-LuxI | Relative expression of azurin compared to LuxR | 10 times the luxI expression | estimated | 10-5 | 105 |
How do we quantitatively define the functional space?
Cost function
To be able to distinguish systems satisfying the criteria about specificity and azurin production and the ones that do not, we need to use a numerically evaluable condition quantifying how well the criteria are met. Based on this, the script will either accept or discard the parameter set. For this, we will use the following cost function, taking a parameter vector as argument:
\[\text{cost}(p) = \max \left( \frac{10\cdot {[Azu]_{\mathrm{(low\; lac,HIGH\; d_{cell})}}}}{[Azu]_{\mathrm{(HIGH\; lac,HIGH\; d_{cell})}}}, \frac{10\cdot {[Azu]_{\mathrm{(HIGH\; lac,low\; d_{cell})}}}}{[Azu]_{\mathrm{(HIGH\; lac,HIGH\; d_{cell})}}}, \frac{1\times10^{6}}{[Azu]_{\mathrm{(HIGH\; lac,HIGH\; d_{cell})}}}\right)\]
Each argument of the max function represents in the same order the following criteria:
- Specificity of the sensing for lactate
- Specificity of the sensing for bacterial cell density
- Achieving a large amount of produced azurin
Interpretation of the result of the cost function goes as follows: the smaller the value the better better the criterium is met. If the highest value (e.g. the value for the criterium is met the worst) is below 1, the paramter set is accepted. Over 1, a ratio is not good enough. This monotonicity enables us to rely on optimization algorithms to reach the best combination of parameters available. Also, we can say that every system that has a cost function value below 1 is good enough for us, while "the smaller the better" still applies.
Intermediate modeling result: Analysis of the functional parameter space
Using an optimization toolbox developed for biological systems, MEIGO [6], followed by a package exploring parameter spaces, HYPERSPACE [7], we could obtain the following graphs describing, in the high-dimension space of all possible circuits, a subset of systems satisfying our performance criteria:
On this figure are shown the systems suitable for our application. All the axis are logarithmic, except for nLac and nLuxR. The yellow points are good systems, the blue ones are even better and surpass the specifications that we demand. From this figure, we can draw the following interpretations (see corresponding sub-graphs referred to on the figure).
- Only some given combination of expression of LuxI and LuxR are suitable for our needs. This is expected as the tuning of the bacterial cell density at which the quorum sensing is triggered is mainly done with these two proteins
- High amounts of azurin are more easily achieved when LuxI maximal expression is high: then the expression of azurin does not need to be that much more compared to luxI to reach the desired level.
- The tipping point of the lactate sensing must be either around or above the lactate levels to be distinguished (1 mM in healthy tissues and 5mM in tumors). The first possibility makes sense as the promoter should ideally be unactivated at low lactate level and activated above. However, the combination of this lactate sensing and quorum sensing into the hybrid promoter seems to allow for a second possibility: that the full activation of the promoter happens at much higher concentrations. In both cases, the differential expression at 1 mM and 5 mM plays the role of "increasing the leakiness" of the promoter in regard to luxR so that the quorum sensing is more easily activated in presence of lactate.
- The leakiness is a very important parameter to be able to achieve a good performance for our system. The smaller the leakiness, the more probable it is to find a good system.
- The Hill coefficient of our hybrid promoter in regard to lactate will allow more or less possibilities of systems: when over 1.5, a population of systems is present (more on the yellow side) that allows for a larger set of aLuxR/aLuxI combinations (see also nLac vs aLuxR and nLac vs aLuxI graphs). As we won't be able to tune it, we should prepare for the worst and try to aim for the best systems (the blue ones) on graph 1 to keep a security margin
- KLuxR and nLuxR don't have a significant influence on our system, we can stop studying them
Final modeling result: Experimental guidelines used for circuit design
From these observations, we can deduce guidelines regarding the parameters on which we can exert an active control, that is to say the expression level of the genes LuxI and LuxR (aLuxR and aLuxI here) as well as a judicious choice of a previously characterized lactate sensor circuit (comprising lldR and lldP genes) among the iGEM ETH 2015 part collection .
Target parameters and restrictions
To translate these insights into experimental results in the lab, we need to chose a target in the range of parameters that works for our application. With the help of the previously characterized initial values for aLuxR (5 nMmin-1) and aLuxI (1x103 nMmin-1), we can hope to tune our system and reach our target in the parameter space via simple RBS tuning which is supported by the Salis Lab RBS Calculator [7].
As it turned out, the regulatory sequence in front of the luxR gene on the part at our disposal induced already a relatively high expression level. It was hard to get more than 10 times more expression for this gene on the Salis calculator, this is why the range aLuxR > 1x102 nMmin-1 is inaccessible to us (grey area), and that we have to choose LuxI in consequence. We also get to chose KLac among the ones available in the promoter collection of parts ranging from BBa_K1847002 to BBa_K1847009: between 0.3 mM and 2.4mM.
Taking into account the experimental constraints (forbidden grey area), the targeted parameters (red squares) were chosen on the following plot, with an extensive compatibility for different potential leakiness of our hybrid promoter (red frame):
With aLuxR = 1x102 nMmin-1, aLuxI = 1x104 nMmin-1 and KLac = 1x106 nM, we should be at a suitable operating point for our system and still have some security margin in case the genetic design does not yield the exact expression levels that we would expect from it. To achieve these parameters, we gave the following directions for the design of our parts:
- Use a 10 times stronger RBS than on the piG0047 sequence of iGEM ETH 2014 team for the expression of LuxR
- Use a 10 times stronger RBS than on the piG0050 sequence of iGEM ETH 2014 team for the expression of LuxI
- Use the BBa_K1847008 part with J23118-B0034 regulatory sequences, giving Klac = 1.8 mM
These value were the basis for the design of our parts and the subsequent experimentations.
Sanity check: in silico behavior for the chosen target parameters
We can validate on our model that they would work well to distinguish the specific levels dictated by our application:We can confirm that the obtained parameter set target would lead to a functioning circuit.
Going further: a 3D tumor model
To assess the validity of this analytical model, and especially validate the assumptions regarding the AHL diffusion model, we developed a comprehensive 3D model of a colonized tumor on the software COMSOL, on which we implemented the same equations governing our bacterial circuit.
References
- Kaplan HB, Greenberg EP. Diffusion of autoinducer is involved in regulation of the Vibrio fischeri luminescence system. 1985;163(3):1210-1214.
- Jordi Garcia-Ojalvo, Michael B. Elowitz, and Steven H. Strogatz Modeling a synthetic multicellular clock: Repressilators coupled by quorum sensing 2004 101 (30) 10955-10960
- Fekete, A., Kuttler, C., Rothballer, M., Hense, B. A., Fischer, D., Buddrus-Schiemann, K., Lucio, M., Müller, J., Schmitt-Kopplin, P. and Hartmann, A. (2010), Dynamic regulation of N-acyl-homoserine lactone production and degradation in Pseudomonas putida IsoF. , 72: 22–34. doi:10.1111/j.1574-6941.2009.00828.x
- A.B. Goryachev, D.J. Toh T.Lee, Systems analysis of a quorum sensing network: Design constraints imposed by the functional requirements, network topology and kinetic constants , Volume 83, Issues 2–3, February–March 2006, Pages 178-187
- A synthetic multicellular system for programmed pattern formation Subhayu Basu, Yoram Gerchman, Cynthia H. Collins, Frances H. Arnold & Ron Weiss 434, 1130-1134 (28 April 2005) | doi:10.1038/nature03461
- Egea JA, Henriques D, Cokelaer T, Villaverde AF, MacNamara A, Danciu DP, Banga JR and Saez-Rodriguez J. (2014) MEIGO: an open-source software suite based on metaheuristics for global optimization in systems biology and bioinformatics. 15:136.
- Zamora-Sillero E, Hafner M, Ibig A, Stelling J, Wagner A (2011) Efficient characterization of high-dimensional parameter spaces for systems biology. 5: 142.http://doi.org/10.1186/1752-0509-5-142
- Salis, Howard M., Ethan A. Mirsky, and Christopher A. Voigt. "Automated design of synthetic ribosome binding sites to control protein expression." 27.10 (2009): 946-950. http://dx.doi.org/10.1038/nbt.1568