Modeling| Phage Titer| https://static.igem.org/mediawiki/2017/3/38/T--Heidelberg--2017_Background_Owl.jpg%7Cblue%7C
While developing PREDCEL in the lab, we simultaneously developed it in silico so that both sides could benefit from each other. One of the most important parameter of phage assisted directed evolution experiments like PREDCEL and PACE is the phage titer itself. If the phage titer drops washout can occur and the experiment has to be restarted with the disadvantage of loosing library complexity. If the phage titer increases too much, the multiplicity of infection (MOI), that means the amount of phage relative the the amount of E. coli rises too. If fore example the MOI is 10 and an E. coli can only be infected by one phage, nine out of ten phages will not infect an E. coli and thus will not evolve, but still make up most of the phage population.
Comparing the plots for 20 % and 100 % fitness show that a higher fitness increases the final phage titer and decreases the amount of uninfected E. coli earlier.Modeling concentrations in one Lagoon
Here the concentrations \(c\) of uninfected E. coli, infected E. coli and phage producing E. coli as well as the M13 phage are modeled. They are denoted with the subscripts \(_{u}\), \(_{i}\), \(_{p}\) and \(_{P}\). If the whole E. coli population is referred to, \(c_{E}\) is used. If an arbitrary E. coli population is meant, the subscript \(_{e}\) is used. The phage concentration \(c_{P}\) refers to the free phage only, phage that are contained in an E. coli they infected are not included. The used parameters include the time \(t\), the affinity of phage for E. coli \(k\), the duration between infection of an E. coli and the first phage leaving the E. coli \(t_{P}\). The three different E. coli populations each have a generation time \(t\) that is denoted with their subscript. The fitness of a phage population is \(f\).Table X: Title Subtitle
TIme Date Operator Sample Action DONE ToDo: OD Turbidostat 2 Stock ID TO DOs: 12:00 24/10/2017 MP - Started filling the turbidostat (T2) with medium 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 13:30 24/10/2017 MP - Inoculation of turbidostat 2 with 50 ml Stock culture 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 14:00 24/10/2017 MP - Measured OD "0 18" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 15:00 24/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 16:00 24/10/2017 MP - Measured OD "0 511" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 17:00 24/10/2017 MP "Measured OD started filling the lagoon and the arabinose pump" "0 548" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 18:00 24/10/2017 MP 1 Inoculation with 1 ml phage supernatant "0 514" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 19:00 24/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 20:00 24/10/2017 MS 1 "Measured OD Took samples L1 T1 - 1" "0 637" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 21:00 24/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 22:00 24/10/2017 MP "Measured OD Flow rate for lagoon was to high from previous filling ( 7 ml/min in contrast to 1 666 ml/min) - potential washout " "0 791" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 23:00 24/10/2017 MP "Lagoon was infected with 0 8 ml 10^8 PFU/ml phages to counteract the potential washout " "0 818" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 00:00 25/10/2017 MP 2 "Measured OD Took samples L1 T1 - 2" "0 833" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) TO DO: Phage-PCR 01:00 25/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 02:00 25/10/2017 MS Measured OD "0 789" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 03:00 25/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 04:00 25/10/2017 MS 3 "Measured OD Took samples L1 T1 - 3" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 05:00 25/10/2017 "0 792" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 06:00 25/10/2017 MS Measured OD 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 07:00 25/10/2017 "0 817" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 08:00 25/10/2017 TB 4 "Measured OD Took samples L1 T1 - 4" Took samples 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 09:00 25/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 10:00 25/10/2017 TB Measured OD "0 816" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 11:00 25/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 12:00 25/10/2017 TB Measured OD Done PCR "0 81" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) TO DO: Phage-PCR 13:00 25/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 14:00 25/10/2017 PP Measured OD "0 843" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 15:00 25/10/2017 inoculated growth for plaque assay 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 16:00 25/10/2017 PP 5 "Measured OD Took samples L1 T1 - 5" "0 833" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 17:00 25/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 18:00 25/10/2017 PP Measured OD "0 836" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 19:00 25/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 20:00 25/10/2017 MP Measured OD "0 862" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) TO DO: Plaque Assay 21:00 25/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 22:00 25/10/2017 MP Measured OD "0 81" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 23:00 25/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 00:00 26/10/2017 LP 6 "Measured OD Took samples L1 T1 - 6" Done PCR "0 809" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) TO DO: Phage-PCR 01:00 26/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 02:00 26/10/2017 CG Measured OD "0 845" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 03:00 26/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 04:00 26/10/2017 CG Measured OD "0 864" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 05:00 26/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 06:00 26/10/2017 CG Measured OD "0 836" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 07:00 26/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 08:00 26/10/2017 TB 7 "Measured OD Took samples L1 T1 - 7" "0 831" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 09:00 26/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 10:00 26/10/2017 TB Measured OD "0 827" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 11:00 26/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 12:00 26/10/2017 TB Measured OD 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) TO DO: Phage-PCR 13:00 26/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 14:00 26/10/2017 TB Measured OD "0 738" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 15:00 26/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 16:00 26/10/2017 TB 8 "Measured OD Took samples L1 T1 - 8" "Took samples L1 T1" "0 652" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 17:00 26/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 18:00 26/10/2017 Measured OD 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 19:00 26/10/2017 MS "0 582" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 20:00 26/10/2017 Measured OD Plaque assay was performed successfully 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) TO DO: Plaque Assay 21:00 26/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 22:00 26/10/2017 MS Measured OD "0 633" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 23:00 26/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 00:00 27/10/2017 PP 9 "Measured OD Took samples L1 T1 - 9" "0 511" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) TO DO: Phage-PCR 01:00 27/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 02:00 27/10/2017 MS Measured OD "0 692" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 03:00 27/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 04:00 27/10/2017 MS Measured OD "0 937" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 05:00 27/10/2017 "0 945" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 06:00 27/10/2017 MP Measured OD "could not be measured as lagoon medium level was too high and therefore pump where set off for a short time " 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 07:00 27/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 08:00 27/10/2017 MS 10 "Measured OD Took samples L1 T1 - 10" "Phage detection PCR with samples 8 9 and 10 for Dickinson-Phage and PI-Phage" "0 912" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 09:00 27/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 10:00 27/10/2017 PP Measured OD "0 903" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 11:00 27/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 12:00 27/10/2017 LP Measured OD see above "0 924" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) TO DO: Phage-PCR 13:00 27/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 14:00 27/10/2017 MP Measured OD "0 942" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 15:00 27/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 16:00 27/10/2017 PP 11 "Measured OD Took samples L1 T1 - 11" took samples "0 942" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 17:00 27/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 18:00 27/10/2017 MP Measured OD "0 822" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 19:00 27/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 20:00 27/10/2017 MK Measured OD "0 59" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) TO DO: Plaque Assay 21:00 27/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 22:00 27/10/2017 CG Measured OD "0 61" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 23:00 27/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 00:00 28/10/2017 MP 12 "Measured OD Took samples L1 T1 - 12" Phage detection PCR with sample 11 "0 664" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) TO DO: Phage-PCR 01:00 28/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 02:00 28/10/2017 CG Measured OD "0 809" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 03:00 28/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 04:00 28/10/2017 MP Measured OD PCR: negative - phage washout "0 820" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 05:00 28/10/2017 "Flow rate was adjusted: 1 ml/min lagoon was infected by pooled samples 6 7 and 8 " 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 06:00 28/10/2017 MP Measured OD 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 07:00 28/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 08:00 28/10/2017 MP 13 "Measured OD Took samples L1 T1 - 13" "0 667" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 09:00 28/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 10:00 28/10/2017 PP Measured OD "0 62" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 11:00 28/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 12:00 28/10/2017 PP Measured OD Phage PCR was performed "0 768" 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) TO DO: Phage-PCR 13:00 28/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 14:00 28/10/2017 PP Measured OD 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 15:00 28/10/2017 199 (pMax69 aka AP-SplitT7 + GenIII+YFP + MP4) 16:00 28/10/2017 PP 14 "Measured OD Took samples L1 T1 - 14" FINAL PROBES
Bacterial Growth
Each term describing the change of an E. coli concentration contains its growth, \(g_{e}\). The growth rate of an E. coli population can be modeled by exponential growth or by logistic growth. Especially, when long durations per lagoon are modeled, the logistic growth model is more exact. [source]. In the exponential case the growth rate \(g_{e}\) is modeled as $$ g_{e} (t_{e}) = c_{e} \cdot \frac{log(2)}{t_{e} } $$ Note that the growth rate in the model increases over time, while in the modeled culture, the nutrient concentration decreases. That makes the logistic model more plausible, it models \(g_{e}\) as $$ g_{e} (t_{e}, \: c_{e}(t), \: c_{c}) = \frac{c_{c} - c_{e} (t)}{c_{c} } \cdot \frac{log(2)}{t_{e} } $$ In this case the learning rate decreases as the current concentration \(c_{e}\) approaches the maximum capacity for E. coli in the given setup \(c_{c}\). With this model \(c_{e} \leq c_{c}\) is true for any point in time.Transitions between uninfected, infected and phage-producing
Change of concentration of uninfected E. coli, \(\frac{\partial c_{u} }{\partial t} \: [cfu/min]\) $$ \frac{\partial c_{u} }{\partial t}(t) = g_{u} (t_{u}, \: c_{u}(t), \: c_{c}) - k \cdot c_{u}(t) \cdot c_{p}(t) $$ In addition to the growth term, the concentration of uninfected E. coli is described by a term for infection that takes into account the concentration of uninfected E. coli and the concentration of free phage and reduces the conentration of uninfected E. coli. Change of concentration of uninfected E. coli, \(\frac{\partial c_{i} }{\partial t} \: [cfu/min]\) $$ \frac{\partial c_{i} }{\partial t}(t) = \begin{cases} g_{i} (t_{i}, \: c_{i}(t), \:c_{c}) + k \cdot c_{i}(t) \cdot c_{p}(t) - c_{i}(t - t_{P}), \quad \text{for} \: t > t_{P} \\ g_{i} (t_{i}, \: c_{i}(t), \: c_{c}) + k \cdot c_{i}(t) \cdot c_{p}(t), \quad \text{otherwise} \end{cases} $$ Until \(t > t_{P}\) the concentration of infected E. coli increases by growth and infection of previouly uninfected E. coli. When \(t > t_{P}\), a third term describing that infected E. coli turn into phage-producing E. coli is subtracted. Change of concentration of phage producing E. coli, \(\frac{\partial c_{p} }{\partial t} \: [cfu/min]\) $$ \frac{\partial c_{p} }{\partial t}(t) = \begin{cases} g_{p} (t_{p}, \: c_{p}(t), \: c_{c}) - c_{i}(t - t_{P}), \quad \text{for} \: t > t_{P} \\ g_{p} (t_{p}, \: c_{p}(t), \: c_{c}), \quad \text{otherwise} \end{cases} $$ The population of phage producing E. coli only increases by growth until \(t > t_{P}\). When infected E. coli drop their first phage they turn into producing E. coli as described by the second term.Phage propagation
Change of concentration of M13 phage, \(\frac{\partial c_{P} }{\partial t} \: [cpu/min]\) $$ \frac{\partial c_{P} }{\partial t}(t) = c_{P}(t) \cdot \mu_{max} \cdot f - k \cdot c_{u}(t)\cdot c_{P}(t) $$ The phage concentration is only increased by phage that leave phage-producing E. coli, which happens at a rate of \(f \cdot \mu_{max}\) per time unit, with f being the fitness, a value between 0 and 1, equal to the share of the wildtype M13 phages fitness and \(\mu_{max}\) being the wildtype phages production rate. We assume that the only negative influence on the free phage titer is phage infecting E. coli, which depends on both the phage titer \(c_{P}\) and the titer of uninfected E. coli, \(c_{i}\). When the fitness is assumed to be the same for all phages, it is modeled to be constant during the time in one lagoon.
Numeric solutions
The problem described above is a system of four differential equations, of which two ( \(\frac{\partial c_{i} }{\partial t} \:, \: \frac{\partial c_{p} }{\partial t}\) ) are so called delayed differential equations. They contain a term that needs to be evaluated at a timepoint in the past \(t - t_{P}\). A python script was used to solve the problem numerically, using the explicit Euler method. The basic idea is that from a point in time with all values and all derivatives values given, the next point in time can be calculated by assuming a linear progress between the two points. $$ f(t_{n+1}) = f(t_{n}) + (t_{n+1} - t_{n}) \cdot f'(t_{n}) $$ This is performed for \(c_{u}(t)\), \(c_{i}(t)\), \(c_{p}(t)\) and \(c_{P}(t)\) rotatory, to always have the needed values from \(t_{n}\) ready for \(t_{n+1}\).Table 2: Additional Variables and Parameters used in the numeric solution of the model List of all additional paramters and variables used in the numeric solution of this model. When possible values are given.
Symbol Name in Source code Value and Unit Explanation \(v_{l}\) vl
[ml] Volume of lagoon \(t_{l} \) tl
[min] Duration until transfer to the next lagoon \(c_{u}(t_{0})\) ceu0
[cfu] Concentration of E. coli in a lagoon when M13 phages are transfered to it \(c_{P}(t_{0})\) cp0
[pfu] Initial concentration of M13 phage in the first lagoon \(n\) epochs
- Number of epochs that are modeled, one epoch being everything that happens in one particular lagoon \(s\) tsteps
- Number of time steps for which numeric solutions are calculated, counted per epoch \(c_{P}^{min}\) min_cp
[pfu] Lower threshold for valid phage titers \(c_{P}^{max}\) max_cp
[pfu] Upper threshold for valid phage titers
Fitness distributions
If a fitness distribution is assumed, the fitness does not necessarily stay constant in one lagoon. The distribution is initialized with 100 % of the phages having the starting fitness. First changes in the fitness distribution occur after the first E. coli start to release phages.
The fitness distribution is modeled to change by mutation and selection. The selection is implemented in the following equation: $$ s_{i}(t_{n + 1}) = f_{i}(t_{n}) \cdot s_{i}(t_{n}) $$ \(f_{i}\) is one of \(N\) fitness values, \(s_{i}\) is the share of phages with that fitness value relative to the total phage population. Since the fitness is interpreted as a percentage of the wildtype fitness in this model, the new shares can be calculated als the product of the fitness value \(f_{i}\) and its old share \(s_{i}(t_{n})\). The resulting values are then normalized by divsion with $$ \sum_{i = 0}^{N} f_{i}(t_{n + 1}) $$ in order to always have shares that sum up to 1.
The mutation is implemented as a function that manipulates the fitness shares and applied to the fitness shares after selection. Here, \(r_{M}\) is the mutation rate, that means the amount of the fitness share \(i\) that changed during the mutation. \(w\) is the width of a fitness bin: $$ w = \frac{1}{N - 1} $$ The new shares are calculated as $$ s_{i}(t_{n + 1}) = \big(1 - r_{M}\big) \cdot s_{i}(t_{n}) + r_{M} \sum_{i = 0}^{N} \int_{f_{i} - \frac{1}{2} \cdot w}^{f_{i} + \frac{1}{2} \cdot w} \mathcal{N}_{\mu=f_{i}, \: \sigma}(x) \: dx $$ Special cases:
For \(f_{0}\) the integral is changed to $$ \int_{- \infty }^{f_{0} + \frac{1}{2} \cdot w} \mathcal{N}_{\mu=f_{i}, \: \sigma}(x) \: dx $$ For \(f_{0}\) the integral is changed to $$ \int_{f_{N} - \frac{1}{2} \cdot w}^{\infty} \mathcal{N}_{\mu=f_{i}, \: \sigma}(x) \: dx $$ to have the integrals over the normal distribution add up to 1. The normal distribution \(\mathcal{N}\) is used to model that many mutations have only small effects on the fitness, while some have a larger effect. \(\sigma\) is a parameter of the model that influences how much the new scores differ from the previous ones. In this model it is theoretically possible that mutation turns any given fitness into any other fitness, however small values for can \(\sigma\) prevent this.
When the the concentrations are modeled over a longer period of time the fitness of the wildtype is reached. To more accurate model evolution in which the wildtype fitness is never reached, \(\mu_{max}\) can be set to a smaller value. This value will be reached by most of the phages, if enough time is given. Finally almost no phages with the lowest fitness can be found in the lagoon.
Since most mutations decrease fitness in reality, a symmetric function like the normal distributions density function may not be the best choice. Consequently we also implemented a mutation function with a skew normal distribution \(\mathcal{S}\), that replaces \(\mathcal{N}\). It has an additional parameter \(\alpha\) that modulates the skewness. $$ s_{i}(t_{n + 1}) = \big(1 - r_{M}\big) \cdot s_{i}(t_{n}) + r_{M} \sum_{i = 0}^{N} \int_{f_{i} - \frac{1}{2} \cdot w}^{f_{i} + \frac{1}{2} \cdot w} \mathcal{S}_{\mu=f_{i}, \: \sigma, \: \alpha}(x) \: dx $$ Special cases:
For \(f_{0}\) the integral is changed to $$ \int_{- \infty }^{f_{0} + \frac{1}{2} \cdot w} \mathcal{S}_{\mu=f_{i}, \: \sigma, \: \alpha}(x) \: dx $$ For \(f_{0}\) the integral is changed to $$ \int_{f_{N} - \frac{1}{2} \cdot w}^{\infty} \mathcal{S}_{\mu=f_{i}, \: \sigma, \: \alpha}(x) \: dx $$ to have the integrals over the normal distribution add up to 1.
The equation for \(\frac{\partial c_{P} (t)}{\partial t}\) is changed into the following to incorporate the distributed fitness. $$ \frac{\partial c_{P} (t)}{\partial t} = -k \cdot c_{u}(t) \cdot c_{P} (t) + \sum_{i = 0}^N f_{i} \cdot s_{i} \cdot \mu \cdot c_{p} (t) $$
Other assumptions for the mutation function are possible, for example a function that randomly samples from a normal distribution but is biased towards mutations that decrease fitness.
Since this model of fitness and mutation is heavily abstracted it is most likely limeted to qualitative statements. Still it provides an intuitive understanding on how directed evolution can work and illustrates the role of selection and mutation. If the model without mutation (Fig. 3) is compared to one with mutation and selection (Fig. 6) role of mutation, diversificating the pool of phages can be seen. When comparing a model without selection (Fig. 9) to the one with mutation and selection (Fig. 6) the role of selection can be described as channeling the variants caused by mutation into a specific direction.
Effects of noise
Errors in the parameters of the model can be due discrepancies of the modeled experiments conditions and the conditions under which parameters were determined. To estimate how problematic the resulting errors of the model are, we compared the model to data that was obtained from our other experiments.To make general statements on how sensitive the model reacts to noise in its parameters, a function was introduced that adds adjustable gaussian noise to all parameters. It was scaled relative to each parameters value. The function \(n(v)\) returns the noisy value of \(v\), the random argument \(r\) is sampled from a uniform distribution over \([0, 1]\). $$ n(v) = v + \big(1 - 2r\big) \cdot \sigma_{G} \cdot \sigma_{v} \cdot v, \quad r \in [0, 1] $$ \(\sigma_{G}\) is a global factor that is the same for all \(v\), \(\sigma_{v}\) is specific for \(v\). This makes it possible to have one parameter being noisier than another, while being able to tune the noise globally.
When different levels of noise are added to the same model (Fig. 10), the comparison of predicted phage titer shows almost no difference. While the predicted E. coli titers show some differences, the total E. coli titer is similar between the different versions. That can hint at errors in the E. coli concentrations cancelling each other out.
The model seems to be sufficiently robust against random errors. However systematic errors in the parameters can still have a large effect on predicted phage and E. coli titers.
Modeling concentrations over multiple Lagoons
PREDCEL is based on transferring phage from one lagoon into the next. This substitutes the constant supply with fresh medium, fresh E. coli and the constant dilution by a turbidostat. Because transferring decreases the phage by the relation of the transfer volume \(V_{t}\) to the lagoon volume \(v_{L}\), there is an increased risk of washout. This model is partly motivated by the goal to minimize that risk. The basic idea is to enable the user to react to a diminishig phage titer early enough while maintaining a reasonable level of monitoring.
In all following model versions, a distributional fitness, transfer of phage and E. coli and logistic growth are assumed. Due to their lower plausibility the alternatives are not shown, but are available as functions in the script.
When transfer from one volume to the next is performed, the new lagoon can be modeled with starting values calculated from the last lagoons end values. For each concentration from the previous lagoon \(c_{t}\), the concentration in the next lagoon \(c_{t+1}\) is calculated as $$ c_{t+1} = \frac{v_{t} }{v_{l} } \cdot c_{t} $$ with \(v_{L}\), the volume of a lagoon and \(v_{t}\), the volume that is transferred.
If the transfered volume is spinned down before it is added to the new lagoon, the initial value for \(c_{P}\) is calculated this way. The initial concentration of uninfected E. coli is set to the initial cell density. Initial concentrations of infected and phage-producing E. coli are set to zero, because before the transfer, no phages are present in the new lagoon. If the transfer volume is not spinned down, the concentration of infected and phage-producing E. coli are calculated, using the above formula. The initial concentration of uninfected E. coli is the calculated the same way, but the initial cell density is added.
Comparison to data from CYP1A2-PREDCEL
To asses the model in comparison with real data, phage titers from CYP1A2-PREDCEL were modeled with different values for the unkown parameters and the set of parameters that described the data best was chosen.
The phage titer data consists of two sets, a first round of PREDCEL with 1 transfer, followed by amplification of the resulting library on a strain that allows the phage to propagate independent of the performance of their CYP1A2 sequence. Then a second round of PREDCEL with one transfer was done. The time between two transfers \(t_{L}\) was 3 h, the lagoon volume \(v_{L}\) 20 ml and the transferred volume \(v_{T}\) 3 ml. Unknown parameters are the initial fitness, how the fitness changes and the phage degradation rate \(r_{D}\).
Both round were best described an initial fitness of \(f(t_{0}) = 0.001\), and a degration rate of \(r_{D} = 0.075\). For the first round constant fitness was assumed, since the initial library complexity is 1 and a sufficiently sized library is a maijor requirement for fitness to rise. For the second round the starting fitness was assumed to be identical to the fitness of the first round, then increasing to \(f(t_{0}) = 0.025\) after the transfer.
The initial phage titers for both rounds were given in the data. The full conditions for both models and the full data is available below.
Finding Parameters
Combinations of parameters that result in maintaing an acceptable phage titer most easily be found by calculating the phage titers for different sets parameters systematically. For this purpose a function was implemented that systematically tries all combinations of a given number \(n\) different values for the initial fitness \(f_{0}\), the time in one lagoon \(t_{L}\) and the transferred volume \(v_{t}\). A tunable parameter \(\delta\) defines how the range over wich the \(n\) values for one of the parameters are spread relative to the value of the parameter. For each parameter \(p\) the tried values are in $$ \Big\{p + \big(1 - \delta \big) + i \frac{2 \delta}{n - 1} \cdot p \: \Big\vert \: i = 0, ..., (n - 1)\Big\} $$
In directed evolution the fitness should increase over time. A linear increase in fitness between to given values was implemented to show this. The problem with this approach is its basic assumption being that all phage-producing E. coli are infected by phages with the same fitness. To make the model more plausible, a distribution of fitness was introduced. For a set of discrete fitness values each fitness values share of the phage-producing E. coli population is calculated. That changes the equation for the change in the concentration of phage-producing E. coli to The calculation is for \(N\) different fitness values \(f_{i}\) and their share of the total phage-producing E. coli population \(s_{i}\).
Parameters used for the figures
Figure 1, 3 - 20 % fitness'capacity': 1000000000.0, 'ceu0': 100000000.0, 'cp0': 100000000.0, 'epochs': 1, 'f0': 0.2, 'f_prec': 21, 'fend': 1.0, 'fitnessmode': 'dist', 'ftype': 'const', 'growth_mode': 'logistic', 'k': 3e-11, 'max_cp': 2000000000.0, 'min_cp': 100000.0, 'mumax': 16.667, 'mutation_dist': 'norm', 'noisy': 0.0, 'phageonly': 'True', 'plot_dist': 'True', 'sigma': 0.0, 'skewness': 1.0, 'ti': 30, 'tl': 60, 'to_mutate': 0.0, 'tp': 40, 'tpp': 10, 'tsteps': 100, 'tu': 20, 'vl': 20, 'vt': 1Figure 2 - 100 % fitness'capacity': 1000000000.0, 'ceu0': 100000000.0, 'cp0': 100000000.0, 'epochs': 1, 'f0': 1.0, 'f_prec': 21, 'fend': 1.0, 'fitnessmode': 'const', 'ftype': 'const', 'growth_mode': 'logistic', 'k': 3e-11, 'max_cp': 2000000000.0, 'min_cp': 100000.0, 'mumax': 16.667, 'mutation_dist': 'norm', 'noisy': 0.0, 'phageonly': 'True', 'plot_dist': 'False', 'sigma': 0.001, 'skewness': 1.0, 'ti': 30, 'tl': 60, 'to_mutate': 0.0, 'tp': 40, 'tpp': 10, 'tsteps': 100, 'tu': 20, 'vl': 20, 'vt': 1Figure 4 - Distributional fitness 1 h'capacity': 1000000000.0, 'ceu0': 100000000.0, 'cp0': 100000000.0, 'epochs': 1, 'f0': 0.2, 'f_prec': 21, 'fend': 1.0, 'fitnessmode': 'dist', 'ftype': 'const', 'growth_mode': 'logistic', 'k': 3e-11, 'max_cp': 2000000000.0, 'min_cp': 100000.0, 'mumax': 16.667, 'mutation_dist': 'norm', 'noisy': 0.0, 'phageonly': 'True', 'plot_dist': 'True', 'sigma': 0.0, 'skewness': 1.0, 'ti': 30, 'tl': 60, 'to_mutate': 0.0, 'tp': 40, 'tpp': 10, 'tsteps': 100, 'tu': 20, 'vl': 20, 'vt': 1Figure 5, 6, 7 - Distributional fitness 6.5 h'capacity': 1000000000.0, 'ceu0': 100000000.0, 'cp0': 100000000.0, 'epochs': 1, 'f0': 0.2, 'f_prec': 21, 'fend': 1.0, 'fitnessmode': 'dist', 'ftype': 'const', 'growth_mode': 'logistic', 'k': 3e-11, 'max_cp': 2000000000.0, 'min_cp': 100000.0, 'mumax': 16.667, 'mutation_dist': 'norm', 'noisy': 0.0, 'phageonly': 'True', 'plot_dist': 'True', 'sigma': 0.1, 'skewness': 1.0, 'ti': 30, 'tl': 400, 'to_mutate': 0.1, 'tp': 40, 'tpp': 10, 'tsteps': 2000, 'tu': 20, 'vl': 20, 'vt': 1Figure 8 - Distributional fitness skew'capacity': 1000000000.0, 'ceu0': 100000000.0, 'cp0': 100000000.0, 'epochs': 1, 'f0': 0.2, 'f_prec': 21, 'fend': 1.0, 'fitnessmode': 'dist', 'ftype': 'const', 'growth_mode': 'logistic', 'k': 3e-11, 'max_cp': 2000000000.0, 'min_cp': 100000.0, 'mumax': 16.667, 'mutation_dist': 'skew', 'noisy': 0.0, 'phageonly': 'True', 'plot_dist': 'True', 'sigma': 0.1, 'skewness': -1.0, 'ti': 30, 'tl': 400, 'to_mutate': 0.1, 'tp': 40, 'tpp': 10, 'tsteps': 2000, 'tu': 20, 'vl': 20, 'vt': 1Figure 9 - Distributional fitness without selection'capacity': 1000000000.0, 'ceu0': 100000000.0, 'cp0': 100000000.0, 'epochs': 1, 'f0': 0.2, 'f_prec': 21, 'fend': 1.0, 'fitnessmode': 'dist', 'ftype': 'const', 'growth_mode': 'logistic', 'k': 3e-11, 'max_cp': 2000000000.0, 'min_cp': 100000.0, 'mumax': 16.667, 'mutation_dist': 'norm', 'noisy': 0.0, 'phageonly': 'True', 'plot_dist': 'True', 'sigma': 0.1, 'skewness': -1.0, 'ti': 30, 'tl': 400, 'to_mutate': 0.1, 'tp': 40, 'tpp': 10, 'tsteps': 500, 'tu': 20, 'vl': 20, 'vt': 1Figure 10 - Effects of different noise levels'capacity': 1000000000.0, 'ceu0': 100000000.0, 'cp0': 100000000.0, 'epochs': 1, 'f0': 0.2, 'f_prec': 21, 'fend': 1.0, 'fitnessmode': 'dist', 'ftype': 'const', 'growth_mode': 'logistic', 'k': 3e-11, 'max_cp': 2000000000.0, 'min_cp': 100000.0, 'mumax': 16.667, 'mutation_dist': 'skew', 'noisy': 0.05, # or 0.1, 0.2, 0.5 'phageonly': 'True', 'plot_dist': 'True', 'sigma': 0.1, 'skewness': -1.0, 'ti': 30, 'tl': 60, 'to_mutate': 0.1, 'tp': 40, 'tpp': 10, 'tsteps': 500, 'tu': 20, 'vl': 20, 'vt': 1Figure 11A - Constant Phage Titer'capacity': 1000000000.0, 'ceu0': 100000000.0, 'cp0': 1000000.0, 'epochs': 5, 'f0': 0.1, 'f_prec': 11, 'fend': 1.0, 'fitnessmode': 'dist', 'ftype': 'const', 'growth_mode': 'logistic', 'k': 3e-11, 'max_cp': 2000000000.0, 'min_cp': 100000.0, 'mumax': 16.667, 'mutation_dist': 'norm', 'noisy': 0.0, 'phageonly': 'False', 'plot_dist': 'True', 'sigma': 0.025, 'skewness': -1.0, 'ti': 30, 'tl': 45, 'to_mutate': 0.025, 'tp': 40, 'tpp': 10, 'tsteps': 200, 'tu': 20, 'vl': 20, 'vt': 0.5Figure 11B - Washout'capacity': 1000000000.0, 'ceu0': 100000000.0, 'cp0': 1000000.0, 'epochs': 5, 'f0': 0.1, 'f_prec': 11, 'fend': 1.0, 'fitnessmode': 'dist', 'ftype': 'const', 'growth_mode': 'logistic', 'k': 3e-11, 'max_cp': 2000000000.0, 'min_cp': 100000.0, 'mumax': 16.667, 'mutation_dist': 'norm', 'noisy': 0.0, 'phageonly': 'False', 'plot_dist': 'True', 'sigma': 0.025, 'skewness': -1.0, 'ti': 30, 'tl': 20, 'to_mutate': 0.025, 'tp': 40, 'tpp': 10, 'tsteps': 200, 'tu': 20, 'vl': 20, 'vt': 0.5Figure 11C - Increase in Phage Titer'capacity': 1000000000.0, 'ceu0': 100000000.0, 'cp0': 1000000.0, 'epochs': 5, 'f0': 0.1, 'f_prec': 11, 'fend': 1.0, 'fitnessmode': 'dist', 'ftype': 'const', 'growth_mode': 'logistic', 'k': 3e-11, 'max_cp': 2000000000.0, 'min_cp': 100000.0, 'mumax': 16.667, 'mutation_dist': 'norm', 'noisy': 0.0, 'phageonly': 'False', 'plot_dist': 'True', 'sigma': 0.025, 'skewness': -1.0, 'ti': 30, 'tl': 45, 'to_mutate': 0.025, 'tp': 40, 'tpp': 10, 'tsteps': 200, 'tu': 20, 'vl': 20, 'vt': 3Figure 11D - Decrease in Phage Titer'capacity': 1000000000.0, 'ceu0': 100000000.0, 'cp0': 100000000.0, 'epochs': 1, 'f0': 0.2, 'f_prec': 21, 'fend': 1.0, 'fitnessmode': 'dist', 'ftype': 'const', 'growth_mode': 'logistic', 'k': 3e-11, 'max_cp': 2000000000.0, 'min_cp': 100000.0, 'mumax': 16.667, 'mutation_dist': 'skew', 'noisy': 0.05, # or 0.1, 0.2, 0.5 'phageonly': 'True', 'plot_dist': 'True', 'sigma': 0.1, 'skewness': -1.0, 'ti': 30, 'tl': 60, 'to_mutate': 0.1, 'tp': 40, 'tpp': 10, 'tsteps': 500, 'tu': 20, 'vl': 20, 'vt': 1Figure 12 - Effects of transferring phage only'capacity': 1000000000.0, 'ceu0': 100000000.0, 'cp0': 1000000.0, 'epochs': 5, 'f0': 0.1, 'f_prec': 11, 'fend': 1.0, 'fitnessmode': 'dist', 'ftype': 'const', 'growth_mode': 'logistic', 'k': 3e-11, 'max_cp': 2000000000.0, 'min_cp': 100000.0, 'mumax': 16.667, 'mutation_dist': 'norm', 'noisy': 0.0, 'phageonly': 'False', 'plot_dist': 'True', 'sigma': 0.025, 'skewness': -1.0, 'ti': 30, 'tl': 30, 'to_mutate': 0.025, 'tp': 40, 'tpp': 10, 'tsteps': 200, 'tu': 20, 'vl': 20, 'vt': 0.5Figure 13 - Comparison modeled and real Phage TitersFirst Round Model
{ 'vl': 20, 'tl': 180, 'vt': 3, 'ceu0': 1e8, 'cp0': 3e8, 'k': 3e-11, 'tu': 20, 'ti': 30, 'tp': 40, 'mumax': 16.667, 'tpp': 10, 'f0': 0.001, 'fend': 0.6, 'tsteps': 180, 'epochs': 5, 'capacity': 1e9, 'min_cp': 1e7, 'max_cp': 1e9, 'growth_mode': 'logistic', 'phageonly': 'True', 'noisy': 0.0, 'f_prec': 21, 'fitnessmode': 'const', 'to_mutate': 0.0, 'sigma': 0.01, 'mutation_dist': 'norm', 'skewness': -1.0, 'plot_dist': 'False', 'phage_degradation': 0.075 }Second Round Model
{ 'vl': 20, 'tl': 180, 'vt': 3, 'ceu0': 1e8, 'cp0': 6e8, 'k': 3e-11, 'tu': 20, 'ti': 30, 'tp': 40, 'mumax': 16.667, 'tpp': 10, 'f0': 0.001, 'fend': 0.025, 'tsteps': 180, 'epochs': 2, 'capacity': 1e9, 'min_cp': 1e7, 'max_cp': 1e9, 'growth_mode': 'logistic', 'phageonly': 'True', 'noisy': 0.0, 'f_prec': 21, 'fitnessmode': 'lin', 'to_mutate': 0.0, 'sigma': 0.01, 'mutation_dist': 'norm', 'skewness': -1.0, 'plot_dist': 'False', 'phage_degradation': 0.075 }Plottet Comma-Separated File
t; Theophylline 1; Theophylline 2\n 0; 3.0e8; NaN 180; 1.5e5; NaN 360; 2.35e4; NaN 540; NaN; 6.0e8 720; NaN; 2.0e7 900; NaN; 2.55e7