Line 312: | Line 312: | ||
<H2>Intention of the Model</H2><br> | <H2>Intention of the Model</H2><br> | ||
− | + | According to our design of biological flip-flops, a recombinase-based device must be | |
− | + | constructed, characterized and standardized. Recombinases can operate on sequences with | |
+ | recombination sites thus altering the direction of transcription. In this section we propose a | ||
+ | model that characterize the process of gene recombination with ordinary differential equations | ||
+ | (ODEs). Each reaction in the process is described with Hill or mas action kinetics. The aims of | ||
+ | this model are listed as follows:<br> | ||
+ | 1. Estimate parameters in the model from experimental data and | ||
+ | fine-tune the model.<br> | ||
+ | 2. Simulate the sequence recombination process via recombinases. Show the | ||
+ | changing of quantities of original and flipped sequences over time. Prove that the device can | ||
+ | work as expected.<br> | ||
+ | 3. Analyze the effects of varying recombinase production and degradation rates | ||
+ | on the overall performance of the device. 4. Assist in designing of the genetic circuit.<br> | ||
</div> | </div> | ||
Line 322: | Line 333: | ||
<h2>Biological Basis and Assumptions</h2><br> | <h2>Biological Basis and Assumptions</h2><br> | ||
− | The | + | The sequence-flipping device is based on the integrase family of recombinases. On this page we |
− | + | focus only on the Bxb1 serine integrase, but the same dynamics model and analysis applies to two | |
− | + | other types TP901 and PhiC31. Mechanism of serine integrases is well recorded in | |
− | + | literature.[Footnote1] | |
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
<br><br> | <br><br> | ||
− | + | <h3 id="set-process">SET Process</h3> | |
− | + | After transcription and translation of the Bxb1 recombinase gene, the proteins are expressed. | |
− | + | Two Bxb1 monomers form a dimer. In dimerized form, the recombinase proteins can bind to attL | |
− | + | and attR binding sites around the constitutive promoter J23119 on the reporter sequence. | |
− | + | When both sites are occupied, synapse between the two sites can happen. This can enable | |
− | + | flipping of the sequence and alter the direction of the promoter. The flipping process | |
− | + | changes the direction of the reporter promoter and transforms the attB and attP sites in | |
− | + | attL and attR. In its initial direction, the promoter starts GFP transcription; after | |
+ | flipping, it turns to RFP. By measuring cell fluorescence with flow cytometry, we can | ||
+ | quantify the ratio of flipped sequences (attL/R ratio). The data-fitting procedure relies on | ||
+ | this key indicator. | ||
<br><br> | <br><br> | ||
− | + | <h3 id="reset-process-with-integrase-rdf-fusion-protein">RESET Process with Integrase-RDF Fusion | |
− | + | Protein</h3> | |
− | + | Another kind of recombinases - excisionases are able to recognize and bind attL and attR | |
− | + | sequences. With the help of excisionases, the state transitions become reversible. | |
− | expression | + | Recombination Directionality Factors (RDF) are a kind of protein that achieves reverse |
− | + | flipping to recover the sequence flipped by recombinases. By co-expression of the integrase | |
+ | with the corresponding RDF, or expression of the integrase-RDF fusion protein, recombination | ||
+ | between the attL and attR sites can be induced. Thus, the flip-flop can restore to the | ||
+ | previous state. | ||
+ | <br> | ||
</div> | </div> | ||
</div> | </div> | ||
Line 355: | Line 366: | ||
<div class="mdl-card__supporting-text"> | <div class="mdl-card__supporting-text"> | ||
− | <h2 id=" | + | <h2 id="Model Compositions">Model Compositions</h2><br> |
− | < | + | <h2 id="model-composition">Model Composition</h2> |
− | The following | + | The following part describes species and reactions involved in different components of the |
− | + | model. | |
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
<br><br> | <br><br> | ||
+ | <h3 id="pBAD promoter">Arabinose-Induced Protein Expression</h3> | ||
+ | We have decided to use an arabinose-induced promoter (pBAD) for the expression of | ||
+ | recombinases on ColE1 vector. This promoter can be modelled as the following chemical | ||
+ | system. <br> | ||
+ | <span class="math inline">\((AraB:AraC) \xrightarrow[]{K_{transcription}} mRNA \xrightarrow[]{K_{translation}} Recombinase\)</span> | ||
+ | <br> | ||
+ | <span class="math inline">\(mRNA \xrightarrow[]{\gamma_{mRNA}} \emptyset\)</span> | ||
+ | <br> | ||
+ | <span | ||
+ | class="math inline">\(Recombinase \xrightarrow[]{\gamma_{prot}} \emptyset\)</span> | ||
+ | <br> | ||
+ | The promoter pBAD binds to AraC and this represses transcription of mRNA. Arabinose will | ||
+ | bind to AraC and form the Arab:AraC compound, allowing transcription to occur. | ||
+ | <br><br> | ||
+ | <h4 id="assumptions">Assumptions</h4> | ||
+ | We assume that:<br> | ||
+ | 1. AraC is always in large concentration<br> | ||
+ | 2. AraC binding to arabinose occurs on a faster time scale to transcription | ||
+ | <br> | ||
+ | Therefore it is not necessary to consider individual concentrations of arabinose and AraC. We | ||
+ | need only include the concentration of (Arab:AraC) complex in the model. The transcription | ||
+ | rate constant K on the schematic diagram above is not a simple constant but described as a | ||
+ | Michaelis-Mentin kinetics[Footnote4] equation below. | ||
+ | <br> | ||
+ | <span class="math inline">\(\frac{\mathrm{d}[mRNA]}{\mathrm{d}t}=K_{max}\frac{[Arab:AraC]^n}{[Arab:AraC]^n+K_{half}^n}-\gamma_{mRNA}[mRNA]\)</span> | ||
+ | <br> | ||
+ | Although pBAD system is known for rigidity[Footnote2], we observed leaky expression in the | ||
+ | experiments. So we add a basal translation level to describe leakiness. The basal expression | ||
+ | level should be several orders of magnitude lower than the maximal level. | ||
+ | <br> | ||
+ | <span class="math inline">\(\frac{\mathrm{d}[mRNA]}{\mathrm{d}t}=K_{basal}+K_{max}\frac{[Arab:AraC]^n}{[Arab:AraC]^n+K_{half}^n}-\gamma_{mRNA}[mRNA]\)</span> | ||
+ | <br> | ||
+ | The mRNA degradation and dilution rates are combined into a single parameter. Protein | ||
+ | (recombinase) synthesis and degradation can be modeled with the equation below. | ||
+ | <br> | ||
+ | <span class="math inline">\(\frac{\mathrm{d}[P]}{\mathrm{d}t}=\alpha_{Int}[mRNA]-\gamma_{Int}[P]\)</span> | ||
+ | <br> | ||
+ | Again, we combine dilution and degradation rate of recombinase into a single parameter. | ||
+ | <br> | ||
+ | [FIGURE: recombinase_expression.png] | ||
+ | <br> | ||
+ | <h3 id="recombinase-functioning">Recombinase Functioning</h3> | ||
+ | <h4 id="assumptions">Assumptions</h4> | ||
+ | Here we make the following assumptions about the reporter vector: | ||
+ | <br>1. The total concentration of | ||
+ | the reporter vector in cell remains constant during the whole reaction. This is based on the | ||
+ | fact that reporter fluorescent proteins are on the stringent pSC101 vector, whose copy number is | ||
+ | strictly controlled to around 5~10 nM[3]. We assume this concentration to be 5nM. | ||
+ | <br>2.The reporter sequences are divided into 4 states: B0F, B1F, B2F and R. Here F and R denote | ||
+ | the | ||
+ | direction of the constitutive promoter. F stands for “Forward”, corresponding to the initial | ||
+ | direction (GFP direction) before flipping. R stands for “Reverse”, corresponding to the | ||
+ | direction after the flipping (RFP direction). B0, B1 and B2 denote the number of recombination | ||
+ | sites bound by the recombinase dimer.<br> | ||
+ | 3. After 12h of reaction, the RFP expressed by the | ||
+ | promoter in reverse direction have all matured. The efficiency of the recombinase flipping | ||
+ | mechanism is defined as the cell population fraction expressing RFP in FACS data. See Experiment | ||
+ | Part for detailed calculation.<br> (链接到实验部分Flip-flop) | ||
+ | <br> | ||
+ | <br> | ||
+ | The following diagram summarizes the mechanism and key reactions in the recombinase flipping | ||
+ | mechanism. | ||
+ | <br> | ||
+ | [FIGURE:recombinase_flipping.png] | ||
+ | <br><br> | ||
+ | The table below are reactions and parameters involved in this model. | ||
+ | <br> | ||
+ | [TABLE:recombinase parameters] | ||
+ | <br> | ||
+ | <br> | ||
+ | These reactions, together with kinetic laws and reaction rate parameters, can be transformed | ||
+ | into an ordinary differential equation (ODE) system. | ||
+ | <br> | ||
+ | <h3 id="recombinase-rdf-fusion-protein-functioning">Recombinase-RDF Fusion Protein | ||
+ | Functioning</h3> | ||
+ | Similar mechanisms can describe recombinase-RDF fusion protein functioning. The recombinases | ||
+ | themselves cannot flip back the sequence flanked between attL and attR sites. They have to work | ||
+ | together with recombination directionality factors (RDFs) to accomplish this task.[5] | ||
+ | Several proposals on genetically engineered RDF expression were suggested by Bonnet and | ||
+ | colleagues. [6] In our experiments, we implemented the fusion protein construction, | ||
+ | which was also reported to be functional in other work. [7] | ||
+ | <br> | ||
+ | <h4 id="assumptions">Assumptions</h4> | ||
+ | We make the following assumptions about recombinase-RDF fusion proteins:<br> | ||
+ | 1. the recombinase-RDF fusion proteins have the similar transcription and translation behavior as the recombinases<br> | ||
+ | 2. the binding and unbinding of recombinase-RDF fusion proteins to attL and attR sites are reversible reactions<br> | ||
+ | 3. flipping of the target sequence is also reversible<br> | ||
+ | <br> | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
</div> | </div> | ||
</div> | </div> | ||
Line 401: | Line 469: | ||
<h2 id="simulation-results">Simulation Results</h2> <br> | <h2 id="simulation-results">Simulation Results</h2> <br> | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
<br><br> | <br><br> | ||
Line 437: | Line 485: | ||
<h2 id="simulation-results">Simulation Results</h2> <br> | <h2 id="simulation-results">Simulation Results</h2> <br> | ||
− | We conducted our simulation using Tellurium[ | + | We conducted our simulation using Tellurium[3], a Python-based simulation platform. |
We first adjusted the parameters protein half life and mRNA half life (equivalently | We first adjusted the parameters protein half life and mRNA half life (equivalently | ||
transcription and translation of repressor proteins) to adjust oscillation period as the one | transcription and translation of repressor proteins) to adjust oscillation period as the one | ||
Line 509: | Line 557: | ||
through continuous imaging of entire cell cultures. This is a theory-experiment interaction | through continuous imaging of entire cell cultures. This is a theory-experiment interaction | ||
procedure. The reason for stochastic modeling is self-evident. Cellular biological processes, | procedure. The reason for stochastic modeling is self-evident. Cellular biological processes, | ||
− | especially gene expression, are innately noisy and stochastic.[ | + | especially gene expression, are innately noisy and stochastic.[1] Previous work has |
demonstrated the effectiveness of the integration of population-level dynamics and stochastic | demonstrated the effectiveness of the integration of population-level dynamics and stochastic | ||
− | cellular responses to inputs.[ | + | cellular responses to inputs.[2]It is important to explain how stochastic single-cell |
responses affect overall population-level distributions and behavior.<br><br> | responses affect overall population-level distributions and behavior.<br><br> | ||
This section describes a Markov model of recombinase-based DNA flipping and a Gillespie | This section describes a Markov model of recombinase-based DNA flipping and a Gillespie | ||
− | stochastic simulation method[ | + | stochastic simulation method[3] to observe theoretical sequence state evolution. We |
will focus on our demonstration design, which can be generalized to other designs.<br><br> | will focus on our demonstration design, which can be generalized to other designs.<br><br> | ||
Line 589: | Line 637: | ||
plasmid. Thus we assume that each cell only has one copy of the control unit. | plasmid. Thus we assume that each cell only has one copy of the control unit. | ||
Uni-directional terminators are screened by the criterion that the forward terminator | Uni-directional terminators are screened by the criterion that the forward terminator | ||
− | strength[ | + | strength[6] exceeds the reverse one by a few orders of magnitude. This allows us to |
assume that when the terminator is in the forward direction, it can completely block RNA | assume that when the terminator is in the forward direction, it can completely block RNA | ||
polymerase, while in the reverse direction, it cannot function.(此处有链接到terminator实验页面) | polymerase, while in the reverse direction, it cannot function.(此处有链接到terminator实验页面) | ||
Line 597: | Line 645: | ||
Serine integrases are produced as monomers that form dimers, search for specific attB and attP | Serine integrases are produced as monomers that form dimers, search for specific attB and attP | ||
sequences, and, once both attB and attP sites are occupied, form a tetramer (dimer of dimers) | sequences, and, once both attB and attP sites are occupied, form a tetramer (dimer of dimers) | ||
− | that digests, flips and re-ligates the DNA[ | + | that digests, flips and re-ligates the DNA[7]. |
In this model we also consider leaky expression of the recombinases, because of observation of | In this model we also consider leaky expression of the recombinases, because of observation of | ||
such phenomena in recombinase characterization experiments. Furthermore, induction of | such phenomena in recombinase characterization experiments. Furthermore, induction of | ||
Line 827: | Line 875: | ||
The | The | ||
repressilator is a synthetic genetic regulatory network consisting of a ring-oscillator with | repressilator is a synthetic genetic regulatory network consisting of a ring-oscillator with | ||
− | three genes, each expressing a protein that represses the next gene in the loop.[ | + | three genes, each expressing a protein that represses the next gene in the loop.[1] |
This network was designed from scratch to exhibit a stable oscillation which is reported via | This network was designed from scratch to exhibit a stable oscillation which is reported via | ||
the expression of green fluorescent protein, and hence acts like an electrical oscillator | the expression of green fluorescent protein, and hence acts like an electrical oscillator | ||
Line 904: | Line 952: | ||
<h2 id="simulation-results">Simulation Results</h2> <br> | <h2 id="simulation-results">Simulation Results</h2> <br> | ||
− | We conducted our simulation using Tellurium[ | + | We conducted our simulation using Tellurium[3], a Python-based simulation platform. |
We first adjusted the parameters protein half life and mRNA half life (equivalently | We first adjusted the parameters protein half life and mRNA half life (equivalently | ||
transcription and translation of repressor proteins) to adjust oscillation period as the one | transcription and translation of repressor proteins) to adjust oscillation period as the one |
Revision as of 13:07, 1 November 2017
Overview
Background
In our design, after having designed the flip flop, the device can remember the information of its state, the next step is to transform the state into an actual function. To achieve this transformation, we first needed a “reader” to read out the current state. At this point the control unit comes into play. A control unit is a DNA sequence with recombinase sites whose expression is controlled by recom-binase and RDF by reversing or deleting a promoter and/or a terminator. To make the control unit reliable and predictable, we first need to be able to predict the behaviors of its “building blocks” (or “elements” in electrical engineering), from which we weave our engineer’s perspective into the biolog-ical system. However, we need to make some adaptations and adjustments to these “elements” to make them usable.Flip-flop
Intention of the Model
According to our design of biological flip-flops, a recombinase-based device must be constructed, characterized and standardized. Recombinases can operate on sequences with recombination sites thus altering the direction of transcription. In this section we propose a model that characterize the process of gene recombination with ordinary differential equations (ODEs). Each reaction in the process is described with Hill or mas action kinetics. The aims of this model are listed as follows:
1. Estimate parameters in the model from experimental data and fine-tune the model.
2. Simulate the sequence recombination process via recombinases. Show the changing of quantities of original and flipped sequences over time. Prove that the device can work as expected.
3. Analyze the effects of varying recombinase production and degradation rates on the overall performance of the device. 4. Assist in designing of the genetic circuit.
Biological Basis and Assumptions
The sequence-flipping device is based on the integrase family of recombinases. On this page we focus only on the Bxb1 serine integrase, but the same dynamics model and analysis applies to two other types TP901 and PhiC31. Mechanism of serine integrases is well recorded in literature.[Footnote1]
SET Process
After transcription and translation of the Bxb1 recombinase gene, the proteins are expressed. Two Bxb1 monomers form a dimer. In dimerized form, the recombinase proteins can bind to attL and attR binding sites around the constitutive promoter J23119 on the reporter sequence. When both sites are occupied, synapse between the two sites can happen. This can enable flipping of the sequence and alter the direction of the promoter. The flipping process changes the direction of the reporter promoter and transforms the attB and attP sites in attL and attR. In its initial direction, the promoter starts GFP transcription; after flipping, it turns to RFP. By measuring cell fluorescence with flow cytometry, we can quantify the ratio of flipped sequences (attL/R ratio). The data-fitting procedure relies on this key indicator.RESET Process with Integrase-RDF Fusion Protein
Another kind of recombinases - excisionases are able to recognize and bind attL and attR sequences. With the help of excisionases, the state transitions become reversible. Recombination Directionality Factors (RDF) are a kind of protein that achieves reverse flipping to recover the sequence flipped by recombinases. By co-expression of the integrase with the corresponding RDF, or expression of the integrase-RDF fusion protein, recombination between the attL and attR sites can be induced. Thus, the flip-flop can restore to the previous state.Model Compositions
Model Composition
The following part describes species and reactions involved in different components of the model.Arabinose-Induced Protein Expression
We have decided to use an arabinose-induced promoter (pBAD) for the expression of recombinases on ColE1 vector. This promoter can be modelled as the following chemical system.\((AraB:AraC) \xrightarrow[]{K_{transcription}} mRNA \xrightarrow[]{K_{translation}} Recombinase\)
\(mRNA \xrightarrow[]{\gamma_{mRNA}} \emptyset\)
\(Recombinase \xrightarrow[]{\gamma_{prot}} \emptyset\)
The promoter pBAD binds to AraC and this represses transcription of mRNA. Arabinose will bind to AraC and form the Arab:AraC compound, allowing transcription to occur.
Assumptions
We assume that:1. AraC is always in large concentration
2. AraC binding to arabinose occurs on a faster time scale to transcription
Therefore it is not necessary to consider individual concentrations of arabinose and AraC. We need only include the concentration of (Arab:AraC) complex in the model. The transcription rate constant K on the schematic diagram above is not a simple constant but described as a Michaelis-Mentin kinetics[Footnote4] equation below.
\(\frac{\mathrm{d}[mRNA]}{\mathrm{d}t}=K_{max}\frac{[Arab:AraC]^n}{[Arab:AraC]^n+K_{half}^n}-\gamma_{mRNA}[mRNA]\)
Although pBAD system is known for rigidity[Footnote2], we observed leaky expression in the experiments. So we add a basal translation level to describe leakiness. The basal expression level should be several orders of magnitude lower than the maximal level.
\(\frac{\mathrm{d}[mRNA]}{\mathrm{d}t}=K_{basal}+K_{max}\frac{[Arab:AraC]^n}{[Arab:AraC]^n+K_{half}^n}-\gamma_{mRNA}[mRNA]\)
The mRNA degradation and dilution rates are combined into a single parameter. Protein (recombinase) synthesis and degradation can be modeled with the equation below.
\(\frac{\mathrm{d}[P]}{\mathrm{d}t}=\alpha_{Int}[mRNA]-\gamma_{Int}[P]\)
Again, we combine dilution and degradation rate of recombinase into a single parameter.
[FIGURE: recombinase_expression.png]
Recombinase Functioning
Assumptions
Here we make the following assumptions about the reporter vector:1. The total concentration of the reporter vector in cell remains constant during the whole reaction. This is based on the fact that reporter fluorescent proteins are on the stringent pSC101 vector, whose copy number is strictly controlled to around 5~10 nM[3]. We assume this concentration to be 5nM.
2.The reporter sequences are divided into 4 states: B0F, B1F, B2F and R. Here F and R denote the direction of the constitutive promoter. F stands for “Forward”, corresponding to the initial direction (GFP direction) before flipping. R stands for “Reverse”, corresponding to the direction after the flipping (RFP direction). B0, B1 and B2 denote the number of recombination sites bound by the recombinase dimer.
3. After 12h of reaction, the RFP expressed by the promoter in reverse direction have all matured. The efficiency of the recombinase flipping mechanism is defined as the cell population fraction expressing RFP in FACS data. See Experiment Part for detailed calculation.
(链接到实验部分Flip-flop)
The following diagram summarizes the mechanism and key reactions in the recombinase flipping mechanism.
[FIGURE:recombinase_flipping.png]
The table below are reactions and parameters involved in this model.
[TABLE:recombinase parameters]
These reactions, together with kinetic laws and reaction rate parameters, can be transformed into an ordinary differential equation (ODE) system.
Recombinase-RDF Fusion Protein Functioning
Similar mechanisms can describe recombinase-RDF fusion protein functioning. The recombinases themselves cannot flip back the sequence flanked between attL and attR sites. They have to work together with recombination directionality factors (RDFs) to accomplish this task.[5] Several proposals on genetically engineered RDF expression were suggested by Bonnet and colleagues. [6] In our experiments, we implemented the fusion protein construction, which was also reported to be functional in other work. [7]Assumptions
We make the following assumptions about recombinase-RDF fusion proteins:1. the recombinase-RDF fusion proteins have the similar transcription and translation behavior as the recombinases
2. the binding and unbinding of recombinase-RDF fusion proteins to attL and attR sites are reversible reactions
3. flipping of the target sequence is also reversible
Simulation Results
In this simulation experiment, the flip-flop lost approximately 10% of its initial state after two trigger signal groups.
Simulation Results
We conducted our simulation using Tellurium[3], a Python-based simulation platform. We first adjusted the parameters protein half life and mRNA half life (equivalently transcription and translation of repressor proteins) to adjust oscillation period as the one observed in the experiment. (此处链接到clock实验)
Fig.1. Repressilator simulation results of tetR and cI protein
Then we incorporate the bio-flip-flop into the model, and we got the following simulation results:
Fig.2. Repressilator-driven frequency divider simulation with two trigger signal groups
X_F and Y_F have the same meaning as those described in Recombinase section. (此处有超链接到第一个model子页面)
In this simulation experiment, the flip-flop lost approximately 10% of its initial state after two trigger signal groups.
Controller
Intention of the Model
In this model, we aim to describe the sequence modification process of the control unit sequence with recombination sites. Different recombinases are expressed in series to alter the direction of terminators between corresponding recombination sites on the control unit circuit. The combination of unidirectional terminators in different directions (reverse or foward) determines the state of RNA polymerase flux, thus controlling expression of the downstream gene.
Though we have done simulation of recombinase dynamics in the previous section, it is necessary to consider two key concepts of biological processes: 1. population behavior 2. stochastic process
The reason for modeling cell population behavior is that deployment of synthetic bacterial devices would certainly involve populations of cells. Furthermore, we demonstrate our design through continuous imaging of entire cell cultures. This is a theory-experiment interaction procedure. The reason for stochastic modeling is self-evident. Cellular biological processes, especially gene expression, are innately noisy and stochastic.[1] Previous work has demonstrated the effectiveness of the integration of population-level dynamics and stochastic cellular responses to inputs.[2]It is important to explain how stochastic single-cell responses affect overall population-level distributions and behavior.
This section describes a Markov model of recombinase-based DNA flipping and a Gillespie stochastic simulation method[3] to observe theoretical sequence state evolution. We will focus on our demonstration design, which can be generalized to other designs.
Biological Basis
The control units are implemented experimentally by transforming a plasmid into an E. coli. This plasmid has similar structure as the reporter plasmid in the previous section. However, in this situation the sequence fragments being inverted are terminators. Using experimentally screened uni-directional terminators, the transcription or polymerase flux4 can be precisely controlled. The figure below illustrates the result of the design software. (超链接到software)
In the seven-segment decoder tube demonstration, if we would like to display “1-2-3” counting, 3 kinds of control units have to be transformed into 3 strains of bacteria. 2 of them has only one pair of recombination sites, and only undergoes recombinase inversion once. The other one has two nested recombination sites. Successive inversion results in the same state as the initial one.
Model Composition and Assumptions
A Markov process is defined upon a specific state space. We define that each state occupies a specific state. Then we can use Markov models to track cell state changes over time. This “state” consists of two parts: the state of recombination sites on the control unit sequence and the concentration or molecule number of two types of recombinases A and B. This definition was first proposed by Victoria Hsiao and colleagues to model a population-based event detector.5 The state of a single cell is defined by the DNA state and the copy numbers of integrases A and B. We denote the state of a single cell by (DNA, IntA, IntB).
Cell States
All of the four possible DNA states are: the initial state (\(S_{init}\)), the IntA inversion only state (\(S_a\)), the IntB inversion only state (\(S_b\)), and the IntA then IntB inversion state (\(S_{ab}\)). We omit the state IntB then IntA inversion state (\(S_{ba}\)) because in our design and experiments, integrase expression is always induced in order. IntB expression can never be triggered before that of IntA. These states and transitions by recombinase inversions are illustrated below.Our implementation requires that the control unit sequence on a stringent, low copy number plasmid. Thus we assume that each cell only has one copy of the control unit. Uni-directional terminators are screened by the criterion that the forward terminator strength[6] exceeds the reverse one by a few orders of magnitude. This allows us to assume that when the terminator is in the forward direction, it can completely block RNA polymerase, while in the reverse direction, it cannot function.(此处有链接到terminator实验页面)
Integrase Production and Tetramerization
Serine integrases are produced as monomers that form dimers, search for specific attB and attP sequences, and, once both attB and attP sites are occupied, form a tetramer (dimer of dimers) that digests, flips and re-ligates the DNA[7]. In this model we also consider leaky expression of the recombinases, because of observation of such phenomena in recombinase characterization experiments. Furthermore, induction of recombinase expression happens in order and we assume that at each time point, only one of the two inducers (a and b) exists. Thus we define the following table of state transitions and parameters:\(x \in \{init, a, b, ab\})\), \(\gamma_A(t)\) and \(\gamma_B(t)\) are defined as follows: \[ \gamma_A(t) = \begin{cases} k_{prodA} + k_{leakA}, & \text{if inducer a exists} \\ k_{leakB}, & \text{otherwise} \end{cases} \] \[ \gamma_B(t) = \begin{cases} k_{prodB} + k_{leakB}, & \text{if inducer a exists} \\ k_{leakB}, & \text{otherwise} \end{cases} \]
The serine integrases need to tetramerize prior to DNA recombination. Each DNA binding site (attB and attP) is bound by two copies of integrase monomers in an independent manner to form a dimer. Once both of the binding sites on both DNA attachment sites are occupied, a dimer of the dimers (tetramer) is formed and recombination can occur. Here we deduce the reaction propensity of the tetramerization process. We only focus on the number of integrase molecules inside the cell, and ignore the detailed process of transcription and translation, and represent the process with a single parameter.
\(S_x:Int_i (i = 1,2,3,4) (x \in \{init, a, b, ab\})\) denote the DNA-integrase complexes where \(i\) denote the number of integrase molecules bound to DNA sequence. DNA recombination occurs when the complex state is at \(S_x:Int_4\), which means the tetramerization of integrases on the DNA. Because of independent binding of DNA sequence with integrase dimers, the transition of states can be modeled by the following equations: \[ S_x \rightleftarrows S_x:Int_1 \rightleftarrows S_x:Int_2 \rightleftarrows S_x:Int_3 \rightleftarrows S_x:Int_4 \]
where the binding rate of DNA recombination sites (attB, attP) and integrase dimers is \(k_{bind}\) (reaction from left to right side) and unbinding rate is \(k_{unbind}\). The dynamics can be modeled by the following ordinary differential equations. \(\mathbb{P}_t(S_x:Int_i, n_*) (* = A, B)\) is the joint probability distribution.
\[
\frac{d}{dt}\mathbb{P}_t(S_x, n_*) = -k_{bind}n_*\mathbb{P}_t(S_x, n_*) + k_{unbind}\mathbb{P}_t(S_x:Int_1, n_*)
\] \[
\frac{d}{dt}\mathbb{P}_t(S_x:Int_1, n_*-1) = -(k_{bind}(n_*-1)+k_{unbind})\mathbb{P}_t(S_x:Int_1, n_*-1) + k_{bind}n_*\mathbb{P}_t(S_x, n_*) + k_{unbind}\mathbb{P}_t(S_x:Int_2,n_*-2)
\] \[
\frac{d}{dt}\mathbb{P}_t(S_x:Int_2, n_*-2) = -(k_{bind}(n_*-2)+k_{unbind})\mathbb{P}_t(S_x:Int_2, n_*-2) + k_{bind}(n_*-1)\mathbb{P}_t(S_x:Int_1, n_*-1) + k_{unbind}\mathbb{P}_t(S_x:Int_3,n_*-3)
\] \[
\frac{d}{dt}\mathbb{P}_t(S_x:Int_1, n_*-3) = -(k_{bind}(n_*-3)+k_{unbind})\mathbb{P}_t(S_x:Int_3, n_*-3) + k_{bind}(n_*-2)\mathbb{P}_t(S_x:Int_2, n_*-2) + k_{unbind}\mathbb{P}_t(S_x:Int_4,n_*-4)
\] \[
\frac{d}{dt}\mathbb{P}_t(S_x:Int_4, n_*-4) = -k_{unbind}n_*\mathbb{P}_t(S_x:Int_4, n_*-4) + k_{bind}(n_*-3)\mathbb{P}_t(S_x:Int_3, n_*-3)
\]
We assume that binding and unbinding of integrase molecules to DNA equilibrate fast enough compared to the dynamics of DNA recombination, and the production and degradation of integrases (See Parameter Supplemental) and hence the above equations converge to an equilibrium while \(n_*\) remain constant (equilibrium approximation). Let the above equations all equal to zero and solve in terms of \(\frac{d}{dt}\mathbb{P}_t(S_x:Int_4, n_*-4)\), we can write:
\[ \frac{d}{dt}\mathbb{P}_t(S_x:Int_4, n_*-4) = \frac{n_* (n_*-1)(n_*-2) (n_*-3)}{K_{d*}^4 + K_{d*}^3 n_* + K_{d*}^2 n_* (n_*-1) + K_{d*}n_*(n_*-1)(n_*-2) + n_* (n_*-1)(n_*-2)(n_*-3)} \] (\(K_{d*} = k_{unbind}/k_{bind}\) )
Continuous-time Markov Process
Rather than accounting fro all individual DNA-integrase interactions, we have created a minimal model of stochastic transitions. Here only the final DNA states \((S_o, S_a, S_b, S_{ab})\) and the number of integrase monomer molecules (IntA, IntB) are tracked and all integrase activity is encompassed in the \(k_{flip*} (*=A,B)\) term. Because no cooperativity has been observed in Bxb1 or TP901-1 DNA binding, e.g. occupation of attB does not increase the probability of attP binding, we assume that the flipping activity is zero unless at least four molecules are present. Then, the propensity function for state transitions is a function of integrase concentration (equivalent to protein molecule number in cell, if cell volume is assumed to be constant):\[ \alpha_i(Int_*) = k_{flip*}(\frac{n_* (n_*-1)(n_*-2) (n_*-3)}{K_{d*}^4 + K_{d*}^3 n_* + K_{d*}^2 n_* (n_*-1) + K_{d*}n_*(n_*-1)(n_*-2) + n_* (n_*-1)(n_*-2)(n_*-3)}) \] \(i = 1, 2, 3; * = A, B\)
Then the table of state transition and intensities can be completed. We consider three kinds of state transitions as described previously: \(S_{init}\) to \(S_a\), \(S_{init}\) to \(S_b\), and \(S_a\) to \(S_{ab}\). \(S_{init}\) to \(S_b\) is due to missing of integrase A flipping. This transition denotes the system’s ineffective fraction.
Numbers of cells occupying each state are calculated by the summing equation below: \[ S_{i} = \sum_{n_A=0}^{\infty}\sum_{n_B=0}^{\infty} S(i,n_A,n_B), i \in \{S_{init},S_a,S_b\} \] because we only concentrate on the cell numbers, not the specific protein copy numbers in cells.
Simulation Results
Simulation Setting
In the simulation in this section, we consider induction with step inputs, where the inducers are either present or not present. Concentrations of the inducers when they are ‘’on’’ will be held constant. Also, it is important to note that inducer \(a\) is still present during and after time \(\Delta t\) when inducer \(b\) is introduced. We predict qualitative circuit behavior with biologically plausible parameters simplified to a minimal set: integrase production, degradation, tetramerization and DNA flipping. The integrase monomer dissociation constant \(K_{d*} (*=intA, intB)\) was estimated from Singh’s work. For more details of parameter setting for simulation please see our supplement page.Whole Process and Analysis: Guidance for experimentalists
Qualitative prediction of the population behavior of 3000 or 1000 cells was simulated. We first limit the simulator to the situation of only one pair of recombination sites on the circuit. From the results below we can see that after approximately 10h the recombinase intA complete the flipping and the reaction is rather efficient.Fig.1. Simulation result of only one pair of recombination sites on control unit circuit
The results for two pairs of recombination sites are shown below. We did two simulations. The first one (shown in the left figure), with two recombinases induced with no time separation, the system acts in an unsatisfactory way and approximately half of the cells are in the undesired state \(S_b\). In the second one, inducing of intA and intB are separated with 5 hours. Then the system performs rather stably, with approximately 90% cells in desired state. This provides guidance for team members working in the lab.
Fig.2. Simulation result of two pairs of recombination sites
The left figure shows state evolution with no time separation in inducing events A and B. The right figure shows state evolution with time separation of \(\Delta t = 5h\).
Finally we show specific cell transitions in Gillespie simulation. 100 cells are sampled from the population. Each vertical line denotes a cell state transition event (from \(S_{init}\) to \(S_a\), \(S_{init}\) to \(S_b\), from \(S_a\) to \(S_{ab}\)). The last two subplots show the simulation results of protein expression in single cell.
Fig.3. Specification of state transitions in Gillespie simulation using 100 sampled cells.
Clock
Intention of the Model
We attempts to develop a framework of biological sequential circuits that are programmable. The envisioned circuit is capable of both storing states in DNA and automatically running a series of instructions in a specific order. According to our design, our bio-flip-flop has two promoters, each triggered by an inducing signal. Therefore, if we would like to build an automatic triggering system for the bio-flip-flop, we need at least two signals in our clock. It is then self-evident to incorporate the classic structure in synthetic biology: the repressilator.
This model serves as a proof of concept for such implementation. We first constructed an ordinary differential equation (ODE) system to describe the repressilator, then we used two repressor proteins expressed by the repressilator to regulate the two promoters. Finally we adjusted parameters and gave a simulation result as a proof of the possibility of repressilator-driven bio-flip-flop.
Biological Basis and Assumptions
The repressilator is a synthetic genetic regulatory network consisting of a ring-oscillator with three genes, each expressing a protein that represses the next gene in the loop.[1] This network was designed from scratch to exhibit a stable oscillation which is reported via the expression of green fluorescent protein, and hence acts like an electrical oscillator system with fixed time periods. The repressilator consists of three genes connected in a feedback loop, such that each gene represses the next gene in the loop, and is repressed by the previous gene.
Fig.1. The genetic structure of the repressilator network
We decided to implement our repressilator-triggered bio-flip-flop with two repressor proteins regulating the two promoters’ transcription. The basis of ODE system construction is illustrated as follows:
Fig.2. Schematic drawing of the repressilator-driven bio-flip-flop
The upper promoter (X) is \(P_{LtetO}\) and is repressed by tetR protein. The lower promoter (Y) is \(P_{R}\) and is repressed by cI protein.Simulation of bio-flip-flop starts at a later time than repressilator because the expression levels of the repressor proteins at the beginning is not high enough.
Equations and Parameters
Repressilator
The following table lists the reactions in the repressilator network. X, Y, Z represent LacI, TetR and cI mRNAs, and PX, PY, PZ represent the corresponding proteins.Calculation of degradation rates are described as follows: \[ k_{dmRNA} = \frac{ln(2)}{\tau_{mRNA}} \] \[ k_{dprot} = \frac{ln(2)}{\tau_{protein}} \] \(\tau_{mRNA}\) and \(\tau_{protein}\) are half lives of mRNA and protein. Calculation of maximum transcription rate: \[ a_{tr} = 60(ps_a - ps_0) \] \(ps_a\) is “tps active” and \(ps_0\) is “tps_repressive”. Calculation of translation rate: \[ k_{tl} = \frac{eff}{t_{ave}} \] \(eff\) is translation efficiency and \(t_{ave}\) is average mRNA life time. \(t_{ave} = \frac{\tau_{mRNA}}{ln(2)}\)
Transcription in bio-flip-flop
The transcription kinetics in the bio-flip-flop were modified to be repressed by repressors. These can be described by the following equations:Simulation Results
We conducted our simulation using Tellurium[3], a Python-based simulation platform. We first adjusted the parameters protein half life and mRNA half life (equivalently transcription and translation of repressor proteins) to adjust oscillation period as the one observed in the experiment. (此处链接到clock实验)
Fig.1. Repressilator simulation results of tetR and cI protein
Then we incorporate the bio-flip-flop into the model, and we got the following simulation results:
Fig.2. Repressilator-driven frequency divider simulation with two trigger signal groups
X_F and Y_F have the same meaning as those described in Recombinase section. (此处有超链接到第一个model子页面)
In this simulation experiment, the flip-flop lost approximately 10% of its initial state after two trigger signal groups.