Due to tautomerisation of isoG and hydrolysis of isoCm there is a need for a system, to preserve the unnatural base pair (UBP) on the plasmid. In 2017, Zhang et al. successfully deployed a CRISPR (clustered regularly interspaced short palindromic repeat)-Cas9 system for retention of a UBP. We adapted this conservation system to our UBP and thus used the bacterial immune response to eliminate all plasmid DNA that had lost the UBP.
The nuclease Cas9 is part of the adaptive immune system of Streptococcus pyogenes, where it induces double strand breaks in the genomic DNA. This enzyme is recruited by a CRISPR RNA (crRNA). A crRNA consists of direct repeats interspaced by variable sequences called protospacer. Those protospacers are derived from foreign DNA and encode the Cas9 guiding sequence (guide RNA). An auxiliary transactivating crRNA (tracrRNA) helps processing the precursor crRNA array into an active crRNA that contains the 20 nucleotide guide RNA. The guide RNA binds to the complementary genomic DNA sequence via Watson‑Crick base pairing. For this binding, the genomic DNA sequence needs to be located upstream of a CRISPR type II specific 5’ NGG protospacer adjacent motif (PAM). To combine crRNA and tracrRNA a chimeric single stranded guide RNA (sgRNA) was designed. Therefore, only the 20 nucleotide guiding sequence needs to be exchanged for targeting any genomic sequence followed by a PAM sequence (Ran et al., 2013 a, b). A double strand break introduced by Cas9 leads to DNA degradation by exonucleases in prokaryotic cells (Simmon and Lederberg, 1972).
The UBP isoG and isoCm is an orthogonal system. UBP inside the target DNA causes a mismatch to the sgRNA generally reducing the cleavage activity of Cas9 (Zhang et al., 2017). Accordingly, Cas9 can be programmed by sgRNAs to cleave all plasmids, which had lost the UBP due to point mutations (Figure 1). Consequently, this leads to a retention of the UBP in the plasmids.
For this conservation system, we integrated the cas9 gene into the genome of E. coli BL21(DE3). This new strain is called X1. To be able to strictly regulate the expression of cas9 we used the optimized IPTG‑inducible promoter Plac‑tight. The genomic knock in was done according to the protocol by Cobb et al., 2014 using the pCRISPomyces plasmid system. A sgRNA was designed searching for a reverse sequence with the constraint N(16)R(4)NGG (Cobb et al., 2004) inside the coding sequence of the arsB gene (Zhang et al., 2017) of the E. coli genome. arsB is coding for an arsenic efflux pump membrane protein. As a result, 5‘‑TATTGTTCATAATAGAAGAG‑3‘ turned out to be the suggested guiding sequence with the highest on‑target activity score with being unique within the complete genome. Furthermore, a repair template is necessary for the knock in. Therefore, a composite BioBrick was assembled using 1 kb long flanking sequences, two times the terminator BBa_B0015 flanking the Plac‑tight and the cas9 coding sequence (Figure 2).
This composite BioBrick from Figure 2 needs to be restricted with NotI to separate the linear repair template from the pSB1C3 backbone. Together with the target plasmid containing the sgRNA and the pCRISPomyces plasmid containing the cas9 all three elements were co‑transfected into E. coli BL21(DE3). The genomic integration was verified by sequencing.
We designed a lac operon for a tight repression called Plac‑tight to achieve a low transcription rate. The induction of the wild type lac operon increases the level of β‑galactosidase 1000‑fold (Müller et al., 1996). As part of the lac operon the lac repressor was the first repressor isolated and sequenced in 1966 by Gilbert and Müller‑Hill. The wild type lac operon consists of the genes lacZ, lacY and lacA that are transcribed from the lac promoter Plac into a polycistronic mRNA (Figure 4). These genes code for the proteins β‑galactosidase, Lac permease and Lac transacetylase (Oehler et al., 1994).
The transcription is constitutively activated by the CAP protein. lacI codes for the tetrameric Lac repressor and is expressed by the promoter Pi. The Lac repressor can bind to the lac operators O1, O2 or O3. O3 is located 92 bp upstream of O1 and O2 401 bp downstream of O1. The inter‑operator distances are counted from the center of O1 to the center of the distal operator. By binding simultaniously to O1 and O2 or to O1 and O3, the Lac repressor forms a DNA loop that negatively controlls the expression of Plac (Oehler et al., 1994).
In 1994, Oehler et al. showed that an inactivated O2 in its natural position does not decrease the repression by low amounts of tetrameric Lac repressor. Based on this result, we designed our Plac‑tight without the lac operator O2. It was also shown that two weak operators result in a tighter repression than a single strong operator. This can be explained by the thermodynamic concept that a second operator increases the local concentration of the Lac repressor for the neighboring operator. As a consequence there is a higher probability of occupation for two operators by the Lac repressor leading to a tighter repression (Oehler et al., 1996). In 1983, Sadler et al. proposed a ideal lac operator Oid that binds the Lac repressor 10‑fold tighter than the natural strong lac operator O1. Oid is a inverted repeat of the left half of O1 (Figure 5).
The operator‑DNA‑operator complex requires energy for the bending process in order to form a DNA loop. Additional energy for a torsion is required when the two lac operators lay on opposite sites of the helical DNA surface. Therefore, the DNA loop formation is energetically favoured for lac operators in phase (Müller et al., 1996). In 1996, Mueller et al. investigated the strength of repression for an inter‑operator distance of Oid and O1 from 57.5 bp up to 1493.5 bp. The repression values were compared to the repression by a single O1 at its natural position. A shorter spacing than 57.5 bp could not be examined due to the 35 box of the promoter. Phase dependency for the repression was observed for a spacing around 200 bp. That leads to the observation of periodically maxima for repression values (Figure 6).
The distance of 70.5 bp showed the strongest repression value which is 50‑fold higher than the natural repression. Repression drops sharply to 15‑fold at a 150.5 bp spacing and to threefold at around 600 bp. All inter‑operator distances beyond 600 bp kept a twofold increased repression value (Müller et al., 1996).
According to these results, our Plac‑tight consists of the auxiliary operator Oid with a 70.5 bp spacing to O1 at its natural position. The residuray sequence like the P lac was kept as the natural lac operon taken from E. coli BL21(DE3) (Figure 7).