Short Summary
Generating the library
Figure 1: Library plasmid pSB1C3 containing the tyrosyl tRNA synthetase.
Library plasmid based on pSB1C3, containing the tyrosyl tRNA synthetase under controle of a glnS promotor, a pMB1 origin of replication and chloramphenicole resistance.
If a marker sequence is inserted in the position which should be randomized, the insertion of this certain sequence can be easily screened. We used a mRFP (BBa_J04450), under control of a lac promoter, lac operator and rrnB T1 terminator as an optical control. The primers 17vi and 17vj were designed with overlaps, homologous to the sequence around the binding pocket region synthetase sequence, allowing an optimal binding into the TyrRS. The position of the TyrRS chosen to be randomized, are Asp158, Ile159 and Leu162, the positions of the center of the binding pocket. The mRFP is inserted into the TyrRS in place of this binding side to function as an optical control. If the randomized DNA double strand is incorporated into the synthetase, the colonies color changes due to the absence of the mRFP, so the E. coli containing the randomized library plasmids can be easily picked for further positive and negative selection.
Figure 3: Generating a synthetase library by using oligo dimers and mRFP as an optical controle.
Two primers, one with a randomized position, are designed to form a dimer (1), which is completed to dsDNA by the Klenow fragment. The region of the TyrRS meant to be modified by randomization is replaced by mRFP as an optical control (2). In the case oft he incorporation oft he randomized dsDNA, the mRFP is replaced and thus the incorporation is visible directly.
Figure 4: Tyrosyl-tRNA/synthetase library on LB-plate with chloramphenicol.
The library was generated by using two primers, one with a randomized position, which are designed to form a dimer. This dimer is completed to a dsDNA by the Klenow fragment. As optical control, a mRFP is incorporated in this certain position to be ranomized, which is then replaced by the dsDNA. On this basis, we could see that xxx% of the cells incorporated the template plasmid, but most cells contained the plasmid with the incorporated randomized dsDNA.
Anaylzing the tyrosyl tRNA/aminoacyl-synthetase library
Sequencing by Sanger
E(T)= n*Hn (1)
Figure 3: Chromatograms of the Tyr-RS Library by sanger sequencing.
Depicting the chromatograms of four TyrRS library replicates being sequenced forward and reverse by sanger sequencing. The positions 158, 159, 162 of the TyrRS are randomized by NNK scheme.
Regarding the chromatograms, depicted in Figure 4, the maximal fluorescence intensity of the thymidine is approximately 75 % up to 90 % lower than the maximal fluorescence intensity of the guanine. In comparison, the maximal fluorescence signal of the guanosine shows up to 97 % of the approximate maximal fluorescence signal of the cytosine.
Comparing these tendencies with the sequence results of the modified positions (NNK), lead to the following assumption: when generating a library using the NNK scheme, the rates for the incorporation of the different nucleotides is not evenly distributed. Originally, we expected an equal distribution of guanosine and thymidine on the K position, but the fluorescence signal of the thymidine is approximately higher than the fluorescence signal of the guanosine. Despite the given data of the maximal fluorescence intensity of the four nucleotides, at this position, the fluorescence signal of the thymidine is higher than the guanosines signal. This implies a higher incorporation rate of the thymidine if the sequence is randomized with a K in this position. Analogue to this experience, the distribution of the incorporation of the four nucleotides, resulting from an N randomization on this certain position, is not equal either. In relation to the other three incorporated nucleotides, there is an approximately higher cytosine signal on this position, also implying a higher incorporation rate of cytosine when using the N randomization.
Illumina Sequencing
Figure 4: Starting the Illumina MiSeq sequencing.
We generated oligonucleotides containing a certain adapter sequence and a unique indice to separate our sequences from other libraries. The amplified region of our library had a maximal length of 500 bp, so the DNA fragments do not get entangled while bridge amplification, which would lead to an overlap of the different clusters. After amplification with the certain oligonucleotides, the PCR product was purified from a 1 % agarose gel. The quality of the library amplificate was controlled for the NGS by using the Agilent BioAnalyzer with High-Sensitivity DNA chip. This technology uses capillary electrophoresis for a sensitive quantification and sizing of DNA fragments to test if our library preparation matches the specifications of the Illumina MiSeq technology. The electropherogram of the Agilent BioAnalyzer High Sensitivity DNA Assay shows our amplified library fragment as the largest peak of 2,874.13 pg/µL and a molarity of 7,735.3 pmol/L with a length of 563 bp and flanked by the two markers (35 bp and 10,380 bp). The image of the gel, depicted in figure 5, shows a thick band at 550-600 bp, fading out up to the band of 700 bp, matching the slightly uneven peak of the electropherogram. It is important, that there are as less as possible larger fragments, forcing a possible overload and therefore the abruption of sequencing.
Figure 5: Electropherogram of the tyrosyl- synthetase library.
The Agilent BioAnalyzer High Sensitivity DNA Assay is used for the measurement. The library fragments are depicted as the peak in the center (563 bp), flanked by markers.
Figure 6: Gel image of the Agilent BioAnalyzer High Sensitivity DNA Assay of the tyrosyl- synthetase library.
We identified, that reads with a coverage of minimal two result in 8,787 different peptides. 8,464 different peptides can be translated of the sequence variants with a coverage higher than two, and 8,135 with a coverage higher than three. Based on this data, our 27,672 variants composed tyrosyl tRNA/synthetase library codes for more than 8,000 different peptides. Considering that we continued the generation of the library after sequencing, nearly doubling the number of clones, we assume the tyrosyl-tRNA/synthetase library to be larger than the analyzed 27,672 different sequences and 8,000 peptides. We were not allowed to submit the complete library.
Therefore, we submitted two versions of the basis library Plasmid ( BBa_K2201400 , BBa_K2201411 ) for the generation of a own library. In addition to that, the complete library is available to all future iGEM teams, after request.
Selection
The positive selection plasmid (BBa_K2201900) contains the Methanococcus jannaschii based tRNA (CUA) with an anticodon for the amber codon under the constitutive promoter proK. The essential part for the selection is the kanamycin resistance with two amber codons behind the translation start. If the tRNA/aminoacyl-synthetase mutant (encoded on a cotransformed library plasmid is able to charge the tRNA (CUA) with any amino acid) the cell could express the kanamycin resistance. Thus, these cells survive when plated out at LB agar plates with the ncAA and kanamycin.
Figure 7: Positive selection plasmid.
Figure 8: Negative selection plasmid
Our goal is to generate a tRNA/synthetase which is able to incorporate 2-Nitro-L-phnylalanine, used for the photocleaving of the polypeptide backbone.
For the first round of selection, we cotransformed the library plasmid BBa_K2201400 with the (BBa_K2201900) and cultivated the cells on LB-plates with kanamycin and 2-nitrophenylalanine (2-NPA). Due to the amber stop codon, integrated in the kanamycine resistance on the positive selection plasmid, only the cells owning a functional aaRS survive. That is due to the amber stop codon on the kanamycin resistance. The resistance is only expressed, when a non canonical or endogenous amino acid is incorporated as response to the amber stop codon. Thus, the aaRS are selected for its function. To avoid an additional pressure on the cells, we did not used tetracycline or chloramphenicol for the cultivation, due to the dependency of the kanamycin expression on the library plasmid and the positive selection plasmid.
After the positive selection, we received approximately 800 colonies, showing that many of our generated TyrRS variants are able to bind a non canonical or endogenous amino acid despite the modifications. We washed these colonies off the plates, isolated the plasmids and cotransformed them with the negative selection plasmid (BBa_K2201901) and cultivated the cells on LB-plates with tetracycline and chloramphenicol to be certain to attain both plasmids.
Figure 9: Remaining colonies while the positive selection for 2-NPA, containing the positive selection plasmid and the library plasmid.
The remaining cells own an aaRS able to bind a non canonical or endogenous amino acid.
Figure 10: Remaining colonies while the negative selection and after the positive selection for 2-NPA, containing the neg selection plasmid and the library plasmid.
The remaining cells own an aaRS, specific to not bind an endogenous amino acid. Red colonies own a positive selection plasmid as result of the plasmid isolation and can be separated for further selection rounds easily.
We combined the positive selection plasmid with a strengthening system (BBa_K2201373) , containing a T3 RNA-Polymerase with a reversed mRFP under T3 RNA-polymerase. With this system, the mRFP is expressed, resulting in a red colour of the colonies, still owning this positive selection plasmid. Thereby, it was possible to easily identify the clones owning the positive and not the negative selection plasmid while the negative selction. As it can be seen in Figure 10, the transformation efficiency of the positive selection plasmids, in contrast to the library plasmids, is low, resulting in one single false colony owning the positive selection plasmid.
References
Pfeufer V., Schulze M. , (2015). Laser fluorescence powers sequencing advances. BioOptica World Beese,L.S, Derbyshire V, Steitz T.A. (1993). Structure of DNA Polymerase I Kienow Fragment Bound to Duplex DNAMiddendorf L.R., Humpfrey P.G., Narayanan N., Roemer S.C.. (2008) Chapter8 Sequencing Technology. WILEY