Our Key Goals
The aim of the USYD iGEM 2017 team was to address the problem of insulin inaccessibility. The design of our insulin, and its means of expression, needed to look at five key characteristics:
Stability
For our project to work effectively, we must have a supply chain that’s not a cold chain, so that costs can be reduced. This will ultimately mean that the cost of these cold chains will not be passed onto the consumer. To achieve this, we hope to design an insulin that will not lose efficacy after being exposed to room temperature for long periods of time.
Single Chained
As a result of the difficult purification methods, Single Chain Insulins, or SCIs for short, have been developed with a small, C-peptide chain linker. This linker connects the A and B chains in such a way that the di-sulfide bonds form more favorably. We aim to develop our own single chain insulin to compare it’s simplicity.
Ease of Purification and Affordaility
We must also consider the impact of a difficult, costly manufacturing process on small scale manufacturing companies. This impact is too great to impose on this grass-roots organisations, so we have pursued to find a cheap, simple purification method which is able to produce high yields from a recombinant system.
Intellectual Property Issues
As a result of the way drugs are currently developed, all new inventions for therapies are protected by Intellectual Property Law through patents. These patents surrounding all currently prescribed and newly invented insulins has inspired our team to pursue a completely open source project.
Safety and Efficacy
Our insulin products must be of certifiable medical grade such that it can be approved for human use after stage IV clinical trials, or biosimilar clinical trials. Furthermore, it must also be at least as effective as the other insulins on the market.
Our Insulin Analogue
We began working on expressing proinsulin as it is a reliable, unpatented, and well characterised therapeutic. Proinsulin itself is inactive, and to become active it requires additional processing to form insulin. This processing is performed by the protease trypsin, which cleaves the C chain. from proinsulin. After trypsin activity, the A and B chains are left over to form active insulin. Proinsulin processing is demonstrated in Figure 1. with the cleavage of the C chain
Figure 1. Cleavage of proinsulin results in active insulin
Due to the additional processing steps required to produce purified active insulin from proinsulin, increasing the cost of insulin synthesis, we looked for way to diminish this cost by reducing the steps required to obtain active insulin.
Our team decided to design a single-chain insulin, as single-chain insulins have been demonstrated to have higher stability and activity than human proinsulin. Stability in particular was important to our design due to the need to synthesise an insulin that could be transported over long distances without requiring cold temperatures. Crucially, single-chain insulins do not need to be cleaved to be active, requiring less processing than proinsulin.
We based the linker peptide of Winsulin on sequences and principles tested in Rajpal et al., 2009. This paper found that peptides with a length of 5 to 12 amino acids had the highest receptor binding activity. Importantly, the composition of the sequence is identified as integral in determining whether the single chain construct has activity. The presence of two adjacent basic (dibasic) residues in the linker peptide have also been found to be crucial for insulin to bind to the insulin receptor.
With the above considerations in mind, we designed the linker peptide sequence for Winsulin with the following properties:
- 12 amino acid length - this was as close as we could get to a 5-12 amino acid length as outlined by Rajpal et al., while not infringing upon the (2005) patent
- Contains dibasic residues (lysine and arginine) - this ensures Winsulin’s receptor binding activity, and ensures that our Winsulin sequence falls outside of the claims of the (2005) patent.
We also modified the linker sequence in Rajpal et al.'s paper by including a GGGSGGG sequence. This is a standard sequence known to have high flexibility, enhancing the folding of our Winsulin. We also used it to increase the length of our linker peptide sequence, as we do not believe that this sequence will interfere with insulin activity in any way.
Additionally, the C-terminal residue of proinsulin is an asparagine. In Winsulin we substituted this for a glycine to increase the pI of Winsulin to approximately 7.8. This pI is somewhat high, but it leads to the aggregation of Winsulin monomers to form hexamers. This substitution is present in many long-acting insulins currently on the market. Yet, interestingly, our modelling indicates our Winsulin is still a rapid-acting insulin, and dissociates from hexamers more readily than human insulin.
Our Constructs
We designed our expression constructs in order to test winsulin and proinsulin with multiple expression systems. Click on each element of the construct to learn more about why we chose it:
BB prefix
RBS
YNCM Tag
His Tag
TEV
Winsulin
BB suffix
BB prefix
RBS
YNCM Tag
His Tag
R
Proinsulin
BB suffix
BB prefix
RBS
Ecotin Tag
His Tag
TEV
Winsulin
BB suffix
BB prefix
RBS
Ecotin Tag
His Tag
R
Proinsulin
BB suffix
BB prefix
RBS
His Tag
TEV
Winsulin
BB suffix
BB prefix
RBS
His Tag
R
Proinsulin
BB suffix
iGEM BioBrick Prefix
Contains the restriction sites that are necessary for BioBrick compatibility including EcoRI, NotI & XbaI.
E. coli Extended Ribosome Binding Site
A derivative of the RBS found in gene 10 of the T7 bacteriophage, this 23 base pair sequence rich in A’s & T’s enhances ribosome binding to boost expression.
YncM Tag
The YNCM tag is a 12 amino acid sequence whose presence on the N-terminus of the protein targets it for secretion out of the cell into the surrounding media via the Sec pathway in Bacillus subtilis. YNCM was chosen because it was recently shown to be massively successful in targeting recombinant protein for secretion compared to a library of other signal peptides. Additionally, this was shown in B. subtilis strain WB600, which is the bacteria that our WB800 strain was derived from. So we expect that it should give us similar success in secretion of our constructs. (Guan et. al. 2016)
His Tag
We have included a tag comprised of 6 sequential histidines that form a vital aspect of our purification technique using affinity chromatography. Histidine’s high attraction to metal ions will cause the entire protein, insulin and all, to bind to a nickel column and separate it from the other proteins of the cell.
TEV Protease Cleavage Site
TEV is a sequence-specific cysteine protease derived from Tobacco Etch Virus. Because of its high specificity, it is commonly used for deliberate protein cleavage. In our project, we will use it to exclusively detach Winsulin from the nickel column, leaving the his tag and Ecotin/YNCM tags behind. This should provide us with a pure elution of Winsulin.
“R” Arginine Cleavage Site
Arginine acts as a recognition site for Trypsin Protease which we will use to specifically remove Proinsulin from the his tag and YNCM/Ecotin tag in a similar way to TEV. We have chosen to use Trypsin in these constructs because it allows us to further simplify the processing of proinsulin. Trypsin naturally cleaves the C-peptide from proinsulin which, following disulfide bond formation, leaves the active form of insulin. This is the way it works in our body, so we are confident that it will work here too.
iGEM BioBrick Suffix
Contains the restriction sites that are necessary for BioBrick compatibility including SpeI, NotI & PstI. We have also added an additional BamHI site at the terminus of our E. coli expressed constructs for ligation into pET-15b.
Ecotin Tag
Ecotin acts as a signal sequence to target the translated protein to the periplasm of the cell. There are a number of advantages that make it a good choice over other tags.
- Relatively low metabolic burden due to its small size
- No interaction with other proteins within the periplasm
- Is native in E. coli and contains a disulfide bond meaning it undergoes through an oxidative compartment that may assist in the formation of the disulfides in Proinsulin and Winsulin.
- It has already been shown to successfully target proinsulin to the periplasm (Malik et. al. 2007)
Proinsulin
See our section above!
Winsulin
See our section above!
Our Vectors
We have chosen to use the pET-15b expression system paired with BL21 strain E. coli to express our periplasmic and cytoplasmic constructs. It is a well characterised vector that uses the T7 induction system to prevent expression without the presence of IPTG, providing a convenient level of control over production.
pUS258 is an integrative expression vector which undergoes homologous recombination to insert into the amyE gene of the Bacillus subtilis chromosome. It’s a relatively simple plasmid using the same IPTG induction system that we use across all of our expression systems.
Our Expression Systems
In order to maximise the yield of our insulin, while also reducing the post-expression methods currently undertaken by manufacturers, we tested both of these constructs in three different expression systems. Two of these expression systems were in BL21 E. coli, and the other was in Bacillus subtilis.
Cytoplasmic Expression
Cytoplasmic expression is our recombinant protein production in its raw and simple form. With no tags attached to the fusion protein, both Winsulin and Proinsulin will accumulate within the cytoplasm to form aggregated inclusion bodies. Extraction will involve complete lysis of the cell and purification will require separating them from DNA, membranes and other proteins.
The reason to express in the cytoplasm is mainly to compare our other expression systems to where we will be able to see whether they are a viable means of production.
Periplasmic Expression
Potentially the biggest hurdle to overcome in efficiently producing recombinant insulin is having the three disulfide bonds in proinsulin and active insulin form correctly. This is because they require an oxidative environment that isn’t usually found in the cytoplasm of common protein factories like E. coli where proteins are naturally expressed.
Fortunately, the periplasmic compartment of gram negative bacteria, including E. coli is more oxidative than the cytoplasm and will therefore improve disulfide bond formation. So we are using the Ecotin tag to target proinsulin and Winsulin to the periplasm in the hope that they will fold correctly and form the correct disulfides.
This method also has the additional advantage that the periplasmic fraction can be extracted without lysing the entire cell, separating our insulin from cells DNA and the rest of the cytoplasmic “junk”.
Secretory Expression
Current methods of insulin production grow bacteria in huge vats. But bacteria like E. coli that only express the protein in the cytoplasm need to be lysed to extract it. Which means they need to drain hundreds of thousands of liters of culture to separate the cells from the media before lysing the cells and then purifying insulin from the DNA and other cellular proteins. Although we will be testing cytoplasmic E. coli expression similar to these methods, we wanted to come up with a way to efficiently separate our insulin from the cells and decided the best way would be have it secreted directly into the media. This is where Bacillus subtilis comes in! Bacillus has been a common tool for recombinant protein production for years and has been proven again and again that they are highly efficient secretion factories. The hope is that having the insulin separated from the cells in the media would mean that scaled up production could use a system where the media is constantly cycled out and the cells can continuously produce our insulin. As can be seen in Figure 3, expression using B. Subtilis can be very simple contributing to our goal to create an easily purifiable product.
Our Purification Process
Our insulins were designed with a 6x His sequence at the N-terminus of the insulin. This sequence, comprised of 6 histidine residues, binds to nickel. When cell lysates or media containing insulin are added to nickel beads, all of the cellular components should wash off from the beads. This separates them from insulin which remains bound to the nickel beads. As such, the 6x His tag sequence's affinity to nickel can be used to purify insulin from cells or media.
We considered two methods for the purification of our insulins: nickel columns, and nickel agarose resin. The nickel column method is simpler, as insulin can be purified in a single step, yet it is more expensive. On the other hand, the nickel agarose resin method involves more steps but is more cost-effective.
An additional factor in the purification of our insulins is the need to remove the 6x His sequence and the expression tags from the insulin. We designed our Winsulin constructs so that both could be cleaved off in a single step by TEV protease. Our Proinsulin constructs take this further - in a single step using the protease trypsin, the His-tag, expression tag and C chain are removed, simultaneously removing the tags and allowing the proinsulin to fold into active insulin.
In the nickel column method, the protease step also releases the insulin from the nickel column for collection. However, it also reduces the amount of insulin released from the nickel column. Taking this into consideration, along with the increased expense of the nickel column method, our team decided to use the nickel agarose resin method for insulin purification.
Assays
As an initial test to determine protein folding and accurately quantify expression yield, we will be using an ELISA. This test uses antibodies that bind to the folded structure of insulin with extremely high specificity. In this regard, it gives an initial indication as to whether the protein is in a conformation that could potentially bind to the insulin receptor. Additionally, the antibodies are tagged with fluorescent protein, so the relative fluorescence will correlate to the amount of insulin in the sample, also with great accuracy.
Following the ELISA, we will test the function of our expressed recombinant Winsulin, and Proinsulin (with trypsin removal of the C-peptide) using an in vitro glucose uptake assay. Here, we measure the changes in glycogen synthesis and glucose oxidation in response to activation of the insulin receptor in cell cultures. C14 radioalabelled glucose will be taken up into the cell at a faster rate upon stimulation and result in incorporation of the carbon into glycogen and CO2, which is measured by a scintillation counter. This assay is a great alternative to an in vivo assay that uses involves assessing the convulsions of mice in response to injections of increasing concentrations of insulin.
Additional Design Considerations
WB800 Bacillus Strain
One of the challenges that we found in the literature of producing recombinant insulin was its susceptibility to proteolytic decay. So to continue with the theme of maximising expression, it would make sense to reduce the effect of proteolytic decay.
Our idea: knockout proteases from an existing strain such as B. subtilis 168.
Our actual solution: find one that already exists.
And hence we discovered B. subtilis WB800. It is one of the more recent of a long line of protease deficient strains of Bacillus subtilis that have been optimised for recombinant protein secretion (Figure 5).
WB800 specifically has 8 proteases knocked that include both intracellular and membrane bound. The strain was kindly provided to us for experimental use by Professor Sui-Lam Wong of the University of Calgary, Canada.
pUS270
Although pUS258 appeared to work quite well as an expression vector, we noticed that it wasn't the most streamline process of cloning in our constructs nor transforming into Bacillus subtilis because of an MCS that was incompatible with the BioBrick cloning system, and the extra steps needed to ensure the correct integration of the plasmid into the chromosome.
So we decided to design a new compact, highly efficient replicative expression vector that would serve not just our own future experiments, but the whole iGEM community. Thus, pUS270 was born!
While we were too short on time to construct it ourselves, we are holding onto it for a future project that can be tested and eventually submitted to the registry.
Click on each feature for more information
Lac Operon Repressor
Codes for the production of the repressor protein that binds to the Lac Operator to prevent expression until induction of the media with IPTG. We have designed pUS270 to have the LacI under the pVeg promoter for strong constitutive expression in Bacillus.
pIP501 Rep
This gene codes for the Replicator protein that pairs with the Bacillus OriV and is essential to promote the independent replication of pUS270 in Bacillus.
E. coli OriV
Taken from pSB1C3, having this origin of replication will make it easier to construct the plasmid using E. coli as well as improve cloning efficiency downstream. Chosen because of the high copy number of pSB1C3.
Multiple Cloning Site
The multiple cloning site of pUS258 and other expression plasmids that we considered using either had too few restriction sites or sites that were incompatible for the efficient cloning with the BB Prefix and Suffix. So we have designed two highly diverse MCS’ that have 27 unique sites amongst them on either side of SF AmylCP-6 to promote simple cloning of parts from the Bio Brick Registry.
Antibiotic Resistance
Codes for Neomycin Phosphotransferase II that gives transformants resistance to both Neomycin and Kanamycin. Allows selective screening of transformants
Antibiotic Resistance
This region allows independent replication of the vector within Bacillus at a high copy number to provide more copies of the recombinant gene to maximise expression.
Lac Operator
Consists of the target sequence for the lac repressor protein to contribute to the IPTG induction system.
SuperFold AmilCP-6
SuperFold AmilCP-6 is a variant of amilCP which was used by the 2016 Sydney IGEM team and originally submitted to the registry by Team Uppsala Sweden in 2011. This variant has undergone site directed mutagenesis to increase the folding rate of the chromoprotein and enhance the blue colour. We are taking advantage of this protein to enable blue-white screening without the need for adding X-gal to the plates. Cloning with the two multiple cloning sites that flank SF AmilCP-6 essentially swaps it with the insert to leave the recombinant bacteria white rather than blue.