Modelling
Overview
The goal of the modelling component of our project was to find optimal pathway for maximizing production of PHB in E. coli. Furthermore, the model will be used to simulate the reactions that occur in the bacteria after it is transformed with our Constructs. After a detailed analysis and discussion with experts, the team decided to pursue flux balance analysis (FBA) and kinetic modelling.
Analysis of various modelling methods
The modelling subgroup carefully considered a number of possible modelling topics for our project. We looked into the pros and cons of each model. We also analyzed the usage of the models by previous iGEM teams and in literature. In order to decide a topic, we graded each of the models based on their impact on our project, integration with lab experiments, risks, and availability of resources. The following is a list of our analysis of each of the modelling topics considered:
Modelling Method: Flux balance analysis (FBA)
Questions addressed by model
- Can use maximization of PHB production as an objective function and determine the optimal steady-state flux distribution, then compare to what we observe in the lab.
- Can potentially compare PHB production between unmodified E. coli and E. coli after genetic modifications
- Explore the capabilities and limitations of biochemical networks
- Analysis of metabolic network robustness (Raman et al., 2009)
- Analysis of Genome-scale metabolic models
Pros
- Does not require kinetic parameters and can be computed fairly quickly for large networks (Orth et al., 2009).
- Proof of concept that our genetic modifications maximize PHB production from human feces
- Can compare model predictions (maximum theoretical yield) to experimental data and potentially troubleshoot the system.
Cons
- The accuracy of the FBA prediction largely depends on the accurate definition of the metabolic network, the various constraints and definition of biologically relevant objective functions.
- May not always be accurate because does not account for regulatory effects (activation of enzymes by protein kinases or regulation of gene expression) (Orth et al., 2009).
Modelling Method: Reaction kinetics of pathways for synthesis of PHB
Questions addressed by model
- Determine the rate limiting step in the pathway (and implement solutions to improve the rate of that step)
- Can potentially provide insight into the impact of gene order
Pros
- Relatively easy to translate metabolic network to mathematical terms (Schallau et al., 2010)
- Can be integrated with the project’s experimental data and used to reiterate model and lab experiments.
Cons
- Difficult to find parameters for many reactions in the pathway.
- When analyzing multiple pathways that interact with each other, we may have to assume constant reaction rates for simplicity.
Modelling Method: Steady state mass balance to predict amount of PHB
Questions addressed by model
- Helpful for cost analysis of PHB production
- Identify the least efficient parts of the process (in terms of conversion)
- Can be combined with cost analysis to figure out the areas of the Process, which can be modified to improve PHB production while adding the least cost.
- Evaluate how much PHB will be produced depending on plant capacity (also combined with cost analysis to suggest the optimal scale at which PHB production is most feasible)
Pros
- Can be integrated with the cost analysis
- Can be integrated with Process development.
Cons
- Cannot be integrated with experimental data.
Modelling Method: Translational kinetics to predict effect of codon optimization
Different bacteria have different tRNAs that are abundant in their cells. When genes from other species of bacteria are expressed in E. coli, they may have a very inefficient translation efficiency if the available tRNA pools are very different. The impact of the difference can be calculated using tAI (tRNA adaptation index) (Han et al., 2010). The tAI for a gene is a measure of the availability of the tRNAs that serve each codon in the gene (Han et al., 2010). Synonymous codon substitutions can affect translational kinetics, which subsequently affects the final protein structure and function (RegulonDB).
Pros
- Determine increase in translation efficiency after codon optimization.
- Determine effect of synonymous codon substitutions on protein folding (synthesis enzymes or membrane proteins involved in secretion).
- Further optimize sequence to take into account secondary binding structures etc to optimize mRNA translational efficiency.
Cons
- Cannot be integrated with experimental data.
- Molecular Dynamics Simulations can be used to model protein folding.
- Many unknown parameters involved.
Flux Balance Analysis
FBA is a mathematical method for simulating metabolism by calculating steady-state metabolic fluxes for large metabolic networks. The modelling subgroup deemed that flux balance analysis will help us find an optimal pathway for maximizing production of PHB in E. coli BL21 (DE3) after inserting our genetic construct. MATLAB has Cobra toolbox that is used for flux balance analysis and visualization of metabolic networks. Thus, the modelling subgroup installed the toolbox and searched for models that contained E. coli’s reactions/metabolic networks. We found that the E. coli core model contained pathways and reactions of our interest. We decided to use this model as a proof of concept because it was not possible to visualize the E. coli BL21 (DE3) model and was computationally expensive due to the presence of large number of reactions (edges) and metabolites (nodes). We loaded the core model and and visualized it using Paint4Net functions. The reaction nodes are represented as rectangle, metabolite nodes as ellipses, zero rate fluxes as grey arrows, green edges represent positive-rate (forward) fluxes, and blue edges represent negative-rate (backward) fluxes.
After reading the model containing metabolic networks, the toolbox has ‘optimizeCbModel’ function that calculates steady-state flux for the metabolic network. The objective of the function can be changed to a different metabolite. In our case, the objective was changed to PHB because we wanted to find the optimal pathway for PHB synthesis after inserting our construct.
Results
The first step for FBA analysis was to insert the genes that utilize products of glycolysis pathway (phaC, phaB, and phaA) and beta-oxidation pathway (phaC1, phaJ4) into the E. coli core model. The team was able to add the phaCBA genes into thecore model. The model was visualized to look for any changes in the pathway.
We noticed that our model showed that the new reactions as a result of adding phaCBA were ‘dead ends’. The result of optimizeCbModel was 0, which means there was no steady-state flux for the objective selected i.e. optimal pathway for PHB. The product (PHB) was shown as a dead-end metabolite. In order to solve this problem, we found that we needed to add exchange or transport reactions for the removal or secretion of PHB from the cell. This will allow the cell to secrete and synthesize PHB at steady-state fluxes.
Future Directions
In order to get conclusive results for finding optimal pathway(s) for the synthesis of PHB at steady-state fluxes, it is necessary to introduce a secretion system into the model. Thus, the solution will be to use fastGapFill, which is used to find missing reactions and metabolites from the universal biochemical reaction database (e.g. Kyoto Encyclopedia of Genes and Genomes) for a given (compartmentalized) metabolic reconstruction. After addition of the transport reactions, the model will have to be validated before it can be used to inform lab experiments.
Kinetic Modelling
We began developing a kinetic model for PHB synthesis via beta-oxidation. We hoped to identify the rate-limiting steps in our reaction pathways and thereby identify whether upregulation of fadD and fade genes would increase the rate of reaction for the PHB-synthesis pathway via β-oxidation. We used Micheles-Menten enzyme kinetics to model the reactions of the beta-oxidation pathway. However, we ran into a number of challenges including lack of reliable data for β-oxidation of fatty acids in E.coli as well as how complex the model was becoming.
Methods
We used the SymBiology toolbox in MATLAB to build the model and used kinetic parameters obtained from literature or online databases.
Results
Our online search from literature, databases, and pre-existing models revealed that there was not much information on the kinetics of many enzymes and pathways for PHB-synthesis via β-oxidation pathway. Therefore, we were unable to obtain informative results even when different organisms' kinetic parameters were used to substitute values for key enzyme reactions. While building the model, we also learned that the fadD and fadE genes were most active for long chain fatty acids and therefore we built our model based on using oleic acid as the substrate to test the activity of these genes for metabolism of long-chain fatty acids in human fecal waste. However, due to time restraints and lack of accurate kinetic parameters in literature and databases we decided not to pursue kinetic modelling; the information generated would not be applicable for informing our experiment given current circumstances.
Future Directions
In the future we hope to test our phaJC, fadE, and fadD constructs in the lab to obtain data of enzyme kinetics which we can use to adjust our model's kinetic parameters to obtain accurate results.
Works Cited
Raman, K., and N. Chandra. "Flux Balance Analysis Of Biological Systems: Applications And Challenges". Briefings in Bioinformatics 10.4 (2009): 435-449. Web.
Orth, Jeffrey D, Ines Thiele, and Bernhard Ø Palsson. "What Is Flux Balance Analysis?". Nature Biotechnology 28.3 (2010): 245-248. Web.
Schallau, K., Junker, B. “Simulating Plant Metabolic Pathways with Enzyme-Kinetic Models”. Plant Physiology Vol.152 (2010): 1763-1771. Web.
Han N. Lim1, Yeong Lee, and Razika Hussein. “Fundamental relationship between operon organization and gene expression”. Proceedings of the National Academy of Sciences of the United States Vol. 108 (2011): 10626-10631. Web.
RegulonDB version 9.0: high-level integration of gene regulation, coexpression, motif clustering and beyond. Gama-Castro S, Salgado H, Santos-Zavaleta A, Ledezma-Tejeida D, Muñiz-Rascado L, García-Sotelo JS, Alquicira-Hernández K, Martínez-Flores I, Pannier L, Castro-Mondragón JA, Medina-Rivera A, Solano-Lira H, Bonavides-Martínez C, Pérez-Rueda E, Alquicira-Hernández S, Porrón-Sotelo L, López-Fuentes A, Hernández-Koutoucheva A, Moral-Chávez VD, Rinaldi F, Collado-Vides J. Nucleic Acids Res. 2016 Jan 4;44(D1):D133-43. doi: 10.1093/nar/gkv1156. Epub 2015 Nov 2.