Team:Virginia/Model




Modeling



Metabolic Flux Modeling

Flux Balance Analysis

Recent advances in computational biology led to emergence of databases with whole-genome metabolic networks, which for simplicity we will call models. They describe the flow of metabolites within a particular organism. Applying constraint-based methods (see this figure) to a particular model allows one to make quantitative predictions about cell phenotype while eliminating many complex parameters. Of particular interest to our project is flux balance analysis (FBA)[1], which allows us to predict optimal steady-state biomass yield, which is tightly correlated with cell growth rate and is the most likely observed phenotype[1,2]. One major advantage of FBA is that it does not require knowledge of enzymatic parameters. This figure[3] elucidates the mathematics of FBA.

One of the primary questions for the project is the following. Will the synthetic P. denitrificans strain containing the device grow better than a wild type strain in the presence of ammonia? It is important to know the answer to this question because the To answer this question, we performed comparative analysis of the two Paracoccus strains using FBA on the corresponding whole-genome metabolic model. The analysis pipeline involved a slew of open-source computational tools, which we will describe below.

Image HTML map generator

First, we reconstructed a metabolic model of Paracoccus denitrificans strain DSM 413 on a complete medium using ModelSEED[4]. A complete medium is such that any nutrient, including ammonia, is available for uptake. Thus, the set of reactions included in the model is the largest of all possible sets. Although the largest, this set is incomplete because certain essential reactions that result in cell growth may be missing. In the next step, the model is gapfilled with the reactions necessary for measurable cell growth.

To quantify the difference between wild-type and synthetic strains, we must include several new reactions, metabolites and genes (e.g. oxygenation of ammonia by the AMO enzyme complex) into the model generated by ModelSEED. Such functionality is not available in ModelSEED. To implement this, we turned to COBRApy: Constraint-Based Reconstruction and Analysis[5] package written in Python. However, COBRApy does not natively work with ModelSEED models. To overcome this compatibility issue, we used Mackinac package[6] to convert the ModelSEED model into COBRApy-compatible format. Using COBRApy, we then added the new reactions into the model and performed FBA to compare the biomass yields, and hence the growth rates, of the two Paracoccus strains. The script is available here.

First, the FBA was run on the gapfilled unmodified (wild-type) model, which initially contained 1550 reactions and 1556 metabolites. The optimal biomass yield was found to be \( 224.3248 (\text{g dry weight}\cdot\text{h})^{-1} \). Next, we added the nitrification reactions, whose list is given below. \(\ce{Q}\) and \(\ce{QH_2}\) represent ubiquinone and ubiquinol, respectively. \( \text{UqO} \) is the ubiquinone oxidoreductase enzyme which catalyzes the last reaction, and is already present in the proteome of P. denitrificans. \[ \ce{NH_3 + QH_2 + O_2 ->[\text{AMO}] H_2O + Q + NH_2OH} \] \[ \ce{NH_3 + NAD + H_2O ->[\text{AMO}] 2H^+ + NADH + NH_2OH} \] \[ \ce{NH_2OH + O_2 ->[\text{HAO}] NO_2^- + H^+ + H_2O} \] \[ \ce{NH_2OH + 2Q + H_2O ->[\text{HAO}] NO_2^- + 2QH_2} \] \[ \ce{QH_2 ->[\text{UqO}] 2H^+ + Q} \] With the new model containing 1555 reactions and 1559 metabolites (with metabolites hydroxylamine, Q and QH2 added), the optimal biomass yield of the modified model was found to be \( \boxed{228.6980~(\text{g dry weight}\cdot\text{h})^{-1}} \).

Exchange Fluxes and Flux Variability Analysis

To gain a deeper understanding of how inclusion of new genes affects the metabolism of the cell, it is important to consider the difference in exchange fluxes between the wild type and synthetic strain. Exchange fluxes determine the intake and expulsion of certain extracellular metabolites, such as glucose, H+, or certain dipeptides. The set of exchange reactions also determines the composition of the growth medium. The set of fluxes that gives rise to the optimal biomass, however, is not unique. This includes exchange fluxes. The system of stoichiometric equations for FBA is usually underdetermined, which means that there are actually infinitely many solutions to the problem, all of which are optimal up to certain degree of significance. In other words, to investigate the significance of all exchange fluxes with respect to the optimum, we need to move away from analyzing specific numbers determined by FBA and instead consider ranges, or variations of fluxes that allow for the optimal solution.

To obtain the allowed range of fluxes, we employ a technique called Flux Variability Analysis (FVA). The main parameter of FVA is the so called fraction of optimum, which specifies the allowed deviation from the optimal biomass. In essence, during FVA we perturb each specified exchange flux in both directions under the constraint that the solution must within the fraction of optimum, thereby obtaining the allowed lower and upper bounds on the flux. If the fraction equals 1 (i.e. no deviation from the optimal biomass yield is allowed, in contrast to 0.95, which corresponds to 95% of the optimum), then any fluxes within the obtained range will reproduce the optimal solution. COBRApy package contains a module of functions which performs FVA on an existing model (see script for details).

From the data obtained during FVA, one can see that the addition of nitrification circuit does, in fact, enhance the metabolism of the entire cell. However, the data are insufficient to determine how exactly the additional reactions and metabolites allow for such improvement.

Nitrite Exchange Knock-out and New Project Directions

To visualize metabolic pathways in our model, we used escher. The two figures below are based on the data from COBRApy simulations.

Fig. 1: An escher map of the synthetic strain metabolism without nitrite transport block (optimal yield 228.6980 [gdw.h]-1)

Fig. 2: An escher map of the synthetic strain metabolism with nitrite transport knocked out (optimal yield 227.8975 [gdw.h]-1)

The first figure represents the nitrification-denitrification metabolic pathway of the strain with the device. As one can see in the first figure, all the nitrite produced is expelled from the cell through nitrite transport. This most likely happens due to cytotoxicity of nitrite. However, with nitrite transport genes knocked out, we can see in the second figure that all nitrite flows directly into the denitrification pathway. This offers a potential idea for a side project: how much will denitrification efficiency be improved due by knocking out nitrite exchange in Paracoccus denitrificans? This idea is worthwhile to explore in particular because knocking out nitrite exchange does not severely decrease the biomass yield (see figure captions).

Results, Summary and Discussion

Analysis of the data obtained from FBA revealed that the biomass yield of the P. denitrificans strain containing our device is greater than the wild-type yield by nearly 2%, which means that our bacterium will use the inserted nitrification pathway in order to enhance its metabolism. As such, our device potentially confers fitness advantage. We predict that our synthetic strain will out-compete the native wild-type strain if both strains are growing in the same environment, including sewage sludge. If our device is used in wastewater treatment facilities, we predict that there will be no need to artificially sustain the new culture.

As was stated earlier, the expected cell phenotype is the one with the largest biomass yield. This is only true under the assumption that the cell lives in an ideal or near-ideal environment with no over-population. Several studies have shown that under different growth environments or culture densities, cells can exhibit non-optimal yield metabolism[7,8], which means that pathways different from those obtained through FBA will be employed by the cell. We do not know whether wastewater causes a similar shift in metabolism of Paracoccus denitrificans. However, seeing as it is able to thrive in such environment gives enough reason to believe that the growth rate will not lose correlation with the biomass. Furthermore, it should be noted that this model is oblivious to protein structure and enzyme kinetics, and as such is unable to capture the full scale of effects caused by inserting the nitrification construct into the model.


(De)nitrification Kinetics

Description

In this section, we present a kinetic model of bioreactors which are able to perform both nitrification and denitrification. In particular, sequential batch reactors and bioreactors containing our synthetic strain of P. denitrificans are described by the model. Literature searching led to the discovery of a denitrification model called Activated Sludge Model with Indirect Coupling of Electrons (ASM-ICE), proposed by Pan et al. in 2013. Mathematica notebook is available here.


In essence, the model is given by a system of coupled nonlinear time-dependent differential equations that produces for the following intermediate concentrations: \(S_{NO3}\) - nitrate, \(S_{NO2}\) - nitrite, \(S_{NO}\) - nitric oxide, \(S_{N2O}\) - nitrous oxide, \(S_{N2}\) - nitrogen gas. The rate of change of each of these variables is coupled to \(X\) - the heterotrophic biomass, \(S_S\) - carbon source concentration, and one of \(S_{Mox}\) or \(S_{Mred}\) - concentrations of oxidized (reduced) electron carriers. The relationships are summarized by a stoichiometric matrix below.

Table of constants for ASM-ICE model. Adapted from Pan et al.

Bla


Software

References

[1] Oberhardt M., Chavali A., Papin J. (2009) Flux Balance Analysis: Interrogating Genome-Scale Metabolic Networks. In: Maly I. (eds) Systems Biology. Methods in Molecular Biology (Methods and Protocols), vol 500. Humana Press
[2] Feist, Adam M., and Bernhard O. Palsson. “The Biomass Objective Function.” Current opinion in microbiology 13.3 (2010): 344–349. PMC. Web. 28 July 2017.
[3] Cuevas, Daniel A. et al. “From DNA to FBA: How to Build Your Own Genome-Scale Metabolic Model.” Frontiers in Microbiology 7 (2016): 907. PMC. Web. 27 July 2017.
[4] Henry, C.S., DeJongh, M., Best, A.B., Frybarger, P.M., Linsay, B., and R.L. Stevens. High-throughput Generation and Optimization of Genome-scale Metabolic Models. Nature Biotechnology, (2010).
[5] Ebrahim, Ali et al. “COBRApy: COnstraints-Based Reconstruction and Analysis for Python.” BMC Systems Biology 7 (2013): 74. PMC. Web. 17 Aug. 2017.
[6] Mundy, Michael, Helena Mendes-Soares, and Nicholas Chia. "Mackinac: a bridge between ModelSEED and COBRApy to generate and analyze genome-scale metabolic models." Bioinformatics (2017): btx185.
[7] MLA Adadi, Roi et al. “Prediction of Microbial Growth Rate versus Biomass Yield by a Metabolic Network with Kinetic Parameters.” Ed. Nathan D. Price. PLoS Computational Biology 8.7 (2012): e1002575. PMC. Web. 31 July 2017.
[8] Molenaar, Douwe et al. “Shifts in Growth Strategies Reflect Tradeoffs in Cellular Economics.” Molecular Systems Biology 5 (2009): 323. PMC. Web. 31 July 2017.