Placement of the Downstream box for amilCP
Overview:
Introduction
In this project we want to produce a bacterium expressing amilCP only at low temperature.
The majority of the 5’ regulatory sequence, the cspA 5’UTR, is upstream of the start codon.
However, a small but important sequence motif, called the DS box, is present after the start codon, and is translated as part of the target protein. Therefore, it is important to consider the structure and function of the target protein, particularly the N-terminal region, in order to correctly place the DS box insertion so that it does not alter the structure or function. However, there is no experimental structure for amilCP. Therefore, we use a computational approach, homology modeling, to create a model of amilCP and identify a safe insertion site in the sequence that would not alter the structure.
Our version of the DS box RNA sequence is 15 nucleotides long (as opposed to the 14 nucleotide version reported in YAMANAKA1); we added an extra nucleotide at the 3’ end of this motif so as to not introduce a frame shift error in amilCP.
Materials and Methods
We used the Phyre2 homology modeling server2 to find templates, generate alignments, and create 3D models. Phyre2 use the hidden Markov model. This model is trustworthy up to 15% identity between the protein to model and the chosen template.
Modeling parameters
We submitted the amilCP sequence (Part:BBa_K592009) to the Phyre2 server for all steps of homology modeling, from template search to model generation.
MSVIAKQMTYKVYMSGTVNGHYFEVEGDGKGKPYEGEQTVKLTVTKGGPLPFAWDILSPQCQYGSIPFTKYPEDIPDYVKQSFPEGYTWERIMNFEDGAVCTVSNDSSIQGNCFIYHVKFSGLNFPPNGPVMQKKTQGWEPNTERLFARDGMLLGNNFMALKLEGGGHYLCEFKTTYKAKKPVKMPGYHYVDRKLDVTNHNKDYTSVEQCEISIARKPVVA
Results
Homology modelling of wild-type AmilCP by Phyre2
Phyre2 choose as the best template PDB entry 1mou (See Protein Data Bank) which is a blue chromoprotein from the GFP family found in Montipora efflorescens.
We have made available all the complete Phyre 2 results: Download here
Phyre2’s best template match was PDB entry 1mou (a coral pocilloporin pigment), calculated to have a sequence identity of 95% to amilCP. This implies that the structure of amilCP will be largely identical.
Phyre2’s resulting model is indeed a GFP-like beta barrel with modified chromophore residue in the middle.
Model evaluation
The very high sequence ID gave us high confidence in the resulting model, even without further optimization or comparative analysis. We were specifically interested in the position of the N-terminus relative to the structure of the rest of the protein, as this would be the general insertion site of the DS Box sequence. In order to evaluate the model quality and obtain a precise geometrical defintion of secondary structure at the residue level we analyzed the resulting model’s Ramachandran plot.
Analysis of the DS box’s placement within amilCP protein
All the following visualisations have been performed using the BioVia Discovery Studio Visualizer developed by Dassault Systemes.
Here we highlight the position of the N-terminus on the 3D structure. AmilCP has a barrel Beta sheet conformation. Based on the Ramachandran plot, we see that the fourth amino acid (isoleucine) is involved in the beta sheet. To avoid the disturbing the protein structure, we must place the DS box before this fourth amino acid.
Solvent accessibility
Next, we calculated the solvent accessibility of the protein structure, paying particular attention to the N-terminus. This allowed us to assess it’s interaction with the rest of the protein structure. The blue color means that a residue is solvent accessible; green indicates that it is buried in the protein structure. The amilCP model is represented in ribbon on the left and as a space filling model on the right.
From the Ramachandran plot we know that this region is not locked into a beta sheet conformation, and from the solvent accessible surface calculation it appears that it makes little contact with the rest of the protein structure. After consideration we decided to place the DS box after the first two residues.
Placement of the DS box
Once the DS box insertion site was chosen, Phyre2 was used to create a 3D model of amilCP with the DS box sequence added after the second residue. AmilCp + DS box sequence (DS box in red):
MSMTGIVVIAKQMTYKVYMSGTVNGHYFEVEGDGKGKPYEGEQTVKLTVTKGGPLPFAWDILSPQCQYGFTKYPEDIPDYVKQSFPEGYTWERIMNFEDGAVCTVSNDSSIQGNCFIYHVKFSGLNFPPNGPVMQKKTQGWEPNTERLFARDGMLLGNNFMALKLEGGGHYLCEFKTTYKAKKPVKMPGYHYVDRKLDVTNHNKDYTSVEQCEISIARKPVVA
Phyre 2 returned this 3D model. The DS box is highlighted in yellow. The beginning of the protein strand, is clearly not involved in the beta sheets barrel.
We have made available all the complete Phyre 2 results: Download here
To confirm the viability of this model, a residue interaction analysis was done with the “Bioviva Discovery Studio Visualizer” software, developed by DASSAULT SYSTEMES.
The residues having a possible interaction with the DS box sequence are highlighted in yellow on the 3D model and in black on the sequence.
This shows only two residues in the protein seems to have a probability to interact with the DS box which is less than the wild type beginning of the protein.
Conclusion
Based on the very high (95%) sequence identity between target and template, we have confidence in Phyre2’s homology model. We extended the DS box sequence from 14 to 15 nucleotides in order to avoid a frame shift error and then placed this insert into the coding sequence at a position which is predicted to not interefere with amilCP’s chromogenic structure. This will be verified experimentally by activity assays of our rationally designed construct.
References
- Yamanaka K, Mitta M, Inouye M. Mutation Analysis of the 5′ Untranslated Region of the Cold Shock cspA mRNA of Escherichia coli. Journal of Bacteriology. 1999;181(20):6284-6291.
- Kelley LA et al. The Phyre2 web portal for protein modeling, prediction and analysis. Nature Protocols 10, 845-858 (2015)
Igem ionis
Is an association created by Sup’Biotech student in 2015. Since this first participation, two teams (2015 and 2016) won the gold medal and several nominations: « Best presentation », « Best applied design », and « Best environmental project ». The strength of the IGEM IONIS comes from its multidisciplinarity and its complementarity.
This year we are 20 members from different schools:18 students from Sup’Biotech1 student from e-artsup1 student from EpitaRead more …
keep IN TOUCH
Location: 66 Rue Guy Môquet94800 Villejuif, France
Email: igem.ionis@gmail.com