Rstclair2012 (Talk | contribs) |
Rstclair2012 (Talk | contribs) |
||
Line 52: | Line 52: | ||
<h4>Artemisinin Consensus Sequence</h4> | <h4>Artemisinin Consensus Sequence</h4> | ||
<p style="font-size: 18px"> | <p style="font-size: 18px"> | ||
− | + | Sequence length of 150, among others, performed rather well. Validation accuracy of around 90 percent (FigureD).The loss/validation graph (Figure E) shows the value of the error of function within the validation set (lower number = more accurate prediction). The model was also able to determine the composition of sequence that is most probable to be associated with binding (Figure F). Since binding to Artemisinin is controlled by multiple unknown mechanisms, we proposed a new dataset to show the model could show a consensus sequence if one was present in a known control experiment (Homeobox Consensus Sequence). | |
</p> | </p> | ||
<h5> Figure D </h5> | <h5> Figure D </h5> | ||
− | <img style="max-width:95%;border:3px solid darkred;" src=" "> | + | <img style="max-width:95%;border:3px solid darkred;" src="https://static.igem.org/mediawiki/2017/5/55/T--Florida_Atlantic--ArtemisininAccuracyValidation.png" width= 200; height= 400px;> |
<h5> Figure E </h5> | <h5> Figure E </h5> | ||
− | <img style="max-width:95%;border:3px solid darkred;" src=" "> | + | <img style="max-width:95%;border:3px solid darkred;" src="https://static.igem.org/mediawiki/2017/f/fc/T--Florida_Atlantic--ArtemisininValidationLoss.png" width= 200; height= 400px;> |
<h5> Figure F </h5> | <h5> Figure F </h5> | ||
− | <img style="max-width:95%;border:3px solid darkred;" src=" "> | + | <img style="max-width:95%;border:3px solid darkred;" src="https://static.igem.org/mediawiki/2017/0/0f/T--Florida_Atlantic--ArtemisininTheoreticalConsensus.png" width= 200; height= 400px;> |
+ | |||
</br> | </br> | ||
<h4>Homeobox Consensus Sequence</h4> | <h4>Homeobox Consensus Sequence</h4> | ||
<p style="font-size: 18px"> | <p style="font-size: 18px"> | ||
− | Sequence length of 100 showed best and consistent results on our model. This result was predicted because the homeo-domain protein in around 60 amino acids in length. A sequence length of 100 allows for the machine to cut the sub-sequence (the parts of the sequence it views) more often, allowing for it to get the entire 60 amino acid long sequence in view (instead of the first half or later half if only viewing 60 amino acid long sequence length). The accuracy graph (Figure | + | Sequence length of 100 showed best and consistent results on our model. This result was predicted because the homeo-domain protein in around 60 amino acids in length. A sequence length of 100 allows for the machine to cut the sub-sequence (the parts of the sequence it views) more often, allowing for it to get the entire 60 amino acid long sequence in view (instead of the first half or later half if only viewing 60 amino acid long sequence length). The model also showed a predicted a sequence composition very close to the theoretical accepted homeo-domain consensus sequence (Figure H). The accuracy graph (Figure I) showed the highest score of around 80%, which is what you would expect to find for a variably conserved sequence (100% would mean the sequence is exactly the same). The loss/validation graph (Figure G) shows the value of the error of function within the validation set (lower number = more accurate prediction). The accuracy is an average of all the proteins in the data set that the model was tested on in predicting if the sequence it was looking at had the theoretical homeo-domain sequence or not. This model was used as the control to compare to the previous artemisinin binding set. Once the model was trained, we ran the theoretical consensus sequence of homeo-domain through the model, which detected the sequence was present with a probability of 94% (see software page). |
</p> | </p> | ||
+ | <h5> Figure H </h5> | ||
+ | <img style="max-width:95%;border:3px solid darkred;" src="https://static.igem.org/mediawiki/2017/d/df/T--Florida_Atlantic--HomeoPredicted.jpeg" width= 200; height= 400px;> | ||
+ | <center><h5> Figure I _________________________________________________________________________________________________________ Figure G</h5></center> | ||
+ | <img style="max-width:95%;border:3px solid darkred;" src="https://static.igem.org/mediawiki/2017/5/5b/T--Florida_Atlantic--HomeoModelResults.png" width= 200; height= 400px;> | ||
− | |||
− | |||
− | |||
− | |||
Revision as of 23:53, 1 November 2017
Florida_Atlantic