Line 97: | Line 97: | ||
CascAID is a potentially universal tool for nucleic acid detection. | CascAID is a potentially universal tool for nucleic acid detection. | ||
Fast adaptation of our platform to new targets requires <i>in silico</i> verification of the crRNA design. | Fast adaptation of our platform to new targets requires <i>in silico</i> verification of the crRNA design. | ||
− | Crucial factors for the development of these crRNA designs are the binding of the crRNA to Cas13a | + | Crucial factors for the development of these crRNA designs are the binding of the crRNA to Cas13a, which is |
− | mainly determined by its secondary structure and the uniqueness of the targeting sequence in the transcriptome | + | mainly determined by its secondary structure, and the uniqueness of the targeting sequence in the transcriptome |
to rule out false positive results. To ensure the integrity of the Cas13a-crRNA complex, we developed | to rule out false positive results. To ensure the integrity of the Cas13a-crRNA complex, we developed | ||
− | a python script that uses the established program packages for secondary structures NUPACK and Mfold. | + | a python script that uses the established program packages for secondary structures, NUPACK and Mfold. |
In order to verify the specificity of the targeting sequence, we used the BLASTN-short program to | In order to verify the specificity of the targeting sequence, we used the BLASTN-short program to | ||
check for similar structures in a transcriptome databank. Additionally, we created a database of crRNA designs | check for similar structures in a transcriptome databank. Additionally, we created a database of crRNA designs | ||
Line 106: | Line 106: | ||
as extensive as possible given the limited time, checking for collaboration with other teams working with Cas13a, | as extensive as possible given the limited time, checking for collaboration with other teams working with Cas13a, | ||
mainly TU Delft. | mainly TU Delft. | ||
− | The second branch of software | + | The second branch of software we developed is needed for hardware control in our project. |
They allow user's devices such as computers and smartphones to control | They allow user's devices such as computers and smartphones to control | ||
− | our hardware | + | our hardware, Heatbringer and Lightbringer. |
The repository to our software can be found <a class="myLink" href="https://github.com/igemsoftware2017/igem_munich_2017">here</a>. | The repository to our software can be found <a class="myLink" href="https://github.com/igemsoftware2017/igem_munich_2017">here</a>. | ||
</p> | </p> | ||
Line 124: | Line 124: | ||
There are two main problems regarding the design of crRNA for a diagnostic test. | There are two main problems regarding the design of crRNA for a diagnostic test. | ||
First, the secondary structure of the crRNA needed for Cas13a activity needs to be verified. | First, the secondary structure of the crRNA needed for Cas13a activity needs to be verified. | ||
− | + | Secondly, the sequence targeted by the crRNA has to be specific, i.e. there must be no identical sequence in the | |
− | reference transcriptome of an healthy patient. Otherwise off-target effects will lead to | + | reference transcriptome of an healthy patient. Otherwise, off-target effects will lead to |
false positive results since Cas13a is activated even though the pathogen is not present. | false positive results since Cas13a is activated even though the pathogen is not present. | ||
To address these issues, we developed a software relying on bioinformatic principles such as | To address these issues, we developed a software relying on bioinformatic principles such as | ||
− | secondary structure prediction and Basic Local Alignment | + | secondary structure prediction and Basic Local Alignment Search Tool (BLAST). |
</p> | </p> | ||
</td> | </td> | ||
Line 147: | Line 147: | ||
crystallography data of crRNA in complex with Cas13a, or from structure prediction data of experimentally | crystallography data of crRNA in complex with Cas13a, or from structure prediction data of experimentally | ||
tested crRNAs. Using secondary structure verification we were able to rule out misfolding crRNA | tested crRNAs. Using secondary structure verification we were able to rule out misfolding crRNA | ||
− | designs prior to experiment. We developed a script for the end user | + | designs prior to experiment. We developed a script for the end user automating this procedure. |
</p> | </p> | ||
</td> | </td> | ||
Line 156: | Line 156: | ||
<br> | <br> | ||
<p> | <p> | ||
− | NUPACK is a RNA Secondary Structure Prediction | + | NUPACK is a RNA Secondary Structure Prediction package developed |
by several contributors under the guidance of Prof. Niles A. Pierce at the California Insitute of Technology (Caltech). | by several contributors under the guidance of Prof. Niles A. Pierce at the California Insitute of Technology (Caltech). | ||
The source-code is available free-of-charge for academic usage. | The source-code is available free-of-charge for academic usage. | ||
NUPACK allows the analysis of the partition function, the minimum free energy and the equillibrium base-pairing | NUPACK allows the analysis of the partition function, the minimum free energy and the equillibrium base-pairing | ||
probabilities of a RNA sequence. | probabilities of a RNA sequence. | ||
− | For offline usage we implemented NUPACK locally. We proceeded to implement Mfold as a webserver request. | + | For offline usage, we implemented NUPACK locally. We proceeded to implement Mfold as a webserver request. |
This decision was made because we experienced that in certain cases, only one of the program packages | This decision was made because we experienced that in certain cases, only one of the program packages | ||
was able to predict the secondary structure of crRNA as described in previous papers, predominantly the paper of Liu et al. published in <i>Cell</i> in 2017 | was able to predict the secondary structure of crRNA as described in previous papers, predominantly the paper of Liu et al. published in <i>Cell</i> in 2017 | ||
Line 228: | Line 228: | ||
<p> | <p> | ||
Mfold is a webserver for RNA secondary structure prediction developed by Michael Zuker based on his paper | Mfold is a webserver for RNA secondary structure prediction developed by Michael Zuker based on his paper | ||
− | "Mfold web server for nucleic acid folding and hybridization prediction" that published in <i>Nucleic Acids Research</i> | + | "Mfold web server for nucleic acid folding and hybridization prediction" that was published in <i>Nucleic Acids Research</i> |
in 2003. Since Mfold is not available as a locally buildable binary for every operating system, we developed a | in 2003. Since Mfold is not available as a locally buildable binary for every operating system, we developed a | ||
script that automatically requests a standardised RNA Fold job from the server, therefore making it available | script that automatically requests a standardised RNA Fold job from the server, therefore making it available | ||
throughout all operating systems. Using the result obtained from this request, the secondary structure is | throughout all operating systems. Using the result obtained from this request, the secondary structure is | ||
checked via a string comparison in so-called "Vienna" notation. This notation gives base pairing as a string | checked via a string comparison in so-called "Vienna" notation. This notation gives base pairing as a string | ||
− | of dots and brackets where a dot represents a non-bonded base and brackets | + | of dots and brackets where a dot represents a non-bonded base and brackets represents paired bases, clarified by |
− | a opening bracket "(" at the 5'-end | + | a opening bracket "(" at the 5'-end and a closing bracket ")" at the 3'-end of each paired sequence. An example for the output |
of the program is given below: | of the program is given below: | ||
<pre style="text-align: left;"> | <pre style="text-align: left;"> | ||
Line 289: | Line 289: | ||
</p> | </p> | ||
<p> | <p> | ||
− | This is also a good example to show that | + | This is also a good example to show that it might happen that one program recognizes the |
crRNA secondary structure while the other does not. In this case, NUPACK has predicted the structure | crRNA secondary structure while the other does not. In this case, NUPACK has predicted the structure | ||
while Mfold is not able to predict the structure. Even though this is an experimental construct | while Mfold is not able to predict the structure. Even though this is an experimental construct | ||
Line 308: | Line 308: | ||
In order to rule out off-target effects for the designed crRNA in diagnostic applications, | In order to rule out off-target effects for the designed crRNA in diagnostic applications, | ||
we developed a script that is able to BLAST the sequence either against whole databases | we developed a script that is able to BLAST the sequence either against whole databases | ||
− | online or a | + | online or a custom database for human and bacterial transcriptomes containing data from human |
as well as from bacteria common in the human nasal tract and modell organisms used in our project including: | as well as from bacteria common in the human nasal tract and modell organisms used in our project including: | ||
<ol style="list-style-type:disc; list-style-position:left; text-align: left;"> | <ol style="list-style-type:disc; list-style-position:left; text-align: left;"> | ||
Line 321: | Line 321: | ||
</p> | </p> | ||
<p> | <p> | ||
− | Transcriptomes that | + | Transcriptomes that are common in the nasal tract but were not available are, |
− | + | among others: | |
</p> | </p> | ||
<ol style="list-style-type:disc; list-style-position:left; text-align: left;"> | <ol style="list-style-type:disc; list-style-position:left; text-align: left;"> |
Revision as of 21:21, 1 November 2017
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|