Difference between revisions of "Team:Hong Kong-CUHK/Software"

Line 26: Line 26:
 
</a>
 
</a>
  
<center><h3>Overview of Software</h3></center>
+
<center><h1>Overview of Software</h1></center>
 
<img src="https://static.igem.org/mediawiki/2017/3/38/CUHK_specification.jpg" width="20%" height="auto" style= "float: right">
 
<img src="https://static.igem.org/mediawiki/2017/3/38/CUHK_specification.jpg" width="20%" height="auto" style= "float: right">
 
To reduce manpower, we wrote a program to automatically generate toehold switch sequences from target RNA input. We developed it to a website for the convenience of other iGEMers who are interested in toehold switch application. Although this website can execute many functions to reduce repetitive works for user, careful examination of thermodynamic parameters and RNA secondary structures is still encouraged to get a promising switch sequence. To help iGEMers get started, we would like to do a brief introduction on our website here.
 
To reduce manpower, we wrote a program to automatically generate toehold switch sequences from target RNA input. We developed it to a website for the convenience of other iGEMers who are interested in toehold switch application. Although this website can execute many functions to reduce repetitive works for user, careful examination of thermodynamic parameters and RNA secondary structures is still encouraged to get a promising switch sequence. To help iGEMers get started, we would like to do a brief introduction on our website here.
Line 41: Line 41:
  
  
<h1> Instructions </h1>
+
<h3> Instructions </h3>
 
<h3>Create Your Account</h3>
 
<h3>Create Your Account</h3>
<img src="https://static.igem.org/mediawiki/2017/d/de/CUHK_register.png" width="20%" height="auto">
+
<img src="https://static.igem.org/mediawiki/2017/d/de/CUHK_register.png" width="20%" height="auto" style="float: left">
 
<p style="font-family: roboto;font-size:115%;">
 
<p style="font-family: roboto;font-size:115%;">
  
Line 49: Line 49:
 
<br>
 
<br>
 
<br>
 
<br>
<h3>Input target RNA sequence</h3>
+
<br>
 +
<br>
 +
<br>
 +
<br>
 +
<br>
 +
<h3>Design your toehold switch</h3>
 
<br><p style="font-family: roboto;font-size:115%;">
 
<br><p style="font-family: roboto;font-size:115%;">
On the “Design toehold switch” page, user can input target RNA in plain text or FASTA file with numbers, space, newline, uppercase, lowercase, T or U.  
+
Input target RNA sequence
 +
<br>
 +
User can input target RNA in plain text or FASTA file with numbers, space, newline, uppercase, lowercase, T or U.  
 
Target RNA sequences will be transformed to uppercased DNA sequence. All downstream process will use this format. Sequences less than 30 bp will be rejected. After procession of input, a page with all user inputs will prompt for user to check and confirm. Rejected sequences will also be shown.
 
Target RNA sequences will be transformed to uppercased DNA sequence. All downstream process will use this format. Sequences less than 30 bp will be rejected. After procession of input, a page with all user inputs will prompt for user to check and confirm. Rejected sequences will also be shown.
 
The program will set the first nucleotide of each target RNA as the start point, generate possible switch sequences for each 30, 31, 32, 33, 34 or 35 nucleotides (nt), which is called a window sequence.   
 
The program will set the first nucleotide of each target RNA as the start point, generate possible switch sequences for each 30, 31, 32, 33, 34 or 35 nucleotides (nt), which is called a window sequence.   
Line 65: Line 72:
 
<br>
 
<br>
 
<br>
 
<br>
<h3>Ribosomal binding site</h3>
+
Input parameters:
<br><p style="font-family: roboto;font-size:115%;">
+
Then “AUA” or “UAU”, ribosomal binding site (RBS), “UAU” or “AUA” and the second half of the hairpin will be added after window sequences.
+
 
<br>
 
<br>
<h3>Rare codon count and RFC standard check</h3>
+
• Choose your cloning method (optional)
Rare codon count and RFC standard check are specifically added for iGEMers.
+
 
<br>
 
<br>
 +
User can choose to add RE site or use our toehold switch cloning tool.
 
<br>
 
<br>
<h3>MFE calculation</h3><p style="font-family: roboto;font-size:115%;">
+
• Choose Promoter (optional)
MFE and MFE structure of this switch, switch dimer, window-switch dimer and RBS-linker will be calculated according to user input. The base pair condition of the toehold domain of switch sequence will be counted, and switches with excessive domain pair will be discarded.
+
 
<br>
 
<br>
 +
User can input a custom promoter or select a promoter from iGEM registry.
 
<br>
 
<br>
<h3>Trigger length</h3><p style="font-family: roboto;font-size:115%;">
+
• Input linker sequence:
The downstream 120 nt (or user specified length) from the start site will be copied to be trigger sequence. Trigger sequence will co-express with corresponding switch under the control of T7 promoter to validate the functionality of the switch.
+
Then MFE and MFE structure of trigger, trigger dimer, trigger-switch dimer will be calculated as user input.
+
 
<br>
 
<br>
 +
The linker is used to separate the coding sequence in the toehold switch and the reporter to prevent interference of protein folding. A default flexible linker (AACCUGGCGGCAGCGCAAAAG) is provided.
 
<br>
 
<br>
<h3>Plasmid construction</h3><p style="font-family: roboto;font-size:115%;">
+
• Choose ribosomal binding site
For plasmid construction, three options are provided. The first option is to use our standard backbone, which contains T7 promoter, EcoR31 recognition site and antibiotics resistance gene. A plasmid for specific switch and trigger pair can be produced by simply digesting the switch standard backbone and trigger standard backbone and ligating them with switch and trigger separately. The second option is simple output switch and trigger sequence. The third option is generate overlapping PCR primers for user.
+
 
<br>
 
<br>
 +
User can input a custom RBS or select a RBS from iGEM registry.
 
<br>
 
<br>
<h3>BLASTn</h3><p style="font-family: roboto;font-size:115%;">
+
• Input trigger length
After all target RNA sequences are processed, BLASTn will be done for every legal window sequences if user ticked this option. BLASTn result will be saved in txt file with switch number as file name.
+
 
<br>
 
<br>
 +
The downstream 120nt (or user specified length) from the start site will be copied to be trigger sequence.
 +
<br>
 +
• Check RFC
 +
RFC standard check are specifically added for iGEMers. The program can help you to eliminate switches with illegal sites.
 +
<br>
 +
<br>
 +
<br>
 +
Output option
 +
<br>
 +
• Counting rare codon
 +
<br>
 +
User can choose to count the occurrence of rare codon at the start of the CDS. Rare codon at the beginning of CDS may significantly hinder translation.
 +
<br>
 +
• Minimal free energy (MFE) and MFE structure calculation
 +
<br>
 +
User can choose to output the MFE and MFE structure of this switch, switch dimer, window-switch dimer and RBS-linker. We used embedded function of “Vienna RNA” (?) to do the calculation.
 +
<br>
 +
• Base pair condition
 +
<br>
 +
The base pair condition of the toehold domain of switch sequence and the RBS domain in trigger-switch dimer will be counted, and switches with excessive domain pair will be discarded. A low number of base pairing at the toehold domain and RBS is preferred.
 +
<br>
 +
• BLASTn
 +
<br>
 +
After all target RNA sequences are processed, BLASTn will be done for every legal window sequences if user ticked this option. BLASTn result will be saved in txt file with switch number as file name.
 
<br>
 
<br>
<h3>Output excel file</h3><p style="font-family: roboto;font-size:115%;">
 
The output result will be represented in a excel file. </p>
 
  
<img src="https://static.igem.org/mediawiki/2017/f/fa/CUHK_Excel.png" width="100%" height="auto">
 
  
  
Line 99: Line 123:
  
  
<p style="font-family: roboto;font-size:115%;">
 
Below list out the definition of terms used in the output excel. </p>
 
<img src="
 
https://static.igem.org/mediawiki/2017/8/87/CUHK_swicthdescription.png" width="70%" height="auto">
 
  
  
  
  
<h3>Email address</h3> <p style="font-family: roboto;font-size:115%;">
+
 
 +
<h3>Output</h3><p style="font-family: roboto;font-size:115%;">
 +
• Excel output
 +
<br>
 +
The output result will be represented in a excel file. </p>
 +
 
 +
<img src="https://static.igem.org/mediawiki/2017/f/fa/CUHK_Excel.png" width="100%" height="auto">
 +
 
 +
 
 +
<p style="font-family: roboto;font-size:115%;">
 +
Below list out the definition of terms used in the output excel. </p>
 +
<img src="
 +
https://static.igem.org/mediawiki/2017/8/87/CUHK_swicthdescription.png" width="70%" height="auto">
 +
<br>
 +
<br>
 +
• Email output
 +
<br>
 
User will receive result files by email. If the input is only one target RNA sequence without BLASTn function, the email with contain an excel file named by user specified name. If the input is multiple sequences or BLASTn is required, the email will contain a folder named by user specified name. Within the folder, both excel files for each target RNAs and BLASTn files will be named in digit numbers.
 
User will receive result files by email. If the input is only one target RNA sequence without BLASTn function, the email with contain an excel file named by user specified name. If the input is multiple sequences or BLASTn is required, the email will contain a folder named by user specified name. Within the folder, both excel files for each target RNAs and BLASTn files will be named in digit numbers.
 
Result files can also be retrieved in user account. The upper limit of fiel number is fifty, including excel files and BLASTn files.  
 
Result files can also be retrieved in user account. The upper limit of fiel number is fifty, including excel files and BLASTn files.  

Revision as of 11:52, 29 October 2017

Overview of Software

To reduce manpower, we wrote a program to automatically generate toehold switch sequences from target RNA input. We developed it to a website for the convenience of other iGEMers who are interested in toehold switch application. Although this website can execute many functions to reduce repetitive works for user, careful examination of thermodynamic parameters and RNA secondary structures is still encouraged to get a promising switch sequence. To help iGEMers get started, we would like to do a brief introduction on our website here.

Our web tool takes as input the trigger RNA sequence and various design parameters, such as the promoter and RBS sequences to be inserted into the toehold switch, length of the recognition part of the trigger RNA and experiment temperature. Users can also choose to produce some optional outputs, such as rare codon count, minimum free energy and numbers of paired and unpaired bases, which could be useful for ranking the resulting list of candidate toehold switches.

Upon receiving the inputs, our web tool carries out a number of steps to design the toehold switches (rightfigure). It first checks the format and validity of the inputs. After that, it uses a sliding window to consider every x consecutive bases of the trigger sequence as the potential recognition part, where x is the length of this part specified by the user. A toehold switch is constructed based on the subsequence in this window and the other input parameters. The toehold switch sequence is then subject to a sequence of tests, including the free of stop codons between the start codon and the downstream gene, and the lack of consecutive bases of the same type. If the switch can pass all these tests, free energy calculations will be performed next for the switch and trigger monomers and their interacting duplex, if the user chooses to output such information. Finally, optionally the sub-sequence in the window can be used to search the sequence database to check for highly similar off-target sequences.

Instructions

Create Your Account

An account is required for every user to execute the main program, save the result and retrieve result later. A total of fifty results could be saved in one account.






Design your toehold switch


Input target RNA sequence
User can input target RNA in plain text or FASTA file with numbers, space, newline, uppercase, lowercase, T or U. Target RNA sequences will be transformed to uppercased DNA sequence. All downstream process will use this format. Sequences less than 30 bp will be rejected. After procession of input, a page with all user inputs will prompt for user to check and confirm. Rejected sequences will also be shown. The program will set the first nucleotide of each target RNA as the start point, generate possible switch sequences for each 30, 31, 32, 33, 34 or 35 nucleotides (nt), which is called a window sequence. Since the whole switch sequence could be predicted by the initial window sequence, window sequence with following features will be rejected to ensure hairpin stability,:
• Have no C or G at the neck of hairpin
• Have in frame stop codon after AUG
• Have more than four consecutive A, G, C or T
Start point will move to right for one nucleotide if the previous window sequence is rejected. Switch design process will continue if the window sequence passed the first examination.

Input parameters:
• Choose your cloning method (optional)
User can choose to add RE site or use our toehold switch cloning tool.
• Choose Promoter (optional)
User can input a custom promoter or select a promoter from iGEM registry.
• Input linker sequence:
The linker is used to separate the coding sequence in the toehold switch and the reporter to prevent interference of protein folding. A default flexible linker (AACCUGGCGGCAGCGCAAAAG) is provided.
• Choose ribosomal binding site
User can input a custom RBS or select a RBS from iGEM registry.
• Input trigger length
The downstream 120nt (or user specified length) from the start site will be copied to be trigger sequence.
• Check RFC RFC standard check are specifically added for iGEMers. The program can help you to eliminate switches with illegal sites.


Output option
• Counting rare codon
User can choose to count the occurrence of rare codon at the start of the CDS. Rare codon at the beginning of CDS may significantly hinder translation.
• Minimal free energy (MFE) and MFE structure calculation
User can choose to output the MFE and MFE structure of this switch, switch dimer, window-switch dimer and RBS-linker. We used embedded function of “Vienna RNA” (?) to do the calculation.
• Base pair condition
The base pair condition of the toehold domain of switch sequence and the RBS domain in trigger-switch dimer will be counted, and switches with excessive domain pair will be discarded. A low number of base pairing at the toehold domain and RBS is preferred.
• BLASTn
After all target RNA sequences are processed, BLASTn will be done for every legal window sequences if user ticked this option. BLASTn result will be saved in txt file with switch number as file name.

Output

• Excel output
The output result will be represented in a excel file.

Below list out the definition of terms used in the output excel.



• Email output
User will receive result files by email. If the input is only one target RNA sequence without BLASTn function, the email with contain an excel file named by user specified name. If the input is multiple sequences or BLASTn is required, the email will contain a folder named by user specified name. Within the folder, both excel files for each target RNAs and BLASTn files will be named in digit numbers. Result files can also be retrieved in user account. The upper limit of fiel number is fifty, including excel files and BLASTn files.