Team:EpiphanyNYC/Model

Model (Computational data model)

 

Summary:

To begin computational research, Huntington mutants and their percentage of occurrence were located on various genetic databases, mainly NCBI. Our goal was to achieve toehold strand displacement of mutated HTT with a corrected strand. Sequences were run through protein-folding softwares to select viable candidates for the project. By aligning proteins, hairpin loops could be identified and targeted. Candidates had to be created for chaperone and promoter sequences, with approximately 40 CAG repeats within the sequence. The computational team was split up to test for feasible sequences. When attempting to order strands, the sequence was not practical, so it was revised.

 

Revised what was on the website a little bit:

Our goal was to attempt a toehold strand displacement of mutated HTT with a corrected strand. We used mFold software packages to model RNA sequence folds in order to find a tractable hairpin within the 5’ UTR. However, using mFold did not provide enough information on any full sequences the size of HTT. We switched to the Vienna package as it provided a much better model of the data. Model folding calculations/visualizations allowed prediction of the position of a usable a hairpin loop for strand displacement.

 

 

After running our sequence through Vienna, it was apparent mRNA molecule was still too large. One software prediction was not enough. Many online sites with protein folding capabilities proved difficult to use or were not being maintained. Genstrip and RNAI designer were two programs provided multiples sequences for us to target. UGENE was used to view and align these sequences, allowing us to target the optimal hairpin loop and figure out exactly where to begin targeting.

 

Computational Lab Notes:

  • Looked for Huntington’s mRNA in particular in NCBI
  • NM numbers, other prefixes gave types of mRNA
  • HW: Find huntington’s-related sequences in NCBI database (assumed at least 5-10 sequences)
  • We used data we received from [ncbi.com] and used it to compare with the wild type and the infected types.
  • HW Results: Found 2 accession numbers- NM_002111.8 (mRNA) and NP_002102.4 (protein); however, only one was mRNA and none were disease form
  • We then collected the data in our shared folder
  • Looked through other databases for other mutants; found:
  • Study with “20 Huntington’s Disease and 49 neurologically normal control samples from post-mortem human subjects”
  • http://trace.ddbj.nig.ac.jp/DRASearch/study?acc=SRP051844
  • Results for HTT on DNA Data Bank of Japan (includes above):
  • http://trace.ddbj.nig.ac.jp/DRASearch/query?keyword=htt&show=20&fq_rep_name=Homo%20sapiens
  • HW: Run wild-type mRNA on mfold (Create a text document with the NM_00211.8 RNA and run mFold on it http://unafold.rna.albany.edu/?q=mfold)
  • We then collected the data in our shared folder
  • HW Results: Nobody able to run mfold successfully locally or on web