Difference between revisions of "Fabry:Sequence-based mutation analysis/Journal"

From Bioinformatikpedia
(SNAP)
(Multiple sequence alignment)
 
(One intermediate revision by one other user not shown)
Line 6: Line 6:
 
IN:
 
IN:
 
[https://dl.dropbox.com/u/13796643/fabry/seqbased/data/aa_properties.txt aa_properties.txt]<ref>Wikipedia, Amino Acid (June 11th, 2012), [http://en.wikipedia.org/wiki/Amino_acid#Table_of_standard_amino_acid_abbreviations_and_properties http://en.wikipedia.org/wiki/Amino_acid#Table_of_standard_amino_acid_abbreviations_and_properties]. June 11th, 2012</ref>, [https://dl.dropbox.com/u/13796643/fabry/seqbased/data/resMass.txt resMass.txt]<ref>ExPASy. The amino acid masses [http://education.expasy.org/student_projects/isotopident/htdocs/aa-list.html http://education.expasy.org/student_projects/isotopident/htdocs/aa-list.html]. June 12th, 2012</ref>,
 
[https://dl.dropbox.com/u/13796643/fabry/seqbased/data/aa_properties.txt aa_properties.txt]<ref>Wikipedia, Amino Acid (June 11th, 2012), [http://en.wikipedia.org/wiki/Amino_acid#Table_of_standard_amino_acid_abbreviations_and_properties http://en.wikipedia.org/wiki/Amino_acid#Table_of_standard_amino_acid_abbreviations_and_properties]. June 11th, 2012</ref>, [https://dl.dropbox.com/u/13796643/fabry/seqbased/data/resMass.txt resMass.txt]<ref>ExPASy. The amino acid masses [http://education.expasy.org/student_projects/isotopident/htdocs/aa-list.html http://education.expasy.org/student_projects/isotopident/htdocs/aa-list.html]. June 12th, 2012</ref>,
[https://dl.dropbox.com/u/13796643/fabry/seqbased/data/ip.txt ip.txt]<ref>Wikipedia, Proteinogenic amino acid (May 20th, 2012‎) [http://en.wikipedia.org/wiki/Proteinogenic_amino_acid#Chemical_properties http://en.wikipedia.org/wiki/Proteinogenic_amino_acid#Chemical_properties]. June 12th, 2012</ref>,
+
[https://dl.dropbox.com/u/13796643/fabry/seqbased/data/pI.txt pI.txt]<ref>Wikipedia, Proteinogenic amino acid (May 20th, 2012‎) [http://en.wikipedia.org/wiki/Proteinogenic_amino_acid#Chemical_properties http://en.wikipedia.org/wiki/Proteinogenic_amino_acid#Chemical_properties]. June 12th, 2012</ref>,
 
[https://dl.dropbox.com/u/13796643/fabry/seqbased/data/pickedSNPsNOINFO.txt pickedSNPsNOINFO.txt]
 
[https://dl.dropbox.com/u/13796643/fabry/seqbased/data/pickedSNPsNOINFO.txt pickedSNPsNOINFO.txt]
   
Line 36: Line 36:
   
 
== Multiple sequence alignment ==
 
== Multiple sequence alignment ==
  +
[http://www.uniprot.org/blast/uniprot/2012061770B0OSJCOA Blast result]
  +
 
== Scoring methods ==
 
== Scoring methods ==
 
=== SIFT ===
 
=== SIFT ===

Latest revision as of 22:24, 20 June 2012

Fabry Disease » Sequence-based mutation analysis » Journal


Amino acid properties

IN: aa_properties.txt<ref>Wikipedia, Amino Acid (June 11th, 2012), http://en.wikipedia.org/wiki/Amino_acid#Table_of_standard_amino_acid_abbreviations_and_properties. June 11th, 2012</ref>, resMass.txt<ref>ExPASy. The amino acid masses http://education.expasy.org/student_projects/isotopident/htdocs/aa-list.html. June 12th, 2012</ref>, pI.txt<ref>Wikipedia, Proteinogenic amino acid (May 20th, 2012‎) http://en.wikipedia.org/wiki/Proteinogenic_amino_acid#Chemical_properties. June 12th, 2012</ref>, pickedSNPsNOINFO.txt

OUT: SNP_aaProps.txt, SNP_aaProps.wiki

perl read_AAProp.pl

Secondary Structure

IN: pickedSNPsNOINFO.txt, P06280.dssp.ss, P06280.reprof.ss, P06280.psipred.ss (from Task 3)
OUT: SNP_secStruc.txt, SNP_secStruc.wiki

perl compare_seq_struc.pl

Substitution matrices

IN: pickedSNPsNOINFO.txt, BLOSUM62.csv<ref>Koders BLOSUM 62 [1]. June 12th. 2012</ref>, PAM250.csv<ref>Koders PAM250 [2]. June 12th. 2012</ref>, PAM1.csv (thank you Canavan Disease Group)
OUT: SNP_substMatr.txt, SNP_substMatr.wiki

perl subst.pl

PSSM

bash run_psi_blast.sh

Multiple sequence alignment

Blast result

Scoring methods

SIFT

IN: Prediction.txt
OUT: SNP_Sift.txt, SNP_Sift.wiki

 perl readSift.pl

Polyphen2

IN: pickedSNPsNOINFO.txt
OUT: Polyphen.batch

 perl prepBatchPolyphen2.pl

IN: pph2-full.txt
OUT: SNP_pph2.txt, SNP_pph2.wiki

 perl readPolyphen.pl

SNAP

In order to gain predictions for every (non-silent) mutation at the positions of the selected SNPs, a new mutation file had to be generated. Since we are only interested in ten positions, SNAPs all keyword, was not an option. The mutations file was generated by the script snap_generate_mutations.sh.

bash snap_generate_mutations.sh

We used the following command to obtain the SNAP2 predictions (see run_snap2.sh).

snap2 --tolerate -i P06280.fasta -m snap_mutations.txt -o P06280_snap2_result.txt |& tee snap2.log

Since snap2 exited with an error without the tolerate option, we had to add it to the command.

--tolerate
    Tolerate failures from external programs. Failures will trigger snap2 to
    switch into fallback mode (predictions will have lower accuracy)

Results and Conclusion

IN: SNP_aaProps_final.wiki, SNP_secStruc.wiki, SNP_substMatr.wiki, SNP_Sift.wiki, SNP_pph2.wiki

OUT: overview.wiki, overview.txt

perl createOverviewTable.pl

References

<references/>