Difference between revisions of "Task 3 (MSUD)"
(→Result) |
|||
Line 3: | Line 3: | ||
=== Result === |
=== Result === |
||
+ | |||
+ | The results for ReProf and PsiPred predictions and the DSSP assignments are in the following folders: |
||
+ | |||
+ | <code> |
||
+ | /mnt/home/student/schillerl/MasterPractical/task3/reprof/ |
||
+ | |||
+ | /mnt/home/student/schillerl/MasterPractical/task3/psipred/ |
||
+ | |||
+ | /mnt/home/student/schillerl/MasterPractical/task3/dssp/ |
||
+ | </code> |
||
+ | |||
+ | |||
+ | For P10775, ReProf was run with the protein sequence fasta file and the PSSMs (see <code>/mnt/home/student/schillerl/MasterPractical/task3/pssm/</code>) derived from big_80 and SwissProt as input. The following tables show the comparison of the prediction results to the secondary structure assignment of DSSP. |
||
+ | |||
+ | |||
+ | {| class="wikitable" border="1" style="width:500px" |
||
+ | |+ Comparison of ReProf prediction (fasta input) to DSSP assignment |
||
+ | |- |
||
+ | !secondary structure element !! recall !! precision !! f-measure |
||
+ | |- |
||
+ | |H || 0.719 || 0.585 || 0.645 |
||
+ | |- |
||
+ | |E || 0.211 || 0.500 || 0.296 |
||
+ | |- |
||
+ | |L || 0.616 || 0.654 || 0.635 |
||
+ | |} |
||
+ | |||
+ | |||
+ | {| class="wikitable" border="1" style="width:500px" |
||
+ | |+ Comparison of ReProf prediction (big_80 PSSM input) to DSSP assignment |
||
+ | |- |
||
+ | !secondary structure element !! recall !! precision !! f-measure |
||
+ | |- |
||
+ | |H || 0.944 || 0.889 || 0.916 |
||
+ | |- |
||
+ | |E || 0.649 || 0.685 || 0.667 |
||
+ | |- |
||
+ | |L || 0.826 || 0.866 || 0.846 |
||
+ | |} |
||
+ | |||
+ | |||
+ | {| class="wikitable" border="1" style="width:500px" |
||
+ | |+ Comparison of ReProf prediction (SwissProt PSSM input) to DSSP assignment |
||
+ | |- |
||
+ | !secondary structure element !! recall !! precision !! f-measure |
||
+ | |- |
||
+ | |H || 0.923 || 0.914 || 0.919 |
||
+ | |- |
||
+ | |E || 0.807 || 0.523 || 0.634 |
||
+ | |- |
||
+ | |L || 0.719 || 0.859 || 0.782 |
||
+ | |} |
||
+ | |||
+ | |||
+ | The percentages of correctly identified secondary structure (H, E or L) for the three methods are 61 %, 86 % and 82 %. So for the remaining sequences, the method with the best performance (usage of PSSM derived from big_80 as input for ReProf) is used. |
||
=== Discussion === |
=== Discussion === |
Revision as of 19:22, 15 May 2013
Contents
Secondary structure
Result
The results for ReProf and PsiPred predictions and the DSSP assignments are in the following folders:
/mnt/home/student/schillerl/MasterPractical/task3/reprof/
/mnt/home/student/schillerl/MasterPractical/task3/psipred/
/mnt/home/student/schillerl/MasterPractical/task3/dssp/
For P10775, ReProf was run with the protein sequence fasta file and the PSSMs (see /mnt/home/student/schillerl/MasterPractical/task3/pssm/
) derived from big_80 and SwissProt as input. The following tables show the comparison of the prediction results to the secondary structure assignment of DSSP.
secondary structure element | recall | precision | f-measure |
---|---|---|---|
H | 0.719 | 0.585 | 0.645 |
E | 0.211 | 0.500 | 0.296 |
L | 0.616 | 0.654 | 0.635 |
secondary structure element | recall | precision | f-measure |
---|---|---|---|
H | 0.944 | 0.889 | 0.916 |
E | 0.649 | 0.685 | 0.667 |
L | 0.826 | 0.866 | 0.846 |
secondary structure element | recall | precision | f-measure |
---|---|---|---|
H | 0.923 | 0.914 | 0.919 |
E | 0.807 | 0.523 | 0.634 |
L | 0.719 | 0.859 | 0.782 |
The percentages of correctly identified secondary structure (H, E or L) for the three methods are 61 %, 86 % and 82 %. So for the remaining sequences, the method with the best performance (usage of PSSM derived from big_80 as input for ReProf) is used.