Difference between revisions of "Sequence-based analyses Gaucher Disease"
(→Disorder) |
(→Secondary structure) |
||
Line 1: | Line 1: | ||
== Secondary structure == |
== Secondary structure == |
||
+ | Knowing the secondary structure of a protein can shed light on its function since structure implies function. If the structure of a protein is known, secondary elements (helix, sheet, coiled) can be ''assigned'' to residues depending on their affinity to form hydrogen bonds. DSSP is the most common method to perform such secondary structure assignments. If the structure of a protein is unknown, secondary structure elements be be ''predicted'' by tools like PSIPRED or Reprof. The aim of this task was to analyse the secondary structure of different proteins and the compare the secondary structure predictions of PSIPRED and Reprof with the DSSP secondary structure assignments. Following sequences were taken into account: |
||
+ | {|style="text-align:left" |
||
+ | ! NAME !! UniProtKB !! PDB |
||
+ | |- |
||
+ | | Glucosylceramidase || [http://www.uniprot.org/uniprot/P04062 P04062] || [http://www.ebi.ac.uk/pdbe-srv/view/entry/1OGS 1OGS] |
||
+ | |- |
||
+ | | Ribonuclease inhibitor || [http://www.uniprot.org/uniprot/P10775 P10775] || [http://www.ebi.ac.uk/pdbe-srv/view/entry/1DFJ 1DFJ] |
||
+ | |- |
||
+ | | Divalent-cation tolerance protein CutA || [http://www.uniprot.org/uniprot/Q9X0E6 Q9X0E6] || [http://www.ebi.ac.uk/pdbe-srv/view/entry/1KR4 1KR4] |
||
+ | |- |
||
+ | | Serine/threonine-protein phosphatase || [http://www.uniprot.org/uniprot/Q08209 Q08209] || [http://www.ebi.ac.uk/pdbe-srv/view/entry/1AUI 1AUI] |
||
+ | |} |
||
+ | Information about program calls and implementation details can be found in our [[protocol]]. |
||
=== Predictions === |
=== Predictions === |
||
− | P04062 |
+ | ==== P04062 ==== |
<pre> |
<pre> |
||
40 |
40 |
||
Line 46: | Line 59: | ||
</pre> |
</pre> |
||
− | P10775 |
+ | ==== P10775 ==== |
<pre> |
<pre> |
||
1 |
1 |
||
Line 100: | Line 113: | ||
</pre> |
</pre> |
||
− | Q08209 |
+ | ==== Q08209 ==== |
<pre> |
<pre> |
||
14 |
14 |
Revision as of 13:57, 17 May 2012
Contents
Secondary structure
Knowing the secondary structure of a protein can shed light on its function since structure implies function. If the structure of a protein is known, secondary elements (helix, sheet, coiled) can be assigned to residues depending on their affinity to form hydrogen bonds. DSSP is the most common method to perform such secondary structure assignments. If the structure of a protein is unknown, secondary structure elements be be predicted by tools like PSIPRED or Reprof. The aim of this task was to analyse the secondary structure of different proteins and the compare the secondary structure predictions of PSIPRED and Reprof with the DSSP secondary structure assignments. Following sequences were taken into account:
NAME | UniProtKB | PDB |
---|---|---|
Glucosylceramidase | P04062 | 1OGS |
Ribonuclease inhibitor | P10775 | 1DFJ |
Divalent-cation tolerance protein CutA | Q9X0E6 | 1KR4 |
Serine/threonine-protein phosphatase | Q08209 | 1AUI |
Information about program calls and implementation details can be found in our protocol.
Predictions
P04062
40 ARPCIPKSFGYSSVVCVCNATYCDSFDPPTFPALGTFSRYESTRSGRRMELSMGPIQANHTGTGLLLTLQPEQKFQKVKG Reprof CCCCCCCCCCCEEEEEEECCEECCCCCCCCCCCCCEEEEEEECCCCCEEEEECCCEECCCCCCEEEEEECCCCEEEEEEC PSIPRED CCCCCCCCCCCCCEEEEECCHHCCCCCCCCCCCCCEEEEEEECCCCCCHHCCCCCCCCCCCCCCCEEEECCCCCCEEEEE DSSP CECCCEEECCCCCEEEEEECCCCCECCCCCCCCCCEEEEEEEECCCCCCEEEEEECECCCCCCCEEEEEEEEEEEEECCE 120 FGGAMTDAAALNILALSPPAQNLLLKSYFSEEGIGYNIIRVPMASCDFSIRTYTYADTPDDFQLHNFSLPEEDTKLKIPL Reprof CCCCCCHHHHHEEEEECCCCCCEEEEEEECCCCCEEEEEEECCCCCCEEEEEEEECCCCCCCEEEEECCCCCCCCCEEEE PSIPRED EEEEHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHH DSSP EEEECCHHHHHHHCCCCHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCHHHH 200 IHRALQLAQRPVSLLASPWTSPTWLKTNGAVNGKGSLKGQPGDIYHQTWARYFVKFLDAYAEHKLQFWAVTAENEPSAGL Reprof EEHHHHHCCCCCEEEECCCCCCCEEEECCCECCCEECCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEEEEEECCCCCCCE PSIPRED HHHHHHHHCCCEEEEEEECCCCHHEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCC DSSP HHHHHHHCCCCCEEEEEECCCCHHHECCCCCCCCCEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCEEECCCCCCHHH 280 LSGYPFQCLGFTPEHQRDFIARDLGPTLANSTHHNVRLLMLDDQRLLLPHWAKVVLTDPEAAKYVHGIAVHWYLDFLAPA Reprof ECCCCEEEECCCCCCCCCEEEECCCCCCCCCCCCEEEEEEECCCCEECCCEEEEEECCCCCCEEEEEEEEEEEEEECCCC PSIPRED CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHHHCCHHHHHHCCEEEEECCCCCCCCH DSSP CCCCCCCCCECCHHHHHHHHHHCHHHHHHCCCCCCCEEEEEEEEHHHCCHHHHHHHCCHHHHCCCCEEEEEEECCCCCCH 360 KATLGETHRLFPNTMLFASEACVGSKFWEQSVRLGSWDRGMQYSHSIITNLLYHVVGWTDWNLALNPEGGPNWVRNFVDS Reprof CCECCCCCECCCCCEEEECCCCCCCEEEEEEEEECCCCCCCEEECEHHHHEEEEEEECCCCEEEECCCCCCEEEEECCCC PSIPRED HHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHEEEEEEEEECCCCCCCCCCCCCCC DSSP HHHHHHHHHHCCCCEEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCEEEEEEEECCECCCCCCCCCCCCCCC 440 PIIVDITKDTFYKQPMFYHLGHFSKFIPEGSQRVGLVASQKNDLDAVALMHPDGSAVVVVLNRSSKDVPLTIKDPAVGFL Reprof CEEEEECCCCCCCCCEEEECCCEEEECCCCCEEEEEEEECCCCCEEEEEECCCCCEEEEEEECCCCCCEEEECCCCEEEE PSIPRED CEEEECCCCEEEECHHHHHHHHHHHHCCCCCEEEEEECCCCCCEEEEEEECCCCCEEEEEEECCCCCEEEEEEECCCEEE DSSP CEEEEHHHCEEEECHHHHHHHHHHCCCCCCCEEEEEEECCCCCEEEEEEECCCCCEEEEEEECCCCCEEEEEEECCCEEE 520 ETISPGYSIHTYLWRRQ Reprof EEECCCCEEEEEEEECC PSIPRED EEECCCCEEEEEEEECC DSSP EEEECCCEEEEEEECCC
P10775
1 MNLDIHCEQLSDARWTELLPLLQQYEVVRLDDCGLTEEHCKDIGSALRANPSLTELCLRTNELGDAGVHLVLQGLQSPTC Reprof CCCCECHHCCCCCHHHHHHHHHHHCCEEEECCCCCCHHHHHHHHHHHHCCCCHHHHHHHHCCCCCCCHEEEHCCCCCCCC PSIPRED CEEECCCCCCCHHHHHHHHHHHCCCCEEECCCCCCCHHHHHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHHHCCC DSSP CECCCECCCCCHHHHHHHHHHHCCCCEEECECCCCCHHHHHHHHHHHHCCCCCCEEECCCCCCHHHHHHHHHHHHCCCCC 81 KIQKLSLQNCSLTEAGCGVLPSTLRSLPTLRELHLSDNPLGDAGLRLLCEGLLDPQCHLEKLQLEYCRLTAASCEPLASV Reprof EEEEECCCCCCCCHCCCCCCHHHHCHCHHHHHHCCCCCCCCHHHHHHHHHCCCCCHCCHHHHHHHHHHCCCCCCHHHHHH PSIPRED CCCEEECCCCCCCHHHHHHHHHHHHCCCCCCEEECCCCCCCHHHHHHHHHHHHCCCCCCCEEECCCCCCCHHHHHHHHHH DSSP CCCEEECCCCCCCCCHHHHHHHHHHHCCCCCEEECCCCCCHHHHHHHHHHHHHCCCCCCCEEECCCCCCCHHHHHHHHHH 161 LRATRALKELTVSNNDIGEAGARVLGQGLADSACQLETLRLENCGLTPANCKDLCGIVASQASLRELDLGSNGLGDAGIA Reprof HHHHHHHHHHCCCCCCHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHCHCCHHHCCCCCCCCCHHHHH PSIPRED HHHCCCCCEEECCCCCCCHHHHHHHHHHHHCCCCCCCEEECCCCCCCHHHHHHHHHHHHCCCCCCEEECCCCCCCHHHHH DSSP HHHCCCCCEEECCCCCCHHHHHHHHHHHHHCCCCCCCEEECCCCCCCHHHHHHHHHHHHHCCCCCEEECCCCCCHHHHHH 241 ELCPGLLSPASRLKTLWLWECDITASGCRDLCRVLQAKETLKELSLAGNKLGDEGARLLCESLLQPGCQLESLWVKSCSL Reprof HHCCCCCCCHHHHCHHEEEHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCHHHHHHHHCHH PSIPRED HHHHHHHHCCCCCCEEECCCCCCCHHHHHHHHHHHHCCCCCCEEECCCCCCCHHHHHHHHHHHHCCCCCCCEEECCCCCC DSSP HHHHHHCCCCCCCCEEECCCCCCCHHHHHHHHHHHHHCCCCCEEECCCCCCHHHHHHHHHHHHHCCCCCCCEEECCCCCC 321 TAACCQHVSLMLTQNKHLLELQLSSNKLGDSGIQELCQALSQPGTTLRVLCLGDCEVTNSGCSSLASLLLANRSLRELDL Reprof HHHHHHHHHHHHHCCHHHHHHHCCCCCCCCHHHHHHHHHHCCCCCEEEEEEECCCCCCCCCHHHHHHHHHHHCCHHHHCC PSIPRED CHHHHHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHCCCCCCCCEEECCCCCCCHHHHHHHHHHHHHCCCCCEEEC DSSP EHHHHHHHHHHHHHCCCCCEEECCCCECHHHHHHHHHHHHHCCCCCCCEEECCCCCCEHHHHHHHHHHHHHCCCCCEEEC 401 SNNCVGDPGVLQLLGSLEQPGCALEQLVLYDTYWTEEVEDRLQALEGSKPGLRVIS Reprof CCCCCCCHHHHHHHCCCCCCCCHHHHHHHCCCCCCHHHHHHHHHHHCCCCCCCECC PSIPRED CCCCCCHHHHHHHHHHHHCCCCCCCEEECCCCCCCHHHHHHHHHHHHHCCCCEECC DSSP CCCECCHHHHHHHHHHHCCCCCCCCEEECCCCCCCHHHHHHHHHHHHHCCCCEEEC
Q9X0E6:
2 ILVYSTFPNEEKALEIGRKLLEKRLIACFNAFEIRSGYWWKGEIVQDKEWAAIFKTTEEKEKELYEELRKLHPYETPAIF Reprof EEEEECCCCHHHHHHHHHHHHHHHHHHHHCHCHHHCCCEEECEEECCHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCHHE PSIPRED EEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEEEEEEEEECCCEEEEEEEEEEECCCHHHHHHHHHHHHHHCCCCCCEEE DSSP EEEEEEECCHHHHHHHHHHHHHCCCCCEEEEEEEEEEEEECCEEEEEEEEEEEEEEEHHHHHHHHHHHHHHCCCCCCCEE 82 TLKVENVLTEYMNWLRESVL Reprof HHHHHHHHHHHHHHHHHHCC PSIPRED EEECCCCCHHHHHHHHHHCC DSSP EECCCCEEHHHHHHHHHHCC
Q08209
14 TDRVVKAVPFPPSHRLTAKEVFDNDGKPRVDILKAHLMKEGRLEESVALRIITEGASILRQEKNLLDIDAPVTVCGDIHG Reprof CCCEEEEECCCCCCCEEEEEEECCCCCCEEEEEHHHECCCCCCCEEEEEEEEECCCCEEECCCCCCCCCCCEEEEECCCC PSIPRED CCCCCCCCCCCCCCCCCHHHCCCCCCCCCHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCEEEECCCEEEECCCCC DSSP CCCCCCCCCCCCCCCECHHHHECCCCCECHHHHHHHHHCCCCECHHHHHHHHHHHHHHHHCCCCEEEECCCEEEECCCCC 94 QFFDLMKLFEVGGSPANTRYLFLGDYVDRGYFSIECVLYLWALKILYPKTLFLLRGNHECRHLTEYFTFKQECKIKYSER Reprof HHHHHHEEEEECCCCCCCEEEEEEEECCCCEEEEEEEHHHHHHHCCCCCEEEEEECCCCCCEEEEEEEEEEEEEEEEECH PSIPRED HHHHHHHHHHHCCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHCCCCCEEEECCCCHHHHHHCCCCHHHHHHHHCCHH DSSP CHHHHHHHHHHHCCCCCCCEEECCCCCCCCCCHHHHHHHHHHHHHHCCCCEEECCCCCCCHHHHHHCCHHHHHHHHCCHH 174 VYDACMDAFDCLPLAALMNQQFLCVHGGLSPEINTLDDIRKLDRFKEPPAYGPMCDILWSDPLEDFGNEKTQEHFTHNTV Reprof HHHHHHHHCCCCCHHHHHCCCEEEEECCCCCCCCCHHHHHHHHCCCCCCCCCCCEEEEECCCCCCCCCCCCCCECCCCCE PSIPRED HHHHHHHHHCCCHHHHHCCCCEEEECCCCCCCCCCHHHHHHCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCCCC DSSP HHHHHHHHHCCCCCEEEECCCEEEECCCCCCCCCCHHHHHHCCCCCCCCCCCHHHHHHHCEECCCCCCCCCCCCEEECCC 254 RGCSYFYSYPAVCEFLQHNNLLSILRAHEAQDAGYRMYRKSQTTGFPSLITIFSAPNYLDVYNNKAAVLKYENNVMNIRQ Reprof CCEEEEECCCCEEEEHCCCCHHHHEHHHCCCCCCEEEEEECCCCCCCEEEEEEECCCEEEEECCCEEEEEECCCEEEEEE PSIPRED CCCCCCCCHHHHHHHHHHCCCCEEEEHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCCCCEEEEEEECCCCCEEEE DSSP CCCCEEECHHHHHHHHHHCCCCEEEECCCCCCCCEEECCECCCCCCECEEEECCCCCHHHCCCCCEEEEEEECCEEEEEE 334 FNCSPHPYWLPNFMDVFTWSLPFVGEKVTEMLVNVLNICSSFEEAKGLDRINERMPPR Reprof ECCCCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHEECCEHHHHCCCCHHCCCCCCC PSIPRED ECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCC DSSP ECCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCC
Prediction accuracy/precision
<figtable id="ss_acc">
Method | Q3 | Precision H | Precision E | Precision C |
---|---|---|---|---|
P04062 | ||||
PSIPRED | 0.831 | 0.830 | 0.872 | 0.810 |
Reprof | 0.553 | 1.000 | 0.455 | 0.592 |
P10775 | ||||
PSIPRED | 0.941 | 0.959 | 0.960 | 0.919 |
Reprof | 0.603 | 0.589 | 0.417 | 0.644 |
Q9X0E6 | ||||
PSIPRED | 0.890 | 1.000 | 0.895 | 0.720 |
Reprof | 0.580 | 0.562 | 0.917 | 0.458 |
Q08209 | ||||
PSIPRED | 0.833 | 0.842 | 0.902 | 0.812 |
Reprof | 0.579 | 0.762 | 0.293 | 0.743 |
</figtable>
Disorder
P04062
<figtable id="disorder_P04062">
Method | Disorder regions | |
---|---|---|
IUPRED | 2, 3, 6, 90-93, 229-231, 235, 236 | |
DisProt | ||
Precision: 0% | Sensitivity: undef | Specificy: 98% |
</figtable>
P10775
Homolog: DP00554 <figure id="fig:disorder_P10775">
</figure>
<figtable id="disorder_P10775">
Method | Disorder regions | |
---|---|---|
IUPRED | ||
DisProt | 31-50 | |
Precision: undef | Sensitivity: 0% | Specificy: 100% |
</figtable>
Q9X0E6
Q08209
<figure id="fig:disorder_Q08209">
</figure> DP00092 <figtable id="disorder_Q08209">
Method | Disorder regions | |
---|---|---|
IUPRED | 1-6, 8, 383,384,424,425,432,434-439,443,445,448,449,455,458,463-521 | |
DisProt | 1-13,390-414,374-468,469-486,487-521 | |
Precision: 100% | Sensitivity: 52% | Specificy: 100% |
</figtable>