Difference between revisions of "Task 5: Homology Modeling"
(→Swiss-Model) |
(→Swiss-Model) |
||
Line 146: | Line 146: | ||
! Pymol visualisation |
! Pymol visualisation |
||
| [[File:swiss_1qvo.png|center | thumb|300px| Visualisation of the target (green), the template 1QVO_A and the model (purple).]] || [[File:swiss_1s7x.png|center | thumb|300px| Visualisation of the target (green), the template 1S7X-A and the model (purple).]] || align="center" | [[File:swiss_1cd1.png|center | thumb|300px| Visualisation of the target (green), the template 1CD1_A and the model (purple).]] |
| [[File:swiss_1qvo.png|center | thumb|300px| Visualisation of the target (green), the template 1QVO_A and the model (purple).]] || [[File:swiss_1s7x.png|center | thumb|300px| Visualisation of the target (green), the template 1S7X-A and the model (purple).]] || align="center" | [[File:swiss_1cd1.png|center | thumb|300px| Visualisation of the target (green), the template 1CD1_A and the model (purple).]] |
||
+ | |- |
||
+ | ! Anolea and Gromos energy |
||
+ | | [[File:1qvo_anolea.png|center | thumb|300px| ]] || [[File:1s7x_anolea.png|center | thumb|300px| ]] || [[File:1s7x_anolea.png|center | thumb|300px| ]] |
||
|+ style="caption-side: bottom; text-align: left" |<font size=1.5>'''Table 4:''' Overview of the Swiss-Model results for the three different templates. |
|+ style="caption-side: bottom; text-align: left" |<font size=1.5>'''Table 4:''' Overview of the Swiss-Model results for the three different templates. |
||
|} |
|} |
Revision as of 23:25, 26 August 2013
1A6Z chain A was used as modeling target for all three methods.
Modeller
We used Modeller to create models based on a single template and multiple templates.
Single template
<css> table.colBasic2 { margin-left: auto; margin-right: auto; border: 2px solid black; border-collapse:collapse; width: 70%; } .colBasic2 th,td { padding: 3px; border: 2px solid black; } .colBasic2 td { text-align:left; } .colBasic2 tr th { background-color:#efefef; color: black;} .colBasic2 tr:first-child th { background-color:#adceff; color:black;} </css>
<figtable id="Modeller single">
Template | Seq. identity | std Alignment | 2d alignment | curated Alignment | ||||||
---|---|---|---|---|---|---|---|---|---|---|
DOPE score | RMSD | GDT score | DOPE score | RMSD | GDT score | DOPE score | RMSD | GDT score | ||
1QVO_A | 39% | -27772 | 3.647 | 0.6241 | -27169 | 4.994 | 0.5653 | |||
1S7X_A | 29% | -19941 | 15.806 | 0.3355 | -18667 | 18.099 | 0.2509 | |||
1CD1_A | 21% | -19034 | 18.066 | 0.3640 | -24213 | 5.640 | 0.4697 |
</figtable>
<xr id="Modeller single"/> lists the selected templates and the Modeller results for the different template structures and alignment methods. In addition to the standard pairwise sequence alignment based on dynamic programming, we also used Modeller's alignment.alig2dn() method to improve the alignment by including secondary structure information and improved the alignments manually. As Modeller quality score, we chose the DOPE score, which is a statistical potential that was optimized for the assessment of model quality. The DOPE score has an arbitrary scale, but scores for structures of the same protein are comparable and can be used to select the best model from a collection of structures. The lower the score, the better the model. In addition to the DOPE score, we also computed the RMSD and GDT score. The RMSD is a a good measure of the average distance between all pairs of corresponding atoms in two structures. Therefore, the lower the RMSD the better. For the GDT score, the average coverage of the target sequence under four defined distance cutoffs is computed. Normally, 1, 2, 4 and 8 Å are used as distance thresholds. The GDT score ranges between 0 and 1, with random superpositions of unrelated structures having a score of 0.1 to 0.2.
Including the secondary structure information did only improve the model of the most distant homolog 1CD1_A.
<figtable id="pymol str. al.">
</figtable>
<figtable id="pymol str. al.">
</figtable>
<figtable id="pymol str. al.">
</figtable>
Multiple templates
We also user more than one template in a modeling step. Therefore, we created three sets of structures, one with close homologes, one with distant homologes and one combined set.
<figtable id="multiple sets">
close homology | distant homology | mixed | |||
---|---|---|---|---|---|
Template | Seq. identity | Template | Seq. identity | Template | Seq. identity |
1QVO_A | 39% | 3HUJ_C | 23% | 1QVO_A | 39% |
1ZAG_A | 36% | 1CD1_A | 21% | 1CD1_A | 21% |
1RJZ_D | 34% | 1VZY_A | 14% |
</figtable>
<xr id="multiple sets"/> specifies the three sets.
<figtable id="multiple sets">
close homology | distant homology | mixed homology | |||
---|---|---|---|---|---|
Template | 1QVO_A, 1ZAG_A | 1QVO_A, 1ZAG_A, 1RJZ_D | 3HUJ_C, 1CD1_A | 3HUJ_C, 1CD1_A, 1VZY_A | 1QVO_A, 1CD1_A |
DOPE score | -28073 | -27460 | -25967 | -20588 | -25894 |
RMSD | 3.432 | 2.431 | 4.130 | 7.741 | 3.974 |
GDT score | 0.6553 | 0.7638 | 0.5607 | 0.3814 | 0.5846 |
Pymol visualisation |
</figtable>
Swiss-Model
We used Swiss-Model to create models using 1QVO_A, 1S7X_A and 1CD1_A as template.
<figtable id="swiss-model">
1QVO_A | 1S7X_A | 1CD1_A | |
---|---|---|---|
Template | 1QVO_A | 1S7X_A | 1CD1_A |
Z-score | -1.977 | -2.005 | -2.707 |
Seq. identity | 39% | 29% | 21% |
RMSD | 2.847 | 2.757 | 3.604 |
GDT score | 0.6774 | 0.7086 | 0.6121 |
Pymol visualisation | |||
Anolea and Gromos energy |
</figtable>
Swiss-Model outputs a raw score and also a Z-score that represents an absolute measure of the model quality. It relates the model's raw score to the scores that high-resolution X-ray structures get and thus gives an estimate of how likely the model has a quality comparable to an experimental structure. A low quality model is indicated by a strong negative Z-score, which means that the raw score is several standard deviations lower as the scores of experimental structures with similar size (see Swiss-Model help).
Swiss-Model also provides plots that help to analyse the local energy of the model. For this, the atomic empirical mean force potential (Anolea) and the Gromos simulation package are used. Both are used calculate the energy of each amino acid in the sequence. The two plots show the protein sequence on the x-axis and the calculated energy of each residue on the y-axis. A low energy corresponds to a favorable energy environment for an amino acid and a positive energy represents an unfavorable energy environment.