Difference between revisions of "Glucocerebrosidase homology modelling"
(→Homologous Structures) |
(→MODELLER) |
||
Line 49: | Line 49: | ||
=== Results === |
=== Results === |
||
+ | |||
+ | The results presented and discussed in this section, where retrieved according to this [[Workflow_homology_modelling_glucocerebrosidase#MODELLER|workflow]]. At first, MODELLER was used to build models based on one template structure. To do this, pairwise alignments of 1OGS with the different template structures (2WNW, 3KL0 and 2E4T) were created and used as input for MODELLER. Figure 1 shows the resulting models aligned to the structure of 1OGS visualized with the tool Pymol. One can see, that the results differ greatly. The models based on the templates 3KL0 and 2E4T vary greatly from the reference structure, whereas the model based on 2WNW seems to be quite good in large parts of the protein. These visual interpretations will be further examined and validated by different measurements, as discribed in the section below. |
||
+ | <br/> |
||
+ | [[File: Modeller_pairwise_alignment_results_glucocerebrosidase.jpg|thumb|500px|center|'''Figure 1:''' Results of MODELLER for pairwise sequence alignments.]] |
||
+ | <br/> |
||
+ | As MODELLER can build models based in multiple sequence alignments as well, it was ivestigated whether this might improve the structure prediction. The templates 2WNW and 2E4T were chosen for this analysis, as the models based on pairwise alignments with them seemed to be better, than the model obtained with the template of 3KL0. The result of the modeling procedure with MODELLER and this multiple sequence alignment is shown below, in Figure 2. The structure of the model consists mostly of loops and only rarely a defined secondary structure is present. Overall the structure consists of 5 small helices and 2 small sheets. In this case, the multiple sequence alignment did not help at all to predict the structure of glucocerebrosidase: the results obtained by pairwise sequence alignments are significantly better. But this may not be true in general. |
||
+ | <br/> |
||
+ | [[File: Modeller_multiple_alignment_results_glucocerebrosidase.jpg|thumb|500px|center|'''Figure 2:''' Results of MODELLER for multiple sequence alignment with 2WNW and 2E4T.]] |
||
+ | <br/> |
||
+ | |||
=== Analysis === |
=== Analysis === |
||
+ | |||
+ | The results of MODELLER, described in the section above, are validated by calculating different measures. |
||
== iTasser == |
== iTasser == |
Revision as of 14:07, 11 June 2011
Contents
Homologous Structures
The 10 best results of the sequence search with HHSearch (as retrieved in Task 1) are listed in the table below.
> 60% sequence identity | ||||
PDB-ID | name | organism | identity | template |
2nt0 | Glucosylceramidase | Homo Sapiens | 99% | |
> 40% sequence identity | ||||
PDB-ID | name | organism | identity | template |
> 0% sequence identity | ||||
PDB-ID | name | organism | identity | template |
2wnw | SrfJ | Salmonella enterica subsp. enterica | 29% | x |
3clw | conserved exported protein | Bacteroides fragilis | 13% | |
3kl0 | Glucuronoxylan Xylanohydrolase | Bacillus subtilis | 18% | x |
1nof | xylanase | Erwinia chrysanthemi | 18% | |
2e4t | Endoglucanase | Clostridium thermocellum | 11% | x |
3ii1 | Cellulase | Uncultured bacterium | 15% | |
1qw9 | Arabinosidase | Geobacillus stearothermophilus | 13% | |
1ik2 | Endoglucanase | Clostridium acetobutylicum | 12 % | |
2c7f | alpha-L-Arabinofuranosidase | Clostridium thermocellum | 16% |
For the different homology searches, the sequence of 1OGS was used, instead of P04062, as it does not contain the signal peptide which is not present in the mature protein.
MODELLER
MODELLER is a method for comparative protein structure modelling, provided by satisfaction of spatial restraints. In the simplest case, the most probable structure for a given sequence can be found based on its alignment with related structures. Additional to model building, MODELLER can perform several other tasks including fold assignment, pairwise/ multiple alignments of protein sequences, calculation of phylogenetic trees, and de novo modeling of loops in protein structures. The method was published by Sali and Blundell in 1993. <ref>A. Sali & T.L. Blundell. Comparative protein modelling by satisfaction of spatial restraints. J. Mol. Biol. 234, 779-815, 1993.</ref>
Usage
- Website with tutorials and download information: http://salilab.org/modeller/
- Description of the steps applied in this analysis: Detailed Workflow
Results
The results presented and discussed in this section, where retrieved according to this workflow. At first, MODELLER was used to build models based on one template structure. To do this, pairwise alignments of 1OGS with the different template structures (2WNW, 3KL0 and 2E4T) were created and used as input for MODELLER. Figure 1 shows the resulting models aligned to the structure of 1OGS visualized with the tool Pymol. One can see, that the results differ greatly. The models based on the templates 3KL0 and 2E4T vary greatly from the reference structure, whereas the model based on 2WNW seems to be quite good in large parts of the protein. These visual interpretations will be further examined and validated by different measurements, as discribed in the section below.
As MODELLER can build models based in multiple sequence alignments as well, it was ivestigated whether this might improve the structure prediction. The templates 2WNW and 2E4T were chosen for this analysis, as the models based on pairwise alignments with them seemed to be better, than the model obtained with the template of 3KL0. The result of the modeling procedure with MODELLER and this multiple sequence alignment is shown below, in Figure 2. The structure of the model consists mostly of loops and only rarely a defined secondary structure is present. Overall the structure consists of 5 small helices and 2 small sheets. In this case, the multiple sequence alignment did not help at all to predict the structure of glucocerebrosidase: the results obtained by pairwise sequence alignments are significantly better. But this may not be true in general.
Analysis
The results of MODELLER, described in the section above, are validated by calculating different measures.
iTasser
Results
Analysis
SWISS-MODEL
SWISS-MODEL workspace was published by Arnold et al. in 2005. <ref> Arnold K., Bordoli L., Kopp J., and Schwede T. (2006). The SWISS-MODEL Workspace: A web-based environment for protein structure homology modelling. Bioinformatics, 22,195-201.</ref>
Results
Using the standard output alignment of ClustalW2, the workunit of Swiss-Model got aborted: too many unfruitful attempts to rebuild a loop were tried. This indicates, that the alignment is not good and that it has to be adjusted.
Analysis
References
<references />