Structure-based mutation analysis ARSA
Preparation
Visualization with Pymol
The following image shows a pymol visualization of Arylsulfatase A, together with all known active and binding sites.
One can see, that the mutations are spread out through the protein. Some lie near functional sites, other are very distant from them. The table below shows again Pymol visualizations of all mutations, but each seperately. With this, we want to try to derive investigate the correlation of location of the mutation with respect to functional sites and its effect on the protein function.
Nr. | mutation | position | Pymol image | Description | Effect on function |
1 | Asp-Asn | 29 | The mutation is located at the position of a metal-binding site. | harmful | |
2 | Pro - Ala | 136 | The mutation is located near the active site and a substrate binding site in sequence as well as in structure. | harmful | |
3 | Gln-His | 153 | The mutation is located near the active site and a substrate binding site in sequence as well as in structure. | harmful | |
4 | Trp-Cys | 193 | The mutation is at moderate distance to all important functional sites of the protein. | neutral | |
5 | Thr-Met | 274 | The mutation is located very distant from the active site and a substrate binding site in sequence as well as in structure. It is located within a beta sheet. |
harmful | |
6 | Phe -Val | 356 | The mutation is at moderate distance to all important functional sites of the protein. | neutral | |
7 | Thr-Ile | 409 | The mutation is not close to important functional sites. | harmful | |
8 | Asn-Ser | 440 | The mutation is very distant from all functional sites. | neutral | |
9 | Cys-Gly | 489 | The mutation is very distant from all functional sites. | harmful | |
10 | Arg-His | 496 | The mutation is very distant from all functional sites. | harmful |
Above visualizations indicate, that mutations near functional important sites of the protein are likely to cause a harmful effect. However, for distant mutations no trend can be observed.
SCRWL
First, we extracted the amino acid sequence from our pdb file and converted it to lower case.
repairPDB arsa_model.pdb -seq > arsa.model.seq
tr '[:upper:]' '[:lower:]' < arsa.model.seq > arsa.model.lower.seq
Next, we included the individual mutations as capital letters in seperate files and executed scwrl with the following command:
scwrl cmd
We also ran SCWRL on the wild type (wt) structure in order to make it comparable to energy predictions by other programs. The minimal energy of the graph for the wt is 415.134.
FoldX
Nr. | mutation | position | Minimal energy | Energy(mutant)/Energy(wt) | |
1 | Asp-Asn | 29 | 496.88 | 0.9725963 | |
2 | Pro - Ala | 136 | 493.81 | 0.966587 | |
3 | Gln-His | 153 | 493.90 | 0.9667632 | |
4 | Trp-Cys | 193 | 496.23 | 0.971324 | |
5 | Thr-Met | 274 | 503.34 | 0.9852412 | |
6 | Phe -Val | 356 | 495.39 | 0.9696798 | |
7 | Thr-Ile | 409 | 495.45 | 0.9697972 | |
8 | Asn-Ser | 440 | 496.79 | 0.9724201 | |
9 | Cys-Gly | 489 | 495.87 | 0.9706193 | |
10 | Arg-His | 496 | 498.75 | 0.9762567 |