Sequence and structure based mutation analysis of GBA

Introduction

In this section we want to combine the results of sequence- and structure-based mutation analysis. Therefore the results of task 6 and task 7 are used. In Figure 1 the mutations are highlighted in the protein structure of 2NT0.

Figure 1: 2NT0 with hilighted mutation positions (red) and active site residues (blue).

Sequence-based mutation analysis

The following table summarizes the results of the sequence-based mutation analysis.

Mutation	Amino-Acid Properties	Substitution Matrices			PSSM	Conservation	Secondary Structure	SNAP	SIFT	PolyPhen-2
Mutation	Amino-Acid Properties	BLOSUM62	PAM1	PAM250	PSSM	Conservation	Secondary Structure	SNAP	SIFT	HumDiv	HumVar
1	non-neutral	neutral	neutral	neutral	non-neutral	non-neutral	non-neutral	neutral	neutral	non-neutral	non-neutral
2	non-neutral	neutral	neutral	neutral	non-neutral	non-neutral	neutral	non-neutral	neutral	non-neutral	non-neutral
3	neutral	neutral	neutral	neutral	non-neutral	neutral	neutral	neutral	neutral	neutral	neutral
4	non-neutral	neutral	neutral	neutral	non-neutral	non-neutral	non-neutral	non-neutral	non-neutral	non-neutral	non-neutral
5	non-neutral	non-neutral	non-neutral	non-neutral	non-neutral	non-neutral	neutral	non-neutral	non-neutral	non-neutral	non-neutral
6	neutral	neutral	neutral	neutral	neutral	non-neutral	neutral	neutral	neutral	neutral	neutral
7	neutral	neutral	neutral	neutral	neutral	non-neutral	non-neutral	non-neutral	non-neutral	non-neutral	neutral
8	non-neutral	neutral	neutral	neutral	non-neutral	non-neutral	non-neutral	non-neutral	non-neutral	non-neutral	non-neutral
9	non-neutral	neutral	neutral	non-neutral	non-neutral	non-neutral	non-neutral	non-neutral	neutral	non-neutral	non-neutral
10	non-neutral	non-neutral	non-neutral	non-neutral	non-neutral	non-neutral	non-neutral	non-neutral	non-neutral	non-neutral	non-neutral

Structure-based mutation analysis

The following table summarizes the results of the structure-based mutation analysis.

Mutation	SCWRL			Minimise			Gromacs			FoldX Energy
Mutation	Polar Interactions	Clashes Holes	Energy	Polar Interactions	Clashes Holes	Energy	Polar Interactions	Clashes Holes	Energy	FoldX Energy
1	neutral	neutral	neutral	neutral	neutral	neutral	neutral	non-neutral	neutral	neutral
2	neutral	neutral	neutral	neutral	neutral	neutral	neutral	neutral	neutral	neutral
3	non-neutral	non-neutral	neutral	non-neutral	non-neutral	neutral	non-neutral	neutral	neutral	neutral
4	non-neutral	neutral	neutral	non-neutral	neutral	neutral	non-neutral	non-neutral	non-neutral	neutral
5	neutral	neutral	non-neutral	neutral	neutral	non-neutral	neutral	neutral	non-neutral	non-neutral
6	non-neutral	non-neutral	neutral	non-neutral	non-neutral	neutral	non-neutral	non-neutral	non-neutral	neutral
7	neutral	neutral	neutral	neutral	neutral	neutral	neutral	neutral	neutral	neutral
8	non-neutral	non-neutral	neutral	non-neutral	neutral	non-neutral	non-neutral	neutral	non-neutral	neutral
9	neutral	non-neutral	neutral	neutral	neutral	non-neutral	neutral	non-neutral	non-neutral	neutral
10	neutral	non-neutral	neutral	neutral	non-neutral	neutral	neutral	neutral	non-neutral	neutral

Predictions of sequence-based and structure-based mutation analysis

The following table shows the predictions made based on either the sequence-based or the structure-based mutation analysis. It is furthermore shown, whether both methods agree and whether the mutation is listed in HGMD and is therefore damaging in reality.

Mutation	Sequence-based mutation analysis	Structure-based mutation analysis	in HGMD?	prediction
1	neutral	neutral	yes	wrong
2	non-neutral	neutral	yes	partly correct
3	neutral	neutral	no	correct
4	non-neutral	non-neutral	yes	correct
5	non-neutral	non-neutral	yes	correct
6	neutral	non-neutral	yes	partly correct
7	non-neutral	neutral	yes	partly correct
8	non-neutral	non-neutral	yes	correct
9	non-neutral	neutral	yes	partly correct
10	non-neutral	neutral	no	partly correct

Discussion

Mutation 1

The first mutation is the only one we predicted totally wrong. With both, sequence- and structure-based analysis, the mutations was predicted as being neutral, but as it is listed in HGMD it is damaging. In sequence-based analysis the amino-acid properties, the PSSM, the conservation, the secondary structure and the prediction of Polyphen-2 indicated that the mutation would be damaging. So it was not easy to decide whether we classify the mutation as neutral or damaging. But as the affected amino acid is located at the exterior of the protein and there were also many results indicating that this mutation is harmless, the mutation was predicted as being neutral. The structure-based mutation analysis almost each result led to the conclusion that the mutation is harmless. Only the structure obtained with Gromacs showed a different surface. It is interesting, that we did not find any significant changes in the structure-based mutation analysis as we had expected after having investigated the results of the sequence-based mutation analysis. All in all there were too little signs for a damaging mutation. For this mutation our prediction was totally wrong. We failed in both methods. It would be interesting to use more methods to see if we would be able to find the reason for the damaging effect.

With both analyses: wrong prediction

Mutation 2

We predicted the second mutation partly correct. It is listed in HGMD and is therefore damaging. The sequence-based mutation analysis led to the same conclusion. In contrast, the structure-based analysis indicated the mutation as being harmless. The steps of the sequence-based analysis produced contradicting results and it was not sure which results are the most important. Mainly because of the change from an acidic to a neutral amino acid we predicted the mutation as damaging. In structure-based analysis all results indicated a neutral substitution. So we classified the mutation as harmless, which was wrong. All together we would classify the mutation as harmless because the sequence-based analysis was not clear and the structure-based analysis tended to a neutral mutation, which is not correct. The effect must be directly at the amino acid with no structural changes. Maybe the binding differs somehow or the loss of the acidic character is damaging.

With both analyses: wrong prediction

Mutation 3

We predicted the third mutation correct with both sequence- and structure-based mutation analysis. Although it was hard to decide in structure-based analysis it was predicted correctly as being neutral.