Difference between revisions of "Rs121907974"

From Bioinformatikpedia
(Gromacs Energy Comparison)
(SNAP Prediction)
 
(12 intermediate revisions by one other user not shown)
Line 18: Line 18:
 
== Sequence-based Mutation Analysis ==
 
== Sequence-based Mutation Analysis ==
   
=== Pysicochemical Properities ===
+
=== Pysicochemical Properties ===
   
 
First of all, we explored the amino acid properties and compared them for the original and the mutated amino acid. Therefore we created the possible effect that the mutation could have on the protein.
 
First of all, we explored the amino acid properties and compared them for the original and the mutated amino acid. Therefore we created the possible effect that the mutation could have on the protein.
Line 37: Line 37:
 
----
 
----
   
=== Visualisation of the Mutation ===
+
=== Visualization of the Mutation ===
   
In the next step, we created the visualization of the muation with PyMol. Therefore we created a picture for the original amino acid, for the new mutated amino acid and finally for both together in one picture whereas the mutation is white colored. The following pictures display that the mutated amino acid Serine looks very different to Phenylalanine. Phenylalanine has a huge aromatical ring. Contrary, Serine is very smaller and differs a little bit in the orientation. This shows that the amino acids have huge structural differences which will probably cause dramtical effects on protein structure and function.
+
In the next step, we created the visualization of the mutation with PyMol. Therefore we created a picture of the original amino acid (Figure 1), of the new mutated amino acid (Figure 2) and finally for both together in one picture whereas the mutation is white colored (Figure 3). The following pictures display that the mutated amino acid Serine looks very different to Phenylalanine. Phenylalanine has a huge aromatic ring. Contrary, Serine is very smaller and differs a little bit in the orientation. This shows that the amino acids have huge structural differences which will probably cause dramatic effects on protein structure and function.
   
 
{| border="1" style="text-align:center; border-spacing:0;"
 
{| border="1" style="text-align:center; border-spacing:0;"
|picture original aa
+
|picture original amino acid
|picture mutated aa
+
|picture mutated amino acid
 
|combined picture
 
|combined picture
 
|-
 
|-
|[[Image:F211.png|thumb|150px|Amino acid Phenylalanine]]
+
|[[Image:F211.png|thumb|150px|Figure 1: Amino acid Phenylalanine]]
|[[Image:211S.png|thumb|150px|Amino acid Serine]]
+
|[[Image:211S.png|thumb|150px|Figure 2: Amino acid Serine]]
|[[Image:F211S.png|thumb|150px|Picture which visualize the mutation]]
+
|[[Image:F211S.png|thumb|150px|Figure 3: Picture which visualize the mutation]]
 
|-
 
|-
 
|}
 
|}
Line 56: Line 56:
 
----
 
----
   
=== Subsitution Matrices Values ===
+
=== Substitution Matrices Values ===
   
Afterwards, we looked at the values of the substitution matrices PAM1, PAM250 and BLOSSUM62. Therefore we looked detailed at the three values: the value for accoding amino acid substitution, the most frequent value for the substitution of the examined amino acid and the rarest substitution.
+
Afterwards, we looked at the values of the substitution matrices PAM1, PAM250 and BLOSSUM62. Therefore we looked detailed at the three values: the value for the according amino acid substitution, the most frequent value for the substitution of the examined amino acid and the rarest substitution.
   
In this case, the substitution of Phenylalanine to Serine has low values that are nearer to the values for the rarest subsitution for all three matrices. Therefore, an exchange at this position is very unlikely and a mutation there will almost certainly cause structural changes which can affect functional changes.
+
In this case, the substitution of Phenylalanine to Serine has low values that are nearer to the values for the rarest substitution for all three matrices. Therefore, an exchange at this position is very unlikely and a mutation will almost certainly cause structural changes which can affect functional changes.
   
 
{| border="1" style="text-align:center; border-spacing:0;"
 
{| border="1" style="text-align:center; border-spacing:0;"
Line 67: Line 67:
 
|colspan="3" | BLOSOUM 62
 
|colspan="3" | BLOSOUM 62
 
|-
 
|-
|value aa
+
|value amino acid
 
|most frequent substitution
 
|most frequent substitution
 
|rarest substitution
 
|rarest substitution
|value aa
+
|value amino acid
 
|most frequent substitution
 
|most frequent substitution
 
|rarest substitution
 
|rarest substitution
|value aa
+
|value amino acid
 
|most frequent substitution
 
|most frequent substitution
 
|rarest substitution
 
|rarest substitution
Line 95: Line 95:
 
=== PSSM Analysis ===
 
=== PSSM Analysis ===
   
Besides, we looked additional at the position specific scoring matrix (PSSM) for ouer sequence. In contrast to PAM and BLOSOUM, the PSSM contains a specific substitution rate for each position in the sequence. Therefore, the PSSM is more position specific than PAM or BLOSOUM. We extracted the substitution value for the underlying mutation, the value for the most frequent substitution and the rarest substitution.
+
Besides, we looked additional at the position specific scoring matrix (PSSM) for our sequence. In contrast to PAM and BLOSOUM, the PSSM contains a specific substitution rate for each position in the sequence. Therefore, the PSSM is more position specific than PAM or BLOSOUM. We extracted the substitution value for the underlying mutation, the value for the most frequent substitution and the rarest substitution.
   
 
In this case the substitution rate for Phenylalanine to Serine at this position is very low and near the value for the rarest substitution. This means this substitution at this position is likely very uncommon which indicates that this substitution has bad effects as a consequence. Therefore, we concluded that this mutation will probably cause protein structure changes as well as functional changes.
 
In this case the substitution rate for Phenylalanine to Serine at this position is very low and near the value for the rarest substitution. This means this substitution at this position is likely very uncommon which indicates that this substitution has bad effects as a consequence. Therefore, we concluded that this mutation will probably cause protein structure changes as well as functional changes.
Line 103: Line 103:
 
|colspan="3" | PSSM
 
|colspan="3" | PSSM
 
|-
 
|-
|value aa
+
|value amino acid
 
|most frequent substitution
 
|most frequent substitution
 
|rarest substitution
 
|rarest substitution
Line 119: Line 119:
 
=== Conservation Analysis with Multiple Alignments ===
 
=== Conservation Analysis with Multiple Alignments ===
   
As a next step we created a multiple alignment which contains the HEXA sequence and 9 other mammalian homologous sequences from uniprot. Afterwards we looked at the position of the different mutations and looked at the conservation level on this position. The regarded mutation is presented by the first colored column. Here we can see, that all the other mammalians havethe amino acid Phenylalanine on this position. Therefore, the mutation on this position is highly conserved and a mutation there will cause probably huge structural and functional changes for the protein.
+
As a next step we created a multiple alignment which contains the HEXA sequence and 9 other mammalian homologous sequences from [[http://www.uniprot.org UniProt]]. Afterwards we looked at the position of the different mutations and looked at the conservation level on this position. The regarded mutation is presented by the first colored column (Figure 4). Here we can see, that all the other mammalians have the amino acid Phenylalanine at this position. Therefore, the mutation at this position is highly conserved and a mutation there will cause probably huge structural and functional changes for the protein.
   
[[Image:mut_4.png|thumb|center|600px|Mutation in the multiple alignment]]
+
[[Image:mut_4.png|thumb|center|600px|Figure 4: Mutation in the multiple alignment]]
   
   
Line 129: Line 129:
 
=== Secondary Structure Mutation Analysis ===
 
=== Secondary Structure Mutation Analysis ===
   
As a next step we compared the different results of the secondary structure prediction tools JPred and PsiPred. Afterwards we can examine in which secondary structure element and where therein the mutation takes place. This can give an overview of how drastical the mutation can be. In this case both tools agree and predict at the position of the mutation a coil. This has as result, that the mutation at this position would not destroy or split a secondary structure element. It will probably only changes the coil between two secondary structure elements, but this can sometimes also cause a change of the the following secondary structure. Furthermore, HEXA_HUMAN does not posses any disordered regions and therefore, a mutation in a coiled region do not change a functional important region. We think that a drastical change of the protein structure and its function is unlikly because the mutation does not affect a secondary struture element. The change of the coil will probably only take places between two secondary structure elements which will probably not change.
+
As a next step we compared the different results of the secondary structure prediction tools JPred and PsiPred. Afterwards we can examine in which secondary structure element and where therein the mutation takes place. This can give an overview of how drastic the mutation can be. In this case both tools agree and predict at the position of the mutation a coil. This has as result, that the mutation at this position would not destroy or split a secondary structure element. It will probably only changes the coil between two secondary structure elements, but this can sometimes also cause a change of the the following secondary structure. Furthermore, HEXA_HUMAN does not posses any disordered regions and therefore, a mutation in a coiled region do not change a functional important region. We think that a drastic change of the protein structure and its function is unlikely because the mutation does not affect a secondary structure element. The change of the coil will probably only take places between two secondary structure elements which will probably not change.
   
 
JPred:
 
JPred:
Line 138: Line 138:
 
''' Comparison with the real Structure:'''
 
''' Comparison with the real Structure:'''
   
Afterwards we also visualize the position of the muation (red) in the real 3D-structure of PDB and compare it with the predicted secondary structure. The visualisation can therefore like above the predicted secondary structure display if the mutation is in a secondary structure element or in some other regions.
+
Afterwards we also visualize the position of the mutation (red) in the real 3D-structure of [[http://www.pdb.org PDB]] and compare it with the predicted secondary structure (Figure 5 and Figure 6). The visualization can display if the mutation is in a secondary structure element or in some other regions.
   
Here in this case the mutation position almost agree with the position of the predicted secondary structure and is within a coil. Like explained above this means a mutation will probably not destroy a secondary structure element which affects no drastical structural change. Otherwise it can cause a change of the position of the two nearest secondary structure element which can has a functional loose as a consequence. We think that a structural change is unlikely, because it is not within a secondary structure element and will therefore not cause extreme changes.
+
Here in this case the mutation position almost agree with the position of the predicted secondary structure and is within a coil. Like explained above this means a mutation will probably not destroy a secondary structure element which affects no drastic structural change. Otherwise it can cause a change at the position of the two nearest secondary structure element which can has a functional loose as a consequence. We think that a structural change is unlikely, because it is not within a secondary structure element and will therefore not cause extreme changes.
   
 
{|
 
{|
| [[Image:211_mut.png|thumb|250px|Mutation at position 211]]
+
| [[Image:211_mut.png|thumb|250px|Figure 5: Mutation at position 211]]
| [[Image:211_mut_detail.png|thumb|250px|Mutation at position 211 - detailed view]]
+
| [[Image:211_mut_detail.png|thumb|250px|Figure 6: Mutation at position 211 - detailed view]]
 
|}
 
|}
   
Line 153: Line 153:
 
=== SNAP Prediction ===
 
=== SNAP Prediction ===
   
Next, we looked at the result of the SNAP prediction. For this prediction we took the amino acid of the certain position and checked every possible amino acid mutation. Afterwards we extract the result for Serine which is the real mutation in this case. SNAP has a result that the exchange from Phenylalanine to Serine at this position is non-neutral with a high accuracy. This means that this certain mutation on this position cause very likely structural and functional changes of the protein.
+
Next, we looked at the result of the SNAP prediction. For this prediction we took the amino acid of the certain position and checked every possible amino acid mutation. Afterwards we extract the result for Serine which is the real mutation in this case. SNAP has as a result that the exchange from Phenylalanine to Serine at this position is non-neutral with a high accuracy. This means that this certain mutation at this position cause very likely structural and functional changes of the protein.
   
 
{| border="1" style="text-align:center; border-spacing:0;"
 
{| border="1" style="text-align:center; border-spacing:0;"
Line 168: Line 168:
 
|}
 
|}
   
A detailed list of all possible substitutions can be found [[http://i12r-studfilesrv.informatik.tu-muenchen.de/wiki/index.php/rs121907974 here]]
+
A detailed list of all possible substitutions can be found [[http://i12r-studfilesrv.informatik.tu-muenchen.de/wiki/index.php/rs121907974_SNAP here]]
   
 
Back to [[http://i12r-studfilesrv.informatik.tu-muenchen.de/wiki/index.php/Sequence-based_mutation_analysis_HEXA Sequence-based mutation analysis]]
 
Back to [[http://i12r-studfilesrv.informatik.tu-muenchen.de/wiki/index.php/Sequence-based_mutation_analysis_HEXA Sequence-based mutation analysis]]
Line 175: Line 175:
 
=== SIFT Prediction ===
 
=== SIFT Prediction ===
   
Next, we used SIFT Prediction which displays if a mutation is neutral or not. Therefore, it first shows a row which contains a score for the particular mutationposition to a certain amino acid. The amino acid which are not tolerated at this position are colored red. Besides, it also constructs a table which lists the amino acids that are predicted as tolerated and not-tolerated.
+
Next, we used SIFT Prediction which displays if a mutation is neutral or not. Therefore, it first shows a row which contains a score for the particular mutation position of a certain amino acid. The amino acid which are not tolerated at this position are colored red. Besides, it also constructs a table which lists the amino acids that are predicted as tolerated and not-tolerated.
   
In this case, the only substitution that is tolerated is the one to Phenylalanine itself. The substitution to Serine is not-tolerated at this position. This means that this mutation at this position is probably not neutral and will cause probably structural and function changes of the protein.
+
In this case, the only substitution that is tolerated is the one to Phenylalanine itself (Figure 8). The substitution to Serine is not-tolerated at this position. This means that this mutation at this position is probably not neutral and will cause probably structural and function changes of the protein.
   
 
SIFT Matrix:<br>
 
SIFT Matrix:<br>
Line 183: Line 183:
   
 
{|
 
{|
| [[Image:sift_legend.png|center]]
+
| [[Image:sift_legend.png|center|thumb|800px|Figure 7: Legend]]
 
|-
 
|-
| [[Image:211_sift.png.png|center]]
+
| [[Image:211_sift.png.png|center|thumb|800px|Figure 8: SIFT Table<br>
  +
Threshold for intolerance is 0.05.<BR>Amino acid color code: non-polar, <font color=green>uncharged polar</font>, <font color=red>basic</font>, <font color=blue>acidic</font>. <BR>Capital letters indicate amino acids appearing in the alignment, lower case letters result from prediction.]]
 
|}
 
|}
   
SIFT Table<br>
 
Threshold for intolerance is 0.05.<BR>Amino acid color code: nonpolar, <font color=green>uncharged polar</font>, <font color=red>basic</font>, <font color=blue>acidic</font>. <BR>Capital letters indicate amino acids appearing in the alignment, lower case letters result from prediction.
 
 
<br>
 
<br>
 
{| class="wikitable centered"
 
{| class="wikitable centered"
Line 203: Line 202:
 
=== PolyPhen2 Prediction ===
 
=== PolyPhen2 Prediction ===
   
Finally, we also regarded the PolyPhen2 prediction for this muation. This prediction visualizes how strongly demaging the mutation probably will be. Therefore it gives the result for two possible cases: HumDiv and HumVar. HumDiv is a prefered model for evaluation rare allels, dense mapping of regions identified by genome-wide assiociation studies and analysis of neutral selection. In contrast, HumVar is a prefered model for diagnostic of Mendelian diseases which require distinguishing mutations with drastic effects from all remaining human variations including abundant mildly deleterious allels. We decided to look at both possible models, which agreed in the most cases.
+
Finally, we also regarded the PolyPhen2 prediction for this mutation. This prediction visualizes how strongly damaging the mutation probably will be. Therefore it gives the result for two possible cases: HumDiv and HumVar. HumDiv is the preferred model for evaluation rare alleles, dense mapping of regions identified by genome-wide association studies and analysis of neutral selection. In contrast, HumVar is the preferred model for diagnostic of Mendelian diseases which require distinguishing mutations with drastic effects from all remaining human variations including abundant mildly deleterious alleles. We decided to look at both possible models (Figure 9 and Figure 10), which agreed in the most cases.
   
 
In this case both models predict that the mutation is probably damaging. This means that the mutation is not neutral and will probably destroy the structure and the function of the protein.
 
In this case both models predict that the mutation is probably damaging. This means that the mutation is not neutral and will probably destroy the structure and the function of the protein.
   
 
{|
 
{|
| [[Image:mut_4_humdiv.png|thumb|450px|HumDiv prediction]]
+
| [[Image:mut_4_humdiv.png|thumb|450px|Figure 9: HumDiv prediction]]
| [[Image:mut_4_humvar.png|thumb|450px|HumVar prediction]]
+
| [[Image:mut_4_humvar.png|thumb|450px|Figure 10: HumVar prediction]]
 
|}
 
|}
   
Line 220: Line 219:
 
=== Mapping onto Crystal Structure ===
 
=== Mapping onto Crystal Structure ===
   
[[Image:mut211_active.png|thumb|center|400px|Visualization of the mutation and important functional sites]]
+
[[Image:mut211_active.png|thumb|center|400px|Figure 11: Visualization of the mutation and important functional sites<br>Color declaration: <br>
  +
* <font color=red>red</font>: position of mutation<br>
  +
* <font color=green>green</font>: position of active side<br>
  +
* <font color=yellow>yellow</font>: position of glycolysation<br>
  +
* <font color=cyan>cyan</font>: position of Cysteine]]
   
  +
First of all, we colored the important residues and also the mutated residue in the crystal structure in Figure 11, to see if the mutation is near of far away from the functional residues. As you can see on the picture, the mutation is located within a loop and far away from the functional residues.
Color declaration:
 
* <font color=red>red</font>: position of mutation
 
* <font color=green>green</font>: position of active side
 
* <font color=yellow>yellow</font>: position of glycolysation
 
* <font color=cyan>cyan</font>: position of Cystein
 
 
First of all, we colored the important residues and also the mutated residue in the crystal structure, to see if the mutation is near of far away from the functional residues. As you can see on the picture, the mutation is located within a loop and far away from the functional residues.
 
 
Therefore, we do not know in which way this mutation affects the global structure of the protein.
 
Therefore, we do not know in which way this mutation affects the global structure of the protein.
   
Line 236: Line 233:
 
=== SCWRL Prediction ===
 
=== SCWRL Prediction ===
   
Because the mapping analysis does not give a good explanation why the mutation causes damages on the protein, we decided to analyse this mutation in more detail. Therefore, we looked for the structure of the original amino acid and the structure of the amino acid after the mutation event and compare them in size and orientation. For this purpose we used SCWRL.
+
Because the mapping analysis does not give a good explanation why the mutation causes damages on the protein, we decided to analyse this mutation in more detail. Therefore, we looked for the structure of the original amino acid and the structure of the amino acid after the mutation event and compared them in size and orientation. For this purpose we used SCWRL.
   
 
{| border="1" style="text-align:center; border-spacing:0;"
 
{| border="1" style="text-align:center; border-spacing:0;"
|picture original aa
+
|picture original amino acid
|picture mutated aa
+
|picture mutated amino acid
 
|combined picture
 
|combined picture
 
|-
 
|-
|[[Image:mut211_org_aa.png|thumb|150px|Amino acid Phenylalanine]]
+
|[[Image:mut211_org_aa.png|thumb|150px|Figure 12: Amino acid Phenylalanine]]
|[[Image:mut11_aa.png|thumb|150px|Amino acid Serine]]
+
|[[Image:mut11_aa.png|thumb|150px|Figure 13: Amino acid Serine]]
|[[Image:mut211_both.png|thumb|150px|Picture which visualize the mutation]]
+
|[[Image:mut211_both.png|thumb|150px|Figure 14: Picture which visualize the mutation]]
  +
|-
  +
|}
  +
  +
As you can see on Figure 12, the original amino acid (Phenylalanine) has a ring structure and is therefore, an aromatic amino acid. The mutated amino acid, visualized in Figure 13, is a Serine, which is smaller than Phenylalanine. Therefore there is no problem with the space for the amino acid. Serine is smaller than Phenylalanine and has enough space which means there should not be any clashes with other amino acid residues or with the backbone (Figure 14).
  +
  +
Back to [[http://i12r-studfilesrv.informatik.tu-muenchen.de/wiki/index.php/Structure-based_mutation_analysis_HEXA Structure-based mutation analysis]]
  +
----
  +
  +
=== FoldX Energy Comparison ===
  +
  +
One important point in the analysis of mutations is to look at the energy of the protein with the original and with the mutated amino acid. Often the energy increases dramatically with the mutated amino acid. This means, that the protein becomes very unstable and therefore, it is often possible that the protein can not bind its ligands any longer. Otherwise, it is also possible, that the protein with the mutated amino acid has a lower energy than the original protein. This means, that the protein is too rigid and loses its flexibility. Than it is also possible, that the protein can not bind the ligands any longer.
  +
  +
Therefore, we compared the energy of our protein with different methods. Here we want to present the result of FoldX.
  +
  +
  +
{| border="1" style="text-align:center; border-spacing:0;"
  +
|Original total energy
  +
|Total energy for the mutated protein
  +
|Strongest energy changes within the mutated protein
  +
|-
  +
| -154.17
  +
| -144.25
  +
| -
  +
|-
  +
|}
  +
  +
In this case, the energy of the mutated protein is higher, than the energy of the original protein. Therefore, this means, that the mutated protein is not that stable than the original protein. So it is possible, that the mutated protein loses its function, because it is too unstable to bind the ligand.
  +
  +
We also will compare the energy values of these two structure with other methods. Because of the different calculation methods, it is not possible to compare the energy values directly. Therefore we decided to calculate the ratio between the energy values of the two structures. Our original mutation has the value 100, with this value we calculate the value of the mutated structure.
  +
  +
  +
{| border="1" style="text-align:center; border-spacing:0;"
  +
|Ratio Original
  +
|Ratio mutated protein
  +
|Difference
  +
|-
  +
|100
  +
|93.66
  +
|6.34
 
|-
 
|-
 
|}
 
|}
   
As you can see on the picture, the original amino acid (Phenylalanine) has a ring structure and is therefore, an aromatic amino acid. The mutated amino acid is a Serine, which is smaller than Phenylalanine. Therefore there is no problem with the space for the amino acid. Serine is smaller than Phenylalanin and has enough space which means there should not be any clashes with other amino acid residues or with the backbone.
 
   
 
Back to [[http://i12r-studfilesrv.informatik.tu-muenchen.de/wiki/index.php/Structure-based_mutation_analysis_HEXA Structure-based mutation analysis]]
 
Back to [[http://i12r-studfilesrv.informatik.tu-muenchen.de/wiki/index.php/Structure-based_mutation_analysis_HEXA Structure-based mutation analysis]]
Line 269: Line 304:
 
|}
 
|}
   
The total energy of the mutated structure is a little bit lower than the energy of the original protein structure. To have the possibility to compare these energy values with the values of the other analysis tools, we calculated a ratio between these energy values.
+
The total energy of the mutated structure is a little bit higher than the energy of the original protein structure. To have the possibility to compare these energy values with the values of the other analysis tools, we calculated a ratio between these energy values.
   
 
{| border="1" style="text-align:center; border-spacing:0;"
 
{| border="1" style="text-align:center; border-spacing:0;"
Line 286: Line 321:
 
''' Comparing Structure: '''
 
''' Comparing Structure: '''
   
This tool also gives as output a pdb file with the position of the original and the mutated amino acid.
+
This tool also gives as output a PDB file with the position of the original and the mutated amino acid.
   
 
{| border="1" style="text-align:center; border-spacing:0;"
 
{| border="1" style="text-align:center; border-spacing:0;"
|picture original aa
+
|picture original amino acid
|picture mutated aa
+
|picture mutated amino acid
 
|combined picture
 
|combined picture
 
|-
 
|-
|[[Image:mmut211_org.png|thumb|150px|Amino acid Phenylalanine]]
+
|[[Image:mmut211_org.png|thumb|150px|Figure 15: Amino acid Phenylalanine]]
|[[Image:mmut211_mut.png|thumb|150px|Amino acid Serine]]
+
|[[Image:mmut211_mut.png|thumb|150px|Figure 16: Amino acid Serine]]
|[[Image:mmut211_both.png|thumb|150px|Picture which visualize the mutation]]
+
|[[Image:mmut211_both.png|thumb|150px|Figure 17: Picture which visualize the mutation]]
 
|-
 
|-
 
|}
 
|}
   
If we have a look at the pictures, we can see that the location of the residue of the two amino acids is very similar.
+
If we have a look at the pictures (Figure 15, Figure 16), we can see that the location of the residue of the two amino acids is very similar (Figure 17).
   
''' Visualization of H-bonds and Clashs: '''
+
''' Visualization of H-bonds and Clashes: '''
   
 
To get more insight in the effects of the mutated amino acid on the structure, we also analysed the H-bonds and clashes of the Serine residue.
 
To get more insight in the effects of the mutated amino acid on the structure, we also analysed the H-bonds and clashes of the Serine residue.
Line 310: Line 345:
 
|Clashes of the mutation
 
|Clashes of the mutation
 
|-
 
|-
|[[Image:orig211.png|thumb|150px|H-bonds of the original amino acid (colored in magenta)]]
+
|[[Image:orig211.png|thumb|150px|Figure 18: H-bonds of the original amino acid (colored in magenta)]]
|[[Image:hbond211.png|thumb|150px|H-bonds of the mutated amino acid (colored in red)]]
+
|[[Image:hbond211.png|thumb|150px|Figure 19: H-bonds of the mutated amino acid (colored in red)]]
|[[Image:clash211.png|thumb|150px|Possible clashes]]
+
|[[Image:clash211.png|thumb|150px|Figure 20: Possible clashes]]
 
|-
 
|-
 
|}
 
|}
   
On the pictures you can see, that neither the original amino acid nor the mutated amino acid has any H-Bonds with other residues or the backbone of the protein. Therefore, it is not possible to explain the damage of the protein by missing H-bonds.
+
On the pictures you can see, that neither the original amino acid (Figure 18) nor the mutated amino acid (Figure 19) has any H-Bonds with other residues or the backbone of the protein. Therefore, it is not possible to explain the damage of the protein by missing H-bonds.
Furthermore, we can see, that there are no clashes between Serine and the rest of the protein, which means that the protein do not have to fold in another way because of clashing residues.
+
Furthermore, we can see, that there are no clashes between Serine and the rest of the protein (Figure 20), which means that the protein do not have to fold in another way because of clashing residues.
   
 
Back to [[http://i12r-studfilesrv.informatik.tu-muenchen.de/wiki/index.php/Structure-based_mutation_analysis_HEXA Structure-based mutation analysis]]
 
Back to [[http://i12r-studfilesrv.informatik.tu-muenchen.de/wiki/index.php/Structure-based_mutation_analysis_HEXA Structure-based mutation analysis]]
Line 325: Line 360:
   
 
''' Comparing Energy: '''
 
''' Comparing Energy: '''
 
   
 
To analyse the energy values calculated by Gromacs, we used the AMBER99SB-ILDN force field.
 
To analyse the energy values calculated by Gromacs, we used the AMBER99SB-ILDN force field.
Line 400: Line 434:
 
|}
 
|}
   
The difference between the energys calculated by Gromacs is much higher than the difference of the energy values calculated by the other tools. But otherwise, Gromacs use a real phisical force field and therefore, it should be the most accurate method to analyse the energy of different structures. In this case the energy of the mutated structure is much lower than the energy of the original structure. Therefore it is possible, that the protein become instable with the mutation and therefore does not work any more.
+
The difference between the energies calculated by Gromacs is much higher than the difference of the energy values calculated by the other tools. But otherwise, Gromacs use a real physical force field and therefore, it should be the most accurate method to analyse the energy of different structures. In this case the energy of the mutated structure is much higher than the energy of the original structure. Therefore it is possible, that the protein become unstable with the mutation and therefore does not work any more.
   
   
Line 408: Line 442:
   
 
{| border="1" style="text-align:center; border-spacing:0;"
 
{| border="1" style="text-align:center; border-spacing:0;"
|picture original aa
+
|picture original amino acid
|picture mutated aa
+
|picture mutated amino acid
 
|combined picture
 
|combined picture
 
|-
 
|-
|[[Image:gro_mut211_org.png|thumb|150px|Amino acid Phenylalanine]]
+
|[[Image:gro_mut211_org.png|thumb|150px|Figure 21: Amino acid Phenylalanine]]
|[[Image:gro_mut211_mut.png|thumb|150px|Amino acid Serine]]
+
|[[Image:gro_mut211_mut.png|thumb|150px|Figure 22: Amino acid Serine]]
|[[Image:gro_mut211_both.png|thumb|150px|Picture which visualize the mutation]]
+
|[[Image:gro_mut211_both.png|thumb|150px|Figure 23: Picture which visualize the mutation]]
 
|-
 
|-
 
|}
 
|}
   
This pictures is very similar to the pictures created by SCWRL. The mutated amino acid is much more smaller than the Phenylalanine and therefore does not need that much space. Otherwise it is possible, that because of the smaller amino acid, there are missing H-Bonds in the protein.
+
These pictures (Figure 21, Figure 22, Figure 23) is very similar to the pictures created by SCWRL (Figure Figure 15, Figure 16, Figure 17). The mutated amino acid is much more smaller than the Phenylalanine and therefore does not need that much space. Otherwise it is possible, that because of the smaller amino acid, there are missing H-Bonds in the protein.
   
''' Visualization of H-bonds and Clashs: '''
+
''' Visualization of H-bonds and Clashes: '''
   
 
To check if this is the case, we analysed the H-Bonds and clashes between the mutated amino acid and the rest of the protein.
 
To check if this is the case, we analysed the H-Bonds and clashes between the mutated amino acid and the rest of the protein.
Line 429: Line 463:
 
|Clashes of the mutation
 
|Clashes of the mutation
 
|-
 
|-
|[[Image:orig211.png|thumb|150px|H-bonds of the original amino acid]]
+
|[[Image:orig211.png|thumb|150px|Figure 24: H-bonds of the original amino acid]]
|[[Image:gro_hbond211.png|thumb|150px|H-bonds of the mutated amino acid]]
+
|[[Image:gro_hbond211.png|thumb|150px|Figure 25: H-bonds of the mutated amino acid]]
|[[Image:gro_clash211.png|thumb|150px|Possible clashes]]
+
|[[Image:gro_clash211.png|thumb|150px|Figure 26: Possible clashes]]
 
|-
 
|-
 
|}
 
|}
  +
  +
Both amino acids do not have any H-Bonds with the rest of the protein (Figure 24, Figure 25). Therefore a missing H-Bond does not cause the damage on the protein.
  +
Furthermore, it is not possible to find any clashes (Figure 26) in between the protein.
   
 
Back to [[http://i12r-studfilesrv.informatik.tu-muenchen.de/wiki/index.php/Structure-based_mutation_analysis_HEXA Structure-based mutation analysis]]
 
Back to [[http://i12r-studfilesrv.informatik.tu-muenchen.de/wiki/index.php/Structure-based_mutation_analysis_HEXA Structure-based mutation analysis]]

Latest revision as of 21:37, 31 August 2011

General Information

SNP-id rs121907974
Codon 211
Mutation Codon Phe -> Ser
Mutation Triplet TTC -> TCC

Sequence-based Mutation Analysis

Pysicochemical Properties

First of all, we explored the amino acid properties and compared them for the original and the mutated amino acid. Therefore we created the possible effect that the mutation could have on the protein.

Phe Ser consequences
polar, tiny, hydrophilic, neutral aliphatic, hydrophobic, neutral Ile is much bigger than Ser and also is branched, because it is an aliphatic amino acid. Therefore the structure of both amino acids is really different and Ile is too big for the position where Ser was. Therefore, there has to be a big change in the 3D structure of the protein and the protein probably will loose its function.


Back to [Sequence-based mutation analysis]


Visualization of the Mutation

In the next step, we created the visualization of the mutation with PyMol. Therefore we created a picture of the original amino acid (Figure 1), of the new mutated amino acid (Figure 2) and finally for both together in one picture whereas the mutation is white colored (Figure 3). The following pictures display that the mutated amino acid Serine looks very different to Phenylalanine. Phenylalanine has a huge aromatic ring. Contrary, Serine is very smaller and differs a little bit in the orientation. This shows that the amino acids have huge structural differences which will probably cause dramatic effects on protein structure and function.

picture original amino acid picture mutated amino acid combined picture
Figure 1: Amino acid Phenylalanine
Figure 2: Amino acid Serine
Figure 3: Picture which visualize the mutation


Back to [Sequence-based mutation analysis]


Substitution Matrices Values

Afterwards, we looked at the values of the substitution matrices PAM1, PAM250 and BLOSSUM62. Therefore we looked detailed at the three values: the value for the according amino acid substitution, the most frequent value for the substitution of the examined amino acid and the rarest substitution.

In this case, the substitution of Phenylalanine to Serine has low values that are nearer to the values for the rarest substitution for all three matrices. Therefore, an exchange at this position is very unlikely and a mutation will almost certainly cause structural changes which can affect functional changes.

PAM 1 Pam 250 BLOSOUM 62
value amino acid most frequent substitution rarest substitution value amino acid most frequent substitution rarest substitution value amino acid most frequent substitution rarest substitution
2 28 (Tyr) 0 (Asp, Cys, Glu, Lys, Pro, Val) 2 20 (Tyr) 1 (Arg, Asp, Cys, Gln, Glu, Gly, Lys, Pro) -2 3 (Tyr) -4 (Pro)


Back to [Sequence-based mutation analysis]


PSSM Analysis

Besides, we looked additional at the position specific scoring matrix (PSSM) for our sequence. In contrast to PAM and BLOSOUM, the PSSM contains a specific substitution rate for each position in the sequence. Therefore, the PSSM is more position specific than PAM or BLOSOUM. We extracted the substitution value for the underlying mutation, the value for the most frequent substitution and the rarest substitution.

In this case the substitution rate for Phenylalanine to Serine at this position is very low and near the value for the rarest substitution. This means this substitution at this position is likely very uncommon which indicates that this substitution has bad effects as a consequence. Therefore, we concluded that this mutation will probably cause protein structure changes as well as functional changes.


PSSM
value amino acid most frequent substitution rarest substitution
-5 11 -7


Back to [Sequence-based mutation analysis]


Conservation Analysis with Multiple Alignments

As a next step we created a multiple alignment which contains the HEXA sequence and 9 other mammalian homologous sequences from [UniProt]. Afterwards we looked at the position of the different mutations and looked at the conservation level on this position. The regarded mutation is presented by the first colored column (Figure 4). Here we can see, that all the other mammalians have the amino acid Phenylalanine at this position. Therefore, the mutation at this position is highly conserved and a mutation there will cause probably huge structural and functional changes for the protein.

Figure 4: Mutation in the multiple alignment


Back to [Sequence-based mutation analysis]


Secondary Structure Mutation Analysis

As a next step we compared the different results of the secondary structure prediction tools JPred and PsiPred. Afterwards we can examine in which secondary structure element and where therein the mutation takes place. This can give an overview of how drastic the mutation can be. In this case both tools agree and predict at the position of the mutation a coil. This has as result, that the mutation at this position would not destroy or split a secondary structure element. It will probably only changes the coil between two secondary structure elements, but this can sometimes also cause a change of the the following secondary structure. Furthermore, HEXA_HUMAN does not posses any disordered regions and therefore, a mutation in a coiled region do not change a functional important region. We think that a drastic change of the protein structure and its function is unlikely because the mutation does not affect a secondary structure element. The change of the coil will probably only take places between two secondary structure elements which will probably not change.

JPred:
...EEEECCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCC...
PsiPred:
...EEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCCCEEC...

Comparison with the real Structure:

Afterwards we also visualize the position of the mutation (red) in the real 3D-structure of [PDB] and compare it with the predicted secondary structure (Figure 5 and Figure 6). The visualization can display if the mutation is in a secondary structure element or in some other regions.

Here in this case the mutation position almost agree with the position of the predicted secondary structure and is within a coil. Like explained above this means a mutation will probably not destroy a secondary structure element which affects no drastic structural change. Otherwise it can cause a change at the position of the two nearest secondary structure element which can has a functional loose as a consequence. We think that a structural change is unlikely, because it is not within a secondary structure element and will therefore not cause extreme changes.

Figure 5: Mutation at position 211
Figure 6: Mutation at position 211 - detailed view


Back to [Sequence-based mutation analysis]


SNAP Prediction

Next, we looked at the result of the SNAP prediction. For this prediction we took the amino acid of the certain position and checked every possible amino acid mutation. Afterwards we extract the result for Serine which is the real mutation in this case. SNAP has as a result that the exchange from Phenylalanine to Serine at this position is non-neutral with a high accuracy. This means that this certain mutation at this position cause very likely structural and functional changes of the protein.

Substitution Prediction Reliability Index Expected Accuracy
S Non-neutral 5 87%

A detailed list of all possible substitutions can be found [here]

Back to [Sequence-based mutation analysis]


SIFT Prediction

Next, we used SIFT Prediction which displays if a mutation is neutral or not. Therefore, it first shows a row which contains a score for the particular mutation position of a certain amino acid. The amino acid which are not tolerated at this position are colored red. Besides, it also constructs a table which lists the amino acids that are predicted as tolerated and not-tolerated.

In this case, the only substitution that is tolerated is the one to Phenylalanine itself (Figure 8). The substitution to Serine is not-tolerated at this position. This means that this mutation at this position is probably not neutral and will cause probably structural and function changes of the protein.

SIFT Matrix:
Each entry contains the score at a particular position (row) for an amino acid substitution (column). Substitutions predicted to be intolerant are highlighted in red.

Figure 7: Legend
Figure 8: SIFT Table
Threshold for intolerance is 0.05.
Amino acid color code: non-polar, uncharged polar, basic, acidic.
Capital letters indicate amino acids appearing in the alignment, lower case letters result from prediction.




Predict Not ToleratedPositionSeq RepPredict Tolerated
ywvtsrqpnmlkihgedca211F1.00F




Back to [Sequence-based mutation analysis]


PolyPhen2 Prediction

Finally, we also regarded the PolyPhen2 prediction for this mutation. This prediction visualizes how strongly damaging the mutation probably will be. Therefore it gives the result for two possible cases: HumDiv and HumVar. HumDiv is the preferred model for evaluation rare alleles, dense mapping of regions identified by genome-wide association studies and analysis of neutral selection. In contrast, HumVar is the preferred model for diagnostic of Mendelian diseases which require distinguishing mutations with drastic effects from all remaining human variations including abundant mildly deleterious alleles. We decided to look at both possible models (Figure 9 and Figure 10), which agreed in the most cases.

In this case both models predict that the mutation is probably damaging. This means that the mutation is not neutral and will probably destroy the structure and the function of the protein.

Figure 9: HumDiv prediction
Figure 10: HumVar prediction


Back to [Sequence-based mutation analysis]


Structure-based Mutation Analysis

Mapping onto Crystal Structure

Figure 11: Visualization of the mutation and important functional sites
Color declaration:
* red: position of mutation
* green: position of active side
* yellow: position of glycolysation
* cyan: position of Cysteine

First of all, we colored the important residues and also the mutated residue in the crystal structure in Figure 11, to see if the mutation is near of far away from the functional residues. As you can see on the picture, the mutation is located within a loop and far away from the functional residues. Therefore, we do not know in which way this mutation affects the global structure of the protein.

Back to [Structure-based mutation analysis]


SCWRL Prediction

Because the mapping analysis does not give a good explanation why the mutation causes damages on the protein, we decided to analyse this mutation in more detail. Therefore, we looked for the structure of the original amino acid and the structure of the amino acid after the mutation event and compared them in size and orientation. For this purpose we used SCWRL.

picture original amino acid picture mutated amino acid combined picture
Figure 12: Amino acid Phenylalanine
Figure 13: Amino acid Serine
Figure 14: Picture which visualize the mutation

As you can see on Figure 12, the original amino acid (Phenylalanine) has a ring structure and is therefore, an aromatic amino acid. The mutated amino acid, visualized in Figure 13, is a Serine, which is smaller than Phenylalanine. Therefore there is no problem with the space for the amino acid. Serine is smaller than Phenylalanine and has enough space which means there should not be any clashes with other amino acid residues or with the backbone (Figure 14).

Back to [Structure-based mutation analysis]


FoldX Energy Comparison

One important point in the analysis of mutations is to look at the energy of the protein with the original and with the mutated amino acid. Often the energy increases dramatically with the mutated amino acid. This means, that the protein becomes very unstable and therefore, it is often possible that the protein can not bind its ligands any longer. Otherwise, it is also possible, that the protein with the mutated amino acid has a lower energy than the original protein. This means, that the protein is too rigid and loses its flexibility. Than it is also possible, that the protein can not bind the ligands any longer.

Therefore, we compared the energy of our protein with different methods. Here we want to present the result of FoldX.


Original total energy Total energy for the mutated protein Strongest energy changes within the mutated protein
-154.17 -144.25 -

In this case, the energy of the mutated protein is higher, than the energy of the original protein. Therefore, this means, that the mutated protein is not that stable than the original protein. So it is possible, that the mutated protein loses its function, because it is too unstable to bind the ligand.

We also will compare the energy values of these two structure with other methods. Because of the different calculation methods, it is not possible to compare the energy values directly. Therefore we decided to calculate the ratio between the energy values of the two structures. Our original mutation has the value 100, with this value we calculate the value of the mutated structure.


Ratio Original Ratio mutated protein Difference
100 93.66 6.34


Back to [Structure-based mutation analysis]


Minimise Energy Comparison

Next we use the minimise energy tool to compare the energy values of the two different structures.

Comparing Energy:

Original total energy Total energy for the mutated protein
-9610.467157 -9594.637506

The total energy of the mutated structure is a little bit higher than the energy of the original protein structure. To have the possibility to compare these energy values with the values of the other analysis tools, we calculated a ratio between these energy values.

Ratio of the original protein Ratio of the mutated protein Difference
100 99.84 0.16

Therefore, we can see, that the mutated structure has only 0.16% less energy than the original structure.

Comparing Structure:

This tool also gives as output a PDB file with the position of the original and the mutated amino acid.

picture original amino acid picture mutated amino acid combined picture
Figure 15: Amino acid Phenylalanine
Figure 16: Amino acid Serine
Figure 17: Picture which visualize the mutation

If we have a look at the pictures (Figure 15, Figure 16), we can see that the location of the residue of the two amino acids is very similar (Figure 17).

Visualization of H-bonds and Clashes:

To get more insight in the effects of the mutated amino acid on the structure, we also analysed the H-bonds and clashes of the Serine residue.

H-bonds of the original amino acid H-bonds of the mutated amino acid Clashes of the mutation
Figure 18: H-bonds of the original amino acid (colored in magenta)
Figure 19: H-bonds of the mutated amino acid (colored in red)
Figure 20: Possible clashes

On the pictures you can see, that neither the original amino acid (Figure 18) nor the mutated amino acid (Figure 19) has any H-Bonds with other residues or the backbone of the protein. Therefore, it is not possible to explain the damage of the protein by missing H-bonds. Furthermore, we can see, that there are no clashes between Serine and the rest of the protein (Figure 20), which means that the protein do not have to fold in another way because of clashing residues.

Back to [Structure-based mutation analysis]


Gromacs Energy Comparison

Comparing Energy:

To analyse the energy values calculated by Gromacs, we used the AMBER99SB-ILDN force field.

Here are the values of the original structure:

Energy Average Err.Est. RMSD Tot-Drift
Bond 1091.57 270 -nan -1622.75
Angle 3326.81 62 -nan 404.076
Potential -61304.1 960 -nan -6402.44

Here you can see the values which gromacs calculated for the structure with the mutated amino acid:

Energy Average Err.Est. RMSD Tot-Drift
Bond 1166.24 390 3425.52 -2317.42
Angle 3275.2 50 185.491 324.728
Potential -46177.4 3600 49139.5 -22414.8

One difference between gromacs and the other tools we used is, that gromacs also calculated the energy for the bonds and the angles. To compare the energies between the different tools we only consider the potential energy in our analysis, because the potential energy is the energy of the complete protein. Therefore, we calculated the ratio between the energies only for the potential energy.

Ratio original amino acid Ratio mutated amino acid Difference
100 75.32 24.68

The difference between the energies calculated by Gromacs is much higher than the difference of the energy values calculated by the other tools. But otherwise, Gromacs use a real physical force field and therefore, it should be the most accurate method to analyse the energy of different structures. In this case the energy of the mutated structure is much higher than the energy of the original structure. Therefore it is possible, that the protein become unstable with the mutation and therefore does not work any more.


Comparing Structure:

Gromacs also offers pictures of the mutated amino acids which can be seen in the following section.

picture original amino acid picture mutated amino acid combined picture
Figure 21: Amino acid Phenylalanine
Figure 22: Amino acid Serine
Figure 23: Picture which visualize the mutation

These pictures (Figure 21, Figure 22, Figure 23) is very similar to the pictures created by SCWRL (Figure Figure 15, Figure 16, Figure 17). The mutated amino acid is much more smaller than the Phenylalanine and therefore does not need that much space. Otherwise it is possible, that because of the smaller amino acid, there are missing H-Bonds in the protein.

Visualization of H-bonds and Clashes:

To check if this is the case, we analysed the H-Bonds and clashes between the mutated amino acid and the rest of the protein.

H-Bonds of the original amino acid H-bonds of the mutated amino acid Clashes of the mutation
Figure 24: H-bonds of the original amino acid
Figure 25: H-bonds of the mutated amino acid
Figure 26: Possible clashes

Both amino acids do not have any H-Bonds with the rest of the protein (Figure 24, Figure 25). Therefore a missing H-Bond does not cause the damage on the protein. Furthermore, it is not possible to find any clashes (Figure 26) in between the protein.

Back to [Structure-based mutation analysis]