Difference between revisions of "Canavan Task 6 - Sequence-based mutation analysis"
(→Discussion) |
(→Discussion) |
||
Line 812: | Line 812: | ||
<td style="border-bottom:solid;border-right:solid">V278M</td> |
<td style="border-bottom:solid;border-right:solid">V278M</td> |
||
<td style="border-bottom:solid;border-right:solid">Location and Structure</td> |
<td style="border-bottom:solid;border-right:solid">Location and Structure</td> |
||
− | <td style="border-bottom:solid;border-right:solid bgcolor="#90EE90 |
+ | <td style="border-bottom:solid;border-right:solid bgcolor="#90EE90">no effect</td> |
− | <td style="border-bottom:solid;border-right:solid bgcolor="#90EE90 |
+ | <td style="border-bottom:solid;border-right:solid bgcolor="#90EE90">little to no effect</td> |
− | <td style="border-bottom:solid;border-right:solid bgcolor="#FFFF00 |
+ | <td style="border-bottom:solid;border-right:solid bgcolor="#FFFF00">possible effect</td> |
<td style="border-bottom:solid;border-right:solid">SIFT</td> |
<td style="border-bottom:solid;border-right:solid">SIFT</td> |
||
<td style="border-bottom:solid;border-right:solid">Polyphen</td> |
<td style="border-bottom:solid;border-right:solid">Polyphen</td> |
Revision as of 18:18, 18 June 2012
Contents
Protocol
Further information can be found in the protocol.
Choosing mutations
The ten chosen mutations are listed below in <xr id="canavan_muts"/>.
<figtable id="canavan_muts"> <xr nolink id="canavan_muts"/> Listed are the 10 chosen mutations used for this task and their database sources. Mutations colored in red are disease causing. Included is also a superposition of the native residue with the mutanted residue.
Mutant | E285A | A305E | G123E | R71H | R71K | K213E | V278M | M82T | E235K | I270T |
Known effect | <= 1% activity left | 0% activity left | <= 25% | Canavan Disease | not disease related | Canavan Disease | not disease related | not disease related | Canavan Disease | not disease related |
structural location | helix - binding pocket | loop C-terminus | beta-sheet | active site | active site | loop region | loop region | outer loop | ||
Sources | dbSNP SNPdbe OMIM HGMD |
dbSNP SNPdbe OMIM HGMD |
SNPdbe HGMD |
dbSNP SNPdbe HGMD |
SNPdbe | HGMD | dbSNP SNPdbe |
dbSNP SNPdbe |
dbSNP SNPdbe |
dbSNP |
</figtable>
Physico-chemical effects
<xr id="mutations_summary"/> is meant to give a first idea of the nature of the mutations. For each of the ten mutations, we included following properties for the amino acids:
- Hydrophobicity scores <ref name="hydrophobicity"> Kyte J, Doolittle RF (May 1982).Journal of Molecular Biology 157 (1): 105–32. </ref> range from -7.5 (Arg) to 3.1 (Ile), where Arginine is most hydrophilic and Isoleucine most hydrophobic
- the volume <ref name="volume">Volume: A.A. Zamyatin, Prog. Biophys. Mol. Biol., 24(1972)107-123</ref>, measured in Å3
- the isoelectric point <ref name="pi">JC Biro, Theor Biol Med Model. 2006; 3: 15</ref>, which is the pH at which the amino acid's overall charge is zero
- the Grantham score <ref name="grantham">Grantham R (1974) Amino acid difference formula to help explain protein evolution. Science 185: 862–864</ref>, which classifies amino acid substitutions into the classes conservative (0-50, coloured green), moderately conservative (51-100, yellow), moderately radical (101-150, orange), or radical (≥151, red)
<figtable id="mutations_summary">
Mutation | Amino acid | Hydrophobicity (and diff.) | Volume [Å3] (and diff.) | Isoelectric point pI(Charge) | Grantham Score | |||
E285A | Glutamic Acid | -3.5 (polar) | 5.3 | 138.4 (bulky) | 49.8 | 3.2 (negative) | 2.8 | 107 |
Alanine | 1.8 (nonpolar) | 88.6 (tiny) | 6.0 (neutral) | |||||
A305E | Alanine | 1.8 (nonpolar) | 5.3 | 88.6 (tiny) | 49.8 | 6.0 (neutral) | 2.8 | 107 |
Glutamic Acid | -3.5 (polar) | 138.4 (bulky) | 3.2 (negative) | |||||
G123E | Glycine | -0.4 (nonpolar) | 3.1 | 60.1 (tiny) | 78.3 | 6.0 (neutral) | 2.8 | 98 |
Glutamic Acid | -3.5 (polar) | 138.4 (bulky) | 3.2 (negative) | |||||
R71H | Arginine | -4.5 (polar) | 1.3 | 173.4 (bulky) | 20.2 | 10.8 (positive) | 3.2 | 29 |
Histidine | -3.2 (polar) | 153.2 (bulky) | 7.6 (neutral) | |||||
R71K | Arginine | -4.5 (polar) | 0.6 | 173.4 (bulky) | 4.8 | 10.8 (positive) | 1.1 | 26 |
Lysine | -3.9 (polar) | 168.6 (bulky) | 9.7 (positive) | |||||
K213E | Lysine | -3.9 (polar) | 0.4 | 168.6 (bulky) | 30.2 | 9.7 (positive) | 6.5 | 26 |
Glutamic Acid | -3.5 (polar) | 138.4 (bulky) | 3.2 (negative) | |||||
V278M | Valine | 4.2 (nonpolar) | 2.3 | 140.0 (small) | 22.9 | 6.0 (neutral) | 0.3 | 21 |
Methionine | 1.9 (nonpolar) | 162.9 (bulky) | 5.7 (neutral) | |||||
M82T | Methionine | 1.9 (nonpolar) | 1.2 | 162.9 (bulky) | 46.8 | 5.7 (neutral) | 0.2 | 81 |
Threonin | -0.7 (polar) | 116.1 (small | 5.9 (neutral) | |||||
E235K | Glutamic Acid | -3.5 (polar) | 0.4 | 138.4 (bulky) | 30.2 | 3.2 (negative) | 6.5 | 56 |
Lysine | -3.9 (polar) | 168.6 (bulky) | 9.7 (positive) | |||||
I270T | Isoleucine | 4.5 (nonpolar) | 3.8 | 166.7 (bulky) | 50.6 | 5.9 (neutral) | 0.0 | 89 |
Threonin | -0.7 (polar) | 116.1 (small) | 5.9 (neutral) |
</figtable>
Substitution matrices
Blosum62
This substitution matrix can be found in the protocol. The lowest score in Blosum62, that represents a really unlikely mutation event is -4 and the highest score, that represents a likely mutation is 11. Blossum substitution matrices are derived from non gapped local alignments of the BLOCKS database. Blossum62 is derived from local alignments of sequences with at most 62% sequence similarity. The scores in the matrix represent the log-odds values of a substitution. Negative scores meaning the substituion is less than randomly expected and positive values meaning the substitution is observed more than randomly expected.
PAM1
This substitution matrix can also be found in the protocol. The worst score in PAM1 is 0 for 0% mutation probability and the highest score is 56 for 0.56% mutation probability. A score in PAM1 expresses how probable a mutation from A to B will be, assuming the two proteins are 99% similar (1% AA mutations in both sequences). As we are just considering variations of the same protein within a family, mutation generally are very unlikely with the most probable mutation being Asp to Glu with 0.56%.
Mutations
E285A
Location
<figure id="e285a_hb" >
</figure>
This amino acid is located at the beginning of helix eight in the binding pocket. Yet is it not involved in substrate binding. DSSP does not assign a state to E285.
As can be seen in <xr id="e285a_hb"/> E285 has several hydrogen bonds with the backbone as well as Y118. In case of the mutation to alanine, these hydrogen bonds can not be established anymore and might destabilize the structure.
physico-chemical properties
Glutamic acid and Alanine have different physico-chemical properties, which is represented in a moderate Grantham Score of 107. Whereas glutamic acid is negatively charged, alanine has no charge or polarity. Furthermore Alanine is much smaller than glutamic acid. From these different properties one would expect an effect of the mutation.
substitution matrices
In Blosum62 the substitution E --> A is scored -1. This means, that a substitution from E to A is very unlikely, since the lowest score in Blosum62 is -4 and negative scores refer to less than randomly expected events.
In PAM1 the substitution E --> A is scored 17 which equals a mutation probability of 0.17%.
Therefore both substitution matrices expect this mutation to be rather unlikely.
PSSM
Besides glutamic acid, proline also has a high score at this position. This is surprising because both amino acids have very different properties. Alanine still has the third highest score, which suggests, that this substitution can be observed relatively often. The high information content at this position also stands for a well conserved position with only few specific substitutions.
A R N D C Q E G H I L K M F P S T W Y V A R N D C Q E G H I L K M F P S T W Y V 285 E 2 0 -1 -1 -6 -2 5 -4 1 -5 -4 -4 -3 -5 5 -2 -2 -6 -6 -3 17 4 3 3 0 1 31 1 4 0 2 1 1 0 26 2 3 0 0 2 0.92 inf
SIFT
SIFT predicts an effect of this mutation with a score of 0.00. Amino acids with probabilities < .05 are predicted to be deleterious. The confidence for this prediction is 0.94 and therefore very high, so that the prediction seems reasonable. Furthermore, SIFT predicts ANY substitution of E at this position to be deleterious. Also the conservation of E at this position is very high with a value of 3.01.
Substitution at pos 285 from E to A is predicted to AFFECT PROTEIN FUNCTION with a score of 0.00. Median sequence conservation: 3.01 Sequences represented at this position:17
pos A C D E F G H I K L M N P Q R S T V W Y 285E 0.94 0.00 0.00 0.00 1.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00
Polyphen2
This mutation is predicted to be probably damaging with a score of 1.000 (sensitivity: 0.00; specificity: 1.00)
Polyphen used 64 sequences of a multiple sequence alignment at position 285.
Evaluation
Both SNP effect prediction methods give a clear result and predict this SNP to cause a mutation. One also would expect this result when taking into account the physico chemical properties of the involved amino acids. They differ quite a lot. For example, alanine would not be able to establish the four hydrogen bonds, that E285 is involved in. Furthermore the close proximity to the active side lets assume that the mutation could influence the activity of the enzyme.
This mutation is the most common Canavan Disease causing mutation among the Ashkenazi Jews. It causes an activity loss of 99% of the enzyme. In this case, the prediction methods predicted correctly.
A305E
location
<figure id="a305e_crowded">
</figure>
A305 is located at the end of the 13th beta sheet at the C-terminus of the protein (protein length: 313 AA). DSSP also assignes state E to this residue as the last residue in the respective sheet.
As can be seen in <xr id="a305e_crowded"/> the space at this position is rather crowded, so that alanine as a small residue fits this very well in this position. Glutamic acid instead, hardly finds space and overlaps with neighboring residues. Therefore the mutation A305E probably interferes with the local secondary structure as this residue needs more space.
physico-chemical properties
As for E285A, the physico-chemical properties for Alanine and Glutamic acid are different, which is represented in a moderate Grantham Score of 107. Whereas glutamic acid is negatively charged, alanine has no charge or polarity. Furthermore Alanine is much smaller than glutamic acid. From these different properties one would expect an effect of the mutation.
substitution matrices
As Blossum is a symmetrical substitution matrix, the score for A305E is the same as for E285A: -1. The lowest score in Blosum64, that represents a really unlikely mutation event is -4 and the highest score, that represents a likely mutation is 11. This means, that a substitution from E to A is very unlikely.
In PAM1 the substitution A --> E is scored 10 which equals a mutation probability of 0.1%.
Therefore both substitution matrices expect this mutation to be rather unlikely.
PSSM
A substitution to leucine seems to be accepted as well and also Isoleucine has a higher score. This is not surprising, since these three amino acids have similar physico chemical properties. A mutation to glutamic acid is uncommon, but not less often observed than other substitutions. The low information content at this position (0.37) lets suggest, that this position is not well conserved.
A R N D C Q E G H I L K M F P S T W Y V A R N D C Q E G H I L K M F P S T W Y V 305 A 2 -4 -4 -4 0 -1 -2 -4 -4 2 2 -3 1 0 -1 -2 -2 -4 -3 3 23 0 0 0 2 3 3 0 0 13 23 0 3 5 2 1 1 0 0 21 0.37 inf
SIFT
SIFT predicts an effect of this mutation with a score of 0.02. Amino acids with probabilities < .05 are predicted to be deleterious. The confidence for this prediction is 0.89 and therefore still very reliable. Furthermore, 13 mutations are predicted to be deleterious at sequence position 305.
Substitution at pos 305 from A to E is predicted to AFFECT PROTEIN FUNCTION with a score of 0.02. Median sequence conservation: 3.03 Sequences represented at this position:16 pos A C D E F G H I K L M N P Q R S T V W Y 305A 0.89 1.00 0.03 0.01 0.02 0.01 0.05 0.01 0.10 0.02 0.08 0.02 0.01 0.04 0.02 0.01 0.06 0.08 0.72 0.00 0.01
Polyphen 2 This mutation is predicted to be probably damaging with a score of 1.000 (sensitivity: 0.00; specificity: 1.00)
Polyphen used 44 sequences of a multiple sequence alignment at position 305.
Evaluation
Again, both SNP effect prediction methods give a clear result and predict this SNP to cause a mutation.The substitution matrices give further confidence, that this mutation is unfavored and might induce some effects. The physico-chemical properties suggest the same conclusion, as both involved amino acids possess differing properties.
Only looking at the location of the mutation almost at the C-terminus one would not expect structural or functional implications, since the mutant residue is far away from the known active site or the dimer interface. Yet, when taking into account the information about the restricted space for an amino acid at this location, one would expect changes in the local structure: neighboring sidechains might move to allow glutamic acid to have enough space. These movements in turn might also influence other parts of the enzyme structure.
This mutation is the most common Canavan Disease causing mutation among the non-Ashkenazi Population. It goes along with a complete loss of funtion of the protein. This severe effect is not expected from the location of the mutation. But both prediction methods have been correct.
G123E
Location
<figure id="g123e_space">
</figure>
G123 is located at the beginning of the fourth beta strand in Aspartoacylase. DSSP also assignes and E to G123, considering it to be in a sheet. This strand is not buried and is solvent accessible. As can be seen from <xr id="g123e_space"/>, one would not expect any effect of this muation. Eventhough glutamic acid is much larger than glycin, there is enough space and no clashes occur. Furthermore, since Glycin is not involved in any interactions, there are thus no interactions that are be lost due to the mutaion.
physico-chemical properties
The physico-chemical protperties for glutamic acid and glycine are as much different as for alanine and glutamic acid. Whereas glutamic acid is very bulky and has a negative charge, glycine is the smalles amino acid and has no charge. These different properties are represented by a still moderate Grantham Score of 98. From these different properties one would expect an effect of the mutation.
substitution matrices
The Blosum62 score for the substitution of glycine with glutamic acid is -2. This value represents a likelihood of the mutation to occur less often than random. The lowest score in Blosum64, that represents a really unlikely mutation event is -4 and the highest score, that represents a likely mutation is 11.
In PAM1 the substitution --> GE is scored 4 which equals a mutation probability of 0.04%. The highest probability of a mutation in PAM1 is 0.56%.
Therefore both substitution matrices imply that this mutation is unlikely.
PSSM
A substitution to glutamic acid is the most often occuring mutation at this position. The low information content at this position implies that glycin is not very conserved and has no important function in the protein. This goes along with the insights gained from the analysis of the location of the mutation in the structure.
A R N D C Q E G H I L K M F P S T W Y V A R N D C Q E G H I L K M F P S T W Y V 123 G -1 0 2 1 -3 -1 2 2 1 -2 -2 -1 -1 -1 1 -1 -1 -4 0 -2 6 5 12 6 1 2 16 14 4 2 5 3 2 2 8 2 3 0 3 4 0.20 inf
SIFT
SIFT predicts this mutation to have an effect. The confidence for this result is very high (1.00). Furthermore, SIFT predict any substitution except for glutamate to be deleterious. It is somewhat surprising, that glutamate is not predicted to be deleterious, whereas the structural very similar glutamic acid is not. This lets suggest, that the effect is due to the negative charge of glutamic acid.
Substitution at pos 123 from G to E is predicted to AFFECT PROTEIN FUNCTION with a score of 0.02. Median sequence conservation: 2.99 Sequences represented at this position:18 pos A C D E F G H I K L M N P Q R S T V W Y 123G 1.00 0.04 0.01 0.03 0.02 0.00 1.00 0.01 0.00 0.02 0.01 0.00 0.03 0.02 0.10 0.01 0.03 0.01 0.01 0.00 0.00
Polyphen This mutation is predicted to be probably damaging with a score of 0.994 (sensitivity: 0.46; specificity: 0.96) Polyphen used 75 sequences for the MSA.
Evaluation
Both SNP effect prediction methods predict this mutation to be deleterious. The same impact can be deduced from the difference in physico-chemical properties between the both residues. Blosum62 and PAM1 state that the substitution from glycine to glutamic acid is very unlikely.
Other conclusions can be made from the analysis of the structural location of the mutant. No effect is expected from a change from G to E at this solvent accessible position in the structure. Additionaly the PSSM suggests, that the substitution of G to E is very likely in related sequences.
As has been found in "Identification and expression of eight novel mutations among non-Jewish patients with Canavan disease.", Kaul R et al, Am. J. Hum. Genet. 59:95-102(1996), the mutation G123E leads to a 25% decrease in enzyme activity and leads to in Canavan Disease. In contrast to the first two disease causing mutations, where the prediction results were correct and in agreement, this time different conclusions could be made. However for the first two mutations, the effect were very dramatic resulting in a complete loss of function. This mutations leaves a functionality of 25% and maybe that is why the results are not clear without ambiguity.
R71H
Location
<figure id="r71h_hbonds">
</figure>
R71 is located at the end of the fourth helix in the active site of the enzyme. It is involved in substrate binding via one Hbond. It also forms other Hbonds with an active water molecule and D68. DSSP assigns this position the state 'G', which stands for a 3-10 helix.
Due to its positioning in the active site and the several Hbonds R71 is involved in, it seems very likely that any mutation is deleterious. The interactions of R71 and the respective positioning of H71 is shown in <xr id="r71h_hbonds"/>.
physico-chemical properties
Arginine and histidine have almost similar physico chemical properties. Both are polar and bulky amino acids, with arginine having a stretched shape whereas histidine has a round shape. Another small difference is the positive charge of arginine versus no charge of histidine (only at pH below 6 histidine bears a positive charge). Therefore only from the chemical properties one would not expect huge impact of the mutation.
substitution matrices
The Blosum62 score for the substitution of arginine with histidine acid is 0. This means, histidine substitution for arginine occur as often as would be expected to happen by chance.
In PAM1 the substitution R--> H is scored 8 which equals a mutation probability of 0.08%. The highest probability of a mutation in PAM1 is 0.56%.
Therefore from both substitution matrices one can imply that this mutation is not very likely.
PSSM
From the information content of 2.8 at this position one can conclude the big importance of arginine at this position. This goes along with the analysis of the location of arginine in the binding site of the enzyme. Obviously no other amino acid is favoured at this postition in MSAs of related sequences. Therefore, one would expect a huge effect from the mutation R71H.
A R N D C Q E G H I L K M F P S T W Y V A R N D C Q E G H I L K M F P S T W Y V 71 R -4 9 -5 -5 -8 -3 -6 -7 -3 -7 -7 -3 -1 -7 -7 -5 -4 -3 -4 -7 2 88 1 1 0 1 0 0 1 0 0 1 2 0 0 1 1 0 1 0 2.80 inf
SIFT
SIFT predicts an effect for the mutation R71H. The conservation of R at position 71 is very high with 2.99. Besides Methionine and Lysine, all other substitutions are predicted to be deleterious. This goes along with the results from the PSSM.
Substitution at pos 71 from R to H is predicted to AFFECT PROTEIN FUNCTION with a score of 0.01. Median sequence conservation: 2.99 Sequences represented at this position:18 pos A C D E F G H I K L M N P Q R S T V W Y 71R 1.00 0.02 0.00 0.00 0.01 0.00 0.01 0.01 0.01 0.06 0.02 0.09 0.01 0.01 0.02 1.00 0.01 0.01 0.01 0.00 0.01
Polyphen2
This mutation is predicted to be probably damaging with a score of 0.909 (sensitivity: 0.69; specificity: 0.90). Polyphen used 75 sequences in the MSA.
Evaluation
Taken all information into account, one would expect this mutation to be deleterious. Both SNP effect prediction methods come to this conclusion. The substitution matrices and PSSM give only low likelihoods and also from analysis of the location of R71 in the binding site one would expect, that any substitution has an impact on the binding of substrate and therefore activity of the enzyme.
These predictions agree with the annotation in HGMD, which associates R71H with Canavan Disease.
R71K
Location
<figure id="r71k_hbonds">
</figure>
Again, R71 is located at the end of the fourth helix in the active site of the enzyme. It is involved in substrate binding via one Hbond. It also forms other Hbonds with an active water molecule and D68. DSSP assigns this position the state 'G', which stands for a 3-10 helix.
As can be seen in <xr id="r71k_hbonds"/>, K71 could be oriented in the same way as R71. Therefore is is possible for lysine to interact with the substrate or with D68. In contrast to R71H, the effect of this mutation is not expected to have such a big influence, since some of the interactions might be kept.
physico-chemical properties
Both residues have similar physico-chemical properties. Both residues are positively charged and therefore polar. Both are bulky residues that have the same overal shape. One difference is the amount of amino groups in the side chain: whereas arginine has two amino groups to form Hbonds, lysine has only one. From these similar properties, one would not expect a big impact of this muation.
substitution matrices
The substitution of R to K gets a score of 2 in the Blosum62 matrix. This means, this substitution is favoured and expected to occur slightly more often than random.
In PAM1 this substitution is scored 37, which is one of the higher scores in this matrix with the highest value being 57. It means, that the probability for this mutation to happen equals 0.37%.
Therefore, from the substitution matrices one can conclude that this mutation is almost likely.
PSSM
As already mentioned for the mutation R71H, this position in sequence is highly conserved, and allows hardyl no other amino acids. Even for lysine, with its similar physico chemical properties, hardly any substitutions are observed in related sequences.
A R N D C Q E G H I L K M F P S T W Y V A R N D C Q E G H I L K M F P S T W Y V 71 R -4 9 -5 -5 -8 -3 -6 -7 -3 -7 -7 -3 -1 -7 -7 -5 -4 -3 -4 -7 2 88 1 1 0 1 0 0 1 0 0 1 2 0 0 1 1 0 1 0 2.80 inf
SIFT
SIFT predicts the mutation R71K to be tolerated with a score of 0.06. This score is just above the threshold of 0.5, below which substitutions are classified as deleterious. Arginine is very highly conserved at this position (2.99) and only lysine and methione have values above 0.5 and are thus tolerated.
Substitution at pos 71 from R to K is predicted to be TOLERATED with a score of 0.06. Median sequence conservation: 2.99 Sequences represented at this position:18 pos A C D E F G H I K L M N P Q R S T V W Y 71R 1.00 0.02 0.00 0.00 0.01 0.00 0.01 0.01 0.01 0.06 0.02 0.09 0.01 0.01 0.02 1.00 0.01 0.01 0.01 0.00 0.01
Polyphen2
This mutation is predicted to be benign with a score of 0.421 (sensitivity: 0.84; specificity: 0.79). Polyphen used 75 sequences in the MSA.
Evaluation
From the prediction methods as well as from the other analysis one can expect this mutation to be tolerated. Also arginine is involved in substrate binding and is highly conserved among related proteins, the similar physico chemical properties of arginine and lysine might allow this substitution.
This goes along with the annotation for this mutation, that is not classifies as disease causing. Yet there is one source of information <ref name="r71k">Le Coq J., Pavlovsky A., Malik R., Sanishvili R., Xu C., Viola R.E, Examination of the mechanism of human brain aspartoacylase through the binding of an intermediate analogue, Biochemistry 47:3484-3492(2008)</ref> about mutation experiments, from which is conducted that this mutation reduces activity of the enzyme by 99%!
K213E
Location
Residue 213 is located on the outside of the protein, on an outer loop region, and it is nowhere close to the dimer interaction site, either. It does not form any backbone-backbone interactions. //picture
Physico-chemical properties
Lysine and Glutamic Acid are fairly similar amino acids: they are both polar and bulky, and only their charge differs (Lysine is positively charged, Glutamic Acid negatively). The Grantham score lies at 26, so the substitution is considered conservative.
Substitution Matrices
Blosum62 score: 1, i.e, it is expected to occur slightly more often than random.
PAM1 score: 4, so, it is only expected to occur with 4% probability, where 37% is the highest possible score.
-> The position-independent substitution matrices consider this substitution as not very likely, but possible.
PSSM
Information content on position 213 is only 0.09, which fits well to the position on an outer loop. A substitution to Glutamic Acid seems not exactly likely, but should not have very large impact either.
A R N D C Q E G H I L K M F P S T W Y V A R N D C Q E G H I L K M F P S T W Y V 213 K 1 0 -1 0 -2 0 0 -1 1 -1 -1 1 0 -1 2 0 0 -2 -1 1 10 5 2 5 1 4 7 4 3 4 6 7 2 3 12 5 7 1 2 10 0.09 inf
PolyPhen2
This mutation is predicted to be benign with a score of 0.003 (sensitivity: 0.98; specificity: 0.44).
Evaluation
Grantham 26
When looking at the position in the protein and the physico-chemical properties, the mutation could be expected to be tolerated. Both substitution matrices, however, suggest that it is not very likely to occur, and the PSSM value
V278M
Location
Physico-chemical properties
Valine and Methionine are both hydrophobic (Valine slightly more so), and both are neutrally charged. Valine is smaller and rounder than Methionine. Methionine contains sulfur, but is not able to form disulfide bonds, so this fact does not make a big difference. This is reflected in a conservative Grantham score of 21.
Substitution Matrices
Blosum62 score: 1, i.e., it is expected to occur slightly more often than random.
PAM1 score: 17, i.e., 17% probability to occur (where 57% is the highest probability here).
-> The substitution matrices do not consider the mutation likely, but not impossible.
PSSM
The information content of this position is 41, so it is definitely not unconserved. A substitution to Methionine results in a very low to low score compared to other amino acids.
A R N D C Q E G H I L K M F P S T W Y V A R N D C Q E G H I L K M F P S T W Y V 278 V 0 -4 -5 -3 0 -3 2 2 -3 1 0 -4 -3 2 -1 -3 -2 -2 -3 4 6 1 0 1 2 1 14 14 1 7 10 0 0 9 3 2 2 1 0 26 0.41 inf
PolyPhen2
This mutation is predicted to be probably damaging with a score of 1.000 (sensitivity: 0.00; specificity: 1.00).
Evaluation
M82T
Location
Physico-chemical properties
Methionine is nonpolar, whereas Threonine is slighly polar. Methionine is large and longish, while Threonine is much smaller (difference of approx 40 Angstroms) and more round. Threonine has an additional hydroxyl group, while Methionine contains sulfur. Both are neutrally charged. Still, the other differences are large enough to result in a Grantham score of 81 (moderately conservative).
Substitution Matrices
Blosum62 score: -1, i.e., this mutation is expected slightly less often than random.
PAM1 score: 2, i.e., 2% and not very likely (8 being the highest score, so substitutions seem unlikely in general).
-> The matrices consider this substitution as slightly unlikely, PAM1 somewhat less likely than Blosum62.
PSSM
A R N D C Q E G H I L K M F P S T W Y V A R N D C Q E G H I L K M F P S T W Y V 82 M 1 -1 0 1 -1 0 0 1 0 -1 -1 0 -1 -2 1 1 1 0 -1 -1 11 3 3 9 1 4 6 10 2 3 5 4 2 2 7 10 10 1 2 5 0.08 inf
PolyPhen2
This mutation is predicted to be benign with a score of 0.001 (sensitivity: 0.99; specificity: 0.15)
Evaluation
E235K
Location
Physico-chemical properties
Glutamic acid is (obviously) acidic, while Lysine is basic. Both are polar, and both are charged (Glu negatively, Lys positively). Both are bulky (but Lysine is bigger). Their Grantham score is still moderately conservative at 56.
Substitution Matrices
Blosum62 score: 1 -> slightly more often than random
PAM1 score: 4, so it is fairly unlikely (56 highest)
-> Both matrices consider the substitution as not very likely, but not impossible.
PSSM
A R N D C Q E G H I L K M F P S T W Y V A R N D C Q E G H I L K M F P S T W Y V 235 E 0 -1 -2 3 -3 1 3 -1 2 -3 -3 0 -4 -5 3 1 -1 -6 -4 -3 9 3 2 17 1 5 17 4 4 1 2 5 0 0 12 9 4 0 1 2 0.41 inf
PolyPhen2
This mutation is predicted to be benign with a score of 0.000 (sensitivity: 1.00; specificity: 0.00).
Evaluation
I270T
Location
Physico-chemical properties
Isoleucine is very unpolar and bulky, while Threonine is slightly polar and small (volume difference of 50Angstrom^3). Both are neutrally charged. Threonine has an additional hydroxyl group. This results in a still moderately conservative Grantham score of 89.
Substitution Matrices
Blosum62 score: -1, i.e., slightly less often than random.
PAM1 score: 7, so fairly unlikely (12 being the highest probability).
-> Both matrices consider the substitution as unlikely.
PSSM
A R N D C Q E G H I L K M F P S T W Y V A R N D C Q E G H I L K M F P S T W Y V 270 I -3 2 -4 -5 -3 -3 -4 -5 -2 5 2 -2 -1 -3 -2 -4 -4 -4 -1 4 2 11 1 1 0 1 1 1 1 29 17 3 1 1 3 2 1 0 2 22 0.74 inf
PolyPhen2
This mutation is predicted to be probably damaging with a score of 0.999 (sensitivity: 0.14; specificity: 0.99).
Evaluation
Discussion
<figtable id="summary_table">
Mutation | Location and Structure | Phys-Chem Properties(Grantham) | Substitution Matrices | PSSM | SIFT | Polyphen | SNAP | annotated effect |
E285A | possible effect | deleterious 107 |
possible effect | no effect | deleterious effect 0.00 |
damaging effect 1.00 |
SNAP | Canavan Disease <= 1% activity |
A305E | structural effect | deleterious effect 107 |
possible effect | possible effect | deleterious effect 0.02 |
damaging effect 1.00 |
SNAP | Canavan Disease 0% activity |
G123E | no effect | deleterious effect 98 |
deleterious effect | no effect | deleterious effect 0.02 |
damaging effect 0.994 |
SNAP | Canavan Disease ~ 25% activity |
R71H | deleterious effect | no effect 29 |
possible effect | deleterious effect | deleterious effect 0.02 | damaging effect 0.909 | SNAP | Canavan Disease |
R71K | no effect | no effect 26 |
no effect | deleterious effect | tolerated mutation 0.06 |
benign 0.421 |
SNAP | not disease causing mutation analysis: 99% red. activity |
K213E | no effect | no effect 26 |
little to no effect | little to no effect | SIFT | benign | SNAP | Canavan disease |
V278M | Location and Structure | no effect | little to no effect | possible effect | SIFT | Polyphen | SNAP | annotated effect |
M82T | Location and Structure | Phys/Chem Properties | Substitution Matrices | PSSM | SIFT | Polyphen | SNAP | annotated effect |
E235K | Location and Structure | Phys/Chem Properties | Substitution Matrices | PSSM | SIFT | Polyphen | SNAP | Canavan Disease |
I270T | Location and Structure | Phys/Chem Properties | Substitution Matrices | PSSM | SIFT | Polyphen | SNAP | annotated effect |
</figtable>
References
<references/>
Henikoff, S. & Henikoff, J.G. (1992) "Amino acid substitution matrices from protein blocks." Proc. Natl. Acad. Sci. USA 89:10915-10919.