Difference between revisions of "Sequence-based mutation analysis TSD"

From Bioinformatikpedia
(Chemical properties)
(Chemical properties)
Line 17: Line 17:
 
== Chemical properties ==
 
== Chemical properties ==
   
  +
The biochemical properties of the wildtype and mutant amino acids of the chosen SNPs ale listed in <xr id="tab:biochem"/>. Displayed are the hydrophobicity in form of the hydropathy index and the according category, the volume with the matching characterisation, the charge and the grantham score.<br>
  +
The Grantham scores predicts the effect of substitutions between amino acids based on chemical properties, including polarity and molecular volume. It categorizes codon replacements into classes of increasing chemical dissimilarity, and it ranges from 5 to 215<ref name="grantham">Grantham R. Amino acid difference formula to help explain protein evolution. Science 1974; 185: 862-864 </ref>.
   
  +
<figtable id="tab:biochem">
The simplest approach is to look at the differences in the WT (wild-type) and mutant amino acids. Please write for each of the 10 mutations a short summary about the physicochemical properties and changes.
 
 
<figtable id="tab:gopetgo">
 
 
{| class="wikitable", style="width:950px; border-collapse: collapse; border-style: solid; border-width:0px; border-color: #000"
 
{| class="wikitable", style="width:950px; border-collapse: collapse; border-style: solid; border-width:0px; border-color: #000"
  +
|+ Table 1: Biochemical properties
 
|- align="center"
 
|- align="center"
 
! style="border-style: solid; border-width: 0 0 1px 0" | Mutation
 
! style="border-style: solid; border-width: 0 0 1px 0" | Mutation
 
! style="border-style: solid; border-width: 0 0 1px 0" colspan="4" | Wildtype
 
! style="border-style: solid; border-width: 0 0 1px 0" colspan="4" | Wildtype
 
! style="border-style: solid; border-width: 0 0 1px 0" colspan="4" |Mutant
 
! style="border-style: solid; border-width: 0 0 1px 0" colspan="4" |Mutant
  +
! style="border-style: solid; border-width: 0 0 1px 0" | Grantham score
 
|- align="center"
 
|- align="center"
 
! style="border-style: solid; border-width: 0 0 2px 0" |
 
! style="border-style: solid; border-width: 0 0 2px 0" |
Line 36: Line 38:
 
! style="border-style: solid; border-width: 0 0 2px 0" |Charge
 
! style="border-style: solid; border-width: 0 0 2px 0" |Charge
 
! style="border-style: solid; border-width: 0 0 2px 0" |Conservation
 
! style="border-style: solid; border-width: 0 0 2px 0" |Conservation
  +
! style="border-style: solid; border-width: 0 0 2px 0" |
 
|- align="center"
 
|- align="center"
 
| style="border-style: solid; border-width: 0 0 0 0" | M1V
 
| style="border-style: solid; border-width: 0 0 0 0" | M1V
Line 46: Line 49:
 
| style="border-style: solid; border-width: 0 0 0 0" | neutral
 
| style="border-style: solid; border-width: 0 0 0 0" | neutral
 
| style="border-style: solid; border-width: 0 0 0 0" | 0
 
| style="border-style: solid; border-width: 0 0 0 0" | 0
  +
| style="border-style: solid; border-width: 0 0 0 0" | 21
 
|-align="center"
 
|-align="center"
 
| style="border-style: solid; border-width: 0 0 0 0" | L39R
 
| style="border-style: solid; border-width: 0 0 0 0" | L39R
Line 56: Line 60:
 
| style="border-style: solid; border-width: 0 0 0 0" | positive
 
| style="border-style: solid; border-width: 0 0 0 0" | positive
 
| style="border-style: solid; border-width: 0 0 0 0" | 1
 
| style="border-style: solid; border-width: 0 0 0 0" | 1
  +
| style="border-style: solid; border-width: 0 0 0 0" | 102
  +
|-align="center"
  +
| style="border-style: solid; border-width: 0 0 0 0" | C58Y
  +
| style="border-style: solid; border-width: 0 0 0 0" | 2.5 (polar)
  +
| style="border-style: solid; border-width: 0 0 0 0" | 108.5 (small)
  +
| style="border-style: solid; border-width: 0 0 0 0" | neutral
  +
| style="border-style: solid; border-width: 0 0 0 0" | -
  +
| style="border-style: solid; border-width: 0 0 0 0" | -1.3 (polar)
  +
| style="border-style: solid; border-width: 0 0 0 0" | 193.6 (bulky)
  +
| style="border-style: solid; border-width: 0 0 0 0" | neutral
  +
| style="border-style: solid; border-width: 0 0 0 0" | -
  +
| style="border-style: solid; border-width: 0 0 0 0" | 194
 
|-align="center"
 
|-align="center"
 
| style="border-style: solid; border-width: 0 0 0 0" | L127R
 
| style="border-style: solid; border-width: 0 0 0 0" | L127R
Line 66: Line 82:
 
| style="border-style: solid; border-width: 0 0 0 0" | positive
 
| style="border-style: solid; border-width: 0 0 0 0" | positive
 
| style="border-style: solid; border-width: 0 0 0 0" | 0
 
| style="border-style: solid; border-width: 0 0 0 0" | 0
  +
| style="border-style: solid; border-width: 0 0 0 0" | 102
 
|-align="center"
 
|-align="center"
 
| style="border-style: solid; border-width: 0 0 0 0" | R170W
 
| style="border-style: solid; border-width: 0 0 0 0" | R170W
Line 76: Line 93:
 
| style="border-style: solid; border-width: 0 0 0 0" | neutral
 
| style="border-style: solid; border-width: 0 0 0 0" | neutral
 
| style="border-style: solid; border-width: 0 0 0 0" | 0
 
| style="border-style: solid; border-width: 0 0 0 0" | 0
  +
| style="border-style: solid; border-width: 0 0 0 0" | 101
 
|-align="center"
 
|-align="center"
 
| style="border-style: solid; border-width: 0 0 0 0" | R178H
 
| style="border-style: solid; border-width: 0 0 0 0" | R178H
Line 86: Line 104:
 
| style="border-style: solid; border-width: 0 0 0 0" | neutral
 
| style="border-style: solid; border-width: 0 0 0 0" | neutral
 
| style="border-style: solid; border-width: 0 0 0 0" | 0
 
| style="border-style: solid; border-width: 0 0 0 0" | 0
  +
| style="border-style: solid; border-width: 0 0 0 0" | 29
 
|-align="center"
 
|-align="center"
 
| style="border-style: solid; border-width: 0 0 0 0" | S210F
 
| style="border-style: solid; border-width: 0 0 0 0" | S210F
Line 96: Line 115:
 
| style="border-style: solid; border-width: 0 0 0 0" | neutral
 
| style="border-style: solid; border-width: 0 0 0 0" | neutral
 
| style="border-style: solid; border-width: 0 0 0 0" | 1
 
| style="border-style: solid; border-width: 0 0 0 0" | 1
  +
| style="border-style: solid; border-width: 0 0 0 0" | 155
 
|-align="center"
 
|-align="center"
 
| style="border-style: solid; border-width: 0 0 0 0" | D258H
 
| style="border-style: solid; border-width: 0 0 0 0" | D258H
Line 106: Line 126:
 
| style="border-style: solid; border-width: 0 0 0 0" | neutral
 
| style="border-style: solid; border-width: 0 0 0 0" | neutral
 
| style="border-style: solid; border-width: 0 0 0 0" | 4
 
| style="border-style: solid; border-width: 0 0 0 0" | 4
  +
| style="border-style: solid; border-width: 0 0 0 0" | 81
 
|-align="center"
 
|-align="center"
 
| style="border-style: solid; border-width: 0 0 0 0" | L451V
 
| style="border-style: solid; border-width: 0 0 0 0" | L451V
Line 116: Line 137:
 
| style="border-style: solid; border-width: 0 0 0 0" | neutral
 
| style="border-style: solid; border-width: 0 0 0 0" | neutral
 
| style="border-style: solid; border-width: 0 0 0 0" | 2
 
| style="border-style: solid; border-width: 0 0 0 0" | 2
  +
| style="border-style: solid; border-width: 0 0 0 0" | 32
 
|-align="center"
 
|-align="center"
 
| style="border-style: solid; border-width: 0 0 0 0" | E482K
 
| style="border-style: solid; border-width: 0 0 0 0" | E482K
Line 126: Line 148:
 
| style="border-style: solid; border-width: 0 0 0 0" | positive
 
| style="border-style: solid; border-width: 0 0 0 0" | positive
 
| style="border-style: solid; border-width: 0 0 0 0" | 0
 
| style="border-style: solid; border-width: 0 0 0 0" | 0
  +
| style="border-style: solid; border-width: 0 0 0 0" | 56
 
|-align="center"
 
|-align="center"
| style="border-style: solid; border-width: 0 0 0 0" | C58Y
 
| style="border-style: solid; border-width: 0 0 0 0" | 2.5 (polar)
 
| style="border-style: solid; border-width: 0 0 0 0" | 108.5 (small)
 
| style="border-style: solid; border-width: 0 0 0 0" | neutral
 
| style="border-style: solid; border-width: 0 0 0 0" | -
 
| style="border-style: solid; border-width: 0 0 0 0" | -1.3 (polar)
 
| style="border-style: solid; border-width: 0 0 0 0" | 193.6 (bulky)
 
| style="border-style: solid; border-width: 0 0 0 0" | neutral
 
| style="border-style: solid; border-width: 0 0 0 0" | -
 
|-
 
 
|}
 
|}
 
</figtable>
 
</figtable>
 
   
 
== Structural observations ==
 
== Structural observations ==

Revision as of 12:34, 13 June 2012

There was only one catch and that was Catch-22, which specified that a concern for one's own safety in the face of dangers that were real and immediate was the process of a rational mind. Orr was crazy and could be grounded. All he had to do was ask; and as soon as he did, he would no longer be crazy and would have to fly more missions. Orr would be crazy to fly more missions and sane if he didn't, but if he was sane, he had to fly them. If he flew them, he was crazy and didn't have to; but if he didn't want to, he was sane and had to. Yossarian was moved very deeply by the absolute simplicity of this clause of Catch-22 and let out a respectful whistle.

"That's some catch, that Catch-22," he observed.

"It's the best there is," Doc Daneeka agreed.

-Catch 22

The journal for this task can be found here.

Mutations

Dataset

The following SNPs, selected by an unbiased source, will be analysed: M1V, L39R, C58Y, L127R, R170W, R178H, S210F, D258H, L451V and E482K.


   Pick 10 mutations (SNPs) of your dataset, some of which are from the HGMD (missense mutations) and some that were only found in dbSNP ( change in amino acid sequence but not found in the HGMD). Shuffle them and PLEASE do not try to memorize whether they cause the disease! The goal is to pretend that we do NOT know what is going on. It would be great if the most common disease-causing mutations would be included, too.

Chemical properties

The biochemical properties of the wildtype and mutant amino acids of the chosen SNPs ale listed in <xr id="tab:biochem"/>. Displayed are the hydrophobicity in form of the hydropathy index and the according category, the volume with the matching characterisation, the charge and the grantham score.
The Grantham scores predicts the effect of substitutions between amino acids based on chemical properties, including polarity and molecular volume. It categorizes codon replacements into classes of increasing chemical dissimilarity, and it ranges from 5 to 215<ref name="grantham">Grantham R. Amino acid difference formula to help explain protein evolution. Science 1974; 185: 862-864 </ref>.

<figtable id="tab:biochem">

Table 1: Biochemical properties
Mutation Wildtype Mutant Grantham score
Hydrophpbicity Volume Charge Conservation Hydrophpbicity Volume Charge Conservation
M1V 1.9 (nonpolar) 162.9 (bulky) neutral 66 4.2 (nonpolar) 140.0 (small) neutral 0 21
L39R 3.8 (nonpolar) 166.7 (bulky) neutral 41 -4.5 (polar) 173.4 (bulky) positive 1 102
C58Y 2.5 (polar) 108.5 (small) neutral - -1.3 (polar) 193.6 (bulky) neutral - 194
L127R 3.8 (nonpolar) 166.7 (bulky) neutral 25 -4.5 (polar) 173.4 (bulky) positive 0 102
R170W -4.5 (polar) 173.4 (bulky) positive 25 -0.9 (nonpolar) 227.8 (bulky) neutral 0 101
R178H -4.5 (polar) 173.4 (bulky) positive 52 -3.2 (polar) 153.2 (bulky) neutral 0 29
S210F -0.8 (polar) 89.0 (tiny) neutral 19 2.8 (nonpolar) 189.9 (bulky) neutral 1 155
D258H -3.5 (polar) 111.1 (small) negative 40 -3.2 (polar) 153.2 (bulky) neutral 4 81
L451V 3.8 (nonpolar) 166.7 (bulky) neutral 12 4.2 (nonpolar) 140.0 (small) neutral 2 32
E482K -3.5 (polar) 138.4 (bulky) negative 52 -3.9 (polar) 168.6 (bulky) positive 0 56

</figtable>

Structural observations

   Now take into consideration where in the protein the mutation occurs and document: Create a picture with PyMOL showing the original and mutated residue in the protein. Use PyMOL for this. More thorough structural analyses will be introduced in the next task.
 Using your secondary structure predictions from the previous tasks, investigate whether the mutations are inside secondary structure elements (Helix, Strand) or not.

Substitution matrices

   Look at the BLOSUM62 and PAM(1/250) matrix. What are the scores for the amino acid substitutions? Is it the worst possible substitution or not? Can we say anything about phenotype from this?
   Getting a bit closer to evolution you will have to create a PSSM (position specific scoring matrix) for your protein sequence using PSI-BLAST (5 iterations). How conserved are the WT residues in your mutant positions? How is the frequency of occurrence (conservation) for the mutant residue type? Anything interesting?

Multiple sequence alignments

   And another step close to evolution: Identify all mammalian homologous sequences. Create a multiple sequence alignment for them with a method of your choice. Using this you can now calculate conservation for WT and mutant residues again. Compare this to the matrix- and PSSM-derived results.

Prediction

SIFT

PolyPhen2

SNAP

   Finally, we use three different approaches to score our mutants.
       SIFT
       Polyphen2
       SNAP is installed on the student cluster and should be used command-line only. You will need to create your own ~/.snapfunrc (unless Tim will change the default one) to point to the correct paths. -- As blast is the bottleneck of SNAP, and you are doing that anyway, we might as well look at all possible substitutions in the position of our mutations. This way we can learn much more about the nature of the given mutation: Is our mutation problematic because we introduce an unwanted effect, or because the WT residue is essential and by mutating we remove that? 

Consensus

   Compare ALL results and create an overview table.
   Try to come up with a consensus between all the findings requested above.

Evaluation

   Check whether you are right in the HGMD – were you able to predict a change? 

For this task it is very important to us that you properly interpret and discuss your results. The production of the data should not take that long – so you have more time to do real science!