Difference between revisions of "Molecular Dynamics Simulations Analysis Hemochromatosis"
(→Protein based) |
(→C-Alpha based) |
||
Line 311: | Line 311: | ||
|} |
|} |
||
</figtable> |
</figtable> |
||
+ | |||
+ | In general the RMS fluctuations of all three models look similar. The most differing graph is the one of R224W which shows a major peak at residues 220-235 as well as only a small peat at around residue 20. |
||
<br style="clear:both;"> |
<br style="clear:both;"> |
||
Revision as of 12:32, 31 August 2012
Hemochromatosis>>Task 10: Molecular dynamics simulations analysis
Contents
- 1 Short task description
- 2 Protocol
- 3 Dummy
- 3.1 Calculation statistics
- 3.2 Energies
- 3.3 Minimum distance between periodic boundary cells
- 3.4 RMSF for protein and C-alpha
- 3.5 Pymol analysis of average and bfactor
- 3.6 Radius of gyration
- 3.7 solvent accessible surface area
- 3.8 hydrogen-bonds between protein and protein / protein and water
- 3.9 Ramachandran plots
- 3.10 RMSD matrix
- 3.11 cluster analysis
- 3.12 internal RMSD
- 4 References
Short task description
Detailed description: Molecular dynamics simulations analysis
Protocol
A protocol with a description of the data acquisition and other scripts used for this task is available here.
Dummy
Note: All pictures/graphs shown here are from the first run (in case of 1a6zC[wildtype]-pictures) or second run (in case of R224W- or C282S-mutation). The reason for this is depicted under LINKTOMINDISTTODO.
Calculation statistics
<figtable id="tab:simulation_stats"> Statistics of the MD simulations
Input | Calc. time | Calc. speed | time to reach 1 s |
---|---|---|---|
Wildtype | 13h31:15 | 17.750 ns/day | 154350,8 years |
C282S | 13h35:05 | 17.667 ns/day | 155075,9 years |
R224W | 13h35:02 | 17.668 ns/day | 155067,1 years |
</figtable>
GMXcheck revealed for all calculations that all 2001 frames were calculated, resulting in a 10ns model.
Energies
Pressure
<figtable id="tab:pressure">
</figtable>
The plots in <xr id="tab:pressure"/> show the pressures of the calculated systems over time. These show that, although the pressures differ greatly in some cases, the average is still at about 0 (with minor fluctuations).
Temperature
<figtable id="tab:temperature">
</figtable>
The next thing we calculated were the temperatures. For all three models they can be seen in <xr id="tab:temperature"/>. The maximal deviation from the average is about 4 degrees for all models.
Potential
<figtable id="tab:potential">
</figtable>
With gromacs we could also extract the potentials, as can be seen in <xr id="tab:potential"/>. As in the plots before the average fluctuates around the same value for all three models themselves. However, these points differ slightly:
The average potential of the wildtype and the C282S mutation tend to be around the same (~-9.195e+05) whereas the R224W mutation potential is slightly higher (~-9.19e+05).
Total energy
<figtable id="tab:total_energy">
</figtable>
In <xr id="tab:total_energy"/> the values of the total energies are denoted over the different states in time. Again we get an average with minor fluctuation at around the same value for each model.
All these plots show the same behavior with one exception: average around the same value and look different between the three models. This can be expected as minor changes can introduce or eradicate bindings, therefore changing the overall energies which then influence all further steps. The exception is the potential of the R224W mutation which is slightly higher than the other two models' potentials.
Minimum distance between periodic boundary cells
<figtable id="comparison">
</figtable>
The first calculations for the mutations resulted in the minimum distances of TABLETODO. As there should be at least 2 nm distance in between at all time one can see that the mutations show the opposite. Therefore it might be possible that the protein affects itself which is not desired. To see if this states were calculated just by chance (random fluctuations that built up over time into an undesired direction) we repeated the calculations for all three models.
<figtable id="comparison">
</figtable>
The resulting minimal distances can be seen in TABLETODO. We therefore decided to use the model of the first calculation for the wildtype, and the models of the second calculation for the mutation types.
RMSF for protein and C-alpha
Protein based
<figtable id="tab:rmsf_prot">
</figtable>
In general the RMS fluctuations of all three models look similar. The most differing graph is the one of R224W which shows a major peak at residues 220-235 as well as only a small peat at around residue 20.
C-Alpha based
<figtable id="tab:rmsf_ca">
</figtable>
In general the RMS fluctuations of all three models look similar. The most differing graph is the one of R224W which shows a major peak at residues 220-235 as well as only a small peat at around residue 20.
Statistical values
With the GIVENSCRIPTTODO we calculated the values for the t-Test:
1a6zC to R224W : 8.079405e-57**
1a6zC to C282S : 2.87453e-56**
R224W to C282S : 5.45069e-25**
Pymol analysis of average and bfactor
<figtable id="tab:avg_bfactor">
</figtable>
In these part we evaluate the model averages and b-factors of each position. Because it is an average over all timesteps this averaged structure can be impossible in nature.
As one can see there is a big change of b-factors when comparing the wildtype and both mutations we calculated. From both mutations the R224W one shows a bigger difference to the wildtype. As expected the positions "at the edges" and those not in a secondary structure tend to have higher fluctuations/higher b-factors. It is worth noting, that even the beta sheets on the right side of the R224W picture (position 180 and higher) have (compared to wildtype and C282S mutation) pretty high b-factors. Also one can see that a little helix is inserted into the average structure of the R224W average model (right part of the picture, high b-factor [red], position ~220-225).
Radius of gyration
<figtable id="tab:gyration_ca">
</figtable>
<figtable id="tab:gyration_prot">
</figtable>
solvent accessible surface area
<figtable id="tab:sas">
</figtable>
<figtable id="tab:sas_res">
</figtable>
hydrogen-bonds between protein and protein / protein and water
Protein-Protein
<figtable id="tab:hbonds_pp">
</figtable>
The calculations show that in each of the three cases the number of bonds within the protein as well as their distances tend to stay the same.
Protein-Water
<figtable id="tab:hbonds_pw">
</figtable>
Although the inner-protein bonds seem to stay the same (see PREVIOUSSECTIONTODO), the bonds formed with hydrogen show a different behavior.
In case of the wildtype, both the number of hydrogen bonds as well as the number of pairs within 0.35nm are (compared to the rest) fairly low at first, rising in the first ~600ps. This may be an indication that at first a dense protein state is existent. Also from ~8000-8600ps there is a drop in the number of hydrogen bonds, whereas the number of pairs within 0.35nm does not show an equal behavior. Overall the numbers tend to be at the same level each after the first 600ps rise.
The R224W model shows a different behavior over time:
The first steps show a similar "both numbers low and rising till 600ps" behavior. But instead of having a fluctuation around one constant value there seems to be a slight decrease over time as well as bigger fluctuations. This affects both, the number of bonds as well as pairs within 0.35nm.
The calculated model of the C282S mutation shows a different behavior at start than both preceding described, but a similar behavior to the R224W mutation:
For the number of hydrogen bonds there are very high fluctuations at the first 1000ps (rising till 100ps, drop to 300ps, rise till 1000ps) with a slight overall decrease afterwards like for the R224W mutation. The number of pairs within 0.35nm seem to show a similar behavior, however because of the fluctuations this is not very clear.
Ramachandran plots
<figtable id="tab:ramachandran">
</figtable>
RMSD matrix
<figtable id="tab:rmsd_matrix_prot">
</figtable>
<figtable id="tab:rmsd_matrix_mcb">
</figtable>
cluster analysis
whole protein based
<figtable id="tab:cluster_size_prot">
</figtable>
C-alpha based
<figtable id="cluster_size_ca">
</figtable>
internal RMSD
against starting structure
<figtable id="tab:rmsd_vs_start">
</figtable>
against average structure
<figtable id="tab:rmsd_vs_avg">
</figtable>
References
<references/>