Difference between revisions of "Molecular Dynamics Simulations Analysis (PKU)"

From Bioinformatikpedia
(Wildtype analysis)
(Root Mean Square Fluctuations)
Line 113: Line 113:
 
</figure>
 
</figure>
   
  +
<figure id="fig:1J8U_average">
  +
[[File:Average structure PheOH.png|300px|thumb|right|<caption>The average structure of the wildtype during the simulation. The structure is not physical as atom positions are averaged over the whole simulation.</caption>]]
  +
</figure>
  +
  +
<figure id="fig:1J8U_bfactor_binding_site">
  +
[[File:B factors bindingside wildtype.png|300px|thumb|right|<caption>The b factors of the binding site in the wildtype. Blue indicates little movement, red great flexibility.</caption>]]
  +
</figure>
  +
  +
  +
<figure id="fig:1J8U_bfactor_down_site">
  +
[[File:B factors downside.png|300px|thumb|right|<caption>The b factors of the wildtype, view on the binding pocket. Blue indicates little movement, red great flexibility.</caption>]]
  +
</figure>
  +
  +
<figure id="fig:1J8U_bfactor_up_site">
  +
[[File:B factors upside.png|300px|thumb|right|<caption>The b factors of the wildtype, view on the upper side. Blue indicates little movement, red great flexibility.</caption>]]
  +
</figure>
 
 
 
*Indicate the start and end residue for the most flexible regions and the maximum amplitudes. ( T )
 
*Indicate the start and end residue for the most flexible regions and the maximum amplitudes. ( T )

Revision as of 17:24, 12 July 2012

Contents

Short Introduction

We will analyze our completed molecular dynamics simulations, following the task description and the tutorial of the Utrecht University Molecular Modeling Practical. We have completed one run for the wildtype protein and for the mutations ALA322GLY and ARG408TRP, a second run of the wildtype is pending. The second run for the wildtype might be necessary as the trajectory of the wildtype differs significantly from both the mutants. The commands used to generate plots, images etc. can be found in our journal.

Initial Checks

All three simulations run for the desired 10 ns, the trajectories contain 2000 frames in 5 ps steps each. The wildtype simulation took significantly longer, since we used only 16 cores for the widtype, 32 for the mutants. Almost half of the calculation time, 44.2% in each run, is spent on calculating Coulomb interactions and the Lennard-Jones potential of the solvent molecules. A few key statistics can be found in <xr id="tab:simulation_stats"/>.

<figtable id="tab:simulation_stats"> Statistics of the MD simulations

Mutation Sim. time Sim. speed time to reach 1 s
Wildtype 11:32 h 20.8 ns/day 131,621 years
ALA322GLY 4:20 h 55.3 ns/day 49,543 years
ARG408TRP 4:26 h 54.1 ns/day 50,685 years

</figtable>

Wildtype analysis

<figure id="fig:1J8U_overlay">

Overlay of all frames of the 10 ns simulation of the wildtype phenylalanine hydroxylase structure 1J8U.

</figure>

Quality Assurance

Convergence of Energy Terms

<figure id="fig:1J8U_temperature">

Plot of the system temperature during the 10 ns simulation of the wildtype phenylalanine hydroxylase structure 1J8U. A running average in a window of length 100 ps is indicated in red.

</figure>

<figure id="fig:1J8U_pressure">

Plot of the system pressure during the 10 ns simulation of the wildtype phenylalanine hydroxylase structure 1J8U. A running average in a window of length 100 ps is indicated in red.

</figure>

<figure id="fig:1J8U_volume">

Plot of the system volume during the 10 ns simulation of the wildtype phenylalanine hydroxylase structure 1J8U. A running average in a window of length 100 ps is indicated in red.

</figure>

<figure id="fig:1J8U_density">

Plot of the system density during the 10 ns simulation of the wildtype phenylalanine hydroxylase structure 1J8U. A running average in a window of length 100 ps is indicated in red.

</figure>

<figure id="fig:1J8U_temperature">

Plot of the system extension in 3 dimensions during the 10 ns simulation of the wildtype phenylalanine hydroxylase structure 1J8U. X- and Y-dimensions overlap and are not to distinguish in the plot.

</figure>

<figure id="fig:1J8U_coulomb">

Plot of the Coulomb interactions during the 10 ns simulation of the wildtype phenylalanine hydroxylase structure 1J8U.

</figure>

<figure id="fig:1J8Uvdw">

Plot of the van-der-Waals interactions during the 10 ns simulation of the wildtype phenylalanine hydroxylase structure 1J8U.

</figure>

  • What is the average temperature and what is the heat capacity of the system? ( T )
  • What are the terms plotted in the files energy.xvg and box.xvg
  • Estimate the plateau values for the pressure, the volume and the density. ( T )
  • What are the terms plotted in the files coulomb-inter.xvg and vanderwaals-inter.xvg ?


Minimum Distance Between Periodic Images

<figure id="fig:1J8U_mindist_c_alpha">

Plot of the minimal distance of interactions of the C alpha atoms of the backbone during the 10 ns simulation of the wildtype phenylalanine hydroxylase structure 1J8U. The distances for the three dimensions overlap and are not to distinguish in the plot.

</figure>


  • What is the average temperature and what is the heat capacity of the system? ( T )
  • What are the terms plotted in the files energy.xvg and box.xvg
  • Estimate the plateau values for the pressure, the volume and the density. ( T )
  • What are the terms plotted in the files coulomb-inter.xvg and vanderwaals-inter.xvg ?
  • What was the minimal distance between periodic images and at what time did that occur?
  • What happens if the minimal distance becomes shorter than the cut-off distance used for electrostatic interactions? Is it the case in your simulations? (It also matters if the small distance occurs transiently or if it is persistent. If it is persistent, it is likely affecting the protein dynamics; but if it's just transiently than it will hardly, if at all, influence.)
  • Run now g_mindist on the C-alpha group, does it change the results? What does is mean for your system? (Ideally, the minimal distance should therefore not be less than two nanometers.)


Root Mean Square Fluctuations

<figure id="fig:1J8U_rmsf">

Plot of the RMSF of all residues of the protein vs. its average position during the 10 ns simulation of the wildtype phenylalanine hydroxylase structure 1J8U.

</figure>

<figure id="fig:1J8U_average">

The average structure of the wildtype during the simulation. The structure is not physical as atom positions are averaged over the whole simulation.

</figure>

<figure id="fig:1J8U_bfactor_binding_site">

The b factors of the binding site in the wildtype. Blue indicates little movement, red great flexibility.

</figure>


<figure id="fig:1J8U_bfactor_down_site">

The b factors of the wildtype, view on the binding pocket. Blue indicates little movement, red great flexibility.

</figure>

<figure id="fig:1J8U_bfactor_up_site">

The b factors of the wildtype, view on the upper side. Blue indicates little movement, red great flexibility.

</figure>

  • Indicate the start and end residue for the most flexible regions and the maximum amplitudes. ( T )
  • Compare the results from the different proteins. Are there differences? If yes, which is the most flexible and which least?


Convergence of RMSD

<figure id="fig:1J8U_rmds_all-atom-vs-start">

Plot of the RMSD of all atoms of the protein vs. the starting structure during the 10 ns simulation of the wildtype phenylalanine hydroxylase structure 1J8U.

</figure>

<figure id="fig:1J8U_rmds_all-atom-vs-average">

Plot of the RMSD of all atoms of the protein vs. the (theoretical) average structure during the 10 ns simulation of the wildtype phenylalanine hydroxylase structure 1J8U.

</figure>

<figure id="fig:1J8U_rmds_backbone-vs-start">

Plot of the RMSD of the backbone atoms of the protein vs. the starting structure during the 10 ns simulation of the wildtype phenylalanine hydroxylase structure 1J8U.

</figure>

<figure id="fig:1J8U_rmds_backbone-vs-average">

Plot of the RMSD of the backbone atoms of the protein vs. the (theoretical) average structure during the 10 ns simulation of the wildtype phenylalanine hydroxylase structure 1J8U.

</figure>

  • If observed, at what time and value does the RMSD reach a plateau?
  • Briefly discuss differences between the graphs against the starting structure and against the average structure. Which is a better measure for convergence?


Convergence of Radius of Gyration

<figure id="fig:1J8U_radius_gyration">

Plot of the radius of gyration during the 10 ns simulation of the wildtype phenylalanine hydroxylase structure 1J8U.

</figure> <figure id="fig:1J8U_inertia">

Plot of the moment of inertia during the 10 ns simulation of the wildtype phenylalanine hydroxylase structure 1J8U.

</figure>


  • Have a look at the radius of gyration and the individual components and note how each of these progress to an equilibrium value.
  • At what time and value does the radius of gyration converge? ( T )


Structural Analysis: Properties Derived from Configurations


Solvent accessible surface

  • Which residues are the most accessible to the solvent?


Hydrogen Bonds

  • Discuss the relation between the number of hydrogen bonds for both cases and the fluctuations in each plot.


Salt Bridges


Secondary Structure

  • Discuss some of the changes in the secondary structure, if any.


Ramachandran Plots

  • What can you say about the conformation of the residues, based on the ramachandran plots?


Aanalysis of Dynamics and Time-averaged Properties


Root Mean Square Deviations

  • What is interesting by choosing the group "Mainchain+Cb" for this analysis?
  • How many transitions do you see?
  • What can you conclude from this analysis? Could you expect such a result, justify?


Cluster Analysis

  • How many clusters were found and what were the sizes of the largest two?
  • Are there notable differences between the two structures?


Distance RMSD

  • At what time and value does the dRMSD converge and how does this graph compare to the standard RMSD?


Gly322Ala analysis

<figure id="fig:mut322_overlay">

Overlay of all frames of the 10 ns simulation of the Gly322Ala mutation of phenylalanine hydroxylase structure 1J8U.

</figure>

Quality Assurance

Convergence of Energy Terms

<figure id="fig:Mut322_temperature">

Plot of the system temperature during the 10 ns simulation of the Ala322Gly mutation. A running average in a window of length 100 ps is indicated in red.

</figure>

<figure id="fig:Mut322_pressure">

Plot of the system pressure during the 10 ns simulation of the Ala322Gly mutation. A running average in a window of length 100 ps is indicated in red.

</figure>

<figure id="fig:Mut322_volume">

Plot of the system volume during the 10 ns simulation of the Ala322Gly mutation. A running average in a window of length 100 ps is indicated in red.

</figure>

<figure id="fig:Mut322_density">

Plot of the system density during the 10 ns simulation of the Ala322Gly mutation. A running average in a window of length 100 ps is indicated in red.

</figure>

<figure id="fig:Mut322_temperature">

Plot of the system extension in 3 dimensions during the 10 ns simulation of the Ala322Gly mutation. X- and Y-dimensions overlap and are not to distinguish in the plot.

</figure>

<figure id="fig:Mut322_coulomb">

Plot of the Coulomb interactions during the 10 ns simulation of the Ala322Gly mutation. A running average in a window of length 100 ps is indicated in red.

</figure>

<figure id="fig:1J8Uvdw">

Plot of the van-der-Waals interactions during the 10 ns simulation of the Ala322Gly mutation. A running average in a window of length 100 ps is indicated in red.

</figure>

  • What is the average temperature and what is the heat capacity of the system? ( T )
  • What are the terms plotted in the files energy.xvg and box.xvg
  • Estimate the plateau values for the pressure, the volume and the density. ( T )
  • What are the terms plotted in the files coulomb-inter.xvg and vanderwaals-inter.xvg ?


Minimum Distance Between Periodic Images

<figure id="fig:Mut322_mindist">

Plot of the minimal distance of interactions of the atoms of the protein during the 10 ns simulation of the Ala322Gly mutation. The distances for the three dimensions overlap and are not to distinguish in the plot.

</figure> <figure id="fig:Mut322_mindist_c_alpha">

Plot of the minimal distance of interactions of the C alpha atoms of the backbone during the 10 ns simulation of the Ala322Gly mutation. The distances for the three dimensions overlap and are not to distinguish in the plot.

</figure>


  • What is the average temperature and what is the heat capacity of the system? ( T )
  • What are the terms plotted in the files energy.xvg and box.xvg
  • Estimate the plateau values for the pressure, the volume and the density. ( T )
  • What are the terms plotted in the files coulomb-inter.xvg and vanderwaals-inter.xvg ?
  • What was the minimal distance between periodic images and at what time did that occur?
  • What happens if the minimal distance becomes shorter than the cut-off distance used for electrostatic interactions? Is it the case in your simulations? (It also matters if the small distance occurs transiently or if it is persistent. If it is persistent, it is likely affecting the protein dynamics; but if it's just transiently than it will hardly, if at all, influence.)
  • Run now g_mindist on the C-alpha group, does it change the results? What does is mean for your system? (Ideally, the minimal distance should therefore not be less than two nanometers.)


Root Mean Square Fluctuations

<figure id="fig:Mut322_rmsf-per-residue.png">

Plot of the RMSF of all residues of the protein vs. its average position during the 10 ns simulation of the Ala322Gly mutation.

</figure>


  • Indicate the start and end residue for the most flexible regions and the maximum amplitudes. ( T )
  • Compare the results from the different proteins. Are there differences? If yes, which is the most flexible and which least?


Convergence of RMSD

<figure id="fig:Mut322_rmds_all-atom-vs-start">

Plot of the RMSD of all atoms of the protein vs. the starting structure during the 10 ns simulation of the Ala322Gly mutation.

</figure>

<figure id="fig:Mut322_rmds-all-atom-vs-average">

Plot of the RMSD of all atoms of the protein vs. the (theoretical) average structure during the 10 ns simulation of the Ala322Gly mutation.

</figure>

<figure id="fig:Mut322_rmds-backbone-vs-start">

Plot of the RMSD of the backbone atoms of the protein vs. the starting structure during the 10 ns simulation of the Ala322Gly mutation.

</figure>

<figure id="fig:Mut322_rmds_backbone-vs-average">

Plot of the RMSD of the backbone atoms of the protein vs. the (theoretical) average structure during the 10 ns simulation of the Ala322Gly mutation.

</figure>

  • If observed, at what time and value does the RMSD reach a plateau?
  • Briefly discuss differences between the graphs against the starting structure and against the average structure. Which is a better measure for convergence?


Convergence of Radius of Gyration

<figure id="fig:Mut322_radius-of-gyration">

Plot of the radius of gyration during the 10 ns simulation of the Ala322Gly mutation.

</figure> <figure id="fig:Mut322_inertia">

Plot of the moment of inertia during the 10 ns simulation of the Ala322Gly mutation.

</figure>


  • Have a look at the radius of gyration and the individual components and note how each of these progress to an equilibrium value.
  • At what time and value does the radius of gyration converge? ( T )


Structural Analysis: Properties Derived from Configurations


Solvent accessible surface

  • Which residues are the most accessible to the solvent?


Hydrogen Bonds

  • Discuss the relation between the number of hydrogen bonds for both cases and the fluctuations in each plot.


Salt Bridges


Secondary Structure

  • Discuss some of the changes in the secondary structure, if any.


Ramachandran Plots

  • What can you say about the conformation of the residues, based on the ramachandran plots?


Aanalysis of Dynamics and Time-averaged Properties


Root Mean Square Deviations

  • What is interesting by choosing the group "Mainchain+Cb" for this analysis?
  • How many transitions do you see?
  • What can you conclude from this analysis? Could you expect such a result, justify?


Cluster Analysis

  • How many clusters were found and what were the sizes of the largest two?
  • Are there notable differences between the two structures?


Distance RMSD

  • At what time and value does the dRMSD converge and how does this graph compare to the standard RMSD?


Arg408Trp analysis

<figure id="fig:mut408_overlay">

Overlay of all frames of the 10 ns simulation of the Arg408Trp mutation of phenylalanine hydroxylase structure 1J8U.

</figure>

Quality Assurance

Convergence of Energy Terms

<figure id="fig:Mut408_temperature">

Plot of the system temperature during the 10 ns simulation of the Arg408Trp mutation. A running average in a window of length 100 ps is indicated in red.

</figure>

<figure id="fig:Mut408_pressure">

Plot of the system pressure during the 10 ns simulation of the Arg408Trp mutation. A running average in a window of length 100 ps is indicated in red.

</figure>

<figure id="fig:Mut408_volume">

Plot of the system volume during the 10 ns simulation of the Arg408Trp mutation. A running average in a window of length 100 ps is indicated in red.

</figure>

<figure id="fig:Mut408_density">

Plot of the system density during the 10 ns simulation of the Arg408Trp mutation. A running average in a window of length 100 ps is indicated in red.

</figure>

<figure id="fig:Mut408_temperature">

Plot of the system extension in 3 dimensions during the 10 ns simulation of the Arg408Trp mutation. X- and Y-dimensions overlap and are not to distinguish in the plot.

</figure>

<figure id="fig:Mut408_coulomb">

Plot of the Coulomb interactions during the 10 ns simulation of the Arg408Trp mutation. A running average in a window of length 100 ps is indicated in red.

</figure>

<figure id="fig:1J8Uvdw">

Plot of the van-der-Waals interactions during the 10 ns simulation of the Arg408Trp mutation. A running average in a window of length 100 ps is indicated in red.

</figure>

  • What is the average temperature and what is the heat capacity of the system? ( T )
  • What are the terms plotted in the files energy.xvg and box.xvg
  • Estimate the plateau values for the pressure, the volume and the density. ( T )
  • What are the terms plotted in the files coulomb-inter.xvg and vanderwaals-inter.xvg ?


Minimum Distance Between Periodic Images

<figure id="fig:Mut408_mindist">

Plot of the minimal distance of interactions of the atoms of the protein during the 10 ns simulation of the Arg408Trp mutation. The distances for the three dimensions overlap and are not to distinguish in the plot.

</figure> <figure id="fig:Mut408_mindist_c_alpha">

Plot of the minimal distance of interactions of the C alpha atoms of the backbone during the 10 ns simulation of the Arg408Trp mutation. The distances for the three dimensions overlap and are not to distinguish in the plot.

</figure>


  • What is the average temperature and what is the heat capacity of the system? ( T )
  • What are the terms plotted in the files energy.xvg and box.xvg
  • Estimate the plateau values for the pressure, the volume and the density. ( T )
  • What are the terms plotted in the files coulomb-inter.xvg and vanderwaals-inter.xvg ?
  • What was the minimal distance between periodic images and at what time did that occur?
  • What happens if the minimal distance becomes shorter than the cut-off distance used for electrostatic interactions? Is it the case in your simulations? (It also matters if the small distance occurs transiently or if it is persistent. If it is persistent, it is likely affecting the protein dynamics; but if it's just transiently than it will hardly, if at all, influence.)
  • Run now g_mindist on the C-alpha group, does it change the results? What does is mean for your system? (Ideally, the minimal distance should therefore not be less than two nanometers.)


Root Mean Square Fluctuations

<figure id="fig:Mut408_rmsf-per-residue.png">

Plot of the RMSF of all residues of the protein vs. its average position during the 10 ns simulation of the Arg408Trp mutation.

</figure>


  • Indicate the start and end residue for the most flexible regions and the maximum amplitudes. ( T )
  • Compare the results from the different proteins. Are there differences? If yes, which is the most flexible and which least?


Convergence of RMSD

<figure id="fig:Mut408_rmds_all-atom-vs-start">

Plot of the RMSD of all atoms of the protein vs. the starting structure during the 10 ns simulation of the Arg408Trp mutation.

</figure>

<figure id="fig:Mut408_rmds-all-atom-vs-average">

Plot of the RMSD of all atoms of the protein vs. the (theoretical) average structure during the 10 ns simulation of the Arg408Trp mutation.

</figure>

<figure id="fig:Mut408_rmds-backbone-vs-start">

Plot of the RMSD of the backbone atoms of the protein vs. the starting structure during the 10 ns simulation of the Arg408Trp mutation.

</figure>

<figure id="fig:Mut408_rmds_backbone-vs-average">

Plot of the RMSD of the backbone atoms of the protein vs. the (theoretical) average structure during the 10 ns simulation of the Arg408Trp mutation.

</figure>

  • If observed, at what time and value does the RMSD reach a plateau?
  • Briefly discuss differences between the graphs against the starting structure and against the average structure. Which is a better measure for convergence?


Convergence of Radius of Gyration

<figure id="fig:Mut408_radius-of-gyration">

Plot of the radius of gyration during the 10 ns simulation of the Arg408Trp mutation.

</figure> <figure id="fig:Mut408_inertia">

Plot of the moment of inertia during the 10 ns simulation of the Arg408Trp mutation.

</figure>


  • Have a look at the radius of gyration and the individual components and note how each of these progress to an equilibrium value.
  • At what time and value does the radius of gyration converge? ( T )


Structural Analysis: Properties Derived from Configurations


Solvent accessible surface

  • Which residues are the most accessible to the solvent?


Hydrogen Bonds

  • Discuss the relation between the number of hydrogen bonds for both cases and the fluctuations in each plot.


Salt Bridges


Secondary Structure

  • Discuss some of the changes in the secondary structure, if any.


Ramachandran Plots

  • What can you say about the conformation of the residues, based on the ramachandran plots?


Aanalysis of Dynamics and Time-averaged Properties


Root Mean Square Deviations

  • What is interesting by choosing the group "Mainchain+Cb" for this analysis?
  • How many transitions do you see?
  • What can you conclude from this analysis? Could you expect such a result, justify?


Cluster Analysis

  • How many clusters were found and what were the sizes of the largest two?
  • Are there notable differences between the two structures?


Distance RMSD

  • At what time and value does the dRMSD converge and how does this graph compare to the standard RMSD?