Difference between revisions of "Normal Mode Analysis BCKDHA"

From Bioinformatikpedia
(Results)
(Comparison of the lowest-frequency normal modes with the MD simulation)
 
(60 intermediate revisions by 2 users not shown)
Line 1: Line 1:
  +
== Introduction ==
  +
  +
Normal Mode Analysis is a powerful tool to examine large global motions of proteins. A normal mode of an oscillating system is a pattern of motion in
  +
which all parts of the system move with the same frequency and in phase. Proteins can be modeled as harmonic oscillating systems. NMA methods calculate low-frequency modes for a protein which correspond to collective motions of the complete protein.
  +
In this task we applied several NMA-methods to our protein BCKDHA and performed an all-atom NMA with small molecule (2LYZ).
  +
 
== WEBnm@ ==
 
== WEBnm@ ==
   
Line 7: Line 13:
 
<b>Single Analysis:</b>
 
<b>Single Analysis:</b>
   
The Single Analysis calculates the lowest frequency normal modes of the given protein and offers different types of calculations to analyse the modes that were calculated. The force field used for the Normal Modes Calculations is the C-alpha force field It uses only the Calpha atoms of the protein which are assigned the masses of the whole residue they represent. <br>The different types of calculation are:
+
The Single Analysis calculates the lowest frequency normal modes of the given protein and offers different types of calculations to analyse the modes that were calculated. The force field used for the Normal Modes Calculations is the C-alpha force field. It uses only the C-alpha atoms of the protein which assigned the masses of the whole residue they represent. <br>The different types of calculation are:
 
*deformation energies of each mode
 
*deformation energies of each mode
 
*calculation of normalized squared atomic displacements (results are provided for each low frequency mode, either as raw data or as plots with displacement vs. residue number)
 
*calculation of normalized squared atomic displacements (results are provided for each low frequency mode, either as raw data or as plots with displacement vs. residue number)
Line 61: Line 67:
   
   
<b>WEBnm@ visualised the normalized squared atomic displacements for the first five modes (modes 7 to 11)</b>. Figures 1-5 display the first five normal modes of our protein.
+
<b>WEBnm@ visualised the normalized squared atomic displacements for the first five modes (modes 7 to 11)</b>. <br>Figures 1-5 display the first five normal modes of our protein.
 
Figure 6-10 show the square of the displacement of each C-alpha atom, normalized so that the sum over all residues is equal to 100. The highest values correspond to the most displaced regions. Cluster of peaks identify significantly big regions. Isolated peaks reflect local flexibility and are not relevant.
 
Figure 6-10 show the square of the displacement of each C-alpha atom, normalized so that the sum over all residues is equal to 100. The highest values correspond to the most displaced regions. Cluster of peaks identify significantly big regions. Isolated peaks reflect local flexibility and are not relevant.
  +
   
 
{| border="1"
 
{| border="1"
Line 106: Line 113:
   
 
<b>Output</b>
 
<b>Output</b>
*properties of the first 100 lowest frequency modes (frequency, collectivity of atom movement, overlap of each mode with the observed conformational change (if two conformations are available) and its corresponding amplitude)
+
*Properties of the first 100 lowest frequency modes (frequency, collectivity of atom movement, overlap of each mode with the observed conformational change (if two conformations are available) and its corresponding amplitude)
 
*3D animations from three orthogonal viewpoints in large and small sizes
 
*3D animations from three orthogonal viewpoints in large and small sizes
 
*Comparison of a normal mode perturbed structure and a second conformation in terms of RMSD and number of residues that are closer than 3Å can be done
 
*Comparison of a normal mode perturbed structure and a second conformation in terms of RMSD and number of residues that are closer than 3Å can be done
*cross plot where the analysis of distance fluctuations between all CA atoms is shown. Red (decreasing) and blue dots (increasing) indicate the residues for which the distance changes significantly in movement. (The upper left corner indicates the first residue. Grey lines are drawn every 10 residues, yellow lines are drawn every 100 residues.)
+
*Cross plot where the analysis of distance fluctuations between all CA atoms is shown. Red (decreasing) and blue dots (increasing) indicate the residues for which the distance changes significantly in movement. (The upper left corner indicates the first residue. Grey lines are drawn every 10 residues, yellow lines are drawn every 100 residues.)
   
 
====References ====
 
====References ====
Line 117: Line 124:
 
=== Results ===
 
=== Results ===
   
<b>CA distance fluctuations for the six modes</b><br>
+
<b>CA distance fluctuations for the five modes</b><br>
 
{|border="1"
 
{|border="1"
 
!mode 7
 
!mode 7
Line 148: Line 155:
 
||[[File:Mod7_3_BCKDHA.gif|thumb|130px|Figure 16c: view 3 of mode 7]]
 
||[[File:Mod7_3_BCKDHA.gif|thumb|130px|Figure 16c: view 3 of mode 7]]
 
|}
 
|}
The mode displayed in figure 16 agrees with the distance fluctuation seen in figure 11. The very beginning of the peptide chain moves away from the rest of the protein. It looks like a hinge-movement.
+
The mode shown in figure 16 agrees with the distance fluctuation seen in figure 11. The very beginning of the peptide chain moves away from the rest of the protein. It looks like a hinge-movement.
   
 
<b>Mode 8: </b>
 
<b>Mode 8: </b>
Line 156: Line 163:
 
||[[File:Mod8_3_BCKDHA.gif|thumb|130px|Figure 17c: view 3 of mode 8]]
 
||[[File:Mod8_3_BCKDHA.gif|thumb|130px|Figure 17c: view 3 of mode 8]]
 
|}
 
|}
The mode shown in figure 17 shows that the beginning of the peptide sequence moves towards the protein. This observation can be confirmed when looking at the cross plot given in figure 12, where the decreasing distance for the first residues is given by blue dots. This mode shows another hinge-movement.
+
The mode displayed in figure 17 shows that the beginning of the peptide sequence moves towards the protein. This observation can be confirmed when looking at the cross plot given in figure 12, where the decreasing distance for the first residues is given by blue dots. This mode shows another hinge-movement.
   
 
<b>Mode 9: </b>
 
<b>Mode 9: </b>
Line 164: Line 171:
 
||[[File:Mod9_3_BCKDHA.gif|thumb|130px|Figure 18c: view 3 of mode 9]]
 
||[[File:Mod9_3_BCKDHA.gif|thumb|130px|Figure 18c: view 3 of mode 9]]
 
|}
 
|}
As seen at the distance fluctations plot (figure 13), the distances for the first residues in the peptide chain vary, some are decreasing and some are increasing. This can be explained by a twisting peptide sequence, where some residues come closer to the protein core and other move apart.
+
As seen at the distance fluctuations plot (figure 13), the distances for the first residues in the peptide chain vary, some are decreasing and some are increasing. This can be explained by a twisting peptide sequence, where some residues come closer to the protein core and other move apart.
   
 
<b>Mode 10: </b>
 
<b>Mode 10: </b>
Line 201: Line 208:
 
<b>Input:</b>
 
<b>Input:</b>
 
* PDB id or PDB file
 
* PDB id or PDB file
* chain id
+
* Chain id
* model (for multi-model files such as from NMR)
+
* Model (for multi-model files such as from NMR)
* cutoff for interaction between Cα atoms in Å (set to 15Å)
+
* Cutoff for interaction between Cα atoms in Å (set to 15Å)
* distance weight for interaction between Cα atoms (set to 3.0)
+
* Distance weight for interaction between Cα atoms (set to 3.0)
   
 
<b> Output:</b>
 
<b> Output:</b>
 
The ANM Webserver offers a broad range of output files to analyse the computed normal modes more precisely.
 
The ANM Webserver offers a broad range of output files to analyse the computed normal modes more precisely.
On the main page you can visualize the 20 first modes calculated. It is possible to scale the amplitude and frequency of motion, to display vectors and the protein in different ways and colors.
+
On the main page you can visualize the calculated 20 first modes. It is possible to scale the amplitude and frequency of motion, to display vectors and the protein in different ways and colors.
 
Furthermore the following options are available
 
Furthermore the following options are available
*download files
+
*Download files
*create PDB (motion)
+
*Create PDB (motion)
*create PyMol script
+
*Create PyMol script
*get anisotrpic temp. factors
+
*Get anisotropic temp. factors
 
*B-factors/mode fluctuations
 
*B-factors/mode fluctuations
 
*Eigenvalues
 
*Eigenvalues
Line 222: Line 229:
   
 
=== Results ===
 
=== Results ===
As the first 5 modes could not be displayed via pymol, we will analyse modes 6-11 in the following section.
+
As the first 5 modes could not be displayed via pymol, we will analyse modes 6-10 in the following section.
   
 
<!-- Energy modes 1-5
 
<!-- Energy modes 1-5
Line 241: Line 248:
 
Overall energy: 108.50491155
 
Overall energy: 108.50491155
   
Mode 6 shows two centers of movement: First, the beginning of the peptide sequence is twisting and turning as shown in Figure 21a. This fluctuations can also be seen in the fluctuation of individual residues according to experimental b-factors (Figure 21b). The distance matrix (Figure 21c) shows that most of the residues in the protein are correlated (red), only the residues at the beginning of the peptide chain are anti-correlated (blue). The white zones indicate weak correlations. So besides the very anti-correlated beginning of the peptide chain, there is a part in the end of the protein that seems to be correlated weakly. This can also be ovserved when looking at Figure 21a, where another hinge-movement cacn be detected at the right.
+
Mode 6 shows two centers of movement: First, the beginning of the peptide sequence is twisting and turning as shown in Figure 21a. This fluctuations can also be seen in the fluctuation of individual residues according to experimental b-factors (Figure 21b). The distance matrix (Figure 21c) shows that most of the residues in the protein are correlated (red), only the residues at the beginning of the peptide chain are anti-correlated (blue). The white zones indicate weak correlations. So besides the very anti-correlated beginning of the peptide chain, there is a part in the end of the protein that seems to be correlated weakly. This can also be observed when looking at Figure 21a, where another hinge-movement can be detected at the right.
   
 
''' Mode 7'''
 
''' Mode 7'''
Line 251: Line 258:
 
|}
 
|}
 
Overall energy: 105.33659403
 
Overall energy: 105.33659403
  +
 
Mode 7 shows twisting movements at both ends of the protein (see Figure 22a). The distribution of b-factors (Figure 22b) and the correlation matrix (Figure 22c) agree with this observation.
 
Mode 7 shows twisting movements at both ends of the protein (see Figure 22a). The distribution of b-factors (Figure 22b) and the correlation matrix (Figure 22c) agree with this observation.
   
Line 261: Line 269:
 
|}
 
|}
 
Overall energy: 114.7349691
 
Overall energy: 114.7349691
  +
The movements given in mode 8 (Figure 23a) seem to be very similar to the ANM mode 7. But looking at the errors in Figure 23a one can see that the movement goes in exactly the opposite direction. The fluctuations per residue for mode 8 (Figure 23b) show also a much higher amplidude than for mode 7.
 
  +
The movements given in mode 8 (Figure 23a) seem to be very similar to the ANM mode 7. But looking at the errors in Figure 23a one can see that the movement goes in exactly the opposite direction. The fluctuations per residue for mode 8 (Figure 23b) show also a much higher amplitude than for mode 7.
   
 
''' Mode 9'''
 
''' Mode 9'''
Line 271: Line 280:
 
|}
 
|}
 
Overall energy: 187.72640385
 
Overall energy: 187.72640385
  +
 
Mode 9 displays a small turning movement of the beginning of the peptide sequence (see Figure 24a). This observation can be confirmed when looking at the fluctuations of single residues (Figure 24b), where a peak exists only for the residues 6-30, and at the correlation matrix (Figure 24c), which shows a highly anti-correlated region for the beginning of the peptide sequence. Again, as in mode 6, the end part of the protein shows some evidence of movement, too, as the correlation here is very weak.
 
Mode 9 displays a small turning movement of the beginning of the peptide sequence (see Figure 24a). This observation can be confirmed when looking at the fluctuations of single residues (Figure 24b), where a peak exists only for the residues 6-30, and at the correlation matrix (Figure 24c), which shows a highly anti-correlated region for the beginning of the peptide sequence. Again, as in mode 6, the end part of the protein shows some evidence of movement, too, as the correlation here is very weak.
   
Line 281: Line 291:
 
|}
 
|}
 
Overall energy: 183.87695037
 
Overall energy: 183.87695037
  +
 
The ANM mode 10 shows the most movement of the protein. The whole protein seems to be turning and both ends are moving like hinges (see Figure 25a). This strong movements can also be detected when looking at the many small peaks in the b-factors (Figure 25b) and the small zones of weak and no correlation in the distance matrix (Figure 25c).
 
The ANM mode 10 shows the most movement of the protein. The whole protein seems to be turning and both ends are moving like hinges (see Figure 25a). This strong movements can also be detected when looking at the many small peaks in the b-factors (Figure 25b) and the small zones of weak and no correlation in the distance matrix (Figure 25c).
   
Line 293: Line 304:
 
*PDB id or PDB file
 
*PDB id or PDB file
 
* No. of nodes to represent a nucleotide (1 or 3)
 
* No. of nodes to represent a nucleotide (1 or 3)
* Cutoff for for amino acid pairs
+
* Cutoff for amino acid pairs
 
* Cutoff for nucleotide pairs
 
* Cutoff for nucleotide pairs
 
* Preferred visualization engine (Jmol or Chime)
 
* Preferred visualization engine (Jmol or Chime)
Line 300: Line 311:
 
The oGNM Webserver provides an comprehensive overview over the first 20 calculated normal modes. It is possible to display the slow modes, slow eigenvectors, slow average, slow av1-3 and RMSD of two modes side-by-side.
 
The oGNM Webserver provides an comprehensive overview over the first 20 calculated normal modes. It is possible to display the slow modes, slow eigenvectors, slow average, slow av1-3 and RMSD of two modes side-by-side.
 
The output includes:
 
The output includes:
*the mobility profiles of residues corresponding to the 20 slowest modes of motion predicted by the GNM
+
*The mobility profiles of residues corresponding to the 20 slowest modes of motion predicted by the GNM
*the average profile reuslting from the first 2 slowest modes
+
*The average profile resulting from the first 2 slowest modes
*the associated eigenvalues (21 of them, including the zero eigenvalue)
+
*The associated eigenvalues (21 of them, including the zero eigenvalue)
*the predicted and experimental B-factors, and the correlation coefficient between the two sets of B-factors
+
*The predicted and experimental B-factors, and the correlation coefficient between the two sets of B-factors
*the spring constant (g) in units of kcal/mol.Å2
+
*The spring constant (g) in units of kcal/mol.Å2
*the cross-correlation between residue fluctuations, plotted as a correlation map (for structures containing less than 2000 nodes)
+
*The cross-correlation between residue fluctuations, plotted as a correlation map (for structures containing less than 2000 nodes)
*the nodes included in the GNM analysis, summarized in the .ca file
+
*The nodes included in the GNM analysis, summarized in the .ca file
   
 
===Results ===
 
===Results ===
The results can also be found [http://ignm.ccbb.pitt.edu/ognm/23173998/temp/index.htm].
 
   
In the following section we are going to discuss the 5 lowest frequency modes calculated by oGNM. The following figures show the mobility of the protein for each computed normal mode, colored from <font color =blue> blue</font> to <font color=red>red</font> in the order of increasing mobilities, as well as the fluctuations per residue.
+
In the following section we are going to discuss the 5 lowest frequency modes (after the first six zero modes) calculated by oGNM. The following figures show the mobility of the protein for each computed normal mode, colored from <font color =blue> blue</font> to <font color=red>red</font> in the order of increasing mobilities, as well as the fluctuations per residue.
   
'''Mode 1'''
+
'''Mode 7'''
 
{|
 
{|
|[[File: BCKDHA_oGNM_mode1.PNG|thumb|150px|Figure 26a: oGNM mode 1]]
+
|[[File: OGNM_mode7_visualization_BCKDHA.png|thumb|150px|Figure 26a: oGNM mode 7]]
|[[File: BCKDHA_tempSlowmodes_mode1.PNG|thumb|150px|Figure 26b: the cross-correlation between residue fluctuations for mode 1]]
+
|[[File: Mode7_fluctuation_BCKDHA.png|thumb|150px|Figure 26b: Residue fluctuations for mode 7]]
 
|}
 
|}
The oGNM mode 1 shows a mobile part at the one end of the protein (Figure 26a). This mobility is also displayed in the fluctuations per residue (Figure 26b), where a peak for residues 1-30 indicates high felxibility, while the rest of the protein seems to be very stable.
+
The oGNM mode 7 shows a mobile part at the one end of the protein (Figure 26a). This mobility is also displayed in the fluctuations per residue (Figure 26b), where a peak for residues 18-23 indicates high flexibility, while the rest of the protein seems to be very stable.
   
'''Mode 2'''
+
'''Mode 8'''
 
{|
 
{|
|[[File: BCKDHA_oGNM_mode2.PNG|thumb|150px|Figure 27a: oGNM mode 2]]
+
|[[File: OGNM_mode8_visualization_BCKDHA.png|thumb|150px|Figure 27a: oGNM mode 8]]
|[[File: BCKDHA_tempSlowmodes_mode2.PNG|thumb|150px|Figure 27b: the cross-correlation between residue fluctuations for mode 2]]
+
|[[File: Mode8_fluctuation_BCKDHA.png|thumb|150px|Figure 27b: Residue fluctuations for mode 8]]
 
|}
 
|}
  +
As in mode 7 there are peaks in the beginning of the protein sequence in mode 8(see Figure 28b). But contrary to mode 7 there are three peaks, indicating three separated centers of movement with a stable part in between. Additionally to these flexible parts there are some regions in the middle of the protein but these peaks are very low so it seems that these parts are also quite stable. This is also shown in the visualization of the protein since the parts in the middle are only light pink (see Figure 28a) which indicates that they are probably stable.
oGNM calculated a very different mobile region of the protein for mode 2. Here only the other end of the protein is flexible, the rest of the protein is more or less stable (compare Figure 27 a and b).
 
   
'''Mode 3'''
+
'''Mode 9'''
 
{|
 
{|
|[[File: BCKDHA_oGNM_mode3.PNG|thumb|150px|Figure 28a: oGNM mode 3]]
+
|[[File: OGNM_mode9_visualization_BCKDHA.png|thumb|150px|Figure 28a: oGNM mode 9]]
|[[File: BCKDHA_tempSlowmodes_mode3.PNG|thumb|150px|Figure 28b: the cross-correlation between residue fluctuations for mode 3]]
+
|[[File: Mode9_fluctuation_BCKDHA.png|thumb|150px|Figure 28b: Residue fluctuations for mode 9]]
 
|}
 
|}
  +
We have the same picture in mode 9 as in the modes before (see Figure 28a). There are two main peaks between position 16 and 23 and a very small peak at position 10 (see Figure 28b) suggesting that there are two flexible regions in the beginning. The really low peak in the beginning and also the one at position 300 can be neglected since they are so low so that we can not be sure about the flexibility. The rest of the protein is predicted to be stable.
Mode 3 is similar to mode 1, with the exception, that there are two distinct peaks at the beginning of the protein sequence (see Figure 28b), indicating two separated centers of movement with a stable part in between.
 
   
'''Mode 4'''
+
'''Mode 10'''
 
{|
 
{|
|[[File: BCKDHA_oGNM_mode4.PNG|thumb|150px|Figure 29a: oGNM mode 4]]
+
|[[File: OGNM_mode10_visualization_BCKDHA.png|thumb|150px|Figure 29a: oGNM mode 10]]
|[[File: BCKDHA_tempSlowmodes_mode4.PNG|thumb|150px|Figure 29b: the cross-correlation between residue fluctuations for mode 4]]
+
|[[File: Mode10_fluctuation_BCKDHA.png|thumb|150px|Figure 29b: Residue fluctuations for mode 10]]
 
|}
 
|}
  +
In mode 10 the most flexible part is completely in the beginning of the protein at position 10 and the peaks which are a bit lower are between position 16 and 23 (see Figure 29b). Although it is completely the other way around than in mode 9, both modes point out that these regions are flexible. This is visualized by the red coloring of these parts of the protein (see Figure 29a). As in mode 8 and 9 there is a low peak around position 300. Since this peak is in all three modes we have to implicate it but in all three modes the peak is really low so it is probably a region with only a little flexibility.
The calculated mode 4 is very different form the other oGNM modes. Here almost half of the protein is colored red (FIgure 29a), indicating high mobility. It is noticable, that especially parts, that were moving in the previous modes, are not predicted to move in mode 4. The residue fluctuations (Figure 29b) for mode 4 correlate well with the colored image given in Figure 29a.
 
   
'''Mode 5'''
+
'''Mode 11'''
 
{|
 
{|
|[[File: BCKDHA_oGNM_mode5.PNG|thumb|150px|Figure 30a: oGNM mode 5]]
+
|[[File: OGNM_mode11_visualization_BCKDHA.png|thumb|150px|Figure 30a: oGNM mode 11]]
|[[File: BCKDHA_tempSlowmodes_mode5.PNG|thumb|150px|Figure 30b: the cross-correlation between residue fluctuations for mode 5]]
+
|[[File: Mode11_fluctuation_BCKDHA.png|thumb|150px|Figure 30b: Residue fluctuations for mode 11]]
 
|}
 
|}
Mode 5 is similar to mode 2, where only the last part of the protein seems to be flexible as indicated by the red color in Figure 30a. When taking a closer look at the residue fluctuations for mode 5 (Figure 30b) it is obvious, that there are two separated peaks at the end of the protein, thus there are two centers of movement with a small stable part in between.
+
Mode 11 is more as mode 7 because of missing a peak in the middle of the protein (see Figure 30a). In this mode there are two peaks in the beginning of the protein indicating that this is a very flexible region. The rest of the protein is completely stable shown by the blue coloring (see Figure 30b) and the fact that there is no peak after position 24.
   
  +
Figure 31 shows the cross correlations between residue fluctuations for modes 1-5.
 
  +
{| align="center"
[[File: BCKDHA_oGNM_cc.jpg|thumb|center|300px|Figure 31: Cross correlation plot for modes 1-5]]
 
  +
|[[File: Mode7_mode11_fluctuation_BCKDHA.png|thumb|350px|Figure 31a: Fluctuations for mode 7-11]]
Fluctuation vectors in the same direction have values of +1 and are colored dark red indicating the motions are fully correlated. Fully anti-correlated motions are displayed in dark blue and are given by values of around -1. Figure 31 shows that the first 20-30 residues are correlated among themselves but are totally anti-correlated with the rest of the protein. The rest of the protein is more or less correlated well, with some parts in the middle that are anti-correlated.
 
  +
||[[File: Correlation_mode7_mode11_BCKDHA.jpg|thumb|300px|Figure 31b: Cross correlation plot for modes 7-11]]
  +
|}
  +
By comparing the fluctuations of all five modes with each other (see Figure 31a) we can see that although the peaks are not all at the exactly same position they are all in the beginning of the protein indicating that all modes predict this region to be flexible and the rest of the protein is more or less stable. Figure 31b shows the cross correlations between residue fluctuations for modes 7-11. Fluctuation vectors in the same direction have values of +1 and are colored dark red indicating the motions are fully correlated. Fully anti-correlated motions are displayed in dark blue and are given by values of around -1. By the light blue regions on the left and top border we can see that these regions are either correlated nor anti-correlated. This can be explained by the results above because the several peaks at the beginning of the protein are never really the same but variate a lot. But these regions differ completely from the rest of the plot which is partly dark red or dark blue. It is not possible to conclude which regions are flexible and which are stable basing on this plot but we can see that the beginning of the protein sequence has to be different than the rest of the protein.
   
 
=== Discussion ===
 
=== Discussion ===
   
The five first calculated modes from oGNM are very different. They differ in the part of the protein that is flexible as well as in the amount of movement. But as two modes predict a moving peptide sequence at the start of the protein and two modes predict some movement at the end of the protein sequence it is very likely that both the start and the end of the protein sequence are very flexible, while the rest of the protein is quite stable.
+
By comparing all five modes with each other we can see that they are all very similiar. In all five cases the region in the beginning of the protein is predicted to be flexible. It is not completely clear which positions are more flexible or which are more stable but overall the beginning of the protein is flexible and the rest is stable. This is shown very good in the plot where all fluctuations are compared with each other. Additionally the cross correlation plot indicates that the beginning of the protein is different than the rest of the protein.
   
 
One disadvantage of this server is that it doesn't provide output pdbs which could be used to generate animated gif-pictures. The color code and the residual fluctuation plots however are very clear and straightforward so it isn't hard to identify flexible and stable regions. It is, however, not possible to determine the way of movement from the still image.
 
One disadvantage of this server is that it doesn't provide output pdbs which could be used to generate animated gif-pictures. The color code and the residual fluctuation plots however are very clear and straightforward so it isn't hard to identify flexible and stable regions. It is, however, not possible to determine the way of movement from the still image.
Line 374: Line 387:
 
: For the average RMSD the default value (3.0) was used.
 
: For the average RMSD the default value (3.0) was used.
 
; Method to use
 
; Method to use
  +
* Automatic
 
  +
* Full matrix solver
** Automatic
 
** Full matrix solver
+
* Sparse matrix solver
** Sparse matrix solver
 
 
: Here we used the default option, the automatic mode.
 
: Here we used the default option, the automatic mode.
   
Line 387: Line 399:
   
 
{| border="1" style="text-align:center; border-spacing:0;"
 
{| border="1" style="text-align:center; border-spacing:0;"
|mode 1
+
|mode 7
|mode 2
+
|mode 8
|mode 3
+
|mode 9
|mode 4
+
|mode 10
|mode 5
+
|mode 11
 
|-
 
|-
|[[Image:BCKDHA_NOMAD_mode7.gif|thumb|150px|NOMAD normal mode 1]]
+
|[[Image:BCKDHA_NOMAD_mode7.gif|thumb|150px|Figure 32: NOMAD normal mode 7]]
|[[Image:BCKDHA_NOMAD_mode8.gif|thumb|150px|NOMAD normal mode 2]]
+
|[[Image:BCKDHA_NOMAD_mode8.gif|thumb|150px|Figure 33: NOMAD normal mode 8]]
|[[Image:BCKDHA_NOMAD_mode9.gif|thumb|150px|NOMAD normal mode 3]]
+
|[[Image:BCKDHA_NOMAD_mode9.gif|thumb|150px|Figure 34: NOMAD normal mode 9]]
|[[Image:BCKDHA_NOMAD_mode10.gif|thumb|150px|NOMAD normal 4]]
+
|[[Image:BCKDHA_NOMAD_mode10.gif|thumb|150px|Figure 35: NOMAD normal mode 10]]
|[[Image:BCKDHA_NOMAD_mode11.gif|thumb|150px|NOMAD normal 5]]
+
|[[Image:BCKDHA_NOMAD_mode11.gif|thumb|150px|Figure 36: NOMAD normal mode 11]]
 
|-
 
|-
|[[Image:BCKDHA_mode_7.png|thumb|150px|Amplitude of movement as rmsd per residue for mode 1]]
+
|[[Image:BCKDHA_mode_7.png|thumb|150px|Figure 37: Amplitude of movement as rmsd per residue for mode 7]]
|[[Image:BCKDHA_mode_8.png|thumb|150px|Amplitude of movement as rmsd per residue for mode 2]]
+
|[[Image:BCKDHA_mode_8.png|thumb|150px|Figure 38: Amplitude of movement as rmsd per residue for mode 8]]
|[[Image:BCKDHA_mode_9.png|thumb|150px|Amplitude of movement as rmsd per residue for mode 3]]
+
|[[Image:BCKDHA_mode_9.png|thumb|150px|Figure 39: Amplitude of movement as rmsd per residue for mode 9]]
|[[Image:BCKDHA_mode_10.png|thumb|150px|Amplitude of movement as rmsd per residue for mode 4]]
+
|[[Image:BCKDHA_mode_10.png|thumb|150px|Figure 40: Amplitude of movement as rmsd per residue for mode 10]]
|[[Image:BCKDHA_mode_11.png|thumb|150px|Amplitude of movement as rmsd per residue for mode 5]]
+
|[[Image:BCKDHA_mode_11.png|thumb|150px|Figure 41: Amplitude of movement as rmsd per residue for mode 11]]
 
|-
 
|-
|[[Image:BCKDHA_Mode7_network.png|thumb|150px|Elastic network for mode 1]]
+
|[[Image:BCKDHA_Mode7_network.png|thumb|150px|Figure 42: Elastic network for mode 7]]
|[[Image:BCKDHA_Mode8_network.png|thumb|150px|Elastic network for mode 2]]
+
|[[Image:BCKDHA_Mode8_network.png|thumb|150px|Figure 43: Elastic network for mode 8]]
|[[Image:BCKDHA_Mode9_network.png|thumb|150px|Elastic network for mode 3]]
+
|[[Image:BCKDHA_Mode9_network.png|thumb|150px|Figure 44: Elastic network for mode 9]]
|[[Image:BCKDHA_Mode10_network.png|thumb|150px|Elastic network for mode 4]]
+
|[[Image:BCKDHA_Mode10_network.png|thumb|150px|Figure 45: Elastic network for mode 10]]
|[[Image:BCKDHA_Mode11_network.png|thumb|150px|Elastic network for mode 5]]
+
|[[Image:BCKDHA_Mode11_network.png|thumb|150px|Figure 46: Elastic network for mode 11]]
 
|}
 
|}
   
 
===Discussion ===
 
===Discussion ===
  +
In general we can say that only one part of the protein shows motion. The five modes calculated by NOMAD-Ref show all the same kind of movement for the loop at the beginning of the protein (see Figures 32-36, green loop on the right). Figures 37-41 confirm this observation as they all show a high amplitude of movement for the first atoms in the protein, and only very little peaks for the rest of the protein. The only difference is mode 11, where a small peak at the end of the protein can be detected. This peak corresponds to the hinge-movement of the end of the protein sequence (in the picture on the left side of the protein).
   
 
==All-atom NMA using Gromacs on the NOMAD-Ref server==
 
==All-atom NMA using Gromacs on the NOMAD-Ref server==
  +
===Background information ===
 
 
In order to do the all-atom NMA we needed an appropriate small molecule that contained not more than 2000 atoms. This small protein was found by searching for "all atom nma". We found a paper <ref>Hetunandan Kamisetty, Eric P. Xing and Christopher J. Langmead: Free Energy Estimates of All-atom Protein
 
In order to do the all-atom NMA we needed an appropriate small molecule that contained not more than 2000 atoms. This small protein was found by searching for "all atom nma". We found a paper <ref>Hetunandan Kamisetty, Eric P. Xing and Christopher J. Langmead: Free Energy Estimates of All-atom Protein
 
Structures Using Generalized Belief Propagation[[http://www.cs.cmu.edu/~epxing/papers/recomb-hetu.pdf]]</ref>, where they used the structure of a hen egg-white lysozyme for an all atom NMA. So we did all the calculations for the corresponding PDB entry 2lyz.
 
Structures Using Generalized Belief Propagation[[http://www.cs.cmu.edu/~epxing/papers/recomb-hetu.pdf]]</ref>, where they used the structure of a hen egg-white lysozyme for an all atom NMA. So we did all the calculations for the corresponding PDB entry 2lyz.
   
 
First, we needed to prepare our PDB file. The PDB file for 2LYZ protein contains 1001 atoms in total, all lines not beginning with "ATOM" were removed from the PDB file.
 
First, we needed to prepare our PDB file. The PDB file for 2LYZ protein contains 1001 atoms in total, all lines not beginning with "ATOM" were removed from the PDB file.
  +
  +
=== Results ===
   
 
'''600 K'''
 
'''600 K'''
Line 430: Line 445:
 
|mode 3
 
|mode 3
 
|-
 
|-
|[[Image:2LYZ_600K_mode7.gif|thumb|150px|All atom normal mode 1 at 600K]]
+
|[[Image:2LYZ_600K_mode7.gif|thumb|150px|Figure 46: All atom normal mode 1 at 600K]]
|[[Image:2LYZ_600K_mode8.gif|thumb|150px|All atom normal mode 2 at 600K]]
+
|[[Image:2LYZ_600K_mode8.gif|thumb|150px|Figure 47: All atom normal mode 2 at 600K]]
|[[Image:2LYZ_600K_mode9.gif|thumb|150px|All atom normal mode 3 at 600K]]
+
|[[Image:2LYZ_600K_mode9.gif|thumb|150px|Figure 48: All atom normal mode 3 at 600K]]
 
|}
 
|}
   
Line 444: Line 459:
 
|mode 3
 
|mode 3
 
|-
 
|-
|[[Image:2LYZ_2000K_mode7.gif|thumb|150px|All atom normal mode 1 at 2000K]]
+
|[[Image:2LYZ_2000K_mode7.gif|thumb|150px|Figure 49: All atom normal mode 1 at 2000K]]
|[[Image:2LYZ_2000K_mode8.gif|thumb|150px|All atom normal mode 2 at 2000K]]
+
|[[Image:2LYZ_2000K_mode8.gif|thumb|150px|Figure 50: All atom normal mode 2 at 2000K]]
|[[Image:2LYZ_2000K_mode9.gif|thumb|150px|All atom normal mode 3 at 2000K]]
+
|[[Image:2LYZ_2000K_mode9.gif|thumb|150px|Figure 51: All atom normal mode 3 at 2000K]]
 
|}
 
|}
   
   
 
''' Comparison to an Elastic Network'''
 
''' Comparison to an Elastic Network'''
  +
<!--
 
 
Frage: je berechnung eines elastic networks für mode 7, 8 und 9 oder
 
Frage: je berechnung eines elastic networks für mode 7, 8 und 9 oder
 
berechnugn eines el networks für "normale" pdb und dann überlagerung mit den modes?
 
berechnugn eines el networks für "normale" pdb und dann überlagerung mit den modes?
   
 
Ich habe jetzt mal für jeden mode (7,8,9) das vorher berechnete network einfach überlagert.
 
Ich habe jetzt mal für jeden mode (7,8,9) das vorher berechnete network einfach überlagert.
  +
-->
 
 
{| border="1" style="text-align:center; border-spacing:0;"
 
{| border="1" style="text-align:center; border-spacing:0;"
 
|mode 1
 
|mode 1
Line 462: Line 477:
 
|mode 3
 
|mode 3
 
|-
 
|-
|[[Image:2LYZ_mode7_network.png|thumb|150px|Elastic network and mode 1]]
+
|[[Image:2LYZ_mode7_network.png|thumb|150px|Figure 52: Elastic network and mode 1]]
|[[Image:2LYZ_mode8_network.png|thumb|150px|Elastic network and mode 2]]
+
|[[Image:2LYZ_mode8_network.png|thumb|150px|Figure 53: Elastic network and mode 2]]
|[[Image:2LYZ_mode9_network.png|thumb|150px|Elastic network and mode 3]]
+
|[[Image:2LYZ_mode9_network.png|thumb|150px|Figure 54: Elastic network and mode 3]]
 
|}
 
|}
  +
  +
=== Discussion ===
  +
The results for the all-atom NMA for 2LYZ look all very similar. The calculated modes show small motions of 2LYZ, looking like a "breathing" molecule. There is no apparent difference in the different modes calculated at one temperature. Furthermore the motion does not seem to be dependent of the temperature as all modes calculated at 600K and 2000K look similar.
  +
  +
== Discussion ==
  +
  +
We have applied five different methods to calculate normal modes for BCKDHA. All methods agree in their reported movements. Each method returned a large, hinge-like motion of the C-terminal loop region. Some calculations also show a twisting and wiggling motion of this region. Some other modes show a hinge-movement of the N-terminal helix region. As all methods reported that only the terminal protein regions are flexible we conclude that these movements are functionally important. They could possibly be crucial for binding ligands or another protein. With the knowledge that the branched-chain alpha-keto dehydrogenase is an enzyme complex consisting of two alpha-subunits (encoded for by BCKDHA) and two beta-subunits (encoded by BCKDHB) one can assume that these movements might be necessary for building up the protein complex. Figure 55 shows that the flexible regions (especially the C-terminal loop) are intertwined. This structure might be obtained via the movements the flexible C-terminal loop can perform as we have seen before.
  +
  +
[[Image:1u5b_bio_r_500.jpg|thumb|center|300px|Figure 55: Crystal structure of the branched-chain alpha-keto acid dehydrogenase [http://www.pdb.org/pdb/explore/explore.do?structureId=1U5B]]]
   
 
== Advantages and Disadvantages from NMA and MD ==
 
== Advantages and Disadvantages from NMA and MD ==
   
  +
The basis of Molecular Dynamics (MD) is the solvation of Newton’s equations of motion to yield a trajectory of atomic positions.
  +
With MD the molecular motion (including small and large structural fluctuations and comformational transitions) can be described realistically.
  +
Therefore MD can be used to reveal structural changes in a protein at the atomic level.
  +
Moreover, the effect of the solvent can be taken into account. MD is limited by the approximate nature of the force fields and
  +
by the relatively short time scale (of the order of a nanoseconds) that is computationally accessible. Visually, one can see that every atom of the protein is wobbling.
  +
  +
NMA is a very powerful method to gain insight into the large-scale, shape-changing motions in proteins. The motion is modeled as a superposition of a set of independent harmonic oscillations about the equilibrium atomic position. NMA uses the same force fields as used in molecular dynamics simulations, but the idea behind it is very simplified. It is assumed that the conformational energy surface at an energy minimum can be approximated by a parabola over the range of thermal fluctuations (which is not correct at physiological temperatures). A NMA result usually shows large protein motions like domain motions.
  +
  +
Elastic Network models are a special form of NMA, where the protein model is drastically simplified. The distances of all of the elastic connections are taken to be at their minimum energy length, therefore there is no need for energy minimization. Second, the number of atoms is reduced as only the C-alpha atoms are used. This simplification brings a major advantage in the diagonalization task of NMA. A disadvantage of Elastic Network Models such as GNM is that motions are not displayed any more.
  +
  +
Compared to the very detailed protein motions that are calculated by Molecular Dynamics simulations,
  +
NMA shows only large amplitude motions of the protein. A great advantage of NMA is its speed. As only
  +
motions of huge protein parts are calculated, it is much faster than MD. Furthermore no sampling problem can arise.
  +
The advantage of MD however is its detailed insight into protein motion.<ref>Pnina Dauber-Osguthorpe, David J. Osguthorpe, Peter S. Stern, and John Moult: Low Frequency Motion in Proteins - Comparison of Normal Mode and Molecular Dynamics
  +
of Streptomyces Griseus Protease, Journal of Computational Physics 151, 169–189 (1999) A</ref>,<ref>Steven Hayward and Bert L. de Groot: Normal Modes and Essential Dynamics, in: Methods in Molecular Biology, vol. 443, Molecular Modeling of Proteins, Humana Press, Totowa, NJ</ref>,<ref>http://www.pasteur.fr/recherche/unites/Binfs/embo2008/speakers/hinsen/slides_normal_modes.pdf</ref>
  +
  +
== Comparison of the lowest-frequency normal modes with the MD simulation ==
  +
  +
{|
  +
!WEBnm@
  +
!ElNemo
  +
!ANM
  +
!oGNM
  +
!NOMAD-Ref
  +
|-
  +
|[[File:Mod11_BCKDHA.gif| thumb |200px| Figure56a: WEBnm@: mode 11]]
  +
|| [[File:Mod7_1_BCKDHA.gif|thumb|150px|Figure56b: ElNemo: mode 7]]
  +
|| [[File:BCKDHA_ANM_mode7.gif|thumb|200px|Figure56c: ANM: mode 7]]
  +
||[[File: OGNM_mode8_visualization_BCKDHA.png|thumb|253px|Figure56d: oGNM: mode 8]]
  +
|| [[Image:BCKDHA_NOMAD_mode7.gif|thumb|200px|Figure56e: NOMAD-Ref: mode 7]]
  +
|}
  +
<br><br>
  +
{|align="center"
  +
! MD simulation of the movement of BCKDHA
  +
|-
  +
|[[File:BCKDHA_MD_Animation.gif|thumb|center|400px|Figure57: MD simulation of the movement of BCKDHA]]
  +
|}
  +
  +
By comparing the normal modes (Figure 56a-e)with the MD simulation (Figure 57) we can see that the model of MD is much more detailed than the normal modes as it is possible to see the motion of the side chains and not only of the domains or secondary structures. Additionally the protein which is simulated by MD shows much more motion in the whole cell and does not seem to be sticked at one position.<br>
  +
When we compare the MD simulation with the mode 7 of WEBnm@ we can observe that both show the motion of the end of the protein (right side in MD and left side in WEBnm@) which is very similar in both cases. But we have to see that only the general movement is very similar since there is a lot of flexibility inside of the structure in the MD simulation which is not shown in the model of the WEBnm@. There is also motion in the other end of the protein which is also displayed by both simulation tools but in two different kinds of movement. In the MD simulation it seems like the red colored end of the protein folds away of the protein and returns to it again. Contrary in the WEBnm@ the end seems to be always a bit away of the rest of the protein and just sways a bit up and down.<br>
  +
Mode 7 of ElNemo has the same problem as WEBnm@. It also shows that there is motion in the end of the protein (left side) but again this motion is displayed very simplified. It seems like the end of the protein becks but as we can see in the visualisation of the MD simulation this is just a rough estimate of the movement. The more detailed motion is very flexible and moves in all directions. One very interesting observation in Figure 56b is that there is a small part in the center of the protein which is very flexible and becks up and down. This is not shown in the MD simulation and also in no other normal modes so we are not sure if this is really a flexible part of the protein. The next difference between the ElNemo model and the protein of the MD simulation is that there is no motion of the end of the protein shown in the model while there is a lot of movement in the MD simulation given in Figure 57.<br>
  +
By comparing the normal mode 7 produced by ANM with the MD simulation we have again this becking part in the end of the protein as in the WEBnm@ mode which is different to the motion in the end of the protein of the MD. As the end of mode7 the beginning also only goes up and down and does not move in any other direction so the flexibility of this part is much less than in the MD simulation.<br>
  +
The normal modes of oGNM are completely different to the MD simulation. Thie program returns no visualization of a protein in motion but only shows the parts of the protein which are flexible. So we can not say whether the parts move in the same direction or not. But we can see whether the flexible parts are the same. Both proteins are flexible at the one end of the protein (right side in both cases) and since there are many red parts where some are deep red in the model of oGNM it seems that this end is also very flexible. But this is the only similarity between the two proteins. In the MD simulation there is also motion in the other end of the protein which is not the case in mode 8 of oGNM. Contrary to this the light pink parts in the center of the protein indicate that there is motion in these regions which does not occur in the MD simulation.<br>
  +
The model of NOMAD-Ref shows again many differences to the protein of the MD simulation. It seems that the whole protein is fixed except of the end of the protein on the right side of Figure 56e. Contrary to this observation we can see in the visualisation of the protein of the MD that the whole protein moves a bit around in the cell and that there is motion in both ends of the protein. By comparing the movement of the two ends which are predicted to be flexible by both programms we can say that mode 7 of NOMAD-Ref is more similar to the MD simulation than the other NMA tools since there is more motion and flexibility in the structures itself. It does not only go up and down but moves in all directions which is more like the simulated protein. Of course it does not show as much motion as is the protein of the MD simulation since the moving side chains are hidden. <br>
  +
All in all we can say that the general information given by the several tools that there is only motion in the end of the protein is mostly the same except of some cases where additional movements in the center of the protein are predicted. But the detailed information about the movement of the protein, like for example the direction, is completely different between the MD simulation and the NMA.
   
 
== References ==
 
== References ==

Latest revision as of 09:40, 29 September 2011

Introduction

Normal Mode Analysis is a powerful tool to examine large global motions of proteins. A normal mode of an oscillating system is a pattern of motion in which all parts of the system move with the same frequency and in phase. Proteins can be modeled as harmonic oscillating systems. NMA methods calculate low-frequency modes for a protein which correspond to collective motions of the complete protein. In this task we applied several NMA-methods to our protein BCKDHA and performed an all-atom NMA with small molecule (2LYZ).

WEBnm@

Background information

WEBnm@<ref>http://apps.cbu.uib.no/webnma/home</ref> provides two different modes:

Single Analysis:

The Single Analysis calculates the lowest frequency normal modes of the given protein and offers different types of calculations to analyse the modes that were calculated. The force field used for the Normal Modes Calculations is the C-alpha force field. It uses only the C-alpha atoms of the protein which assigned the masses of the whole residue they represent.
The different types of calculation are:

  • deformation energies of each mode
  • calculation of normalized squared atomic displacements (results are provided for each low frequency mode, either as raw data or as plots with displacement vs. residue number)
  • interactive visualization of the modes using vector field representation or vibrations

Comparative Analysis (beta version):

The Comparative Analysis calculates and compares the normal modes of a set of aligned protein structures. This tool is still under development. It also provides three types of calculations:

  • Deformation Energy profiles
  • Atomic Fluctuation profiles
  • Conformational Overlap Comparison

Input:

  • Single Analysis: structure file in the pdb format
  • Comparative Analysis: a file containing the sequence alignment of the proteins which should be compared and a protein structure file for each of the proteins. The alignment file needs to be written in the Fasta format, and the header line of each sequence should contain the name of the structure file as first field, and the chain in the last field.

Results

Below are the values of the deformation energy for modes 7 to 20

Mode Index Deformation Energy
7 292.36
8 401.29
9 603.95
10 757.28
11 848.99
12 989.93
13 1745.19
14 2675.54
15 2999.49
16 3341.82
17 3572.19
18 3685.84
19 4103.34
20 4925.43


WEBnm@ visualised the normalized squared atomic displacements for the first five modes (modes 7 to 11).
Figures 1-5 display the first five normal modes of our protein. Figure 6-10 show the square of the displacement of each C-alpha atom, normalized so that the sum over all residues is equal to 100. The highest values correspond to the most displaced regions. Cluster of peaks identify significantly big regions. Isolated peaks reflect local flexibility and are not relevant.


mode 7 mode 8 mode 9 mode 10 mode 11
Figure 1: normalized squared atomic displacement for mode 7
Figure 2: normalized squared atomic displacement for mode 8
Figure 3: normalized squared atomic displacement for mode 9
Figure 4: normalized squared atomic displacement for mode 10
Figure 5: normalized squared atomic displacement for mode 11
Figure 6: normalized squared atomic displacement for mode 7
Figure 7: normalized squared atomic displacement for mode 8
Figure 8: normalized squared atomic displacement for mode 9
Figure 9: normalized squared atomic displacement for mode 10
Figure 10: normalized squared atomic displacement for mode 11

Discussion

The calculated normal modes of Webnma differ in the amplitude of movement. While modes 7 and 9-11 show the highest peak and therefore the most movement for residues 0-25, mode 8 has the highest peak for the last 40 residues in the sequence.

The normal modes calculated by Webnma show that the most displaced regions of the branched-chain alpha-keto acid dehydrogenase complex are the beginning and the end of the protein sequence. The two ends of the protein sequence which are also the outermost parts of the protein structure show some kind of hinge-movement. The protein motion could be described as an opening and closing complex.

ElNemo

Background information

Input

The input for ElNemo is a protein structure in PDB format. From this PDB file only the residues that are encoded by ATOM are used in the calculations. The other residues are not taken into account. If there are other residues which should be used in the calculations they have to be encoded by ATOM. Additionally there are a lot of options which can be chosen.

Output

  • Properties of the first 100 lowest frequency modes (frequency, collectivity of atom movement, overlap of each mode with the observed conformational change (if two conformations are available) and its corresponding amplitude)
  • 3D animations from three orthogonal viewpoints in large and small sizes
  • Comparison of a normal mode perturbed structure and a second conformation in terms of RMSD and number of residues that are closer than 3Å can be done
  • Cross plot where the analysis of distance fluctuations between all CA atoms is shown. Red (decreasing) and blue dots (increasing) indicate the residues for which the distance changes significantly in movement. (The upper left corner indicates the first residue. Grey lines are drawn every 10 residues, yellow lines are drawn every 100 residues.)

References

  • ElNémo Webserver<ref>http://www.igs.cnrs-mrs.fr/elnemo/start.html</ref>
  • K. Suhre and Y.-H. Sanejouand, ElNémo: a normal mode web server for protein movement analysis and the generation of templates for molecular replacement<ref>Karsten Suhre and Yves-Henri Sanejouand, ElNémo: a normal mode web server for protein movement analysis and the generation of templates for molecular replacement, Nucl. Acids Res, 2004</ref>

Results

CA distance fluctuations for the five modes

mode 7 mode 8 mode 9 mode 10 mode 11
Figure 11: CA distance fluctuations for mode 7
Figure 12: CA distance fluctuations for mode 8
Figure 13: CA distance fluctuations for mode 9
Figure 14: CA distance fluctuations for mode 10
Figure 15: CA distance fluctuations for mode 11

Figures 11-14 show that the greatest distance fluctuations are between the 10-20 first amino acids and the rest of the protein (residues 50-400). While mode 7 calculated only distance decreases, mode 8 seemed to have calculated almost only increasing distances between the first ~20 residues and the rest of the protein. The cross plots for mode 9 and 10 (figures 13 and 14) show strong distance fluctuations (decreases for residues 1-10 and increases for residues 10-20) between the first 20 residues and the rest of the protein. Mode 11 as displayed in figure 15 calculated completely different distance fluctuations. Here the highest distance fluctuations are between the last 40 residues and the rest of the protein. There are both increasing and decreasing distances. The totally different cross plot leads to the assumption, that the calculated mode 11 differs quite a lot from the other normal modes. It is very likely, that here the last part of the protein shows the greatest movement.


ElNemo prepared different views from three orthologuous viewpoints with MolScript for each mode.

Mode 7:

Figure 16a: view 1 of mode 7
Figure 16b: view 2 of mode 7
Figure 16c: view 3 of mode 7

The mode shown in figure 16 agrees with the distance fluctuation seen in figure 11. The very beginning of the peptide chain moves away from the rest of the protein. It looks like a hinge-movement.

Mode 8:

Figure 17a: view 1 of mode 8
Figure 17b: view 2 of mode 8
Figure 17c: view 3 of mode 8

The mode displayed in figure 17 shows that the beginning of the peptide sequence moves towards the protein. This observation can be confirmed when looking at the cross plot given in figure 12, where the decreasing distance for the first residues is given by blue dots. This mode shows another hinge-movement.

Mode 9:

Figure 18a: view 1 of mode 9
Figure 18b: view 2 of mode 9
Figure 18c: view 3 of mode 9

As seen at the distance fluctuations plot (figure 13), the distances for the first residues in the peptide chain vary, some are decreasing and some are increasing. This can be explained by a twisting peptide sequence, where some residues come closer to the protein core and other move apart.

Mode 10:

Figure 19a: view 1 of mode 10
Figure 19b: view 2 of mode 10
Figure 19c: view 3 of mode 10

Mode 10 behaves similarily to mode 9, only the beginning of the protein chain seems not to be twisting but to be pulled in and out. This observation agrees with the increasing and decreasing distances shown in figure 14.

Mode 11:

Figure 20a: view 1 of mode 11
Figure 20b: view 2 of mode 11
Figure 20c: view 3 of mode 11

Figure 20 shows a hinge-movement of the last part of the protein sequence. The helix-part shown in red moves to and apart from the protein core, which is also displayed in figure 15.


Discussion

ElNemo calculated very different normal modes. Some of the normal modes show some kind of hinge-movement at the one end of the protein, another mode shows the movement of the other end in the protein. All in all we can say, that our protein seems only to be flexible at the outermost parts, while the core of the protein is very stable.

Anisotropic Network Model web server

Background information

The ANM Webserver<ref>http://ignmtest.ccbb.pitt.edu/cgi-bin/anm/anm1.cgi</ref> provides NMA Analysis with the anisotropic network model (ANM) which is an elastic network (EN).

Input:

  • PDB id or PDB file
  • Chain id
  • Model (for multi-model files such as from NMR)
  • Cutoff for interaction between Cα atoms in Å (set to 15Å)
  • Distance weight for interaction between Cα atoms (set to 3.0)

Output: The ANM Webserver offers a broad range of output files to analyse the computed normal modes more precisely. On the main page you can visualize the calculated 20 first modes. It is possible to scale the amplitude and frequency of motion, to display vectors and the protein in different ways and colors. Furthermore the following options are available

  • Download files
  • Create PDB (motion)
  • Create PyMol script
  • Get anisotropic temp. factors
  • B-factors/mode fluctuations
  • Eigenvalues
  • Correlations
  • Distance fluctuations and deformation energy

In the following we show the movements, the distance fluctuations, the deformation energies per position and the B-factors for each mode.

Results

As the first 5 modes could not be displayed via pymol, we will analyse modes 6-10 in the following section.


Mode 6

Figure 21a: ANM mode 6
Figure 21b: Distribution of the B-factors for mode 6
Figure 21c: Distance matrix mode 6
Figure 21d: Deformation energy for mode 6

Overall energy: 108.50491155

Mode 6 shows two centers of movement: First, the beginning of the peptide sequence is twisting and turning as shown in Figure 21a. This fluctuations can also be seen in the fluctuation of individual residues according to experimental b-factors (Figure 21b). The distance matrix (Figure 21c) shows that most of the residues in the protein are correlated (red), only the residues at the beginning of the peptide chain are anti-correlated (blue). The white zones indicate weak correlations. So besides the very anti-correlated beginning of the peptide chain, there is a part in the end of the protein that seems to be correlated weakly. This can also be observed when looking at Figure 21a, where another hinge-movement can be detected at the right.

Mode 7

Figure 22a: ANM mode 7
Figure 22b: Distribution of the B-factors for mode 7
Figure 22c: Distance matrix mode 7
Figure 22d: Deformation energy for mode 7

Overall energy: 105.33659403

Mode 7 shows twisting movements at both ends of the protein (see Figure 22a). The distribution of b-factors (Figure 22b) and the correlation matrix (Figure 22c) agree with this observation.

Mode 8

Figure 23a: ANM mode 8
Figure 23b: Distribution of the B-factors for mode 8
Figure 23c: Distance matrix mode 8
Figure 23d: Deformation energy for mode 8

Overall energy: 114.7349691

The movements given in mode 8 (Figure 23a) seem to be very similar to the ANM mode 7. But looking at the errors in Figure 23a one can see that the movement goes in exactly the opposite direction. The fluctuations per residue for mode 8 (Figure 23b) show also a much higher amplitude than for mode 7.

Mode 9

Figure 24a: ANM mode 9
Figure 24b: Distribution of the B-factors for mode 9
Figure 24c: Distance matrix mode 9
Figure 24d: Deformation energy for mode 9

Overall energy: 187.72640385

Mode 9 displays a small turning movement of the beginning of the peptide sequence (see Figure 24a). This observation can be confirmed when looking at the fluctuations of single residues (Figure 24b), where a peak exists only for the residues 6-30, and at the correlation matrix (Figure 24c), which shows a highly anti-correlated region for the beginning of the peptide sequence. Again, as in mode 6, the end part of the protein shows some evidence of movement, too, as the correlation here is very weak.

Mode 10

Figure 25a: ANM mode 10
Figure 25b: Distribution of the B-factors for mode 10
Figure 25c: Distance matrix mode 10
Figure 25d: Deformation energy for mode 10

Overall energy: 183.87695037

The ANM mode 10 shows the most movement of the protein. The whole protein seems to be turning and both ends are moving like hinges (see Figure 25a). This strong movements can also be detected when looking at the many small peaks in the b-factors (Figure 25b) and the small zones of weak and no correlation in the distance matrix (Figure 25c).

Discussion

Almost all modes calculated by ANM and discussed above agree in the following point: The flexible regions of our protein are the beginning and the end of the protein sequence. Only mode 10 differs from this observation, as a lot more motions all over the whole protein are visible. These motions however are still very weak according to the twisting and wiggling sequence ends.

oGNM – Gaussian network model

Background information

The oGNM Webserver<ref>http://ignm.ccbb.pitt.edu/Online_GNM.htm</ref> calculates the equilibrium dynamics of any structure submitted in PDB format, using the Gaussian Network Model (GNM).

Input:

  • PDB id or PDB file
  • No. of nodes to represent a nucleotide (1 or 3)
  • Cutoff for amino acid pairs
  • Cutoff for nucleotide pairs
  • Preferred visualization engine (Jmol or Chime)

Output: The oGNM Webserver provides an comprehensive overview over the first 20 calculated normal modes. It is possible to display the slow modes, slow eigenvectors, slow average, slow av1-3 and RMSD of two modes side-by-side. The output includes:

  • The mobility profiles of residues corresponding to the 20 slowest modes of motion predicted by the GNM
  • The average profile resulting from the first 2 slowest modes
  • The associated eigenvalues (21 of them, including the zero eigenvalue)
  • The predicted and experimental B-factors, and the correlation coefficient between the two sets of B-factors
  • The spring constant (g) in units of kcal/mol.Å2
  • The cross-correlation between residue fluctuations, plotted as a correlation map (for structures containing less than 2000 nodes)
  • The nodes included in the GNM analysis, summarized in the .ca file

Results

In the following section we are going to discuss the 5 lowest frequency modes (after the first six zero modes) calculated by oGNM. The following figures show the mobility of the protein for each computed normal mode, colored from blue to red in the order of increasing mobilities, as well as the fluctuations per residue.

Mode 7

Figure 26a: oGNM mode 7
Figure 26b: Residue fluctuations for mode 7

The oGNM mode 7 shows a mobile part at the one end of the protein (Figure 26a). This mobility is also displayed in the fluctuations per residue (Figure 26b), where a peak for residues 18-23 indicates high flexibility, while the rest of the protein seems to be very stable.

Mode 8

Figure 27a: oGNM mode 8
Figure 27b: Residue fluctuations for mode 8

As in mode 7 there are peaks in the beginning of the protein sequence in mode 8(see Figure 28b). But contrary to mode 7 there are three peaks, indicating three separated centers of movement with a stable part in between. Additionally to these flexible parts there are some regions in the middle of the protein but these peaks are very low so it seems that these parts are also quite stable. This is also shown in the visualization of the protein since the parts in the middle are only light pink (see Figure 28a) which indicates that they are probably stable.

Mode 9

Figure 28a: oGNM mode 9
Figure 28b: Residue fluctuations for mode 9

We have the same picture in mode 9 as in the modes before (see Figure 28a). There are two main peaks between position 16 and 23 and a very small peak at position 10 (see Figure 28b) suggesting that there are two flexible regions in the beginning. The really low peak in the beginning and also the one at position 300 can be neglected since they are so low so that we can not be sure about the flexibility. The rest of the protein is predicted to be stable.

Mode 10

Figure 29a: oGNM mode 10
Figure 29b: Residue fluctuations for mode 10

In mode 10 the most flexible part is completely in the beginning of the protein at position 10 and the peaks which are a bit lower are between position 16 and 23 (see Figure 29b). Although it is completely the other way around than in mode 9, both modes point out that these regions are flexible. This is visualized by the red coloring of these parts of the protein (see Figure 29a). As in mode 8 and 9 there is a low peak around position 300. Since this peak is in all three modes we have to implicate it but in all three modes the peak is really low so it is probably a region with only a little flexibility.

Mode 11

Figure 30a: oGNM mode 11
Figure 30b: Residue fluctuations for mode 11

Mode 11 is more as mode 7 because of missing a peak in the middle of the protein (see Figure 30a). In this mode there are two peaks in the beginning of the protein indicating that this is a very flexible region. The rest of the protein is completely stable shown by the blue coloring (see Figure 30b) and the fact that there is no peak after position 24.


Figure 31a: Fluctuations for mode 7-11
Figure 31b: Cross correlation plot for modes 7-11

By comparing the fluctuations of all five modes with each other (see Figure 31a) we can see that although the peaks are not all at the exactly same position they are all in the beginning of the protein indicating that all modes predict this region to be flexible and the rest of the protein is more or less stable. Figure 31b shows the cross correlations between residue fluctuations for modes 7-11. Fluctuation vectors in the same direction have values of +1 and are colored dark red indicating the motions are fully correlated. Fully anti-correlated motions are displayed in dark blue and are given by values of around -1. By the light blue regions on the left and top border we can see that these regions are either correlated nor anti-correlated. This can be explained by the results above because the several peaks at the beginning of the protein are never really the same but variate a lot. But these regions differ completely from the rest of the plot which is partly dark red or dark blue. It is not possible to conclude which regions are flexible and which are stable basing on this plot but we can see that the beginning of the protein sequence has to be different than the rest of the protein.

Discussion

By comparing all five modes with each other we can see that they are all very similiar. In all five cases the region in the beginning of the protein is predicted to be flexible. It is not completely clear which positions are more flexible or which are more stable but overall the beginning of the protein is flexible and the rest is stable. This is shown very good in the plot where all fluctuations are compared with each other. Additionally the cross correlation plot indicates that the beginning of the protein is different than the rest of the protein.

One disadvantage of this server is that it doesn't provide output pdbs which could be used to generate animated gif-pictures. The color code and the residual fluctuation plots however are very clear and straightforward so it isn't hard to identify flexible and stable regions. It is, however, not possible to determine the way of movement from the still image.

NOMAD-Ref

Background information

The NOMAD <ref>[[2]]</ref> server provides a lot of information and options. The interface is quite user friendly as all available parameter choices are explained in detail and there is also the runtime listed for an example NMA, which can be used to estimate the runtime for our own jobs.

Input:

The following parameters can be set:

Number of modes to calculate
As specified in the task description we wanted to obtain 10 modes. NOMAD does six zero modes which are just translation and rotation. Therefore we set the number of modes to calculate to 16.
Distance weight parameter
This parameter is used to introduce a smoother cutoff value that in the original Tirion model. All distances are weightend by exp(-(d_ij/d)^2), where d is the distance weight parameter. As proposed by NOMAD a distance weight parameter of 3Å is well suited for CA-only models. As we are doing no all-atom calculation, the distance weight parameter was set to 3.0Å.
Cutoff to use for mode calculation
The cutoff describes which pairs of atomes are linked by a spring of universal length according to the Tirion model (Elastic Network Model). The cutoff was set to 15Å.
Average Rmsd in output trajectories
For the average RMSD the default value (3.0) was used.
Method to use
  • Automatic
  • Full matrix solver
  • Sparse matrix solver
Here we used the default option, the automatic mode.

Output:

The output contains one PDB file and one plot per mode. The plot contains the rmsd per residue, which can be interpreted as the amplitude of movement and which is controlled by the average rmsd of trajectory (input parameter).

Results

mode 7 mode 8 mode 9 mode 10 mode 11
Figure 32: NOMAD normal mode 7
Figure 33: NOMAD normal mode 8
Figure 34: NOMAD normal mode 9
Figure 35: NOMAD normal mode 10
Figure 36: NOMAD normal mode 11
Figure 37: Amplitude of movement as rmsd per residue for mode 7
Figure 38: Amplitude of movement as rmsd per residue for mode 8
Figure 39: Amplitude of movement as rmsd per residue for mode 9
Figure 40: Amplitude of movement as rmsd per residue for mode 10
Figure 41: Amplitude of movement as rmsd per residue for mode 11
Figure 42: Elastic network for mode 7
Figure 43: Elastic network for mode 8
Figure 44: Elastic network for mode 9
Figure 45: Elastic network for mode 10
Figure 46: Elastic network for mode 11

Discussion

In general we can say that only one part of the protein shows motion. The five modes calculated by NOMAD-Ref show all the same kind of movement for the loop at the beginning of the protein (see Figures 32-36, green loop on the right). Figures 37-41 confirm this observation as they all show a high amplitude of movement for the first atoms in the protein, and only very little peaks for the rest of the protein. The only difference is mode 11, where a small peak at the end of the protein can be detected. This peak corresponds to the hinge-movement of the end of the protein sequence (in the picture on the left side of the protein).

All-atom NMA using Gromacs on the NOMAD-Ref server

Background information

In order to do the all-atom NMA we needed an appropriate small molecule that contained not more than 2000 atoms. This small protein was found by searching for "all atom nma". We found a paper <ref>Hetunandan Kamisetty, Eric P. Xing and Christopher J. Langmead: Free Energy Estimates of All-atom Protein Structures Using Generalized Belief Propagation[[3]]</ref>, where they used the structure of a hen egg-white lysozyme for an all atom NMA. So we did all the calculations for the corresponding PDB entry 2lyz.

First, we needed to prepare our PDB file. The PDB file for 2LYZ protein contains 1001 atoms in total, all lines not beginning with "ATOM" were removed from the PDB file.

Results

600 K

The following movies show the all-atom NMA for 2LYZ at 600K

mode 1 mode 2 mode 3
Figure 46: All atom normal mode 1 at 600K
Figure 47: All atom normal mode 2 at 600K
Figure 48: All atom normal mode 3 at 600K

2000 K

The following movies show the all-atom NMA for 2LYZ at 2000K

mode 1 mode 2 mode 3
Figure 49: All atom normal mode 1 at 2000K
Figure 50: All atom normal mode 2 at 2000K
Figure 51: All atom normal mode 3 at 2000K


Comparison to an Elastic Network

mode 1 mode 2 mode 3
Figure 52: Elastic network and mode 1
Figure 53: Elastic network and mode 2
Figure 54: Elastic network and mode 3

Discussion

The results for the all-atom NMA for 2LYZ look all very similar. The calculated modes show small motions of 2LYZ, looking like a "breathing" molecule. There is no apparent difference in the different modes calculated at one temperature. Furthermore the motion does not seem to be dependent of the temperature as all modes calculated at 600K and 2000K look similar.

Discussion

We have applied five different methods to calculate normal modes for BCKDHA. All methods agree in their reported movements. Each method returned a large, hinge-like motion of the C-terminal loop region. Some calculations also show a twisting and wiggling motion of this region. Some other modes show a hinge-movement of the N-terminal helix region. As all methods reported that only the terminal protein regions are flexible we conclude that these movements are functionally important. They could possibly be crucial for binding ligands or another protein. With the knowledge that the branched-chain alpha-keto dehydrogenase is an enzyme complex consisting of two alpha-subunits (encoded for by BCKDHA) and two beta-subunits (encoded by BCKDHB) one can assume that these movements might be necessary for building up the protein complex. Figure 55 shows that the flexible regions (especially the C-terminal loop) are intertwined. This structure might be obtained via the movements the flexible C-terminal loop can perform as we have seen before.

Figure 55: Crystal structure of the branched-chain alpha-keto acid dehydrogenase [1]

Advantages and Disadvantages from NMA and MD

The basis of Molecular Dynamics (MD) is the solvation of Newton’s equations of motion to yield a trajectory of atomic positions. With MD the molecular motion (including small and large structural fluctuations and comformational transitions) can be described realistically. Therefore MD can be used to reveal structural changes in a protein at the atomic level. Moreover, the effect of the solvent can be taken into account. MD is limited by the approximate nature of the force fields and by the relatively short time scale (of the order of a nanoseconds) that is computationally accessible. Visually, one can see that every atom of the protein is wobbling.

NMA is a very powerful method to gain insight into the large-scale, shape-changing motions in proteins. The motion is modeled as a superposition of a set of independent harmonic oscillations about the equilibrium atomic position. NMA uses the same force fields as used in molecular dynamics simulations, but the idea behind it is very simplified. It is assumed that the conformational energy surface at an energy minimum can be approximated by a parabola over the range of thermal fluctuations (which is not correct at physiological temperatures). A NMA result usually shows large protein motions like domain motions.

Elastic Network models are a special form of NMA, where the protein model is drastically simplified. The distances of all of the elastic connections are taken to be at their minimum energy length, therefore there is no need for energy minimization. Second, the number of atoms is reduced as only the C-alpha atoms are used. This simplification brings a major advantage in the diagonalization task of NMA. A disadvantage of Elastic Network Models such as GNM is that motions are not displayed any more.

Compared to the very detailed protein motions that are calculated by Molecular Dynamics simulations, NMA shows only large amplitude motions of the protein. A great advantage of NMA is its speed. As only motions of huge protein parts are calculated, it is much faster than MD. Furthermore no sampling problem can arise. The advantage of MD however is its detailed insight into protein motion.<ref>Pnina Dauber-Osguthorpe, David J. Osguthorpe, Peter S. Stern, and John Moult: Low Frequency Motion in Proteins - Comparison of Normal Mode and Molecular Dynamics of Streptomyces Griseus Protease, Journal of Computational Physics 151, 169–189 (1999) A</ref>,<ref>Steven Hayward and Bert L. de Groot: Normal Modes and Essential Dynamics, in: Methods in Molecular Biology, vol. 443, Molecular Modeling of Proteins, Humana Press, Totowa, NJ</ref>,<ref>http://www.pasteur.fr/recherche/unites/Binfs/embo2008/speakers/hinsen/slides_normal_modes.pdf</ref>

Comparison of the lowest-frequency normal modes with the MD simulation

WEBnm@ ElNemo ANM oGNM NOMAD-Ref
Figure56a: WEBnm@: mode 11
Figure56b: ElNemo: mode 7
Figure56c: ANM: mode 7
Figure56d: oGNM: mode 8
Figure56e: NOMAD-Ref: mode 7



MD simulation of the movement of BCKDHA
Figure57: MD simulation of the movement of BCKDHA

By comparing the normal modes (Figure 56a-e)with the MD simulation (Figure 57) we can see that the model of MD is much more detailed than the normal modes as it is possible to see the motion of the side chains and not only of the domains or secondary structures. Additionally the protein which is simulated by MD shows much more motion in the whole cell and does not seem to be sticked at one position.
When we compare the MD simulation with the mode 7 of WEBnm@ we can observe that both show the motion of the end of the protein (right side in MD and left side in WEBnm@) which is very similar in both cases. But we have to see that only the general movement is very similar since there is a lot of flexibility inside of the structure in the MD simulation which is not shown in the model of the WEBnm@. There is also motion in the other end of the protein which is also displayed by both simulation tools but in two different kinds of movement. In the MD simulation it seems like the red colored end of the protein folds away of the protein and returns to it again. Contrary in the WEBnm@ the end seems to be always a bit away of the rest of the protein and just sways a bit up and down.
Mode 7 of ElNemo has the same problem as WEBnm@. It also shows that there is motion in the end of the protein (left side) but again this motion is displayed very simplified. It seems like the end of the protein becks but as we can see in the visualisation of the MD simulation this is just a rough estimate of the movement. The more detailed motion is very flexible and moves in all directions. One very interesting observation in Figure 56b is that there is a small part in the center of the protein which is very flexible and becks up and down. This is not shown in the MD simulation and also in no other normal modes so we are not sure if this is really a flexible part of the protein. The next difference between the ElNemo model and the protein of the MD simulation is that there is no motion of the end of the protein shown in the model while there is a lot of movement in the MD simulation given in Figure 57.
By comparing the normal mode 7 produced by ANM with the MD simulation we have again this becking part in the end of the protein as in the WEBnm@ mode which is different to the motion in the end of the protein of the MD. As the end of mode7 the beginning also only goes up and down and does not move in any other direction so the flexibility of this part is much less than in the MD simulation.
The normal modes of oGNM are completely different to the MD simulation. Thie program returns no visualization of a protein in motion but only shows the parts of the protein which are flexible. So we can not say whether the parts move in the same direction or not. But we can see whether the flexible parts are the same. Both proteins are flexible at the one end of the protein (right side in both cases) and since there are many red parts where some are deep red in the model of oGNM it seems that this end is also very flexible. But this is the only similarity between the two proteins. In the MD simulation there is also motion in the other end of the protein which is not the case in mode 8 of oGNM. Contrary to this the light pink parts in the center of the protein indicate that there is motion in these regions which does not occur in the MD simulation.
The model of NOMAD-Ref shows again many differences to the protein of the MD simulation. It seems that the whole protein is fixed except of the end of the protein on the right side of Figure 56e. Contrary to this observation we can see in the visualisation of the protein of the MD that the whole protein moves a bit around in the cell and that there is motion in both ends of the protein. By comparing the movement of the two ends which are predicted to be flexible by both programms we can say that mode 7 of NOMAD-Ref is more similar to the MD simulation than the other NMA tools since there is more motion and flexibility in the structures itself. It does not only go up and down but moves in all directions which is more like the simulated protein. Of course it does not show as much motion as is the protein of the MD simulation since the moving side chains are hidden.
All in all we can say that the general information given by the several tools that there is only motion in the end of the protein is mostly the same except of some cases where additional movements in the center of the protein are predicted. But the detailed information about the movement of the protein, like for example the direction, is completely different between the MD simulation and the NMA.

References

<references />

go back to Maple syrup urine disease main page

go back to Task 8: Molecular Dynamics Simulations

go to Task 10: Molecular Dynamics Analysis