Normal Mode Analysis of Glucocerebrosidase
- 1 Introduction
- 2 Elastic and Gaussian Network Models
- 3 Comparison of All-atom NMA to Elastic Network NMA
- 4 Normal Mode Analysis compared to Molecular Dynamics Simulations Analysis
- 5 References
Nomal mode analysis (NMA) is a time-independent method to calculate vibrational modes and protein flexibility and therefore helps to describe slow large-amplitude/ low-frequency motions. These motions describe conformational changes which are essential for the protein function. Each NMA technique uses harmonic approximations and neglect solvent damping. For the analysis atoms are modeled as point masses connected by springs, which represent the interatomic forces. The normal mode describes a pattern of motion in which all parts of the system (all atoms or only Cα atoms) move with the same frequency and in phase. These modes are no actual protein motions, but can give insight in the possible movements and regions of flexibility. <ref> Alexandrov et al., Normal modes for predicting protein motions: A comprehensive database assessment and associated Web tool. 2004. Protein Science, Vol. 14 </ref> Normal modes can either be calculated with all atoms or can be restrained to Cα atoms only.
In this section different normal mode analyses were carried out for glucocerebrosidase (2NT0). Furthermore, the difference between all-atom and elastic network normal mode analysis gets illustrated on a small protein (1BPT).
Additional modes and figures, which are not addressed in this discussion, are located in the supplementary data page.
Elastic and Gaussian Network Models
The Elastic Network Model is a particular type of NMA in which the springs connecting each node to the neighbouring nodes are of equal strength and only the atom pairs within a cutoff distance get considered. In the Gaussian Network Model the proteins are represented by point masses of the Cα atoms and a harmonic approximation is used to model the interactions between point masses.
WEBnm@ is a web-server which provides automated computation and analysis of low-frequency normal modes of proteins. For an input structure file in PDB format, the normal modes are calculated and a series of automated calculations (normalized squared atomic displacements, vector field representation and an animation of the first six vibrational modes). Additional to the single analysis (calculating lowest frequency normal modes for one protein), the server offers a comperative analysis. In this analysis, the normal modes of a set of aligned sequences are calculated and compared. This feature is still under development. <ref>Hollup SM, Sælensminde G, Reuter N. WEBnm@: a web application for normal mode analysis of proteins BMC Bioinformatics. 2005 Mar 11;6(1):52 </ref>
- Webserver: http://apps.cbu.uib.no/webnma/home
- Single Analysis: structure file in PDB format
- Comparative Analysis: structure files in PDB format and sequence files in fasta format
Figure 1 to 5 show the first five vibrational modes (modes 7 to 11) calculated with WEBnm@ for the structure of glucocerebrosidase (2NTO). On the left side of each figure the vectors are shown, whereas the movement is illustrated to the right.
The atomic displacement of the different models is shown in the following pdf file: File:Webnma atomic displacement glucocerebrosidase.pdf. The plots show the normalized square of the displacement of each Calpha atom, whereof the highest values correspond to the most displaced regions.
Having a look at the atomic displacements, one can see that modes 7, 9, 10 and 11 have peaks and high values all over the protein. This indicates, that most parts of glucocerebrosidase are flexible. Mode 8 describes instead local flexibilities at the N- and C- terminus of the protein. The movement describes movements and displacements of the different domains. These movements consist of streching, bending, twisting, attraction or repulsion of the different glucocerebrosidase domains.
Looking at Figure 1, one can see that the upper and the lower part of glucocerebrosidase are attracted and repulsed from each other. The normalized square of displacement plot shows, that almost the complete protein is in motion. The aminoacids around position 80 move the most.
The overall motion of Mode 8 are lower than the movements of Mode 7: the maximal movement is at the same position as in Mode 8, but the rest of the protein shows almost no displacement of Cα atoms at all. Only at the N-terminus of the protein, another region of flexibility is located. The movement can also be described as attraction and repulsion, but in another direction compared to Mode 8.
Mode 9 is another mode with a very high flexibility located all over the protein. The highest atomic displacement is located at the N-terminus of the protein. Once again, the two parts of the protein get attracted to and repulsed from each other (cf. Figure 3).
Mode 10 also describes movements all over the protein, although the regions of highest flexibility are this time located in the middle of the protein. The movement can be described as attraction and repulsion, but also a slight twisting can be observed.
Mode 11 has a high flexibility all over the protein. Peaks of atomar displacement can be found all over the protein as well. The movement consists of repulsion and attraction.
The active sites of Glucocerebrosidase, namely Glu235 and Glu340 are located in the middle of the protein. In Mode 8 these regions are almost not flexible at all, whereas flexibility in the other 4 modes is located all over the protein, including the area around the active sites.
ElNémo is a web-server based on the Elastic Network Model. The tool computes, visualizes and analyses low-frequency normal modes of large macro-molecules. Any size of proteins can be treated as ElNémo uses a 'rotation-translation-block' approximation. <ref>K. Suhre & Y.H. Sanejouand, ElNemo: a normal mode web-server for protein movement analysis and the generation of templates for molecular replacement. Nucleic Acids Research, 32, W610-W614, 2004. </ref>
- Webserver: http://www.igs.cnrs-mrs.fr/elnemo/start.html
- structure file in PDB format
- supplementary options for NMA calculation (number of normal modes to be calculated, minimum and maximum perturbation and stepsize between DQMIN and DQMAX)
- options for computing the eigenmodes (NRBL and cutoff to identify elastic interactions)
Figure 6 to 9 show for each of the five first vibrational normal modes calculated with ElNémo three different perspectives.
The normal modes obtained with ElNémo seem to describe the same movements as the resulting normal modes of Webnm@. Most of the movement seems to be between the different domains. As observed in the results of Webnm@ the movements consists of stretching, bending, attraction, repulsion and twisting.
Mode 7, cf. Figure 6, shows a bending and stretching movement. The active site is not affected by this movement.
Mode 8, shown in Figure 7, moves the same parts as Mode 7, just in another direction. Once again, the part only consisting of beta sheets in bended towards the other part of the protein. The active site is not affected by this movement.
Mode 9, cf. Figure 8, shows a twisting movement. The part only consisting of beta sheets is twisted compared to the remaining part of the protein. The active site is not affected by this movement.
Mode 10, which is displayed in Figure 9, shows repulsing and attracting movements of smaller parts of the protein. But still, the active site does not seem to be affected.
Mode 11, shown in Figure 10, is similar to Mode 9 and shows a twisting movement. The active site is not affected.
Anisotropic Network Model
The ANM Webserver provides NMA Analysis with the anisotropic network model (ANM) which is an elastic network (EN) introduced in 2000. It is a very fast method which predicts the global modes. The ANM adopts a uniform force constant γ to all springs and nodes are refered as the Cα atoms.<ref>Anisotropic network model: systematic evaluation and a new web interface, Eyal E, Yang LW, Bahar I. Bioinformatics. 22, 2619-2627, (2006)</ref>
- Webserver: http://ignmtest.ccbb.pitt.edu/cgi-bin/anm/anm1.cgi
- structure file in PDB format or PDB id and chain
- cutoff for interaction between Cα atoms in Å
- distance weight for interaction between Cα atoms
After submitting the job you get a results page where you can analyze the different normal modes more precisely. You can change the amplitude and the frequency of the motions, you can visualize the vectors and change the appearance of the protein. Furthermore you have several possibilities to choose:
- download files
- create PDB (motion)
- create PyMol script
- get anisotrpic temp. factors
- B-factors/mode fluctuations
- Distance fluctuations and deformation energy
The movement, distance fluctuations, the deformation energy per position and mode fluctuations are shown for five different modes in the following figures. Additional modes can be found here.
At the end you can see the Cα anisotropic temperature factors as ellipsoids calculated in different ways.
|Mode 1||Mode 2||Mode 3||Mode 4||Mode 5|
|Inter Distance Analysis and Deformation Analysis|
|Cα anisotropic temperature factors as ellipsoids|
|purely computational||refined by experimental b-factors|
In each of the five best normal modes obtained, the whole protein seems to be in motion. This is confirmed by the various peaks in the mode fluctuations and the deformation energy plots. Various movements can be observed: twisting, turning, repulsion, attraction and bending. We can also see that the areas, which stand out in the B-factors/mode fluctuations and the Cα anisotropic temperature factors (figures 16 and 17) are the ones, which move the most at the normal modes.
For mode 1 (figures 11a-d) you can see repulsion and attraction. The upper and the lower part are moving very strong. Therefore the core of the protein is also a bit in motion. Looking at the deformation energy you can also see that the protein is in motion. Especially in the B-factors/mode fluctuations diagram you can see where the motion comes from. The distance matrix shows the large inter-residue fluctuations as blue. For this case you can see that residues with a greater distance show larger fluctuations.
In mode 2 (figures 12a-d) only the upper part shows repulsion and attraction. But this influences also the rest of the protein which is in motion. The lower part responds very strong to the movement of the upper part. The distance matrix is very similar to that of mode 1. The reason is that the movement is also very similar. But the deformation energy differs. The maxima are at the end at positions 344 til 393 and at 442. The last one is in the upper part of the protein, 344 til 393 in the middle.
Mode 3 (figures 13a-d) shows also attraction and repulsion but this time with the beta barrels. Therefore also the alpha helices are moving. The distance matrix shows more large inter-residue fluctuations than in mode 1 and 2. Also the deformation energy differs. You can see the maxima at the beginning between position 50 and 99 and at position 344 and 393. The last two ones are positioned at loops which are in very strong motion.
Mode 4 (figures 14a-d) shows many movements. It is like pulsation. The protein is contracting and extending. Only the core is almost without motion. The distance matrix shows many large inter-residue fluctuations, but low inter-residue fluctuations for residues that are very near or very far away. The deformation energy shows one maximum between position 50 and 99 and some little maxima between 246 and 393. This also reflects that there are many different movements.
Mode 5 (figures 15a-d) shows bending and turning and also a bit twisting at the upper part. The beta sheets are the elements that are responsible for the moving. They show the most motion. The rest of the protein just responds. The distance matrix shows that most of the residues have small inter-residue fluctuations. For the deformation energy we can see the maxima at the beginning of the sequence at positions 1 to 50 and also some small maxima at the end.
oGNM is a tool which calculates the equilibrium dynamics of proteins using the Gaussian Network Model. Futhermore it provides a database (iGNM) with precalculated PDB results.
- Webserver: http://ignm.ccbb.pitt.edu/Online_GNM.htm
- structure file in PDB format or PDB id
- number of nodes to represent a nucleotide
- cutoff distances for amino acids and nucleotide pairs
- preferred visualization engine
[The results are also available at http://ignm.ccbb.pitt.edu/ognm/795639148/temp/index.htm]
The figure below show a mobility-color-coded ribbon diagram for modes 1 to 5 where red describes regions of high mobility and blue regions of low mobility.
|Mobility-Color-Coded Ribbon Diagram|
|Fluctuations per residue for modes 1 to 5|
|Cross-correlation plot for mode 1 to 5|
oGNM shows which parts of the protein are able to move and where the protein is most dynamic. In the upper part the Mobility-Color-Coded Ribbon Diagrams, diagrams with the fluctuations per residue and a cross-correlation plot for mode 1 to 5 are shown. In the last one you can see that only for residues that are very near or very far the correlation is high.
In mode 1 (figure 18) you can see that the beta sheets at the exterior of the protein are the most dynamic. The inner part is colored blue and therefore not mobile. Compared to the normal modes of other tools it is what we have seen. Also the diagram with the fluctuations (figure 23) per residue shows that positions 300 to 400 has the fewest mobility which is the middle or the core of the protein.
In mode 2 (figure 19) there are only two small red parts with high mobility, one part of an alpha helix and a beta sheet. Both are at the exterior of the protein. But there are more parts with medium mobility. All the exterior elements are colored light blue. The diagram with the fluctuations (figure 23) also shows that there are more moving parts.
Mode 3 (figure 20) has some little helix parts that are red and several beta sheet and also helix parts that are colores rosy. There are little flexible parts at the left and right side of the protein and one larger part in the middle. This is also shown in the diagram with the fluctuations (figure 24), where the maximum occurs between position 300 and 350.
Mode 4 (figure 21) shows only two little red loops and a few rosy parts. But the bigger part of the protein is colored blue. This seems to be a mode with few motion. Interestingly the diagram with the fluctuations (figure 24) shows a maximum of 0.065 at position 350, which is very high. This is exactly the red colored loop.
Mode 5 (figure 22) has some loops and parts of beta sheets that are colored red. Near them are some rosy parts but again the biggest part of the protein is colored blue. Looking at the diagram with the fluctuations (figure 25) there are peaks at the beginning and the end, which are the exterior parts which are colored red or rosy.
The active site does not seem to be affected by any of the modes. Most motions described by the modes are located at the exterior of the protein, whereas the active site is located in the center of the protein.
NOMAD-Ref is a webserver for normal mode analysis. By simplifying the potential energy between atoms all atoms can be used. This is possible by using the Tirion calculation of normal modes.<ref>Tirion, Monique M. Large Amplitude Elastic Motions in Proteins from a Single-Parameter, Atomic Analysis. Phys Rev Lett, 77, 1905--1908 (1996).</ref> NOMAD-Ref provides pdb-files for pymol and the elastic network of different normal modes, a graph with the RMSD values per residue and a modes.dat file with gives information about the frequency of the modes and the eigenvectors.
- Webserver: http://lorentz.dynstr.pasteur.fr/nma/submission.php
- PDB file
- number of modes to calculate
- distance weight parameter in Å
- cutoff to use for mode calculation in Å
- average RMSD (Angstrom) in output trajectories
Modes 1-6 are rigid body motions (can be found here), and modes 7 to 11 describe the best five vibrational modes. Each mode is represented by two figures:
- animation of the protein motion
- cRMS per residue plot
|Mode 7||Mode 8||Mode 9||Mode 10||Mode 11|
Mode 7 to 9 show a movement all over the protein (see plots for rmsd per residue), whereas Mote 10 and 11 have a high flexibility at the N-terminus of the protein.
Mode 7 shows repulsion and attraction. It seems as the upper and the lower part of the protein are moving to each other and then apart. The plot with the cRMS per residue has the maxima at the beginning and at the end of the sequence. At position 2500 there is also one maximum.
Mode 8 has a twisting motion. The whole protein moves, it looks like bending and turning. The plot with the cRMS per residue shows its maximum at position 3500, which is at the upper part of the protein.
In mode 9 there is again repulsion and attraction. It looks very similar to mode 7. But the plot with the cRMS per residue differs. At position 3500 again there is a huge maximum. The rest seems to be the same.
As already mentioned in mode 10 one can see primarily the N-terminus which is moving. The rest of the protein also moves because of this motion. The plot with the cRMS per residue cannot be interpreted well because there is one maximum at position 2500 which is very high (2000 Angstrom) and so the rest cannot be seen.
Also in mode 11 the N-terminus is the most flexible part of the protein. There is also a twisting motion of the protein. The plot with the cRMS per residue also shows a maximum at position 3500 and a little peak at position 2500.
Overall we can see in all tools similar movements. The modes resemble each other. With that information we want to look at our selected mutations again. If they are positioned at regions with high flexibility they could change something.
Mutation 1: Gly - Ser (Pos. 49/10): In oGNM this position is not part of a flexible region. And as far we can see also the other tools show no special movement for this position. Of course it also is affected of total protein movement, but not in some special case. So it might not affect the flexibility of the protein.
Mutation 2: Asp - Asn (Pos. 63/24): It's the same for the second mutation. There we also can't see that it is part of a highly flexible region.
Mutation 3: His - Arg (Pos. 99/60): Mutation three is at a position of the protein, where it has high flexibility according to some Normal Modes. So it might change the flexibility of the protein. Corresponding to our sequence- and structure-based mutation analysis it is not harmful, so the mutation may have no influence on the flexibility.
Mutation 4: Arg - Gln (Pos. 159/120): This mutation is located in the interior of the protein. The Normal Modes show that this is the most stable region of the protein. There is almost no flexibility. So a change of the amino acid should not change the movement of the protein.
Mutation 5: Pro - Leu (Pos. 221/182): The fifth mutation is part of flexible region. Due to the different structure of the amino acids this may affect the flexibility of the protein. It might be another cause why this mutation is damaging.
Mutation 6: Ser - Asn (Pos. 310/271): There are some Normal Modes where this mutation is in a flexible region but in most cases it is not. So there may be an effect on the flexibility but as there are more Normal Modes where it is not flexible it is not clear. And we know that this mutation is damaging, so it could have an effect.
Mutation 7: Asn - Ser (Pos. 409/370): oGNM shows the position of this mutation in mode 3 as flexible, the other tools also have some modes where it is a little flexible. So there might be also an influence on the flexibility. But again it's not clear.
Mutation 8: His - Arg (Pos. 350/311): This mutation is part of a beta sheet. It is in the interior of the protein and in the middle. There is no high flexibility. There are some modes which show a bit flexibility but in most cases it is rigid. As the mutation is harmless the mutation should have no great effect.
Mutation 9: Met - Val (Pos. 455/416): The ninth mutation is also in a part of the protein which is not very flexible. So there should also be no great effect on the flexibility.
Mutation 10: Leu - Pro (Pos. 509/470): Mutation ten is in most cases part of a flexible region. The majority of the Normal Modes show it as flexible. So there may be an effect on the flexibility. But this mutation is not listed in HGMD. So the effect may be low.
Comparison of All-atom NMA to Elastic Network NMA
All-atom NMA considers each atom for the calculation of the modes, not only the Cα atoms and is therefore quite time conusming. In this section, all-atom normal mode analyses are compared to the normal modes based on elastic networks. Instead of glucocerebrosidase (our main protein of interest) 1BPT was chosen, as it is quite small and therefore enables a quite fast all-atom normal mode analysis.
The all-atom normal mode analysis was carried out two times with two different temperatures (600 and 2000 K). The temperature was modified as low frequencies dynamics of proteins are temperature dependent and it would therefore be interesting to see the differences.
1BPT at 600 K
1BPT at 2000 K
Elastic Network NMA
The following images show the normal modes 7 to 10 for 1BPT created with Nomad-Ref.
The modes caluclated at 2000 K are slightly bigger than the ones calculated with a temperature of 600 K. The directions of the movements are the same with both temperatures. This shows that the motions get more anharmonic with higher temperatures. The visual inspectation of the different modes suggests that modes 2 - 4 of all-atom and elastic network describe the same motion, whereof the elastic network modes have the biggest movement range. Mode 1 of the elastic normal mode analysis seems to be rigid apart from one flexible loop. In the all-atom mode mor parts of the protein are flexible.
Normal Mode Analysis compared to Molecular Dynamics Simulations Analysis
The Normal Modes show a different behaviour compared to the Molecular Dynamics Simulations Analysis. Normal Modes show great motions of huge protein parts whereas the Molecular Dynamics Analysis is more detailed. There we can see sidechains wobbling and an overall motion of the protein. But there is no attraction or repulsion. WEBnm@ is nearest to the Molecular Dynamics Simulation.
We can also compare the results from oGNM with the flexible regions shown in Molecular Dynamics. Here mode 4 and 5 come nearest. Molecular Dynamics analysis shows only low flexibility with some loops with higher flexibility and a part of an alpha helix and a part of a beta sheet with a little higher mobility. But overall the flexibility is low.
For more details look at the Molecular Dynamics Simulations Analysis to compare the different results.