Fabry:Normal mode analysis
Fabry Disease » Normal_mode_analysis
- 1 Introduction
- 2 WEBnm@
- 3 ElNémo
- 4 References
Maybe one of the first questions that can be asked in this task is, why we use low-frequency normal modes. This is explained in the paper of Marc Delarue and Philippe Dumas<ref>Marc Delarue and Philippe Dumas On the use of low-frequency normal modes to enforce collective movements in refining macromolecular structural models, Proc. Natl. Acad. Sci. (USA), 101, 6957-6962 (2004)</ref>, where they claim, that "many of the structural transitions (...) can be explained by just a few of the lowest-frequency normal modes". The normal modes can be used to generate the general motion of a system by superposition them. Thus we could in principle infer from our analysis in this task how the alpha-galactosidase A, which we examine hydrolyses the terminal alpha-galactosyl moiety of its substrate<ref>Normal mode http://en.wikipedia.org/wiki/Normal_mode, July 5th, 2012</ref>.
We decided to use the structures 3HG2 and 3HG3, which represent the human α-Galactosidase catalytic mechanism with empty active site and substrate bound, respectively (see <xr id="fig:bindSite"/>). From this, we hope to getter an insight into the mechanism and a possibility to compare normal modes and the behaviour of the molecule.
Possible normal modes:
The structure 3HG2 has 781 C-alpha atoms in its pdb file, thus 781*3 - 6 = 2337 normal modes could be calculated in principle for this structure without any cutoff by elNémo. Since the structure has a total of 6765 atoms, 20289 NMs could be calculated by WEBnm@.
The structure 3HG3 has 793 C-alpha atoms in its pdb file, thus 793*3 - 6 = 2373 normal modes could be calculated in principle for this structure without any cutoff by elNémo. Since the structure has a total of 7537 atoms, 22605 NMs could be calculated by WEBnm@.
WEBnm@ <ref>Hollup SM, Sælensminde G, Reuter N. WEBnm@: a web application for normal mode analysis of proteins BMC Bioinformatics. 2005 Mar 11;6(1):52 </ref> claim to administer simple and automated computation of low-frequency normal modes for proteins as well as their analysis in order to clarify if it is beneficial to perform a complete study on the protein in question.
The server calculates Normal Modes with the help of the MMTK package <ref>Hinsen K, The Molecular Modelling Toolkit: a new approach to molecular simulations, J Comput Chem, 21:79-85, 2000</ref>, which is an Open Source program library for molecular simulation applications. A C-alpha force field <ref>Hinsen K, Petrescu AJ, Dellerue S, Bellissent-Funel MC, Kneller GR, Harmonicity in slow protein dynamics, Chemical Physics, 261:25-37, 2000</ref> is used and only these C-alpha atoms are used, but with a weight assigned that corresponds to the masses of the whole residue they represent.
The server provides a bunch of analysis tools and all results can be downloaded without any problems. The tools are:
- deformation energies of each mode
- calculation of normalized squared atomic displacements
- calculation of normalized squared fluctuations
- interactive visualization of the modes using vector field representation or vibrations
- correlation matrix
Deformation energies and eigenvalues
In <xr id="fig:eigen3HG2"/> and <xr id="fig:eigen3HG3"/> the Eigenvalues for the first 50 modes for both examined structures are plotted. The increase of the values shows a decrease of amplitude of motion in the modes, since there is an invers relationship between the Eigenvalues and the amplitude. Hence, mode 7 has the highest amplitude.
Atomic Displacement Analysis
In <xr id="fig:atomDisplacement3HG2"/> and <xr id="fig:atomDisplacement3HG3"/> the square of the atomic displacements of the C alphas of the examined structures are shown. These are normalized in a way, such that the sum over all residues is equal to 100. With these plots we can find out, which regions are displaced most, i.e. which move the most; this is shown by the highest values. It is recommended to look for clustered peaks, which identify significantly big regions. Local flexibility (a single peak) is of less importance.
In both figures, chain A and B are colored different. From this we can see, that although in <xr id="tab:webnma_3hg2"/> and <xr id="tab:webnma_3hg3"/> the motion of both chains in a mode looks alike, but in general it is not perfectly equal or even differs a lot. A good example for a significant variation is mode 7 of the structure 3HG2 (see <xr id="fig:atomDisplacement3HG2"/>, upper left). While the first part of both chains (approximately until position 200) behaves similiar, except for a different amplitude, the second half differs with chain B showing much more movement. This action can be oberserved in <xr id="tab:webnma_3hg2"/> only after a very close inspection.
For the dimer it seems to be easier to act different when no substrate is bound, since the atomic displacements of the chains in the modes of 3HG3 seem to be much more alike than those of 3HG2.
All in all, the substrate binding site itself (residue 203 to 207) seems to be rather ridgid, except for maybe mode 10 in 3HG3, while the ends of both chains (the last 50 residues) are fairly flexible. This can best be seen in both modes 12.
Fluctuation is the sum of the atomic displacements of each C alpha atom in each non-trivial mode weighted by the inverse of their corresponding eigenvalues. These are normalized in a way, such that the sum over all residues is equal to 100. The fluctuations of the structures 3HG2 and 3HG3 are shown in <xr id="fig:fluctuation_3HG2"/> and <xr id="fig:fluctuation_3HG3"/>, respectively.
The plots support our previous assumption, that the chains of the substrate bound structure 3HG3 act much more similiar than those of the structure with an empty active site, since the overlap is almost perfect in the left plot of <xr id="fig:fluctuation_3HG3"/>.
Again we can observe that the binding site is an rigid island among two moderate flexible regions, which probably are responsible for opening and closing the binding pocket and the needed movement for the breake down of the bound sugar.
Towards the end of the chain more motion can be observed, which is needed for the independant movement of both chains.
Plots the correlation of motions between all the Calphas in the protein structure.
General error. A trouble ticket has already been send on 04.07.2012 11:22
3HG2: Correlation= 0.651 for 781 C-alpha atoms.
3HG3: Correlation= 0.779 for 793 C-alpha atoms.
This graph displays the distance variation between successive pairs of CA atoms in the two extreme conformations that were computed for this mode (DQMIN/DQMAX). Large distance variations can be an indicator for residue pairs that support the important strain in that particular normal mode movement. Note that residue pairs between chain breaks or at flexible ends of the protein may also exhibit large CA-CA distance variations. If more than one residues ae grouped together into a rigid block (NRBL>1), CA-CA distance variations between CA atoms in the same block will be very low.
This feature is still experimental and will be further developped in the future.
This matrix displays the maximum distance fluctuations between all pairs of CA atoms and between the two extreme conformations that were computed for this mode (DQMIN/DQMAX). Distance increases are plotted in blue and decreases in red for the strongest 10% of the residue pair distance changes. Every pixel corresponds to a single residue. Grey lines are drawn every 10 residues, yellow lines every 100 residues (counting from the upper left corner).