Difference between revisions of "Structure based mutation analysis of GBA"
Line 131: | Line 131: | ||
=== Gromacs === |
=== Gromacs === |
||
+ | |||
+ | '''Usage ''' |
||
+ | * '''Step 1 - [http://manual.gromacs.org/current/online/pdb2gmx.html pdb2gmx]''' |
||
+ | ** pdb2gmx adds hydrogens to the molecules and generates a coordinate (.gro) and a topology(.top) file. |
||
+ | ** Commandline: <code>pdb2gmx -f mutant_nw.pdb -o mutant_nw.gro -p mutant_nw.top -water tip3p -ff amber03</code> |
||
+ | * '''Step 2 - [http://manual.gromacs.org/current/online/mdp_opt.html MDP file] ''' |
||
+ | ** Create a MDP file with the following content: |
||
+ | title = PBSA minimization in vacuum |
||
+ | cpp = /usr/bin/cpp |
||
+ | # Preprocessing |
||
+ | # -DFLEXIBLE: include flexible water in stead of rigid water into topology |
||
+ | # -DPOSRES: includes posre.itp into topology, used for position restraints. |
||
+ | define = -DFLEXIBLE -DPOSRES |
||
+ | # Implicit solvent - simulation with implicit solvent using the Generalized Born formalism |
||
+ | implicit_solvent = GBSA |
||
+ | # Run control - steepest descent algorithm for energy minimization |
||
+ | integrator = steep |
||
+ | # Run control - tolerance |
||
+ | emtol = 1.0 |
||
+ | # Run control - maximum number of steps to integrate or minimize |
||
+ | nsteps = 500 |
||
+ | # Output - frequency to write energies to energy file |
||
+ | nstenergy = 1 |
||
+ | # Tables - group(s) to write to energy file |
||
+ | energygrps = System |
||
+ | # Neighbour searching - makes grid in the box and only check atoms in neighboring grid cells when constructing a new neighbor list every nstlist steps |
||
+ | ns_type = grid |
||
+ | # Electrostatics - twin range cut-off's with neighborlist cut-off rlist and Coulomb cut-off rcoulomb |
||
+ | coulombtype = cut-off |
||
+ | rcoulomb = 1.0 |
||
+ | # VDW - distance for the LJ or Buckingham cut-off |
||
+ | rvdw = 1.0 |
||
+ | # Bonds - no constraints except for those defined explicitly in the topology |
||
+ | constraints = none |
||
+ | # Neighbour searching - Use no periodic boundary conditions, ignore the box. |
||
+ | pbc = no |
||
+ | * '''Step 3 - [http://manual.gromacs.org/current/online/grompp.html grompp]''' |
||
+ | ** grompp, the GROMACS preprocessor, prepares the system for gromacs. Therefore it reads a molecular topology file, checks the validity of the file and expands the topology from a molecular description to an atomic description. |
||
+ | ** Commandline: <code>grompp -v -f mutant_nw.mdp -c mutant_nw.gro -p mutant_nw.top -o mutant_nw.tpr</code> |
||
+ | *'''Step 4 - [http://manual.gromacs.org/current/online/mdrun.html mdrun]''' |
||
+ | ** The mdrun program is the main computational chemistry engine within GROMACS and can perform several different things: molecular dynamics simulations, Brownian Dynamics, Langevin Dynamics, Conjugate Gradient, L-BFGS, or Steepest Descents energy minimization. |
||
+ | ** Commandline: <code></code> |
||
+ | * '''Step 5 - [http://manual.gromacs.org/current/online/g_energy.html g_energy]''' |
||
+ | ** g_energy extracts energy components or distance restraint data from an energy file and calculates Average, RMSD and drift from the simulation. |
||
+ | ** Commandline: <code>g_energy -f FILE.edr -o energy_1.xvg</code> |
||
+ | |||
+ | ''' Results ''' |
||
== Discussion == |
== Discussion == |
Revision as of 12:09, 1 July 2011
Contents
Introduction
A detailed list of the ten mutations analyzed in this section, can be found in Task 6.
Structure Selection
To carry out a structure-based analysis of the mutations chosen in Task 7 a crystal structure had to be chosen. According to Uniprot 19 different crystal structures of glucocerebrosidase exist. The table below shows the six different structures with a resolution of or better than 2 Angstrom. 2NT0 is chosen as template for the analysis carried out in this section, as no residues are missing, the R-value is quite low, and it has the best resolution among the structures without missing residues. Only incomplete structures have been resolved near the physiological pH (7.4), therefore a structure resolved at a more acid pH had to be chosen. The structure can either be downloaded from the PDB website or by using the script fetchpdb, which validates the ID and downloads the corresponding structure.
PDB ID | Resolution [Å] | R-factor | Coverage | pH | # Missing Residues (A/B) |
---|---|---|---|---|---|
1OGS | 2.00 | 0.195 | 4.6 | 0 | |
2NT0 | 1.79 | 0.181 | . | 4.5 | 0 |
2V3D | 1.96 | 0.157 | 6.5 | 9/8 | |
2V3E | 2.00 | 0.163 | 7.5 | 7/7 | |
2V3F | 1.95 | 0.154 | 6.5 | 8/14 | |
3GXI | 1.84 | 0.193 | NULL | 0 |
Mutation Mapping
The ten positions at which the mutations analyzed in this task take place, are hilighted in the structure of 2NT0 shown in Figure 1. As already mentioned in Task 5 and 6, one can clearly see that two mutations are next to the active site residues Glu235 and Glu340, namley the positions 120 and 311. The wildtype residues at these positions (Arg120 and His311) are known to form hydrogen bonds with the active sites and should therefore be quite important for function and structure. <ref>Kim et al., Crystal Structure of the Salmonella enterica Serovar Typhimurium Virulence Factor SrfJ, a Glycoside Hydrolase Family Enzyme. Journal of Bacteriology, 2009, p. 6550-6554, Vol. 191, No. 21 </ref> The other eight mutation positions are located all over the protein.
SCWRL
SCWRL is a tool, specifically designed for side-chain conformation predictions, given a fixed backbone from an experimental determined structure. <ref>G. G. Krivov, M. V. Shapovalov, and R. L. Dunbrack, Jr. Improved prediction of protein side-chain conformations with SCWRL4. Proteins (2009). </ref>
Usage
- Create pdb-File only containing chain A of the protein (e.g with pymol):
2NT0_A.pdb
- Extract sequence (e.g with repairPDB) and change all letters of protein sequence to lower case and introduce mutation in capital letters:
mutantSequence.seq
- Note, that only one mutation should be inserted per sequence -> results in 10 mutant sequence files.
- Commandline:
scwrl4 -i 2NT0_A.pdb -s mutantSequence.seq -o 2NT0_mutant.pdb > scwrl.log
Results
SCRWL4 was applied ten times, once for each mutation. The resulting conformations of the mutants are visualized in Figure 3. Figure 3 additionally shows the wildtype amino acids and the mutants created with the mutagenesis method of pymol. The conformations, created with SCWRL4 and pymol vary greatly. Only in mutation 9 they seem to be quite similar. Figure 4 shows a superposition of the wild type protein and the mutated proteins in cartoon representation. This shows that SCWRL did not only change the mutant residues, but also changed some beta sheets at the bottom of the structure (shown in green). These changes are consistent in all ten mutant structures.
Mutation | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
Minimal Energy | 351.329 | 348.659 | 350.017 | 355.416 | 473.454 | 364.148 | 352.615 | 362.604 | 354.98 | 375.976 |
Energy Calculation
FoldX
FoldX is an algorithm which provides an estimation of the importance of the interactions contributing to the stability of proteins. It was developped by the FoldX consortium now centered at the Centre de Regulació Genòmica in Barcelona. <ref>http://foldx.crg.es/about.jsp</ref>
Usage
- Create a runfile according to the example "Energy of the molecule" at the FoldX website:
run.txt
- Do not forget to adjust known information like pH and temperature.
- Create a list file containing all pdb files you want the energy to be calculated of:
list.txt
- Commandline:
sudo ./foldx -runfile run.txt > foldx.output
Results
The total energies calculated with FoldX are shown in the table below. The differences between the wild type and the different mutant structures have been calculated and are listed in the table as well.
Mutation | Total Energy | Difference |
---|---|---|
WT | -372.60 | 0 |
1 | -225.82 | -146.78 |
2 | -228.18 | -144.42 |
3 | -226.97 | -145.63 |
4 | -226.38 | -146.22 |
5 | -196.84 | -175.76 |
6 | -224.01 | -148.59 |
7 | -228.48 | -144.12 |
8 | -217.29 | -155.31 |
9 | -221.71 | -150.89 |
10 | -218.65 | -153.95 |
Minimise
Usage
- Remove hydrogens and waters from the pdb files with repairPDB:
repairPDB mutant.pdb -nosol > mutant_nw.pdb
- Apply Minimise:
minimise mutant_nw.pdb mutant_nw_min.pdb > mutant_nw_min.out
Results
Gromacs
Usage
- Step 1 - pdb2gmx
- pdb2gmx adds hydrogens to the molecules and generates a coordinate (.gro) and a topology(.top) file.
- Commandline:
pdb2gmx -f mutant_nw.pdb -o mutant_nw.gro -p mutant_nw.top -water tip3p -ff amber03
- Step 2 - MDP file
- Create a MDP file with the following content:
title = PBSA minimization in vacuum cpp = /usr/bin/cpp # Preprocessing # -DFLEXIBLE: include flexible water in stead of rigid water into topology # -DPOSRES: includes posre.itp into topology, used for position restraints. define = -DFLEXIBLE -DPOSRES # Implicit solvent - simulation with implicit solvent using the Generalized Born formalism implicit_solvent = GBSA # Run control - steepest descent algorithm for energy minimization integrator = steep # Run control - tolerance emtol = 1.0 # Run control - maximum number of steps to integrate or minimize nsteps = 500 # Output - frequency to write energies to energy file nstenergy = 1 # Tables - group(s) to write to energy file energygrps = System # Neighbour searching - makes grid in the box and only check atoms in neighboring grid cells when constructing a new neighbor list every nstlist steps ns_type = grid # Electrostatics - twin range cut-off's with neighborlist cut-off rlist and Coulomb cut-off rcoulomb coulombtype = cut-off rcoulomb = 1.0 # VDW - distance for the LJ or Buckingham cut-off rvdw = 1.0 # Bonds - no constraints except for those defined explicitly in the topology constraints = none # Neighbour searching - Use no periodic boundary conditions, ignore the box. pbc = no
- Step 3 - grompp
- grompp, the GROMACS preprocessor, prepares the system for gromacs. Therefore it reads a molecular topology file, checks the validity of the file and expands the topology from a molecular description to an atomic description.
- Commandline:
grompp -v -f mutant_nw.mdp -c mutant_nw.gro -p mutant_nw.top -o mutant_nw.tpr
- Step 4 - mdrun
- The mdrun program is the main computational chemistry engine within GROMACS and can perform several different things: molecular dynamics simulations, Brownian Dynamics, Langevin Dynamics, Conjugate Gradient, L-BFGS, or Steepest Descents energy minimization.
- Commandline:
- Step 5 - g_energy
- g_energy extracts energy components or distance restraint data from an energy file and calculates Average, RMSD and drift from the simulation.
- Commandline:
g_energy -f FILE.edr -o energy_1.xvg
Results