Task 8: Molecular Dynamics Simulations
A detailed task description can be found here.
Intro and selection of two mutants
The description given here was applied for our wild type (1J8U). However, the steps done for our two mutants R408W and P281L are the same.
We selected R408 we because it is the most abundant mutation of PAH which is associated with phenylketonuria. It has a total frequency of 6.67% (see ).
The second mutation, P281L, was selected because it is the closed mutation to the binding site (HIS 285) and we hope that we are able to see some interesting things here which give us an explanation why this mutation is disease causing.
Before we can start to generate all necessary files for our MD simulation we have to prepare our structure first. We employed the following steps:
1. Extracting the crystal water below 15 Å
repairPDB 1J8U.pdb -ssw 15 > water_below_15.out
We received the following output:
HETATM 2559 O HOH A1008 2.996 9.738 16.094 1.00 14.91 O HETATM 2561 O HOH A1010 -3.667 12.788 8.538 1.00 14.80 O HETATM 2572 O HOH A1021 3.557 27.911 10.913 1.00 14.69 O HETATM 2879 O HOH A1328 -5.335 24.271 14.490 1.00 14.88 O TER
2. Extract only the protein
repairPDB 1J8U.pdb -nosol > 1J8U_nosol.pdb
3. Extract the amino acid sequence and turn it into lower case letter to give it as a input for SCWRL
repairPDB 1J8U.pdb -seq > 1J8U_seq.txt
vim 1J8U_seq.txt :%s/.*/\L&/g
4. Use SCWRL to complete the side chains of all amino acids
/apps/scwrl4/Scwrl4 -i 1J8U_nosol.pdb -s 1J8U_seq.txt -o 1J8U_nosol_after_SCWRL.pdb | tee scwrl_1J8U_nosol_after_scwrl.out
5. Remove the hydrogen atoms from the SCWRL output
repairPDB 1J8U_nosol_after_SCWRL.pdb -noh > 1J8U_nosol_after_SCWRL_no_h.pdb
6. concatenate the protein and the crystal water into one file
We just added the output of step 1 to the end of the PDB file 1J8U_nosol_after_SCWRL_no_h.pdb and named this file 1J8U_nosol_after_SCWRL_no_h_merged_crystal_water.pdb
Create important GROMACS files
1. We have to create a TOP (topology) and a PAR (parameter) file with the following command:
pdb2gmx -f 1J8U_nosol_after_SCWRL_no_h_merged_crystal_water.pdb -o 1J8U_nosol_after_SCWRL_no_h_merged_crystal_water.gro -p 1J8U_nosol_after_SCWRL_no_h_merged_crystal_water.top -water tip3p -ff amber03 -vsite hydrogens | tee pdb2gmx.out
2. Next we create a box around the protein and fill it with water
Create the box:
editconf -f 1J8U_nosol_after_SCWRL_no_h_merged_crystal_water.gro -o 1J8U_nosol_after_SCWRL_no_h_merged_crystal_water_box.gro -bt dodecahedron -d 1.0 | tee editconf.out
Fill it with water:
genbox -cp 1J8U_nosol_after_SCWRL_no_h_merged_crystal_water_box.gro -cs spc216.gro -p 1J8U_nosol_after_SCWRL_no_h_merged_crystal_water.top -o 1J8U_nosol_after_SCWRL_no_h_merged_crystal_water_water.gro
3. In the next step we have to add the ions. To do so we need to use the grompp command, which is always used to prepare a system for more complicated steps. This command needs an mdp file where we can define several additional parameters.
Create parameter file addions.mdp:
And fill it with the following content:
integrator = steep emtol = 1.0 nsteps = 500 nstenergy = 1 energygrps = System coulombtype = PME rcoulomb = 0.9 rvdw = 0.9 rlist = 0.9 fourierspacing = 0.12 pme_order = 4 ewald_rtol = 1e-5
Now, we execute it with this command:
grompp -v -f addions.mdp -c 1J8U_nosol_after_SCWRL_no_h_merged_crystal_water_water.gro -p 1J8U_nosol_after_SCWRL_no_h_merged_crystal_water.top -o 1J8U_nosol_after_SCWRL_no_h_merged_crystal_water_water.tpr | tee grommp.out
4. Now we add the solvent to the system and neutralize the charge that is part of the system due to charged amino acids. To do so we use genion:
genion -s 1J8U_nosol_after_SCWRL_no_h_merged_crystal_water_water.tpr -o 1J8U_nosol_after_SCWRL_no_h_merged_crystal_water_solv.pdb -conc 0.1 -neutral -pname NA+ -nname CL- | tee genion.out
During the call we were asked for a number we selected 13.
5. Next we have to adjust the SOL value and add the number of NA+ and CL- in our 1J8U_nosol_after_SCWRL_no_h_merged_crystal_water.top
We had 25 NA+ and 21 CL- ions generated from our last step. So we had to reduce the number in the SOL field to 10170 - 46 = 10124 the .top file had in the end the following values:
[ molecules ] ; Compound #mols Protein_chain_A 1 SOL 4 SOL 10124 NA 25 CL 21
6. To make sure our initial crystal water does not overlap with the added water we use repairPDB again.
repairPDB 1J8U_nosol_after_SCWRL_no_h_merged_crystal_water_solv.pdb -cleansol > 1J8U_nosol_after_SCWRL_no_h_merged_crystal_water_solv2.pdb
7. In the last line there is a REMARK tag with the number of removed water molecules. This has to be changed in the TOP file if larger than 0.
REMARK RM 0
Value is 0 so we did not have to change anything.
8. We need to extract restraints from the structures. These restraints are useful to reduce the simulation time by disallowing very fast vibrations such as seen for hydrogen atoms. The command we use is called genrestr.
genrestr -f 1J8U_nosol_after_SCWRL_no_h_merged_crystal_water_solv2.pdb -o 1J8U_nosol_after_SCWRL_no_h_merged_crystal_water_solv2.itp | tee genrestr.out
In the command line menu we choose number 1 for protein.