Difference between revisions of "Lab Journal of Task 5 (MSUD)"

From Bioinformatikpedia
m (Modeller)
(Modeller)
Line 9: Line 9:
 
For single template modelling, we have used the structure 2BFE (99% sequence identity to 1U5B) and 3EXG(24.9% sequence identity to 1U5B).
 
For single template modelling, we have used the structure 2BFE (99% sequence identity to 1U5B) and 3EXG(24.9% sequence identity to 1U5B).
 
Computed models are stored in the directory <tt>/mnt/home/student/weish/master-practical-2013/task05/models</tt> on the student server.
 
Computed models are stored in the directory <tt>/mnt/home/student/weish/master-practical-2013/task05/models</tt> on the student server.
  +
  +
==== Multiple template modelling ====
  +
In order to evaluate the performance of homology modelling, we have also tried to use multiple structures with homologous sequence as templates for structure prediction. The Python program <tt>create_dataset.py</tt>([[Task_2_lab_journal_(MSUD)#Dataset_creation]]) by Laura Schiller was used for searching for homologous PDB structures with high and low sequence identities.
   
 
=== Swissmodel ===
 
=== Swissmodel ===

Revision as of 18:47, 8 June 2013

Calculation of models

Structure models were build with the following programs, using 2BFE and 3EXG as templates.

Modeller

Single template modelling

For single template modelling, we have used the structure 2BFE (99% sequence identity to 1U5B) and 3EXG(24.9% sequence identity to 1U5B). Computed models are stored in the directory /mnt/home/student/weish/master-practical-2013/task05/models on the student server.

Multiple template modelling

In order to evaluate the performance of homology modelling, we have also tried to use multiple structures with homologous sequence as templates for structure prediction. The Python program create_dataset.py(Task_2_lab_journal_(MSUD)#Dataset_creation) by Laura Schiller was used for searching for homologous PDB structures with high and low sequence identities.

Swissmodel

Swissmodel was run on this server. The models are stored in /mnt/home/student/schillerl/MasterPractical/task5/Swissmodel/.

iTasser

This server was used for running iTasser. For creating a model with the low sequence identity template (3EXG), "Option II" was set to exclude templates with more than 30 % sequence identity from the iTasser template library. Models are stored in /mnt/home/student/schillerl/MasterPractical/task5/iTasser/.

Evaluation of models

The calculated models were compared to the reference structure 1U5B. Since the residue numbering in the 1U5B pdb file didn't fit the query protein sequence of BCKDHA, we used the following script to change the numbering in the pdb file.


<source lang="python"> Change residue numbering in pdb file by adding an offset.

Usage: python change_residue_numbering.py <pdb file> <chain> <offset> <output file>

@author: Laura Schiller

import sys

pdb_file = open(sys.argv[1]) out_file = open(sys.argv[4], "w") chain = sys.argv[2] offset = int(sys.argv[3])

line = pdb_file.readline() while(line):

   if (line.startswith("ATOM") or line.startswith("TER")) and line[21:22] == chain:
       number = int(line[23:26]) + offset
       new_line = "%s%3d%s" % (line[0:23], number, line[26:len(line)])
       out_file.writelines(new_line)
   else:
       out_file.writelines(line)
   line = pdb_file.readline()

pdb_file.close() out_file.close() </source>


GDT

For calculating GDT_TS values, the LGA server was used with options -3 -sda -ch1:A -ch2:A.

RMSD

RMSD values were calculated on the SAP server.