Difference between revisions of "Task Structural Alignments"

From Bioinformatikpedia
m (Theoretical background talk)
m (Explore structural alignments)
Line 17: Line 17:
 
== Explore structural alignments ==
 
== Explore structural alignments ==
   
  +
* Assemble a set of 8 to 9 structures related to your protein. These structures should span the range of similarities from almost identical to completely unrelated. You can take structures found in the sequence search and you can go to CATH. E.g.
  +
** one or two structure with identical sequence (ideally once with filled binding site, once unfilled, so you can make one pair with similar binding site status, one with different)
  +
** one similar sequence (>60% seq. identity)
  +
** one rather unrelated sequence (<30% seq. identity)
  +
** one arbitrary structure with a CATH code which is identical to your protein at each of these levels:
  +
*** CAT
  +
*** CA
  +
*** C
  +
** on arbitrary structure from a different CATH class
  +
  +
* Apply different structural alignment methods to these structures:
  +
** use Pymol (will only work on more closely related structures)
  +
**
   
 
== Use structural alignments to evaluate sequence alignments ==
 
== Use structural alignments to evaluate sequence alignments ==

Revision as of 05:36, 27 May 2013

In order to evaluate the similarity between protein structures, the structures have to be superimposed in 3D. A multitude of methods are available to achieve this task. Also, there are many different measures to quantify structural similarity. In this task we will explore different methods and compare different measures to get a feeling for the structural similarity they imply. We will then apply structural alignment to evaluate some sequence-based alignments generated in Task 2 (Run sequence searches on the disease gene product and produce alignments).

Theoretical background talk

The introductory talks should given an overview of

Explore structural alignments

  • Assemble a set of 8 to 9 structures related to your protein. These structures should span the range of similarities from almost identical to completely unrelated. You can take structures found in the sequence search and you can go to CATH. E.g.
    • one or two structure with identical sequence (ideally once with filled binding site, once unfilled, so you can make one pair with similar binding site status, one with different)
    • one similar sequence (>60% seq. identity)
    • one rather unrelated sequence (<30% seq. identity)
    • one arbitrary structure with a CATH code which is identical to your protein at each of these levels:
      • CAT
      • CA
      • C
    • on arbitrary structure from a different CATH class
  • Apply different structural alignment methods to these structures:
    • use Pymol (will only work on more closely related structures)

Use structural alignments to evaluate sequence alignments