Difference between revisions of "Task 2: Multiple Sequence Alignment"

From Bioinformatikpedia
(Blast)
(MSA)
Line 26: Line 26:
 
== Psi-Blast ==
 
== Psi-Blast ==
 
== HHBlits ==
 
== HHBlits ==
== MSA ==
+
= MSA =
  +
 
= clustalW =
 
= clustalW =
 
= T-Coffee =
 
= T-Coffee =

Revision as of 19:56, 7 May 2012

Sorry, were behind scedule, page will be filled with content as soon as possible.


We researched the protein sequence of the branched-chain alpha-keto acid dehydrogenase complex subunit alpha (BCKDHA) with the following original sequence:

  • BCKDHA
>sp|P12694|ODBA_HUMAN 2-oxoisovalerate dehydrogenase subunit alpha, mitochondrial OS=Homo sapiens GN=BCKDHA PE=1 SV=2
MAVAIAAARVWRLNRGLSQAALLLLRQPGARGLARSHPPRQQQQFSSLDDKPQFPGASAE
FIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYKSMTLLNTMDRILY
ESQRQGRISFYMTNYGEEGTHVGSAAALDNTDLVFGQYREAGVLMYRDYPLELFMAQCYG
NISDLGKGRQMPVHYGCKERHFVTISSPLATQIPQAVGAAYAAKRANANRVVICYFGEGA
ASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIAARGPGYGIMSIRVDG
NDVFAVYNATKEARRRAVAENQPFLIEAMTYRIGHHSTSDDSSAYRSVDEVNYWDKQDHP
ISRLRHYLLSQGWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNLLFSDVYQEMPAQL
RKQQESLARHLQTYGEHYPLDHFDK


Blast

To calculate the sequence alignments we used the blast and psiblast binaries from NCBI (version 2.2.26+) As the standard blast alignment hits the limit of 250 matches per alignment, and all of them still seemed very significant (Evalue of < 1e-60) we increased the number of max target hits to 2000 and set an Evalue threshold of 0.002. With this method we found about 1550 matching alignments.

As can be seen in the figure to the right

Distibution of sequence similarity with the BCKDHA blast-query against the big80 database.

, the sequence alignments mainly have a similarity between 15% and 40%.

Distribution of evalues in BLAST.

Psi-Blast

HHBlits

MSA

clustalW

T-Coffee