Gaucher Disease: Task 02 - Alignments

From Bioinformatikpedia
Revision as of 22:28, 5 May 2013 by Gerkej (talk | contribs) (Description)

Alignments allow a comparisons of Strings. In the field of bioinformatics, sequence alignments show the relation between two or more sequences.

Theoretical Background

Description

Pairwise or multiple alignments opten contain an aditional line below the proper alignment. This line gives a more accurate description of the aligned residues above. The symbols show if there is match between identical amino acids or if they are only similar.

Symbols for describing sequence alignments
Symbol Example Meaning
* blub identical residues
:
.

Sequence Searches

Sequence searches with our query protein sequence, P04062.fasta, were done with the following programs:

  • BLAST
    • using standard parameters
    • against big80
  • Psi-BLAST
    • with number of shown hits and alignments set to 10000 (-b, -v options)
    • with all combinations of:
      • 2 iterations: 1 iterations against big_80 followed by 1 iteration against big
      • 10 iterations: 9 iterations against big_80 followed by 1 iteration against big
      • default E-value cutoff (0.002)
      • E-value cutoff 10E-10
    • other options leaved default
  • HHblits
    • with number of shown hits and alignments set to 10000 (-b, -v options)
    • with all combinations of:
      • 2 iterations against uniprot_20
      • 10 iterations against uniprot_20
      • default E-value cutoff (0.002)
      • E-value cutoff 10E-10
    • other options leaved default

The script run.pl was written and used for the runs. PSSM files - a3m and hhr for HHblits, chk ("checkpoint") and PSSM for Psi-BLAST were created in order to start the search against another database, from big_80 to big for Psi-BLAST and later against a PDB database for the evaluation.

Comparison

Evaluation

Multiple Sequence Alignment

ClustalW

Muscle

T-Coffee

Alignment Comparison