Lab Journal - Task 4 (PAH)

From Bioinformatikpedia
Revision as of 10:27, 1 June 2013 by Waldraffs (talk | contribs) (Evaluate sequence alignments)

Explore structural alignments

Evaluate sequence alignments

The perl script hhmakemodel can be found in /usr/share/hhsuite/scripts. It needs a input file in form of a result file from hhsearch with hit list and alignments (-i).

-d   <pdbdirs>         directories containing the pdb files (for PDB, SCOP, or DALI
                       sequences) (default=/cluster/databases/pdb/all)
-m   <int> [<int> ...] pick hits with specified indices  (default='-m 1')
-ts  <file.pdb>        write the PDB-formatted models based on *pairwise*
                       alignments into file.pdb

So first a hhr file is crated with the FASTA file of our protein PAH and the pdb database. The output file is called hhsearch_PAH.hhr. Furthermore we set the maximal reported lines in summary and alignments to 10000:
hhsearch -i /mnt/home/student/waldraffs/Masterpraktikum/PAH.fasta -d /mnt/project/pracstrucfunc13/data/hhblits/pdb70_current_hhm_db -o /mnt/home/student/waldraffs/Masterpraktikum/Task4/hhsearch_PAH.hhr -Z 10000 -B 10000

26 entries were found. Two of them are also used in explore structural alignments: 1j8u, 3luy. We choose eight entries thereby trying to get the whole range of e-values and scores:

No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
 1 1phz_A Protein (phenylalanine  100.0  7E-165  3E-169 1181.8   0.0  429    1-429     1-429 (429)
 2 1j8u_A Phenylalanine-4-hydroxy 100.0  3E-135  1E-139  951.9   0.0  325  103-427     1-325 (325)
 6 2v27_A Phenylalanine hydroxyla 100.0 3.6E-74 1.4E-78  528.8   0.0  231  172-406    13-248 (275)
 7 2qmx_A Prephenate dehydratase;  98.2 1.1E-09   4E-14   98.4   0.0   67   33-99    199-266 (283)
 9 3luy_A Probable chorismate mut  98.1 3.3E-09 1.2E-13   97.7   0.0   67   33-99    206-274 (329)
12 1qey_A MNT-C, protein (regulat  54.0     3.4 0.00013   28.3   0.0   12  189-200    18-29  (31)
19 1wyp_A Calponin 1; CH domain,   29.4      15 0.00057   29.7   0.0   68  201-269    47-115 (136)
26 1a6s_A GAG polyprotein; core p  20.6      29  0.0011   28.0   0.0   42  141-196    42-83  (87)