Difference between revisions of "Lab Journal - Task 4 (PAH)"

From Bioinformatikpedia
(Evaluate sequence alignments)
(Evaluate sequence alignments)
Line 11: Line 11:
 
So first a hhr file is crated with the FASTA file of our protein PAH and the pdb database. The output file is called hhsearch_PAH.hhr. Furthermore we set the maximal reported lines in summary and alignments to 10000:<br>
 
So first a hhr file is crated with the FASTA file of our protein PAH and the pdb database. The output file is called hhsearch_PAH.hhr. Furthermore we set the maximal reported lines in summary and alignments to 10000:<br>
 
<code>
 
<code>
hhsearch -i /mnt/home/student/waldraffs/Masterpraktikum/PAH.fasta -d /mnt/project/pracstrucfunc13/data/hhblits/pdb70_current_hhm_db -o hhsearch_PAH.hhr -Z 10000 -B 10000
+
hhsearch -i /mnt/home/student/waldraffs/Masterpraktikum/PAH.fasta -d /mnt/project/pracstrucfunc13/data/hhblits/pdb70_current_hhm_db -o /mnt/home/student/waldraffs/Masterpraktikum/Task4/hhsearch_PAH.hhr -Z 10000 -B 10000
 
</code>
 
</code>
  +
  +
26 entries were found. Two of them are also used in explore structural alignments: 1j8u, 3luy.
  +
We choose eight entries thereby trying to get the whole range of e-values and scores:
  +
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
  +
1 1phz_A Protein (phenylalanine 100.0 7E-165 3E-169 1181.8 0.0 429 1-429 1-429 (429)
  +
2 1j8u_A Phenylalanine-4-hydroxy 100.0 3E-135 1E-139 951.9 0.0 325 103-427 1-325 (325)
  +
6 2v27_A Phenylalanine hydroxyla 100.0 3.6E-74 1.4E-78 528.8 0.0 231 172-406 13-248 (275)
  +
7 2qmx_A Prephenate dehydratase; 98.2 1.1E-09 4E-14 98.4 0.0 67 33-99 199-266 (283)
  +
9 3luy_A Probable chorismate mut 98.1 3.3E-09 1.2E-13 97.7 0.0 67 33-99 206-274 (329)
  +
12 1qey_A MNT-C, protein (regulat 54.0 3.4 0.00013 28.3 0.0 12 189-200 18-29 (31)
  +
19 1wyp_A Calponin 1; CH domain, 29.4 15 0.00057 29.7 0.0 68 201-269 47-115 (136)
  +
26 1a6s_A GAG polyprotein; core p 20.6 29 0.0011 28.0 0.0 42 141-196 42-83 (87)

Revision as of 10:27, 1 June 2013

Explore structural alignments

Evaluate sequence alignments

The perl script hhmakemodel can be found in /usr/share/hhsuite/scripts. It needs a input file in form of a result file from hhsearch with hit list and alignments (-i).

-d   <pdbdirs>         directories containing the pdb files (for PDB, SCOP, or DALI
                       sequences) (default=/cluster/databases/pdb/all)
-m   <int> [<int> ...] pick hits with specified indices  (default='-m 1')
-ts  <file.pdb>        write the PDB-formatted models based on *pairwise*
                       alignments into file.pdb

So first a hhr file is crated with the FASTA file of our protein PAH and the pdb database. The output file is called hhsearch_PAH.hhr. Furthermore we set the maximal reported lines in summary and alignments to 10000:
hhsearch -i /mnt/home/student/waldraffs/Masterpraktikum/PAH.fasta -d /mnt/project/pracstrucfunc13/data/hhblits/pdb70_current_hhm_db -o /mnt/home/student/waldraffs/Masterpraktikum/Task4/hhsearch_PAH.hhr -Z 10000 -B 10000

26 entries were found. Two of them are also used in explore structural alignments: 1j8u, 3luy. We choose eight entries thereby trying to get the whole range of e-values and scores:

No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
 1 1phz_A Protein (phenylalanine  100.0  7E-165  3E-169 1181.8   0.0  429    1-429     1-429 (429)
 2 1j8u_A Phenylalanine-4-hydroxy 100.0  3E-135  1E-139  951.9   0.0  325  103-427     1-325 (325)
 6 2v27_A Phenylalanine hydroxyla 100.0 3.6E-74 1.4E-78  528.8   0.0  231  172-406    13-248 (275)
 7 2qmx_A Prephenate dehydratase;  98.2 1.1E-09   4E-14   98.4   0.0   67   33-99    199-266 (283)
 9 3luy_A Probable chorismate mut  98.1 3.3E-09 1.2E-13   97.7   0.0   67   33-99    206-274 (329)
12 1qey_A MNT-C, protein (regulat  54.0     3.4 0.00013   28.3   0.0   12  189-200    18-29  (31)
19 1wyp_A Calponin 1; CH domain,   29.4      15 0.00057   29.7   0.0   68  201-269    47-115 (136)
26 1a6s_A GAG polyprotein; core p  20.6      29  0.0011   28.0   0.0   42  141-196    42-83  (87)