CompareCath.py

From Bioinformatikpedia
Revision as of 18:07, 6 May 2013 by Betza (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

The script compareCath.py can be found on the biocluster at /mnt/home/student/betza/scripts.

usage: compareCath.py [-h] -i IFILE -q QUERY
optional arguments:
 -h, --help  show this help message and exit
 -i IFILE    with parse_output.pl created results file (default: None)
 -q QUERY    PDB id and chain of query, e.g. 1a6zA (default: None)

The input is the results file from parse_output.pl for (Psi)Blast.

In CATH, each domain of the protein is assigned to a fold class. This means, that one query protein can have several fold classes, one for each domain. This pyhton script computes the number of fold classes that each hit has in common with the specified query. The output is a histogram histogram of the number of same fold classes per protein for all pdb hits.