Difference between revisions of "CompareCath.py"

From Bioinformatikpedia
Line 6: Line 6:
 
-i IFILE with parse_output.pl created results file (default: None)
 
-i IFILE with parse_output.pl created results file (default: None)
 
-q QUERY PDB id and chain of query, e.g. 1a6zA (default: None)
 
-q QUERY PDB id and chain of query, e.g. 1a6zA (default: None)
  +
  +
The input is the results file from parse_output.pl for (Psi)Blast.
   
 
In CATH each domain of the protein is assigned to a fold class. This means, that one query protein can have several fold classes, one for each domain.
 
In CATH each domain of the protein is assigned to a fold class. This means, that one query protein can have several fold classes, one for each domain.

Revision as of 20:34, 5 May 2013

The script compareCath.py can be found on the biocluster at /home/alexander/studium/master-prac/scripts.

usage: compareCath.py [-h] -i IFILE -q QUERY
optional arguments:
 -h, --help  show this help message and exit
 -i IFILE    with parse_output.pl created results file (default: None)
 -q QUERY    PDB id and chain of query, e.g. 1a6zA (default: None)

The input is the results file from parse_output.pl for (Psi)Blast.

In CATH each domain of the protein is assigned to a fold class. This means, that one query protein can have several fold classes, one for each domain. This pyhton script computes the number of fold classes that each hit has in common with the specified query. The output is a histogram histogram of the number of same fold classes per protein for all pdb hits.