Difference between revisions of "Gaucher Task06 Protocol"

From Bioinformatikpedia
(SIFT)
(SIFT)
Line 24: Line 24:
   
 
We used the [http://sift.jcvi.org/www/SIFT_seq_submit2.html online server of SIFT]. It took a little bit long (10-15 min) because they have to search for the related sequences in database.<br/>
 
We used the [http://sift.jcvi.org/www/SIFT_seq_submit2.html online server of SIFT]. It took a little bit long (10-15 min) because they have to search for the related sequences in database.<br/>
  +
 
Input: the protein sequence [http://www.uniprot.org/uniprot/P04062.fasta P04062], the list of Mutations. Other setting default.
 
Input: the protein sequence [http://www.uniprot.org/uniprot/P04062.fasta P04062], the list of Mutations. Other setting default.
   
 
Alternatively, the [http://sift.jcvi.org/www/SIFT_BLink_submit.html online server of SIFT Blink] was used. The predictions there are based on pre-computed BLAST searches, therefore are returned almost immediately. For SIFT Blink, we should provide the corresponding NCBI GI number ([http://www.ncbi.nlm.nih.gov/protein/CAI95090.1 66347912]) for our protein (UniProt id: [http://www.uniprot.org/uniprot/P04062 P04062]).<br/>
 
Alternatively, the [http://sift.jcvi.org/www/SIFT_BLink_submit.html online server of SIFT Blink] was used. The predictions there are based on pre-computed BLAST searches, therefore are returned almost immediately. For SIFT Blink, we should provide the corresponding NCBI GI number ([http://www.ncbi.nlm.nih.gov/protein/CAI95090.1 66347912]) for our protein (UniProt id: [http://www.uniprot.org/uniprot/P04062 P04062]).<br/>
  +
 
Input: the corresponding NCBI GI number ([http://www.ncbi.nlm.nih.gov/protein/CAI95090.1 66347912]), the list of Mutations. Other setting default.
 
Input: the corresponding NCBI GI number ([http://www.ncbi.nlm.nih.gov/protein/CAI95090.1 66347912]), the list of Mutations. Other setting default.
   

Revision as of 20:31, 16 June 2012

Sources

You can checkout the git repository containing all relevant data an scripts by:

git clone /mnt/home/student/angermue/mp/tasks/task06

PSSM

We created the PSSM as follows:

blastpgp -i data/P04062.seq -d $NR -j 5 -h 1e-3 -b 1000 -o pssm/all/P04062.bla -Q pssm/all/P04062.pssm

We used the script alignhits.pl from the HHsuite for filtering out the most similar hits from the PSI-BLAST result file:

alignhits.pl -Q data/P04062.seq -qsc 1.5 pssm/all/P04062.bla pssm/best/P04062.psi

The PSSM for the resulting PSI-BLAST alignment was computed as follows:

blastpgp -i data/P04062.seq -B pssm/best/P04062.psi -d $DUMMY -j 0 -Q pssm/best/P04062.pssm

SIFT

We used the online server of SIFT. It took a little bit long (10-15 min) because they have to search for the related sequences in database.

Input: the protein sequence P04062, the list of Mutations. Other setting default.

Alternatively, the online server of SIFT Blink was used. The predictions there are based on pre-computed BLAST searches, therefore are returned almost immediately. For SIFT Blink, we should provide the corresponding NCBI GI number (66347912) for our protein (UniProt id: P04062).

Input: the corresponding NCBI GI number (66347912), the list of Mutations. Other setting default.

PlyPhen2

We used the online server of PolyPhen-2.

SNAP