Difference between revisions of "Gaucher Task06 Protocol"

From Bioinformatikpedia
(SIFT)
(SIFT)
Line 23: Line 23:
 
== SIFT ==
 
== SIFT ==
   
We uses the [http://sift.jcvi.org/www/SIFT_seq_submit2.html online server of SIFT]. It took a little bit long (10-15 min) because they have to search for the related sequences in database. Alternatively, the [http://sift.jcvi.org/www/SIFT_BLink_submit.html online server of SIFT Blink] was used. The predictions there are based on pre-computed BLAST searches, therefore are returned almost immediately. For SIFT Blink, we should provide the corresponding NCBI GI number ([http://www.ncbi.nlm.nih.gov/protein/CAI95090.1 66347912]) for our protein (UniProt id: [http://www.uniprot.org/uniprot/P04062 P04062]).
+
We used the [http://sift.jcvi.org/www/SIFT_seq_submit2.html online server of SIFT]. It took a little bit long (10-15 min) because they have to search for the related sequences in database. Alternatively, the [http://sift.jcvi.org/www/SIFT_BLink_submit.html online server of SIFT Blink] was used. The predictions there are based on pre-computed BLAST searches, therefore are returned almost immediately. For SIFT Blink, we should provide the corresponding NCBI GI number ([http://www.ncbi.nlm.nih.gov/protein/CAI95090.1 66347912]) for our protein (UniProt id: [http://www.uniprot.org/uniprot/P04062 P04062]).
   
 
== PlyPhen2 ==
 
== PlyPhen2 ==

Revision as of 20:23, 16 June 2012

Sources

You can checkout the git repository containing all relevant data an scripts by:

git clone /mnt/home/student/angermue/mp/tasks/task06

PSSM

We created the PSSM as follows:

blastpgp -i data/P04062.seq -d $NR -j 5 -h 1e-3 -b 1000 -o pssm/all/P04062.bla -Q pssm/all/P04062.pssm

We used the script alignhits.pl from the HHsuite for filtering out the most similar hits from the PSI-BLAST result file:

alignhits.pl -Q data/P04062.seq -qsc 1.5 pssm/all/P04062.bla pssm/best/P04062.psi

The PSSM for the resulting PSI-BLAST alignment was computed as follows:

blastpgp -i data/P04062.seq -B pssm/best/P04062.psi -d $DUMMY -j 0 -Q pssm/best/P04062.pssm

SIFT

We used the online server of SIFT. It took a little bit long (10-15 min) because they have to search for the related sequences in database. Alternatively, the online server of SIFT Blink was used. The predictions there are based on pre-computed BLAST searches, therefore are returned almost immediately. For SIFT Blink, we should provide the corresponding NCBI GI number (66347912) for our protein (UniProt id: P04062).

PlyPhen2

SNAP