Canavan Disease: Task 03 - Journal
From Bioinformatikpedia
Link back to Task 03: Sequence-based Predictions
Contents
Task 3 Working Log
Secondary structure prediction
- creation of pssm files via psi-blast
blastpgp -i /mnt/home/student/.../data/P45381.fasta -o /mnt/home/student/.../aspa_big80.out -d /mnt/project/pracstrucfunc13/data/big/big_80 -C /mnt/home/student/.../aspa_big80.chk -Q /mnt/home/student/.../aspa_big80.pssm -h 10e-10 -j 3) where: -i input file -o outfile -d database to search against -C checkfile -Q pssm file -h eVaule cutoff -j number of iterations
- pssm from big 80
- pssm from swissprot
- without pssm
- DSSP taken as "truth" as DSSP assigns secondary structure (does not predict) from the atomic coordinates
- the precision for each prediction method was calculated
- the results show that PSI-Pred shows the best precision
- how ever within ReProf ReProf with sequence profile from Big_80 shows the best result
- Reprof with Big_80 PSSM used for further predictions, however for comparison PSI-PRED runs are made as well
Disorder
- creation of the IUPred predictions via run_iupred.sh
- creation of analysis via disorder_statistics.py
- applicable for both IUPred and Metadisorder
- finding the right match for P10775 in disprot a sequence search had to be initiated (swiss-Waterman and PSI-Pred on the disprot-website)
TMH prediction
- Polyphobius
- blastget index file creation
/mnt/project/pracstrucfunc13/polyphobius/blastget -ix swiss_p.idx -create /mnt/project/pracstrucfunc13/data/swissprot/uniprot_sprot.fasta
- blast get for the single files
/mnt/project/pracstrucfunc13/polyphobius/blastget -ix swiss_p.idx -db /mnt/project/pracstrucfunc13/data/swissprot/uniprot_sprot -ix swiss_p.idx ../data/query_seqs/TMH/P45381.fasta >> P45381.blastget.out
/mnt/opt/T-Coffee/bin/kalign -i P45381.blastget.out -o P45381.kalign.out
/mnt/project/pracstrucfunc13/polyphobius/jphobius -poly P45381.kalign.out >> P45381.polyph.out
- Your job is in the queue under the name: P45381 with the job ID: cc3e3788-c45a-11e2-add6-00163e110593
- Your job is in the queue under the name: P35462 with the job ID: 4cf4b7b0-c3c7-11e2-840f-00163e110593
- Your job is in the queue under the name: P47863 with the job ID: 6799d884-c3c7-11e2-8b61-00163e110593
- Your job is in the queue under the name: Q9YDF8 with the job ID: 81e7ddda-c3c7-11e2-8b61-00163e110593
- http://bioinf.cs.ucl.ac.uk/psipred/result/81e7ddda-c3c7-11e2-8b61-00163e110593
- Difficulty to find the right protein to compare Q9YDF8 to (in OMP/PDBTM) -> how that was achieved is explained in the wiki
SignalP
- for creation of the signalP outfiles SginalP version 4.1 was used (the web server)
GOterms:
- GoPet see xml file
- Protfun used via the webserver
- PFam see: http://pfam.sanger.ac.uk/protein/P45381