Homology based structure predictions BCKDHA

1.Calculation of models

To find similar structures to BCKDHA we ran HHsearch:
hhsearch -i query -d database -o output

It found the following 10 hits in the pdb70 database.

No	Hit	Prob	E-value	P-value	Score	Cols	Query HMM	Template HMM	Identity
1	2bfd_A 2-oxoisovalerate dehydr	1.0	1	1	791.3	400	1-400	1-400 (400)	99%
2	1qs0_A 2-oxoisovalerate dehydr	1.0	1	1	571.5	349	32-382	52-407 (407)	39%
3	1w85_A Pyruvate dehydrogenase	1.0	1	1	530.8	356	8-382	6-362 (368)	34%
4	1umd_A E1-alpha, 2-OXO acid de	1.0	1	1	521.8	351	34-386	16-367 (367)	37%
5	2ozl_A PDHE1-A type I, pyruvat	1.0	1	1	482.7	331	46-380	25-356 (365)	27%
6	3l84_A Transketolase; TKT, str	1.0	1	1	85.4	133	161-297	113-252 (632)	21%
7	2r8o_A Transketolase 1, TK 1;	1.0	1	1	74.5	121	161-285	113-245 (669)	33%
8	2o1x_A 1-deoxy-D-xylulose-5-ph	1.0	1	1	74.2	127	161-287	122-254 (629)	18%
9	1gpu_A Transketolase; transfer	1.0	1	1	74.2	140	161-302	115-265 (680)	22%
10	3m49_A Transketolase; alpha-be	1.0	1	1	68.8	121	161-285	139-271 (690)	31%

> 60% sequence identity:
-2bfd_A
> 40% sequence identity:
< 40% sequence identity (ideally go towards 20%) :
-1qs0_A, 1umd_A, 1w85_A, 2r8o_A, 3m49_A, 2ozl_A, 1gpu_A, 3l84_A, 2o1x_A, -1w85_A

HHSearch has only hits with an identity higher than 60% or lower than 40%.

These are the templates we will work with:
> 60% sequence identity:
-2bfd_A
< 40% sequence identity (ideally go towards 20%) :
-2r8o_A

Modeller

MODELLER is used for homology or comparative modeling of protein three-dimensional structures.It calculates a model containing all non-hydrogen atoms. There are also many other tasks provided by MODELLER like de novo modeling of loops in protein structures, optimization of various models of protein structure with respect to a flexibly defined objective function, multiple alignment of protein sequences and/or structures, clustering, searching of sequence databases, comparison of protein structures, etc.[1]

A tutorial is provided on [2] and on [3]

To run modeller with more than one target we use the targets:
-1dtw:A 95%
-2bfe:A 94%
-2bfb:A 99%
-2bfd:A 99%
-1gpu:A 22%
-2o1x:A 18%
-2r8o:A 33%

Protocol Modeller

SWISS-MODEL

SWISS-MODEL server page

To find protein structure homology models SWISS-MODEL can be used. As input it needs a protein sequence or a UniProt AC Code. Optional the template PDB-Id and the chain or a template file can be assigned. SWISS-MODEL is a fully automated protein structure homology-modeling server. It is accessible via the ExPASy web server, or from the program DeepView (Swiss Pdb-Viewer).
SWISS-MODEL

SWISS-MODEL server:

ID	link
2bfd_A	2bfd_A
2r8o_A	2r8o_A

Prediction for 2bfd_A

TARGET    51      KPQFPGAS AEFIDKLEFI QPNVISGIPI YRVMDRQGQI INPSEDPHLP
2bfdA     6       kpqfpgas aefidklefi qpnvisgipi yrvmdrqgqi inpsedphlp
                                                                     
TARGET                                             sss   ss s         
2bfdA                                              sss   ss s         


TARGET    99    KEKVLKLYKS MTLLNTMDRI LYESQRQGRI SFYMTNYGEE GTHVGSAAAL
2bfdA     54    kekvlklyks mtllntmdri lyesqrqgri sfymtnygee gthvgsaaal
                                                                     
TARGET          hhhhhhhhhh hhhhhhhhhh hhhhhhh             h hhhhhhhh  
2bfdA           hhhhhhhhhh hhhhhhhhhh hhhhhhh             h hhhhhhhh  


TARGET    149   DNTDLVFGQY REAGVLMYRD YPLELFMAQC YGNISDLGKG RQMPVHYGCK
2bfdA     104   dntdlvfgqa reagvlmyrd yplelfmaqc ygnisdlgkg rqmpvhygck
                                                                     
TARGET              sss       hhhhh     hhhhhhhh h                    
2bfdA               sss       hhhhhh    hhhhhhhh h                    


TARGET    199   ERHFVTISSP LATQIPQAVG AAYAAKRANA NRVVICYFGE GAASEGDAHA
2bfdA     154   erhfvtissp latqipqavg aayaakrana nrvvicyfge gaasegdaha
                                                                     
TARGET                        hhhhhhh hhhhhhhh     ssssssss hhh   hhhh
2bfdA                         hhhhhhh hhhhhhhh     ssssssss hhh   hhhh


TARGET    249   GFNFAATLEC PIIFFCRNNG YAISTPTSEQ YRGDGIAARG PGYGIMSIRV
2bfdA     204   gfnfaatlec piiffcrnng yaistptseq yrgdgiaarg pgygimsirv
                                                                     
TARGET          hhhhhhhh   ssssssss                    hhhh hhh  sssss
2bfdA           hhhhhhhh   ssssssss                    hhhh hhh  sssss


TARGET    299   DGNDVFAVYN ATKEARRRAV AENQPFLIEA MTYRIGHHST SDDSSAYRSV
2bfdA     254   dgndvfavyn atkearrrav aenqpfliea mtyrig---- ----------
                                                                     
TARGET          ss  hhhhhh hhhhhhhhhh hh   sssss ss                   
2bfdA           ss  hhhhhh hhhhhhhhhh hh   sssss ss                   


TARGET    349   DEVNYWDKQD HPISRLRHYL LSQGWWDEEQ EKAWRKQSRR KVMEAFEQAE
2bfdA     292   -------std hpisrlrhyl lsqgwwdeeq ekawrkqsrr kvmeafeqae
                                                                      
TARGET                      hhhhhhhh    h    hhh hhhhhhhhhh hhhhhhhhhh
2bfdA                       hhhhhhhh    h    hhh hhhhhhhhhh hhhhhhhhhh


TARGET    399   RKPKPNPNLL FSDVYQEMPA QLRKQQESLA RHLQTYGEHY PLDHFDK   
2bfdA     354   rkpkpnpnll fsdvyqempa qlrkqqesla rhlqtygehy pldhfdk-  
                                                                     
TARGET          h                   h hhhhhhhhhh hhhhh                
2bfdA           h                   h hhhhhhhhhh hhhhh

Prediction for 2r8o_A

TARGET    152           DL -VFG-QYREA ---GVLMYRD --YPLELFMA QCYGNISDLG
2r8oA     52    pswadr--dr fvlsnghgsm liysllhltg ydlpmeelkn -f-rql----
                                                                     
TARGET                              s    ssssss              sssssssss
2r8oA                    s ssss     h hhhhhhhh        hhhh            


TARGET    187   KGRQMPVHYG CK-ERHFVTI SSPLATQIPQ AVGAAYAAKR AN--------
2r8oA     94    -hsktpghpe vgytagvett tgplgqgian avgmaiaekt laaqfnrpgh
                                                                     
TARGET          s  sssss        hhhhh h     hhhh hhhhhhhhhh h         
2r8oA                                       hhhh hhhhhhhhhh hhhhh     


TARGET    228   --ANRVVICY FGEGAASEGD AHAGFNFAAT LEC-PIIFFC RNNGYAISTP
2r8oA     143   divdhytyaf mgdgcmmegi shevcslagt lklgkliafy ddngisidgh
                                                                     
TARGET                ssss s hhhh   h hhhhhhhhhh h    sssss ss sss  ss
2r8oA                sssss s hhhh   h hhhhhhhhhh h   ssssss ss sss  ss


TARGET    275   TSEQYRGDGI AARGPGYGIM SIR-VDGNDV FAVYNATKEA RRRAVAENQP
2r8oA     193   vegwft-ddt amrfeaygwh virdidghda asikraveea ra---vtdkp
                                                                     
TARGET          s        h hhhhhhh  s sss sss  h hhhhhhhhhh h        s
2r8oA           s        h hhhhhh   s ss  sss  h hhhhhhhhhh hh       s


TARGET    324   FLIEAMTYRI GHHSTSDDSS ----AYRSVD EVNYWDKQ - ----------
2r8oA     239   sllmcktiig fgspnkagth dshgaplgda eialtreqlg wkyapfeips
                                                                      
TARGET          sssssss                       hh hhhhhhhh             
2r8oA           sssssss               hh      hh hhhhhhhhh           h

iTasser

2bfd_A

This prediction is based on several templates fount by iTasser itself.

Protocol Swissmodel

2.Evaluation of models

As we want to compare the predicted models to the existing PDB entry we need the PDB file for 1U5B. Here only chain A is important, as the sequence for chain A was used to create the models. Therefore chain B has to be removed from the PDB file. Herefore we used a program names ExtractChains.pl provided by <ref>http://www.rosettacommons.org/guide/PDB+Manipulation+Scripts</ref>.

Modeller

Numeric evaluation

Comparison to experimental structure

experimental structure	model with template	C-alpha RMSD
1U5B_A	2BFD_A	2.3 [4]
1U5B_A	2R8O_A	7.3 [5]
1U5B_A	1DTW_A, 2BFE_A, 2BFB_A, 2BFD_A, 1GPU_A, 2O1X_A, 2R8O_A	1.2 [6]

Swissmodel

Numeric evaluation

QMEAN4 global scores

QMEANscore4

2bfd_A	2r8o_A
0.67	0.271

QMEAN Z-Score

2bfd_A	2r8o_A
-1.604	-6.943
Z-Score plot1 2bfd_A	Z-Score plot1 2r8o_A
Z-Score plot2 2bfd_A	Z-Score plot2 2r8o_A

Score components

2bfd_A	2r8o_A
score components 2bfd_A	score components 2r8o_A

Local scores

2bfd_A	2r8o_A
Coloring by residue error 2bfd_A	Coloring by residue error 2r8o_A
Residue error plot 2bfd_A	Residue error plot 2r8o_A

Global scores: QMEAN4:

	2bfd_A		2r8o_A
Scoring function term	Raw score	Z-score	Raw score	Z-score
C_beta interaction energy	-162.66	0.54	-47.91	-1.49
All-atom pairwise energy	-10811.93	0.35	-2558.65	-1.98
Solvation energy	-27.04	-1.02	10.53	-4.08
Torsion angle energy	-75.78	-1.45	18.95	-4.99
QMEAN4 score	0.670	-1.60	0.271	-6.94

Local Model Quality Estimation

2bfd_A	2r8o_A
Local Model Quality Estimation 2bfd_A	Local Model Quality Estimation 2r8o_A

Comparison to experimental structure

experimental structure	model with template	C-alpha RMSD
1U5B_A	2BFD_A	1.1 [7]
1U5B_A	2R8O_A	2.9 [8]

Homology based structure predictions BCKDHA

Contents

1.Calculation of models

Modeller

SWISS-MODEL

iTasser

2.Evaluation of models

Modeller

Numeric evaluation

Comparison to experimental structure

Swissmodel

Numeric evaluation

Comparison to experimental structure

iTasser

Numeric evaluation

Navigation menu

Views

Personal tools

Bioinformatik navigation

MediaWiki navigation

Search

Tools