Difference between revisions of "Sequence-based predictions"
(→Jpred3) |
(→Comparison with DSSP) |
||
Line 61: | Line 61: | ||
− | Because, the PDB sequence is not complete, the dssp assignment is also incomplete. The interessting parts - the signal peptide and the cytoplasmic part - which are predicted as disordert are not |
+ | Because, the PDB sequence is not complete, the dssp assignment is also incomplete. The interessting parts - the signal peptide and the cytoplasmic part - which are predicted as disordert are not covered by DSSP. PSIPRED and JPred predicted the transmembrane region well and also made no assignment to the - as disordert predicted - N- and C-terminus. |
UniProt: ---------------------------E<font color="#008000">EEEEEEEEE</font>E----EEE--<font color="#008000">EEEEEE</font>--<font color="#008000">EEEEE</font> |
UniProt: ---------------------------E<font color="#008000">EEEEEEEEE</font>E----EEE--<font color="#008000">EEEEEE</font>--<font color="#008000">EEEEE</font> |
Revision as of 16:54, 7 June 2011
Contents
Secondary structure prediction
PSIPRED
PSI-PRED use the PSI-BLAST output as input for a neuronal network which has a single hidden layer and a feed-forward back-propagation architecture to predict the secondary structure. PSI-PRED predict a alpha/beta structure. The transmembrane region is predicted as a beta region.
PSIPRED HFORMAT (PSIPRED V3.0) Conf: 999851589999999877513567886245556456636899750389988756755687 Pred: CCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCCCCCCEEEEEEEECCEEEEE AA: MGPRARPALLLLMLLQTAVLQGRLLRSHSLHYLFMGASEQDLGLSLFEALGYVDDQLFVF 10 20 30 40 50 60 Conf: 318998225536664688990669998865311211002358577441156788603899 Pred: ECCCCCCEEECCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCCCCCEEEE AA: YDHESRRVEPRTPWVSSRISSQMWLQLSQSLKGWDHMFTVDFWTIMENHNHSKESHTLQV 70 80 90 100 110 120 Conf: 987799319835459889765910588728988756689786135787788899999876 Pred: EEEEEEECCCEEEEEEEEEECCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHH AA: ILGCEMQEDNSTEGYWKYGYDGQDHLEFCPDTLDWRAAEPRAWPTKLEWERHKIRARQNR 130 140 150 160 170 180 Conf: 310271499889888616322000378810000468999601699981450765189996 Pred: HHHCCCHHHHHHHHHHCCCCCCCCCCCCCEEEECCCCCCCEEEEEEEEEECCCCEEEEEE AA: AYLERDCPAQLQQLLELGRGVLDQQVPPLVKVTHHVTSSVTTLRCRALNYYPQNITMKWL 190 200 210 220 230 240 Conf: 288106667520025355899875899999965999872169986699998826885259 Pred: ECCEECCCCCCCCCCCEECCCCCEEEEEEEEECCCCCCCEEEEEECCCCCCCEEEEEECC AA: KDKQPMDAKEFEPKDVLPNGDGTYQGWITLAVPPGEEQRYTCQVEHPGLDQPLIVIWEPS 250 260 270 280 290 300 Conf: 999711124320001367777622367764115889887620212359 Pred: CCCCCEEEEEEEEEEEEEEEEEEEEEEEEEECCCCCCCCCCEEECCCC AA: PSGTLVIGVISGIAVFVVILFIGILFIILRKRQGSRGAMGHYVLAERE 310 320 330 340
Jpred3
Jpred uses the Jnet algorithm which provides "a three-state (α-helix, β-strand and coil) prediction of secondary structure at an accuracy of 81.5%" <ref>http://nar.oxfordjournals.org/content/36/suppl_2/W197.abstract</ref>.
Jpred found in it's first blast search a lot of homologous hits with an e-value range from e-163 to 4e-44. There are some self hits included. We continued to the prediction which is:
Seq: MGPRARPALLLLMLLQTAVLQGRLLRSHSLHYLFMGASEQDLGLSLFEALGYVDD SS: ------HHHHHHHHHHHHH---------EEEEEEEEE-------EEEEEEEEE-- Seq: QLFVFYDHESRRVEPRTPWVSSRISSQMWLQLSQSLKGWDHMFTVDFWTIMENHN SS: EEEEEE-----EEEE----------HHHHHHHHHHHHHHHHHHHHHHHHHH---- Seq: HSKESHTLQVILGCEMQEDNSTEGYWKYGYDGQDHLEFCPDTLDWRAAEPRAWPT SS: -----EEEEEEEEEE------EEEEEEE-----EEEEEE----EEE-------HH Seq: KLEWERHKIRARQNRAYLERDCPAQLQQLLELGRGVLDQQVPPLVKVTHHVTSSV SS: HHHHH--HHHHHHHHHH------HHHHHHHHHH-H-------EEEEE-------- Seq: TTLRCRALNYYPQNITMKWLKDKQPMDAKEFEPKDVLPNGDGTYQGWITLAVPPG SS: -EEEEEEE------EEEEEEE----------EE----------EEEEEEEEE--- Seq: EEQRYTCQVEHPGLDQPLIVIWEPSPSGTLVIGVISGIAVFVVILFIGILFIILR SS: ---EEEEEEEE------EEEEE---------HHHHHHHHHHHHHHHHHHHHHHHH Seq: KRQGSRGAMGHYVLAERE SS: HH----------------
Comparison with DSSP
Because, the PDB sequence is not complete, the dssp assignment is also incomplete. The interessting parts - the signal peptide and the cytoplasmic part - which are predicted as disordert are not covered by DSSP. PSIPRED and JPred predicted the transmembrane region well and also made no assignment to the - as disordert predicted - N- and C-terminus.
UniProt: ---------------------------EEEEEEEEEEE----EEE--EEEEEE--EEEEE DSSP: --EEEEEEEEEEB-SS-SSB--EEEEEETTEEEEE PSIPRED: CCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCCCCCCEEEEEEEECCEEEEE JPred: ------HHHHHHHHHHHHH---------EEEEEEEEE-------EEEEEEEEE--EEEEE AA: MGPRARPALLLLMLLQTAVLQGRLLRSHSLHYLFMGASEQDLGLSLFEALGYVDDQLFVF DSSPSeq: RSHSLHYLFMGASEQDLGLSLFEALGYVDDQLFVF 10 20 30 40 50 60 UniProt: EEEEE--EEE--------TTTHHHHHHHHHHHHHHHHHHHHHHHHHHTTT-EEE--EEEE DSSP: EESSS--EEE-STTS-SSTTTTHHHHHHHHHHHHHHHHHHHHHHHHHTTT-SSS--EEEE PSIPRED: ECCCCCCEEECCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCCCCCEEEE JPred: E-----EEEE----------HHHHHHHHHHHHHHHHHHHHHHHHHH---------EEEEE AA: YDHESRRVEPRTPWVSSRISSQMWLQLSQSLKGWDHMFTVDFWTIMENHNHSKESHTLQV DSSPSEQ: YDHESRRVEPRTPWVSSRISSQMWLQLSQSLKGWDHMFTVDFWTIMENHNHSKESHTLQV 70 80 90 100 110 120 UniProt: EEEEEE-----EEEEEEEEE--EEEEEEEHHH-EEEEEE---HHHHHHHH---HHHHHHH DSSP: EEEEEE-TTS-EEEEEEEEETTEEEEEEEGGGTEEEESSGGGHHHHHHHHSSTHHHHHHH PSIPRED: EEEEEEECCCEEEEEEEEEECCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHH JPred: EEEEE------EEEEEEE-----EEEEEE----EEE-------HHHHHHH--HHHHHHHH AA: ILGCEMQEDNSTEGYWKYGYDGQDHLEFCPDTLDWRAAEPRAWPTKLEWERHKIRARQNR DSSPSEQ: ILGaEMQEDNSTEGYWKYGYDGQDHLEFCPDTLDWRAAEPRAWPTKLEWERHKIRARQNR 130 140 150 160 170 180 UniProt: HHHH-HHHHHHHHHHHHHTTT-------EEEEEEEE----EEEEEEEEEEEEE--EEEEE DSSP: HHHHTHHHHHHHHHHHHHTTTSS--B--EEEEEEEE-SS-EEEEEEEEEEBSS--EEEEE PSIPRED: HHHCCCHHHHHHHHHHCCCCCCCCCCCCCEEEECCCCCCCEEEEEEEEEECCCCEEEEEE JPred: HH------HHHHHHHHHH-H-------EEEEE---------EEEEEEE------EEEEEE AA: AYLERDCPAQLQQLLELGRGVLDQQVPPLVKVTHHVTSSVTTLRCRALNYYPQNITMKWL DSSPSEQ: AYLERDaPAQLQQLLELGRGVLDQQVPPLVKVTHHVTSSVTTLRbRALNYYPQNITMKWL 190 200 210 220 230 240 UniProt: E------HHH----EEEE-----EEEEEEEEE---HHHHEEEEEE---EEE-EEEE---- DSSP: ETTEE--GGGS---EEEE-TTS-EEEEEEEEE-TTGGGGEEEEEE-TTSSS-EEEE- PSIPRED: ECCEECCCCCCCCCCCEECCCCCEEEEEEEEECCCCCCCEEEEEECCCCCCCEEEEEECC JPred: E----------EE----------EEEEEEEEE------EEEEEEEE------EEEEE--- AA: KDKQPMDAKEFEPKDVLPNGDGTYQGWITLAVPPGEEQRYTCQVEHPGLDQPLIVIWEPS DSSPSEQ: KDKQPMDAKEFEPKDVLPNGDGTYQGWITLAVPPGEEQRYTbQVEHPGLDQPLIVIW 250 260 270 280 290 300 UniProt: ------------------------------------------------ DSSP: PSIPRED: CCCCCEEEEEEEEEEEEEEEEEEEEEEEEEECCCCCCCCCCEEECCCC JPred: ------HHHHHHHHHHHHHHHHHHHHHHHHHH---------------- AA: PSGTLVIGVISGIAVFVVILFIGILFIILRKRQGSRGAMGHYVLAERE DSSPSEQ: 310 320 330 340
Prediction of disordered regions
DISOPRED
For the prediction, we used the DISOPRED-Server at http://bioinf.cs.ucl.ac.uk/disopred/
Disopred predictes two disordert residues at the signal peptide and a disordert region at the end of the sequence which is located inside the cell.
AA:Target sequence Pred:Residue disorder prediction(.)= ordered residue(*)=Disordered residue conf:997600000000000000000000000000000000000000000000000000000000 pred:**.......................................................... AA:MGPRARPALLLLMLLQTAVLQGRLLRSHSLHYLFMGASEQDLGLSLFEALGYVDDQLFVF 10 20 30 40 50 60 conf:000120011000000000000000000000000000000000000000000000000000 pred:............................................................ AA:YDHESRRVEPRTPWVSSRISSQMWLQLSQSLKGWDHMFTVDFWTIMENHNHSKESHTLQV 70 80 90 100 110 120 conf:000000000000000000000000000000000000000000000000000000000000 pred:............................................................ AA:ILGCEMQEDNSTEGYWKYGYDGQDHLEFCPDTLDWRAAEPRAWPTKLEWERHKIRARQNR 130 140 150 160 170 180 conf:000000000000000000000002456777878777766530000000000000000000 pred:..............................*.*........................... AA:AYLERDCPAQLQQLLELGRGVLDQQVPPLVKVTHHVTSSVTTLRCRALNYYPQNITMKWL 190 200 210 220 230 240 conf:000035555545543000000000000000000000000000000000000001354667 pred:............................................................ AA:KDKQPMDAKEFEPKDVLPNGDGTYQGWITLAVPPGEEQRYTCQVEHPGLDQPLIVIWEPS 250 260 270 280 290 300 conf:777766643300000000000000047889999999999999898999 pred:...........................********************* AA:PSGTLVIGVISGIAVFVVILFIGILFIILRKRQGSRGAMGHYVLAERE 310 320 330 340 DISOPRED predictions for a false positive rate threshold of: 2%
POODLE
POODLE stands for Prediction Of Order and Disorder by machine LEarning.
POODLE provides three different predictions
- POODLE-S: short disorder regions prediction
- POODLE-L: long disorder regions prediction (longer 40 residues)
- unfolded protein prediction
All POODLE variants predict a disordert region at the end of the protein which contains a transmembrane region (pos: 307-330), this shows an evidance for a disordert region at the C-Terminus. But also, all variants predict a short disordered region at the begining of the sequence which is a part of the signal peptid (pos: 1-22).
POODLE-I
POODLE-I (series only) predicted 4 disordert regions within the protein sequence.
MGPRARPALLLLMLLQTAVLQGRLLRSHSLHYLFMGASEQDLGLSLFEALGYVDDQLFVF **************---------------------------------------------- YDHESRRVEPRTPWVSSRISSQMWLQLSQSLKGWDHMFTVDFWTIMENHNHSKESHTLQV -------**********---******------*--------------------------- ILGCEMQEDNSTEGYWKYGYDGQDHLEFCPDTLDWRAAEPRAWPTKLEWERHKIRARQNR ------------------------------------------------------------ AYLERDCPAQLQQLLELGRGVLDQQVPPLVKVTHHVTSSVTTLRCRALNYYPQNITMKWL ---------------------***************------------------------ KDKQPMDAKEFEPKDVLPNGDGTYQGWITLAVPPGEEQRYTCQVEHPGLDQPLIVIWEPS ----*********----------------------------------------******* PSGTLVIGVISGIAVFVVILFIGILFIILRKRQGSRGAMGHYVLAERE ************************************************
POODLE-S
POODLE-S (using missing residues) predict 6 short disordert regions within the protein sequence.
MGPRARPALLLLMLLQTAVLQGRLLRSHSLHYLFMGASEQDLGLSLFEALGYVDDQLFVF -**************--------------------------------------------- YDHESRRVEPRTPWVSSRISSQMWLQLSQSLKGWDHMFTVDFWTIMENHNHSKESHTLQV -------**********---******---------------------------------- ILGCEMQEDNSTEGYWKYGYDGQDHLEFCPDTLDWRAAEPRAWPTKLEWERHKIRARQNR ------------------------------------------------------------ AYLERDCPAQLQQLLELGRGVLDQQVPPLVKVTHHVTSSVTTLRCRALNYYPQNITMKWL ---------------------***************------------------------ KDKQPMDAKEFEPKDVLPNGDGTYQGWITLAVPPGEEQRYTCQVEHPGLDQPLIVIWEPS ----*********----------------------------------------******* PSGTLVIGVISGIAVFVVILFIGILFIILRKRQGSRGAMGHYVLAERE *--------------------------------********-------
POODLE-S (using High B-Factor residues) predict 2 short disordert regions within the protein sequence.
MGPRARPALLLLMLLQTAVLQGRLLRSHSLHYLFMGASEQDLGLSLFEALGYVDDQLFVF -*-***------------------------------------------------------ YDHESRRVEPRTPWVSSRISSQMWLQLSQSLKGWDHMFTVDFWTIMENHNHSKESHTLQV ------------------------------------------------------------ ILGCEMQEDNSTEGYWKYGYDGQDHLEFCPDTLDWRAAEPRAWPTKLEWERHKIRARQNR ------------------------------------------------******------ AYLERDCPAQLQQLLELGRGVLDQQVPPLVKVTHHVTSSVTTLRCRALNYYPQNITMKWL ------------------------------------------------------------ KDKQPMDAKEFEPKDVLPNGDGTYQGWITLAVPPGEEQRYTCQVEHPGLDQPLIVIWEPS ------------------------------------------------------------ PSGTLVIGVISGIAVFVVILFIGILFIILRKRQGSRGAMGHYVLAERE ------------------------------------------------
POODLE-L
POODLE-L predict a disorderd region from 296 to the end.
MGPRARPALLLLMLLQTAVLQGRLLRSHSLHYLFMGASEQDLGLSLFEALGYVDDQLFVF ------------------------------------------------------------ YDHESRRVEPRTPWVSSRISSQMWLQLSQSLKGWDHMFTVDFWTIMENHNHSKESHTLQV ------------------------------------------------------------ ILGCEMQEDNSTEGYWKYGYDGQDHLEFCPDTLDWRAAEPRAWPTKLEWERHKIRARQNR ------------------------------------------------------------ AYLERDCPAQLQQLLELGRGVLDQQVPPLVKVTHHVTSSVTTLRCRALNYYPQNITMKWL ------------------------------------------------------------ KDKQPMDAKEFEPKDVLPNGDGTYQGWITLAVPPGEEQRYTCQVEHPGLDQPLIVIWEPS ------------------------------------------------------****** PSGTLVIGVISGIAVFVVILFIGILFIILRKRQGSRGAMGHYVLAERE ************************************************
IUPRED
The short term prediction predict 5 short regions. Also disordert residues at the beginin in the signal peptide.
MGPRARPALLLLMLLQTAVLQGRLLRSHSLHYLFMGASEQDLGLSLFEALGYVDDQLFVF ***--------------------------------------------------------- YDHESRRVEPRTPWVSSRISSQMWLQLSQSLKGWDHMFTVDFWTIMENHNHSKESHTLQV ------------------------------------------------------------ ILGCEMQEDNSTEGYWKYGYDGQDHLEFCPDTLDWRAAEPRAWPTKLEWERHKIRARQNR ------------------------------------------------------------ AYLERDCPAQLQQLLELGRGVLDQQVPPLVKVTHHVTSSVTTLRCRALNYYPQNITMKWL ------------------------------------------------------------ KDKQPMDAKEFEPKDVLPNGDGTYQGWITLAVPPGEEQRYTCQVEHPGLDQPLIVIWEPS ---------********----------***--------*-****---------------- PSGTLVIGVISGIAVFVVILFIGILFIILRKRQGSRGAMGHYVLAERE ---------------------------------------------***
The long term prediction predictet 7 disordert residues, but just one short region.
MGPRARPALLLLMLLQTAVLQGRLLRSHSLHYLFMGASEQDLGLSLFEALGYVDDQLFVF ------------------------------------------------------------ YDHESRRVEPRTPWVSSRISSQMWLQLSQSLKGWDHMFTVDFWTIMENHNHSKESHTLQV ------------------------------------------------------------ ILGCEMQEDNSTEGYWKYGYDGQDHLEFCPDTLDWRAAEPRAWPTKLEWERHKIRARQNR ------------------------------------------------------------ AYLERDCPAQLQQLLELGRGVLDQQVPPLVKVTHHVTSSVTTLRCRALNYYPQNITMKWL ------------------------------------------------------------ KDKQPMDAKEFEPKDVLPNGDGTYQGWITLAVPPGEEQRYTCQVEHPGLDQPLIVIWEPS ---------******-------------------------*------------------- PSGTLVIGVISGIAVFVVILFIGILFIILRKRQGSRGAMGHYVLAERE ------------------------------------------------
The prediction of sturcured regions predicts one globular domain from 1-348. This means, that the whole protein is structured. This is a contradiction to the prediction of POODLE, but because of the weak evidance given by the other IUPRED-methods not a real contradiction to the other results of IUPRED.
META-Disorder
Prediction of transmembrane alpha-helices and signal peptides
TMHMM
Phobius and PolyPhobius
For Phobius and PolyPhobius, we used http://phobius.sbc.su.se/ with standart settings.
Phobius
Phobius predicts very accurate as seen below. The transmembrane region is predicted just 1-2 residues upstream from the annotated region. The same holds for the topological domains before and after the transmembrane region. Also the signal peptid is correctly predicted.
PREDICTED ANNOTATION ID sp|Q30201|HFE_HUMAN FT SIGNAL 1 21 | 1-20 FT REGION 1 7 N-REGION. FT REGION 8 16 H-REGION. FT REGION 17 21 C-REGION. FT TOPO_DOM 22 304 NON CYTOPLASMIC. | 23-306 FT TRANSMEM 305 329 | 307-330 FT TOPO_DOM 330 348 CYTOPLASMIC. | 331-348
PolyPhobius
PolyPhobius also predicts very accurate but in our case not as accurate as Phobius.
PREDICTED ANNOTATION ID sp|Q30201|HFE_HUMAN FT SIGNAL 1 23 | 1-20 FT REGION 1 5 N-REGION. FT REGION 6 19 H-REGION. FT REGION 20 23 C-REGION. FT TOPO_DOM 24 304 NON CYTOPLASMIC. | 23-306 FT TRANSMEM 305 329 | 307-330 FT TOPO_DOM 330 348 CYTOPLASMIC. | 331-348
OCTOPUS and SPOCTOPUS
Both, OCTOPUS and SPOCTOPUS predict the signal peptide and the transmembrane region correctly.
SignalP
TargetP
TargetP does not predict the signal peptide of the HFE-protein.
### targetp v1.1 prediction results ################################## Number of query sequences: 1 Cleavage site predictions not included. Using NON-PLANT networks. Name Len mTP SP other Loc RC ---------------------------------------------------------------------- sp_Q30201_HFE_HUMAN 348 0.433 0.912 0.004 S 3 ---------------------------------------------------------------------- cutoff 0.000 0.000 0.000
Prediction of GO terms
Generel
HFE is annotated with 27 different GO Terms which are <ref>http://www.ebi.ac.uk/QuickGO/GProtein?ac=Q30201</ref>:
GOID | GO Term | Aspect |
---|---|---|
GO:0002474 | antigen processing and presentation of peptide antigen via MHC class I | Process |
GO:0005515 | protein binding | Function |
GO:0005737 | cytoplasm | Component |
GO:0005769 | early endosome | Component |
GO:0005886 | plasma membrane | Component |
GO:0005887 | integral to plasma membrane | Component |
GO:0006461 | protein complex assembly | Process |
GO:0006810 | transport | Process |
GO:0006811 | ion transport | Process |
GO:0006826 | iron ion transport | Process |
GO:0006879 | cellular iron ion homeostasis | Process |
GO:0006898 | receptor-mediated endocytosis | Process |
GO:0006955 | immune response | Process |
GO:0007565 | female pregnancy | Process |
GO:0010106 | cellular response to iron ion starvation | Process |
GO:0016020 | membrane | Component |
GO:0016021 | integral to membrane | Component |
GO:0019882 | antigen processing and presentation | Process |
GO:0031410 | cytoplasmic vesicle | Component |
GO:0042446 | hormone biosynthetic process | Process |
GO:0042612 | MHC class I protein complex | Component |
GO:0045177 | apical part of cell | Component |
GO:0045178 | basal part of cell | Component |
GO:0048471 | perinuclear region of cytoplasm | Component |
GO:0055037 | recycling endosome | Component |
GO:0055072 | iron ion homeostasis | Process |
GO:0060586 | multicellular organismal iron ion homeostasis | Process |
GOPET
Gopet predicted 2 GO-Terms which have no overlab to the annotation.
GOID | Aspect | Confidence | GO Term |
---|---|---|---|
GO:0004872 | Molecular Function | 91% | receptor activity |
GO:0030106 | Molecular Function | 88% | MHC class I receptor activity |
Pfam
ProtFun 2.2
ProtFun is an ab initio prediction server.
Functional category Prob Odds
Amino_acid_biosynthesis 0.011 0.484
Biosynthesis_of_cofactors 0.105 1.452
Cell_envelope => 0.633 10.377
Cellular_processes 0.095 1.297
Central_intermediary_metabolism 0.231 3.663
Energy_metabolism 0.059 0.659
Fatty_acid_metabolism 0.016 1.265
Purines_and_pyrimidines 0.583 2.400
Regulatory_functions 0.013 0.079
Replication_and_transcription 0.019 0.073
Translation 0.079 1.801
Transport_and_binding 0.732 1.785
Enzyme/nonenzyme Prob Odds
Enzyme 0.208 0.727
Nonenzyme => 0.792 1.110
Enzyme class Prob Odds
Oxidoreductase (EC 1.-.-.-) 0.084 0.404
Transferase (EC 2.-.-.-) 0.062 0.179
Hydrolase (EC 3.-.-.-) 0.135 0.425
Lyase (EC 4.-.-.-) 0.049 1.054
Isomerase (EC 5.-.-.-) 0.010 0.321
Ligase (EC 6.-.-.-) 0.042 0.827
Gene Ontology category Prob Odds
Signal_transducer 0.201 0.939
Receptor 0.353 2.076
Hormone 0.002 0.365
Structural_protein 0.005 0.190
Transporter 0.024 0.219
Ion_channel 0.008 0.147
Voltage-gated_ion_channel 0.002 0.085
Cation_channel 0.010 0.221
Transcription 0.036 0.283
Transcription_regulation 0.018 0.147
Stress_response 0.274 3.108
Immune_response => 0.381 4.486
Growth_factor 0.013 0.943
Metal_ion_transport 0.009 0.02