Sequence-based predictions
Contents
Secondary structure prediction
PSIPRED
PSI-PRED use the PSI-BLAST output as input for a neuronal network which has a single hidden layer and a feed-forward back-propagation architecture to predict the secondary structure. PSI-PRED predicts a alpha/beta structure. The transmembrane region is predicted as a beta region.
PSIPRED HFORMAT (PSIPRED V3.0) Conf: 999851589999999877513567886245556456636899750389988756755687 Pred: CCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCCCCCCEEEEEEEECCEEEEE AA: MGPRARPALLLLMLLQTAVLQGRLLRSHSLHYLFMGASEQDLGLSLFEALGYVDDQLFVF 10 20 30 40 50 60 Conf: 318998225536664688990669998865311211002358577441156788603899 Pred: ECCCCCCEEECCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCCCCCEEEE AA: YDHESRRVEPRTPWVSSRISSQMWLQLSQSLKGWDHMFTVDFWTIMENHNHSKESHTLQV 70 80 90 100 110 120 Conf: 987799319835459889765910588728988756689786135787788899999876 Pred: EEEEEEECCCEEEEEEEEEECCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHH AA: ILGCEMQEDNSTEGYWKYGYDGQDHLEFCPDTLDWRAAEPRAWPTKLEWERHKIRARQNR 130 140 150 160 170 180 Conf: 310271499889888616322000378810000468999601699981450765189996 Pred: HHHCCCHHHHHHHHHHCCCCCCCCCCCCCEEEECCCCCCCEEEEEEEEEECCCCEEEEEE AA: AYLERDCPAQLQQLLELGRGVLDQQVPPLVKVTHHVTSSVTTLRCRALNYYPQNITMKWL 190 200 210 220 230 240 Conf: 288106667520025355899875899999965999872169986699998826885259 Pred: ECCEECCCCCCCCCCCEECCCCCEEEEEEEEECCCCCCCEEEEEECCCCCCCEEEEEECC AA: KDKQPMDAKEFEPKDVLPNGDGTYQGWITLAVPPGEEQRYTCQVEHPGLDQPLIVIWEPS 250 260 270 280 290 300 Conf: 999711124320001367777622367764115889887620212359 Pred: CCCCCEEEEEEEEEEEEEEEEEEEEEEEEEECCCCCCCCCCEEECCCC AA: PSGTLVIGVISGIAVFVVILFIGILFIILRKRQGSRGAMGHYVLAERE 310 320 330 340
Jpred3
Jpred uses the Jnet algorithm which provides "a three-state (a-helix, ß-strand and coil) prediction of secondary structure at an accuracy of 81.5%" <ref>http://nar.oxfordjournals.org/content/36/suppl_2/W197.abstract</ref>.
Jpred found in it's first blast search a lot of homologous hits with an e-value range from e-163 to 4e-44. There are some self hits included. We continued to the prediction which is:
Seq: MGPRARPALLLLMLLQTAVLQGRLLRSHSLHYLFMGASEQDLGLSLFEALGYVDD SS: ------HHHHHHHHHHHHH---------EEEEEEEEE-------EEEEEEEEE-- Seq: QLFVFYDHESRRVEPRTPWVSSRISSQMWLQLSQSLKGWDHMFTVDFWTIMENHN SS: EEEEEE-----EEEE----------HHHHHHHHHHHHHHHHHHHHHHHHHH---- Seq: HSKESHTLQVILGCEMQEDNSTEGYWKYGYDGQDHLEFCPDTLDWRAAEPRAWPT SS: -----EEEEEEEEEE------EEEEEEE-----EEEEEE----EEE-------HH Seq: KLEWERHKIRARQNRAYLERDCPAQLQQLLELGRGVLDQQVPPLVKVTHHVTSSV SS: HHHHH--HHHHHHHHHH------HHHHHHHHHH-H-------EEEEE-------- Seq: TTLRCRALNYYPQNITMKWLKDKQPMDAKEFEPKDVLPNGDGTYQGWITLAVPPG SS: -EEEEEEE------EEEEEEE----------EE----------EEEEEEEEE--- Seq: EEQRYTCQVEHPGLDQPLIVIWEPSPSGTLVIGVISGIAVFVVILFIGILFIILR SS: ---EEEEEEEE------EEEEE---------HHHHHHHHHHHHHHHHHHHHHHHH Seq: KRQGSRGAMGHYVLAERE SS: HH----------------
Comparison with DSSP
Because, the PDB sequence is not complete, the dssp assignment is also incomplete. The interessting parts - the signal peptide and the cytoplasmic part - which are predicted as disordered are not covered by DSSP. PSIPRED and JPred predicted the transmembrane region well and also made no assignment to the - as disordered predicted - N- and C-terminus.
UniProt: ---------------------------EEEEEEEEEEE----EEE--EEEEEE--EEEEE DSSP: --EEEEEEEEEEB-SS-SSB--EEEEEETTEEEEE PSIPRED: CCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCCCCCCEEEEEEEECCEEEEE JPred: ------HHHHHHHHHHHHH---------EEEEEEEEE-------EEEEEEEEE--EEEEE AA: MGPRARPALLLLMLLQTAVLQGRLLRSHSLHYLFMGASEQDLGLSLFEALGYVDDQLFVF DSSPSeq: RSHSLHYLFMGASEQDLGLSLFEALGYVDDQLFVF 10 20 30 40 50 60 UniProt: EEEEE--EEE--------TTTHHHHHHHHHHHHHHHHHHHHHHHHHHTTT-EEE--EEEE DSSP: EESSS--EEE-STTS-SSTTTTHHHHHHHHHHHHHHHHHHHHHHHHHTTT-SSS--EEEE PSIPRED: ECCCCCCEEECCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCCCCCEEEE JPred: E-----EEEE----------HHHHHHHHHHHHHHHHHHHHHHHHHH---------EEEEE AA: YDHESRRVEPRTPWVSSRISSQMWLQLSQSLKGWDHMFTVDFWTIMENHNHSKESHTLQV DSSPSEQ: YDHESRRVEPRTPWVSSRISSQMWLQLSQSLKGWDHMFTVDFWTIMENHNHSKESHTLQV 70 80 90 100 110 120 UniProt: EEEEEE-----EEEEEEEEE--EEEEEEEHHH-EEEEEE---HHHHHHHH---HHHHHHH DSSP: EEEEEE-TTS-EEEEEEEEETTEEEEEEEGGGTEEEESSGGGHHHHHHHHSSTHHHHHHH PSIPRED: EEEEEEECCCEEEEEEEEEECCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHH JPred: EEEEE------EEEEEEE-----EEEEEE----EEE-------HHHHHHH--HHHHHHHH AA: ILGCEMQEDNSTEGYWKYGYDGQDHLEFCPDTLDWRAAEPRAWPTKLEWERHKIRARQNR DSSPSEQ: ILGaEMQEDNSTEGYWKYGYDGQDHLEFCPDTLDWRAAEPRAWPTKLEWERHKIRARQNR 130 140 150 160 170 180 UniProt: HHHH-HHHHHHHHHHHHHTTT-------EEEEEEEE----EEEEEEEEEEEEE--EEEEE DSSP: HHHHTHHHHHHHHHHHHHTTTSS--B--EEEEEEEE-SS-EEEEEEEEEEBSS--EEEEE PSIPRED: HHHCCCHHHHHHHHHHCCCCCCCCCCCCCEEEECCCCCCCEEEEEEEEEECCCCEEEEEE JPred: HH------HHHHHHHHHH-H-------EEEEE---------EEEEEEE------EEEEEE AA: AYLERDCPAQLQQLLELGRGVLDQQVPPLVKVTHHVTSSVTTLRCRALNYYPQNITMKWL DSSPSEQ: AYLERDaPAQLQQLLELGRGVLDQQVPPLVKVTHHVTSSVTTLRbRALNYYPQNITMKWL 190 200 210 220 230 240 UniProt: E------HHH----EEEE-----EEEEEEEEE---HHHHEEEEEE---EEE-EEEE---- DSSP: ETTEE--GGGS---EEEE-TTS-EEEEEEEEE-TTGGGGEEEEEE-TTSSS-EEEE- PSIPRED: ECCEECCCCCCCCCCCEECCCCCEEEEEEEEECCCCCCCEEEEEECCCCCCCEEEEEECC JPred: E----------EE----------EEEEEEEEE------EEEEEEEE------EEEEE--- AA: KDKQPMDAKEFEPKDVLPNGDGTYQGWITLAVPPGEEQRYTCQVEHPGLDQPLIVIWEPS DSSPSEQ: KDKQPMDAKEFEPKDVLPNGDGTYQGWITLAVPPGEEQRYTbQVEHPGLDQPLIVIW 250 260 270 280 290 300 UniProt: ------------------------------------------------ DSSP: PSIPRED: CCCCCEEEEEEEEEEEEEEEEEEEEEEEEEECCCCCCCCCCEEECCCC JPred: ------HHHHHHHHHHHHHHHHHHHHHHHHHH---------------- AA: PSGTLVIGVISGIAVFVVILFIGILFIILRKRQGSRGAMGHYVLAERE DSSPSEQ: 310 320 330 340
Prediction of disordered regions
DISOPRED
For the prediction, we used the DISOPRED-Server at http://bioinf.cs.ucl.ac.uk/disopred/
Disopred predictes two disordered residues at the signal peptide and a disordered region at the end of the sequence which is located inside the cell.
AA:Target sequence Pred:Residue disorder prediction(.)= ordered residue(*)=Disordered residue conf:997600000000000000000000000000000000000000000000000000000000 pred:**.......................................................... AA:MGPRARPALLLLMLLQTAVLQGRLLRSHSLHYLFMGASEQDLGLSLFEALGYVDDQLFVF 10 20 30 40 50 60 conf:000120011000000000000000000000000000000000000000000000000000 pred:............................................................ AA:YDHESRRVEPRTPWVSSRISSQMWLQLSQSLKGWDHMFTVDFWTIMENHNHSKESHTLQV 70 80 90 100 110 120 conf:000000000000000000000000000000000000000000000000000000000000 pred:............................................................ AA:ILGCEMQEDNSTEGYWKYGYDGQDHLEFCPDTLDWRAAEPRAWPTKLEWERHKIRARQNR 130 140 150 160 170 180 conf:000000000000000000000002456777878777766530000000000000000000 pred:..............................*.*........................... AA:AYLERDCPAQLQQLLELGRGVLDQQVPPLVKVTHHVTSSVTTLRCRALNYYPQNITMKWL 190 200 210 220 230 240 conf:000035555545543000000000000000000000000000000000000001354667 pred:............................................................ AA:KDKQPMDAKEFEPKDVLPNGDGTYQGWITLAVPPGEEQRYTCQVEHPGLDQPLIVIWEPS 250 260 270 280 290 300 conf:777766643300000000000000047889999999999999898999 pred:...........................********************* AA:PSGTLVIGVISGIAVFVVILFIGILFIILRKRQGSRGAMGHYVLAERE 310 320 330 340 DISOPRED predictions for a false positive rate threshold of: 2%
POODLE
POODLE stands for Prediction Of Order and Disorder by machine LEarning.
POODLE provides three different predictions
- POODLE-S: short disorder regions prediction
- POODLE-L: long disorder regions prediction (longer 40 residues)
- unfolded protein prediction
All POODLE variants predict a disordered region at the end of the protein which contains a transmembrane region (pos: 307-330), this shows an evidance for a disordered region at the C-Terminus. But also, all variants predict a short disordered region at the beginning of the sequence which is a part of the signal peptid (pos: 1-22).
POODLE-I
POODLE-I (series only) predicted 4 disordered regions within the protein sequence.
MGPRARPALLLLMLLQTAVLQGRLLRSHSLHYLFMGASEQDLGLSLFEALGYVDDQLFVF **************---------------------------------------------- YDHESRRVEPRTPWVSSRISSQMWLQLSQSLKGWDHMFTVDFWTIMENHNHSKESHTLQV -------**********---******------*--------------------------- ILGCEMQEDNSTEGYWKYGYDGQDHLEFCPDTLDWRAAEPRAWPTKLEWERHKIRARQNR ------------------------------------------------------------ AYLERDCPAQLQQLLELGRGVLDQQVPPLVKVTHHVTSSVTTLRCRALNYYPQNITMKWL ---------------------***************------------------------ KDKQPMDAKEFEPKDVLPNGDGTYQGWITLAVPPGEEQRYTCQVEHPGLDQPLIVIWEPS ----*********----------------------------------------******* PSGTLVIGVISGIAVFVVILFIGILFIILRKRQGSRGAMGHYVLAERE ************************************************
POODLE-S
POODLE-S (using missing residues) predicts 6 short disordered regions within the protein sequence.
MGPRARPALLLLMLLQTAVLQGRLLRSHSLHYLFMGASEQDLGLSLFEALGYVDDQLFVF -**************--------------------------------------------- YDHESRRVEPRTPWVSSRISSQMWLQLSQSLKGWDHMFTVDFWTIMENHNHSKESHTLQV -------**********---******---------------------------------- ILGCEMQEDNSTEGYWKYGYDGQDHLEFCPDTLDWRAAEPRAWPTKLEWERHKIRARQNR ------------------------------------------------------------ AYLERDCPAQLQQLLELGRGVLDQQVPPLVKVTHHVTSSVTTLRCRALNYYPQNITMKWL ---------------------***************------------------------ KDKQPMDAKEFEPKDVLPNGDGTYQGWITLAVPPGEEQRYTCQVEHPGLDQPLIVIWEPS ----*********----------------------------------------******* PSGTLVIGVISGIAVFVVILFIGILFIILRKRQGSRGAMGHYVLAERE *--------------------------------********-------
POODLE-S (using High B-Factor residues) predicts 2 short disordered regions within the protein sequence.
MGPRARPALLLLMLLQTAVLQGRLLRSHSLHYLFMGASEQDLGLSLFEALGYVDDQLFVF -*-***------------------------------------------------------ YDHESRRVEPRTPWVSSRISSQMWLQLSQSLKGWDHMFTVDFWTIMENHNHSKESHTLQV ------------------------------------------------------------ ILGCEMQEDNSTEGYWKYGYDGQDHLEFCPDTLDWRAAEPRAWPTKLEWERHKIRARQNR ------------------------------------------------******------ AYLERDCPAQLQQLLELGRGVLDQQVPPLVKVTHHVTSSVTTLRCRALNYYPQNITMKWL ------------------------------------------------------------ KDKQPMDAKEFEPKDVLPNGDGTYQGWITLAVPPGEEQRYTCQVEHPGLDQPLIVIWEPS ------------------------------------------------------------ PSGTLVIGVISGIAVFVVILFIGILFIILRKRQGSRGAMGHYVLAERE ------------------------------------------------
POODLE-L
POODLE-L predicts a disordered region from 296 to the end.
MGPRARPALLLLMLLQTAVLQGRLLRSHSLHYLFMGASEQDLGLSLFEALGYVDDQLFVF ------------------------------------------------------------ YDHESRRVEPRTPWVSSRISSQMWLQLSQSLKGWDHMFTVDFWTIMENHNHSKESHTLQV ------------------------------------------------------------ ILGCEMQEDNSTEGYWKYGYDGQDHLEFCPDTLDWRAAEPRAWPTKLEWERHKIRARQNR ------------------------------------------------------------ AYLERDCPAQLQQLLELGRGVLDQQVPPLVKVTHHVTSSVTTLRCRALNYYPQNITMKWL ------------------------------------------------------------ KDKQPMDAKEFEPKDVLPNGDGTYQGWITLAVPPGEEQRYTCQVEHPGLDQPLIVIWEPS ------------------------------------------------------****** PSGTLVIGVISGIAVFVVILFIGILFIILRKRQGSRGAMGHYVLAERE ************************************************
IUPRED
The short term prediction predicts 5 short regions. There are also disordered residues at the beginning and in the signal peptide.
MGPRARPALLLLMLLQTAVLQGRLLRSHSLHYLFMGASEQDLGLSLFEALGYVDDQLFVF ***--------------------------------------------------------- YDHESRRVEPRTPWVSSRISSQMWLQLSQSLKGWDHMFTVDFWTIMENHNHSKESHTLQV ------------------------------------------------------------ ILGCEMQEDNSTEGYWKYGYDGQDHLEFCPDTLDWRAAEPRAWPTKLEWERHKIRARQNR ------------------------------------------------------------ AYLERDCPAQLQQLLELGRGVLDQQVPPLVKVTHHVTSSVTTLRCRALNYYPQNITMKWL ------------------------------------------------------------ KDKQPMDAKEFEPKDVLPNGDGTYQGWITLAVPPGEEQRYTCQVEHPGLDQPLIVIWEPS ---------********----------***--------*-****---------------- PSGTLVIGVISGIAVFVVILFIGILFIILRKRQGSRGAMGHYVLAERE ---------------------------------------------***
The long term prediction predicted 7 disordered residues, but just one short region.
MGPRARPALLLLMLLQTAVLQGRLLRSHSLHYLFMGASEQDLGLSLFEALGYVDDQLFVF ------------------------------------------------------------ YDHESRRVEPRTPWVSSRISSQMWLQLSQSLKGWDHMFTVDFWTIMENHNHSKESHTLQV ------------------------------------------------------------ ILGCEMQEDNSTEGYWKYGYDGQDHLEFCPDTLDWRAAEPRAWPTKLEWERHKIRARQNR ------------------------------------------------------------ AYLERDCPAQLQQLLELGRGVLDQQVPPLVKVTHHVTSSVTTLRCRALNYYPQNITMKWL ------------------------------------------------------------ KDKQPMDAKEFEPKDVLPNGDGTYQGWITLAVPPGEEQRYTCQVEHPGLDQPLIVIWEPS ---------******-------------------------*------------------- PSGTLVIGVISGIAVFVVILFIGILFIILRKRQGSRGAMGHYVLAERE ------------------------------------------------
The prediction of sturcured regions predicts one globular domain from 1-348. This means, that the whole protein is structured. This is a contradiction to the prediction of POODLE, but because of the weak evidence given by the other IUPRED-methods not a real contradiction to the other results of IUPRED.
META-Disorder
For this task, we used the Server at https://www.predictprotein.org.
predicted secondary structure composision
sec str type | H | E | L |
---|---|---|---|
% in protein | 27.30 | 28.74 | 43.97 |
<A NAME="prof_body_pred">Prediction (brief)</A> <a id="show_landscape" class="toggle">(Show Landscape View)</a> <a id="show_portrait" style="display: none;" class="toggle">(Show Portrait View)</a>
....,....1....,....2....,....3....,....4....,....5....,....6 <A HREF="#AA">AA </A> MGPRARPALLLLMLLQTAVLQGRLLRSHSLHYLFMGASEQDLGLSLFEALGYVDDQLFVF <A HREF="#OHEL">OBS_sec</A> <A HREF="#PHEL">PROF_sec</A> HHHHHHHHHHHHHHHHH EEEEEEEE EEEEEEEE EEEE <A HREF="#RI_S">Rel_sec</A> 973114566667777521210010233203576754315666754156877522514676 <A HREF="#SUBsec">SUB_sec</A> LL....HHHHHHHHHH..............EEEEE...LLLLLL..EEEEEE..L..EEE <A HREF="#Obie">O_3_acc</A> bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb <A HREF="#Pbie">P_3_acc</A> ee ebbbbbbbbbbbbbbbbb b bbbbbbbbbbb ee e bbbbbbbbbbbbbbb <A HREF="#RI_A">Rel_acc</A> 520200358998765332221110210035146355212110221032547240111431 <A HREF="#SUBacc">SUB_acc</A> e......bbbbbbbb..............b.bb.bb............bbb.b....b.. ....,....7....,....8....,....9....,....10...,....11...,....12 <A HREF="#AA">AA </A> YDHESRRVEPRTPWVSSRISSQMWLQLSQSLKGWDHMFTVDFWTIMENHNHSKESHTLQV <A HREF="#OHEL">OBS_sec</A> <A HREF="#PHEL">PROF_sec</A> E HHHH HHHHHHHHHHHHHHHHHHHHHHHHHHHH EEEEE <A HREF="#RI_S">Rel_sec</A> 305544434531132102560013554332222224555556777754113566223556 <A HREF="#SUBsec">SUB_sec</A> ..LL.....L........LL....HH..........HHHHHHHHHHH....LLL...EEE <A HREF="#Obie">O_3_acc</A> bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb <A HREF="#Pbie">P_3_acc</A> b be ee ebbeeeeeee b ebbe beebe bb b bb bbe e eee bbbbb <A HREF="#RI_A">Rel_acc</A> 300011201101101221011402211211141022121015023021211231111312 <A HREF="#SUBacc">SUB_acc</A> .....................e.........e.........b.................. ....,....13...,....14...,....15...,....16...,....17...,....18 <A HREF="#AA">AA </A> ILGCEMQEDNSTEGYWKYGYDGQDHLEFCPDTLDWRAAEPRAWPTKLEWERHKIRARQNR <A HREF="#OHEL">OBS_sec</A> <A HREF="#PHEL">PROF_sec</A> EEEEEE EEEEEE EEEEE EEEE HHHHHHH HHHHHHHHH <A HREF="#RI_S">Rel_sec</A> 531232367530002354105311245404432132024100102444200024578877 <A HREF="#SUBsec">SUB_sec</A> E......LLL......E...L.....E...........................HHHHHH <A HREF="#Obie">O_3_acc</A> bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb <A HREF="#Pbie">P_3_acc</A> bbbb b eeee e bb bbbebe bb bb ee eb bb eeb e eeebee eeebe b <A HREF="#RI_A">Rel_acc</A> 221521223210110011102012141301210301101000011022211111112102 <A HREF="#SUBacc">SUB_acc</A> ...b.....................b.................................. ....,....19...,....20...,....21...,....22...,....23...,....24 <A HREF="#AA">AA </A> AYLERDCPAQLQQLLELGRGVLDQQVPPLVKVTHHVTSSVTTLRCRALNYYPQNITMKWL <A HREF="#OHEL">OBS_sec</A> <A HREF="#PHEL">PROF_sec</A> HHHHHHHHHHHHHHHHH EEE EEEEEEE EEEEEE <A HREF="#RI_S">Rel_sec</A> 777655467787654203110245666500010234686504777642134551367875 <A HREF="#SUBsec">SUB_sec</A> HHHHHH.HHHHHHH.........LLLLL........LLLL..EEEE.....LL..EEEEE <A HREF="#Obie">O_3_acc</A> bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb <A HREF="#Pbie">P_3_acc</A> ebb e b ebbeeb e bee beeeeee eb eeee b bbbbbbbbbb eb b bb <A HREF="#RI_A">Rel_acc</A> 203232623063231420210131211111201100121012809171230111636350 <A HREF="#SUBacc">SUB_acc</A> ......b...b....e..........................b.b.b.......b.b.b. ....,....25...,....26...,....27...,....28...,....29...,....30 <A HREF="#AA">AA </A> KDKQPMDAKEFEPKDVLPNGDGTYQGWITLAVPPGEEQRYTCQVEHPGLDQPLIVIWEPS <A HREF="#OHEL">OBS_sec</A> <A HREF="#PHEL">PROF_sec</A> E EEEEEEEEE EEEEEEEE EEEEE <A HREF="#RI_S">Rel_sec</A> 245302444322331012666312368887860676742678874135666514451246 <A HREF="#SUBsec">SUB_sec</A> ..L...............LLL....EEEEEEE.LLLL..EEEEE...LLLLL...E...L <A HREF="#Obie">O_3_acc</A> bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb <A HREF="#Pbie">P_3_acc</A> eeee eeee e ee beeee bbb bb b b eee bbb b bee e ebbb b ee <A HREF="#RI_A">Rel_acc</A> 221221203312100110211002010325232023213108383001031101020101 <A HREF="#SUBacc">SUB_acc</A> .............................b...........b.b................ ....,....31...,....32...,....33...,....34...,... <A HREF="#AA">AA </A> PSGTLVIGVISGIAVFVVILFIGILFIILRKRQGSRGAMGHYVLAERE <A HREF="#OHEL">OBS_sec</A> <A HREF="#PHEL">PROF_sec</A> EEEEEEHHHHHHHHHHHHHEEEEEEEE <A HREF="#RI_S">Rel_sec</A> 777522333201334455455432023454202346645544556678 <A HREF="#SUBsec">SUB_sec</A> LLLL............HH.HH.......E......LL.LL..LLLLLL <A HREF="#Obie">O_3_acc</A> bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb <A HREF="#Pbie">P_3_acc</A> eee bbbbbbbbbbbbbbbbbbbbbbbbb eeeeeee e eee <A HREF="#RI_A">Rel_acc</A> 121010326654777899899766653421221022000111000347 <A HREF="#SUBacc">SUB_acc</A> ........bbbbbbbbbbbbbbbbbb.b..................ee