Difference between revisions of "Reference Sequence BCKDHA"

From Bioinformatikpedia
(Fasta Sequence of the BCKDHA gene)
 
(Mutated sequence)
 
(42 intermediate revisions by 2 users not shown)
Line 1: Line 1:
 
 
== Sequence ==
 
== Sequence ==
  +
* Uniprot:
  +
>sp|P12694|ODBA_HUMAN 2-oxoisovalerate dehydrogenase subunit alpha, mitochondrial OS=Homo sapiens GN=BCKDHA PE=1 SV=2
  +
  +
<tt>
  +
MAVAIAAARVWRLNRGLSQAALLLLRQPGARGLARSHPPRQQQQFSSLDDKPQFPGASAE<br>
  +
FIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYKSMTLLNTMDRILY<br>
  +
ESQRQGRISFYMTNYGEEGTHVGSAAALDNTDLVFGQYREAGVLMYRDYPLELFMAQCYG<br>
  +
NISDLGKGRQMPVHYGCKERHFVTISSPLATQIPQAVGAAYAAKRANANRVVICYFGEGA<br>
  +
ASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIAARGPGYGIMSIRVDG<br>
  +
NDVFAVYNATKEARRRAVAENQPFLIEAMTYRIGHHSTSDDSSAYRSVDEVNYWDKQDHP<br>
  +
ISRLRHYLLSQGWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNLLFSDVYQEMPAQL<br>
  +
RKQQESLARHLQTYGEHYPLDHFDK
  +
</tt>
  +
  +
Sequence info: [http://www.uniprot.org/uniprot/P12694]
  +
The Uniprot sequence is 445 aa long, as is contains the transit peptide sequence from position 1-45.
  +
  +
*PDB:
  +
>1U5B:A|PDBID|CHAIN|SEQUENCE
  +
  +
<tt>
  +
SSLDDKPQFPGASAEFIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYKSMTLLNTMDRILYESQRQ<br>
  +
GRISFYMTNYGEEGTHVGSAAALDNTDLVFGQYREAGVLMYRDYPLELFMAQCYGNISDLGKGRQMPVHYGCKERHFVTI<br>
  +
SSPLATQIPQAVGAAYAAKRANANRVVICYFGEGAASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIA<br>
  +
ARGPGYGIMSIRVDGNDVFAVYNATKEARRRAVAENQPFLIEAMTYRIGHHSTSDDSSAYRSVDEVNYWDKQDHPISRLR<br>
  +
HYLLSQGWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNLLFSDVYQEMPAQLRKQQESLARHLQTYGEHYPLDHFDK<br>
  +
</tt>
  +
  +
Sequence info: [http://www.pdb.org/pdb/explore/remediatedSequence.do?structureId=1U5B]
  +
  +
== Mutated sequence ==
  +
  +
The following sequence shows the sequence inclusive all point mutations (missense/nonsense) listed in HGMD. (green: signal sequence)
  +
  +
<tt>
  +
> bckdha 445 aminoacids; Mw=50481.62Da
  +
  +
<font color=green>
  +
MAVAIAAARVWRLNRGLSQAALLLLRQPGARGLARSHPPRQQQQF</font>SSLDDKPQFPGASAE<br>
  +
FIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYKSMTLLNTMDRILY<br>
  +
ESQRQGRISFYMTNYGEEGTHVGSAAALDNTDLVFGQYREAGVLMYRDYPLELFMAQCYG<br>
  +
NISDLGKGRQMPVHYGCKERHFVTISSPLATQIPQAVGAAYAAKRANANRVVICYFGEGA<br>
  +
ASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIAARGPGYGIMSIRVDG<br>
  +
NDVFAVYNATKEARRRAVAENQPFLIEAMTYRIGHHSTSDDSSAYRSVDEVNYWDKQDHP<br>
  +
ISRLRHYLLSQGWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNLLFSDVYQEMPAQL<br>
  +
RKQQESLARHLQTYGEHYPLDHFDK*
  +
</tt>
  +
  +
  +
The following sequence is the reference sequence used by dbSNP. Note that this sequence is longer that 400 amino acids (protein length) and even longer than 445 amino acids (protein plus signal sequence(green)). It contains additional amino acids both at the beginning and the end of the sequence (blue).
  +
  +
<tt>
  +
<font color=blue>LRECRTAEWLLAK</font><font color=green>MAVAIAAARVWRLNRGLSQAALLLLRQPGARGLARSHPPRQQQQF</font>SS<br>
  +
LDDKPQFPGASAEFIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYK<br>
  +
SMTLLNTMDRILYESQRQGRISFYMTNYGEEGTHVGSAAALDNTDLVFGQYREAGVLMYR<br>
  +
DYPLELFMAQCYGNISDLGKGRQMPVHYGCKERHFVTISSPLATQIPQAVGAAYAAKRAN<br>
  +
ANRVVICYFGEGAASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIAAR<br>
  +
GPGYGIMSIRVDGNDVFAVYNATKEARRRAVAENQPFLIEAMTYRIGHHSTSDDSSAYRS<br>
  +
VDEVNYWDKQDHPISRLRHYLLSQGWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNL<br>
  +
LFSDVYQEMPAQLRKQQESLARHLQTYGEHYPLDHFDK<font color=blue>.DLLSPPPPILSYPER.PHSKG<br>
  +
SRGT.QHTTVFPSQLPLKYSAARAAATLHPCSSRLLHCQGTASAAVAEAPSAPSSPVVTV<br>
  +
PSPRGWVRAHSGLEAPLGMGWTWQVSLWNLRRCEWPAEVTNKLHLCAWLSTKKKKKK</font>
  +
</tt>
  +
  +
  +
This sequence has an additional 13 amino acids at the beginning, which should be taken care of when comparing the SNP positions with the positions retrieved by HGMD.
  +
  +
go to Task 2: [[Sequence_Alignments]]
  +
  +
go to Task 5: [[Mapping_SNPs_BCKDHA| Mapping SNPs]]
   
  +
back to [[Maple syrup urine disease]] main page
>gi|260593636:5001-32217 Homo sapiens branched chain keto acid dehydrogenase E1, alpha polypeptide (BCKDHA), RefSeqGene on chromosome 19
 
CTACGTGAGTGCCGGACCGCTGAGTGGTTGTTAGCCAAGATGGCGGTAGCGATCGCTGCAGCGAGGGTCT
 
GGCGGCTAAACCGTGGTTTGAGCCAGGCTGCCCTCCTGCTGCTGCGGCAGCCTGGGGCTCGGGGACTGGC
 
TAGATCTGTGAGTACCTGGGCCCCAGGCGGTTTTCCCAAAGGGGATTAGGGATGTAAAGGCTATCTTCAG
 
AGTGTGGGGTCCCTGAAGGATATGAAGGAAGGGCTGTCACAAAAGGGAAAAAGAGTGGGAGACTCCTTGG
 
AGAAGACACCGAAGGGAAGATCTGCCACTTCCTTAGGAGACGAAAAGAACTTCCACTTCTTTAGGAGAGG
 
AAAAAACCCGTTTGCAACTTCTTTGTGGGCCCTGAGGGAGACGACTTGAAGAGAGAGATGGGGGTTCCTT
 
ATTAGCTGCCACTGGTATTCAGGGGAGGCTCCTTGGAGGAACAGGGGAAGACCGCAAGACCTTCTCCCTC
 
AGGGTTTGGAGAATGAAACTGACTTTGTAAGGGAGGGACTTTCTTCTGGGCAAAGATCCCTCTTCAAGAA
 
GAAGAAAGAAGAGAACAGGTCCCTGTCACAGGCTGCATAGGGCCTTGTATTTCTTTGCTCCTTTGCCACA
 
ATTCTGATTTTAAAATGAATTGTGTAAAAAATTTGTTTGATTCCTGTGTCTCTTGATCGAATATCAAGAC
 
AGAGAATTCGTGTGTTCTGGACACTGTACAGAGGAGACGATTAATATTTGTTGAATTAATGAGGGGAAAA
 
AAGGAGGGAGAGAGGTACAGACAGGGCACTTTGCCCACATTTCTTAGAAACACTTGAGAAGGCTTTGTTT
 
CTTGTCAAAAGGGTCACAATCCCTTCAGGTGGATGCTGCTAAGGGCAAGATTACACTGCAAAAAACCTAG
 
GTGGAAGCCTATCTTTAGACGGGGAGAGGCAATGGGTTGCAGAAGAGCTTTGAACTACAGTTGACAGATA
 
GGAGAGTAGTGGTTACATGGGTGGATGCATGTCCTTGCTCTGCCGCTTTCCACTTGTGTAGCCTTGGCCA
 
ACGGACTGCACTTCTCTGGATCTATGGAGCTAATTTTCCTCTCTAAAATATTAACAGTGGCCACCTCATA
 
GGGCTGTGAAAAATCAATATGTAAAATGCTAAGGACTGGACCTGGCACATCCTAAGGGCAATAGGAATGT
 
TGATAGTGAGGGTGGAGGGAAGGGACAGAAACTAACCTTGGAGATGGGGAGAAGAAAGGGGTGACCTTGG
 
GAAGGCCTGGATTGGTGAGGCCCTGGGGAAAAAAATCTGAAAAGATATTGTGAGGTTGAGTCTGATCCTC
 
TGATCCCCAATCCATAAGGAGGGGAAGATGGAATCTGTAGAAAGAAGCTCTGGGTAGGGGAACAAACGGG
 
AGGAGGGTGAAGAAGATGGGCCAGGAGTAGGACGTAGTAGGTATCTAGGAACGGAGTCACTTGAAACAGG
 
TAAATACAGAAGGGCTGCAGGAGCAAGGTGTGGGTAGGGCAGTCTCTTGCCTTTGGGCTGGAGACCTTTT
 
CACTTTTTTTTTTTTTTTTTTTTTTGAGCAGTTCATATCCTCCACAGCTGATCTCAGTCCCAGGGGCCCA
 
TTGGCTGGGCAAGTGGGGCAGGAGGTGGGCAGGACTGGGGCAGGCTAGGGAAAGGTCATTTATGGCTGAT
 
TAAATCTTTCTCTCCTGTCTCAATCTCTTGATTACCTTTCCTCTTGCTTCCCCTACTTCATTTTTGGTGG
 
GGAGTGGGGAGGGAAGAAGTCCCTACTTTTTATTTATATATATTTAAATTTGTATTATCTTAAGAGTTAT
 
TGTTTTTTCCCCTTTATTTCTCCCACCTGTTCTTTCTCTCTTCTTCTTTCTTTCCTCTCCTTCTCCTCTC
 
CTCTCCTCCCCTCTCCTTTTTCTTTTCTTTTCTTTTTCCCTTTCCTTTCCTTTCCTTTTCTCTTCTCTTT
 
TCTCTCTCTCTTTCTTTCTGTACCTCTCTTTCCTTTTTCTTTTTTTTTCTTTTTTGAGACAGAGTTTCAC
 
TCTCTCGCCTAGGCTGGAGTGCAGCAGCATGATCTTGACTTACTGCAATCTCTGCCTGCCGAGTTCAAGC
 
AATTCTCCTGCCTCAGCCTCCCTAGTAGCTGGGATTACAGGCATGCACTATCACGCCTGGCTAATTTTTG
 
TATTTTTGGTAGGGACGAGGTTTCACCATGTTGTCCAGGCTGGTCTCGAACTTCTGGCCTCAAGTTCCGC
 
CTGTCTTGGCCTCCAAAAGCATGAGCCACTGCGCCTAGCCTTTTCTTTTTTTGAGACAAGGTCTCACTCT
 
GTCACCCAGGCTGGAGTGAGTGCAGTGGTGTGATCATGGCTCACTGCAGCCTCAACCTCCTGGGCTCAAG
 
CAATCCCCCCACCTCCACCTCCTAAGTAGCTGGGACCACAGGCACGTGCTACTACGCCTGGCTAATTTTT
 
AATTTTTTTTTTTTTTTTTGAGACGGAGTCTTGCTCTGTCACCCAAGCTGGAGTGCAGTGGCACGATCTC
 
GGCTCACTGCAACCTCCACTTCCCAGGTTCAAGTGATTCTCCTGCCTCAGCCTCCCAAATAGCTGGGACT
 
ACTGGCGTGCGCCACCATACCCAGTTAATTTTTGTATTTTTAGTAGAGATGGGGTTTCACCATGTTGACC
 
AGGCTGGTCTTGAACTCCTGATCTAAGGTGATCTGCCCGCCTCGGCCTCCCAAAGTGCTGGGATTACAGG
 
CATAAGCCACTGTGCCTGGCTGCGCCTGGCTAATTTTTAAATCTTTGTAGAGATGGGGTTTTGCCATGTT
 
TTCCAGGCTGGTCTTGAACTCCTGGACTCAAGCGATCTTCCTGCCTCAGCCTCCCAAAGTGCAGGATGAC
 
AAGCGTGAGCCACCTTGCCTGGCCATATTATTAAATTATCATGAGTTTAAGTTTAAGTAGCCACAAGTGG
 
GCTGGGTGCGGTGGCTCATGCCTGTAATCCCAGCATTTTGAGAGGCCAAGGTGGGCGGGTTGCCTGAGGC
 
CAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCCATCTCTACTAAAAATACAAAAAAATTAGCT
 
GGGCGTAATGGCGTGTGCCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGAAGAATCGCTTGAACCTGG
 
GAGGCGGAGATTGCAGTGAGCCGAGATCGCACCACTATACCCCAGCCTAGGTGACAGAGCAGCCCTCTGT
 
CTCAAAATAAGATAGCCACAAGTGGTTGGTGGCTGCCACATTAGATGGTGCAGGGTTTTTGTTTTGTTTT
 
GTTTTTTGTTTTTTCGAGGTGGAGTTTTGCTCTCATTGCCTAGGCTGGAGTGCAATGGCATGATCTTGGC
 
TCACCACAACCTCCTCTTCCTGGGTTCAAGCGATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGATTACA
 
GGCATGCACCACCATGCCTGGCTAATTTTGTATTTTTAGTAGAGACAGGGTTTCTCCATGTTGATCAGGC
 
TGGTCTTGAACTCCCGACCTCAGGCGACCCACCCACCTCGGCCTCCCAGAGTGCTGGGATTACAGGTGTG
 
AGCCACCATGCCTGGCTAAGATGATGCAGTTTTATCCAGGTGGAGATGAAGAAATAGTCAGGGAAAGGTT
 
GAGAGGCTGCATATTGTAGACAGATTGGATTACTGGGTTCACTGGGGAAGATTGGATTAGAAGGAGGAGT
 
GGGGCTGCTTTACTTGTCCCCGACTTCTTGCCCCTGCTCTGCCCACTCCTTCATTGAATACCCTGCTGCC
 
CTGCAGATGTGTTTGCCTGATTTTCACCTCCCTTCCTGTCTCTGTTATCATAGCAGACGTTTTAGTGCTC
 
AGGGAAAGGAGGAGGAAGGAAACTCAGATTGTCAAGGAATTGTGAGTCAGTTTCTCAGCCACTACCCCAG
 
GCCTGGGCTTGGCTAAGTCATTTCACCTCTCTGACCCTAGCTAGTGGGGTGAATAGAGTCCCTACCCTGT
 
TGGAAGTCAGGAAGCCCCTGGAGCTCGCTCTGTGCTAATTATCGGTGACTATTTCTGTGACCCAGCTCTT
 
CAGGGGCAGCCTGTCTGGGGGTTCCAGTGCCCTGTGCAGTATCTCCAAGTGGTAGGTACTCAGTGTATGT
 
ATTAGTCCGTTTTCAGGCTGCTAATAAAAACACACCCAAGACTGGGTAATTTATAAAGAAAAAGAGGCTT
 
AATGGACTCACAGTTCCACATGGCTGGGGAAGCCTCACAAACATGGCGGAAGGCAAATGAGGAGCAAAGT
 
CACATCTTACATGGTGGCAGGCAGACAGAGTGCTTGCAGGGGAACTCCCCTTTATAAAACCATCAGATCT
 
CGTGAGACTTATTCAGTATAATGAGAACAGCACGTGAAAGATCCACCCCCATAATTCAATTACCTCCCAC
 
TGGCTCCTTCCCACAACATGTGGGAATTATGGGAGCTATAATTCAAGATGAGATTTGAGTGGGGACAGAG
 
ACAAACCATGTCAGTGTGTATTGTGGGATGAATGGAGATGGTCATTACAGTCTTCTAAGTGTATGGGTCC
 
ATCTCTTACTTGCTAGTGTGTTAAGAAACCTGGCCTCCTTTTTCACATCTTGAGGCACTGAATGGTTTAC
 
AAATCCAACTCTAATTTTTTATTATATTTAATACACCAATAGTGCAAGGCCTTTATTGGTAGTACTCTCC
 
TATTACAGATGGAGAGACTGAGGCTCAGAGAGGTGAGGTCTGAACTTGGAGCTCTGGACTCCAAAACCCA
 
GTTTCTTTTCATGGTACCAGGCTGCCAGGAGGAGAAAGGATGGAGCTTAGATTCAAGTCTTGCAACATTC
 
TTCTTAGTCCAGTATTGGCTTATTTTGTTTTATATTTTTTTGAGACGGAGTCTTGCTCTGTTACCCAGGC
 
TGGAGTGCAGTGGCGTGATTTCTGCTCACTGCAACCTCTGCCTCCTGGGTTAAGCGATTGTCCTGCCTCC
 
CCCTCTCGAGTAGCTGGGACTACAGGTGTGTGCCACCGTGTCCGGCTAATTTTTATATTTTTAATAGAGA
 
CAGGGTTTTGCCATGTTGGCCAGGCTGGTCTCGAACTCCTGGCCTCAACTGATCCACCCACCTCAGCCTC
 
CCAAAGTGCTGGGGTTACAGGCGTGAGCCACCATGTGTGGCCGGGCAGGTGATTTTTCTTGTCTTTCGCC
 
TCTTTCCACTTCAGTCCCCCTGGCCATCTTTCCCTTCCTTGAATGTGCCTCCTCCCACCTGGTGGCCTTT
 
GTAGGTATGAGGTCCTTCAGATGATGGCTGACATGACACTTCCTCAGAGAAGCGTTCCAGACCCCAAACA
 
CCAGGTCTCTGTCTGTGAAACAGGGACAGTGATACCTGCCTAATCTGCACGCCTTTCCCTTCTCACGAGA
 
TATTCTGGGAAGGGGGACTCAACTCTTTGCTCTGTTTGAGGGCAGAGTGCCTTTGAGCTCAGGAGAGAGT
 
GGGTTTGGGTGGCCTTAGCTACCCTAGATACGGAGGACTTGGAGTTAACTGGCACCATCCACAAAGCCTT
 
GCATGCACTGGACCTGAACAAGTTGCTTAGCCCTTGAGGCGGCGCCTCAGCTACCACCTGTCTTTGATGG
 
AGTTATGATAACAGTGCCTGCCCCGTCTGGATGCCTTATGGGTTCAGTGAGCACATAGGTGTAAATAAAG
 
TGCTTTGAGTGGCACCTCAGCATGTAGTTATTCTCTGATTGACCTATAGCAGCCATGCCAGTCTTCAGGG
 
TCCCAGGGCGCCAGGCTTTCTTCTCTTGCCTTGGAGCCTTGGTTTTTCCCTAGGACTGAAGTGTTCTGAC
 
CCCTTGGCTGAACTCCCCCTTCTTTAGGAAGCTTTCCCTGGTGTGCCATCTGATGCTGGTGCTTCCTGGA
 
GGGGTACTATCTGATCCTAGCTGGTGAGGCATTCCTGCCAGAGCTGCGGCAGGTGCTTTCCCCGCCTCTC
 
CACCAGCCTGTGAGGTGGATGCCATCCCCACGAGGGCACGCAGGTTCACAGAGACTGGCTCCTGTCCAAG
 
GCCGCGTGGCTGGAAAGAGACAGAACCAGATTCAAACACCAGCAGTTTAACCCTAGAGACCCTCATCCTA
 
CCTCCTATGTGGGTAACCACTATACATCCTGCCTCCTCTGAAATCTCCCGTTTTTTTTTATTTTAAAATT
 
TTATGTGAGTATTTACTTATTTATTTGCTGCTGTTGTTATTGTTTTGAGACAGAGTCTTGCTCTGTTTTC
 
ACCCAGGCTGGAGTGCAGTGGTGCGATCTTGGCTCACTGCAACCTCTGCCTCCTCGGCTCAAGCGATTCT
 
CGTGCCTCAGTCTCCCGAGTAGCTGGGACTACAGACATGCACCACCATGCCTGGCTAATTTTTGTGTCTT
 
TAGTAGAGATGGGGTTTCGCCATATTGGCCAGGCTGGTCTTGAACTCCTGGCTTCGAGTGAACTGCCTGC
 
CCTGGCCTCCCAAAGTATTGAGATTATAAGCATAAGCCACTGTGCCCGGCCAAAATTTTATTTATTTATT
 
TATTTTTTTGAGATGTAGTCTTGCTCTGTCACCCAGGCTGGAGTGCAGTGGCACAATCTCGGCTCACTGC
 
AAGCTCTGCCTCCCAGGTTGACGCCATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGACTACAGGCGCCC
 
GCCACCACGCCCGGCTAATTTTTTGTATTTTTAGTAGAGATAGGGTTTCACCTTATTAGCCAGGATGGTC
 
TCGATCTCCCGACCTCATGATCCGCCCGCCTCGGTCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACC
 
ACGCCCAGCCTAATTTTATTTATTAATTTATTTGAGATGGAGTCTCACTCTGTCACCCAGGCTGGAGTGC
 
AGTGGTGCGATCTTGGCTCACTGCAACCTCCACCTCCCAGGTTCAAGCGATTCTCCTGCCTCAGCCTCCT
 
GAGTAGCTGGGACTATAGGTGCGTGCCACCACGCCTGGCTAATTTTTGTATTTTTAGTAGAGATGGGGTT
 
TCACCATCGTGATCAGCCTGACAGGCTGGTCATGAACTCCTGACCTTAAGTGATTCACCTGCCTCAGCCT
 
CCCAAATCTGCTGGGATTACAGGCATGAGCCACCGCACCCAGCCATATTATTTCTGTATTTTCATTTGGC
 
TTACCTGTCAACTTGACACTTTTTTGTTTTTTTAGGCTAGTCAGGTGAAGCAATGTGAGTAAGATCACCT
 
GTCTTGATTATAACTTGTGTGTTCAGTGTTTGTCATTCATCCCTCCCTGTCTCTAGCCTTCAGTTCAGGG
 
CATGATTTGGAGCAAATGTTCAGTATCTGTTTAACAAGAACAATCATGAGAAATACTTACTCACTACTTG
 
TGTTTATGAAACATTATGTGCCAGGCACTGGGCTGAGCAATCCACACAGGTCAACTCGGTTAATCCTCAT
 
GACAACCCCTGAGATAAGTACTTATACCACCCTCGGTGTACACAGGAGGAAACTGAGGCTCAGGGAGTTT
 
TTCTTTAAAAATTTTTTCTTTTTTTTTTCCTTGAGACAGGTCTTACTTTGTCACCCAGGCTGGAGTGCAG
 
TGGTGTGATCTTGGCTCACTGCAGCCTCTTGGGCTCCAGCGATCCTCCCACCTCAGCCCCTCAAGTAGTT
 
GGGGAGTACAGGTGTGCACCACCATGCCCAGCTAATTTTTTGTAGGGATGGGGTCTTGCCATGTTATCCA
 
GGCTGGTCTTGAACTCCTGGGCTCAAGCGATCCTCCCTCCTTGGCCTCCCAAAGTGCTGGTATTACAAGT
 
GTGAACCACCACACCTGGCCTTAAATTTTATTTATATATTTATTTTTGTAGAGACGAGGTCTCACTATGT
 
TTCCCAGGCTGGCCTCGAACTCCTGGACTCAAGTGATCCTCTTGCCTCAGCCTCCAGAGTAGCTAGAATT
 
ACAGGCGTCTGCCACTGCACTTGGCTCAGGGAGTTTAAGTCACATAATCAGGCAGCGGTCTGCTCAGTTA
 
GAGTCCATGCGCTTAGCCATGGTGCCTTCAAGTGACTCCACCAGGGTCACCCCAGTACTTCCTGGGCAGA
 
CCAGACCATCCAGAGCGGGCAGGTTGTGCTTTCCTGCATGGTGGGAGAGGACGTGGCATGAAGCTTTGTG
 
TAAATCTGTGCAGCCTCTGCCATTGTAAATCACTGGCCACCGGGACAGACTCCCCCACCCCCTTCCAAAA
 
ACTAGAGGAACAGATGGGTGGAGTCGCATCCTTCCTCTGCAGGCATTTGGCCACCCACTCCCATTATGGA
 
GCATTTCCTCAGGACCAGGCACAGCGTGGAGCCCTTTGCATAGCTCACTGAATTGGGGCAGCAACCCTGG
 
GGCTGGGGACTACCATTATCTCCACCAGCCCAGCAAAGAGGGGAGGAGGGTACAGCCGAGACCCAGCTGC
 
AGTATGTGGAGCACACTGGGGTGCCGTGAAGATAGCTGTCCAGCCTCTAGAGGCTGACAGTCCTGCCCTG
 
ACACAGTGGTTTGTTCTCAGACTCTGGCCTCCCCTCTTAGTGCCCGTCTGAAGACTTCAGCCTTCGGTCC
 
TCTTCCATCAGCATGCCGTCTCCCCTGTGCCTCAGTCTCCCTCTCTGGAGTATCTGCTGGGGCCACAGTT
 
TGGTTTTATTGTTCTTTGAGACAGGGTATCACTTTGTTGCCCAGGCTGGAGTGCAATGGTGCAGTCATGG
 
CTCACTGCAGCCTTGACCTCCTGGGCTCAAGCCATCCTCTCACCTCAGCCTCCCGCAACTCCAGTATCTG
 
GACTACATGTAAATGCCACCATTCCCAGCTAATTTTTTTTTTTTTTTTTAGATAGAGTCTCACTCTTGTC
 
ACCCAGGCTGGAGTGCAATGGTGCCATCTTGGCTCACTGCAACATCCGCCTCCCAGGTTCAAGCCATTCT
 
CCTGCCTCAGCCTCCCAAGTAGCTGGGATTACAGGCACCTACCATCATGCACGGCTGTTTTTTGTATTTT
 
TAGTAGAGACAGGGTTTCACTATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATCTGCCTGC
 
CTCGGACTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCACGCCCAGCCTTCCTGGCTAATTTAAAAA
 
TTTTTCAGAGACGAGGGTTTTGCCTAGGCTGGACTCACTATGTTGCCTAGGCTGATCTCGAACCACTGGC
 
CTTAAGTGATCCTCCCACCTCAGCCTCCCAAAGTACTGGGATTACAGGCATGAGCCGCTGCACATCGCCA
 
CAGTTTGATTTTAGTATCCACTTTAAGGTGCAGTTAGTTCTTCCCAGCTCCTCTTTTCCCTATCTCATAC
 
AGTGTAGTCACTTCCCTTTCTCCCACTTCTCACCTTCCTAGATTTTGTTTTTCTCTCTCTTACCAAGTTT
 
TGAGTATGTAATACGTCCCAGATCGGTGCTGAACCCCTGGTGTGAGTTAACTTAGCAACTTTTCACGTGG
 
TGTTCTGCAGCCTTCTTACAGATAAGGAAATTCAGAGAGGTGAAGTGTTTTTGCAGAGAGCCACACAATG
 
AGCTAAGAAATTAGAACCCAGGGCAGTTGAACTCCAGAGCCAGGGAACTTAACAGCCCATTCATGCATTC
 
ACTCAGGAATTCCCTCTTGAGTGCCTGCTGTGAAGGCAACCTGGACGCTGTCTCTGCCTTCCTGGGATTG
 
TGGTGTGTGTGGGAGACAGACACGAAACAAAGCAATCTTCTAGGTCAGTGTCTGGCTATGCTGGGTACTC
 
AGAAGGGGGAAGCACTGTGCTGCAAGAGGCCGGCTGTTTCCTCCTCACAAGGGGAGGCCTGCAGTGACAT
 
TGCCAGGCCTGAATCTGCCCTGCCATTGGGAGTGTAATTTCCTCACGACTACAAATGCAAATAAATACAC
 
ACACTGAACATGCTACCCCGCTAGAACTGTGGACAGCTTGTTCACTGCAGTTAAAGGAAGACGGAGGGAC
 
ACATTTGAGCACGCATGTGTGAGATCCAGCTTGTGGCAAACCCTGCCATTCTTTAACATCTGTGGGAAGT
 
GAAGCCACCATTACTGACCTCGGAGGAATCCCGGGAGTAGTGGGAAGAAAAGTTGCAGATGTTTGAAAAT
 
CAGGCTGCAGGGGCAGTTGGGGTGAGGAGCATGGATGTCTAGGAGCCTGAATCTCAGCGTTGGAAGCCCA
 
GTTCTGATGTAATTTTTTTTTTTTTTTTTTTTTTTGAGACAGGGTCTCACTTTGTCCCCCAGGCTGGAGT
 
GCAGTGGCACAGTCTCAGCTCACTACAGCCTCATCCTTCCAGGTTCAAGCAGTCCTCCTGCCTCAGCCTC
 
CCAAGTAGCTGGGACCACAGGCATGTGTCACCATGCCTTTCTAATGTTTGTATTTTTTTGTGGAGACAGG
 
GTTTCTCACTGTTGCCCAGGCTGGTCTCAAACTCCTGAGCTCATGTGATCCTCCTGCCTTGGCCTTCCAA
 
AGTGCTAGGATTACAGGCGTAAGCCACTGCCCAGCATCCCCACCTTTTTTTTTTTTTTTTTAAATTTGTG
 
GTTTGGTGCAGGTTATTTTACCTCCCCAGGCCTCGGTTTCCTCATCTACTCAAGGGCAATGGCAGATAAT
 
CAACCTCACAAATTCCCCGAGGAGTTAGTAGGCTTGTGCCTGCCAGGCTCAGCTGGCTTACAGCAGGTGC
 
TTCATAAGGGGGAGCTATTGCAACGCCTCGGCTGGTTCTAGGCTGCCTAGCTCCAGAGCTAAGAGTCCCA
 
GTGGGTTCATATCCCTTTGAGCATTTATAGGCTTCTCACAGGTCTCCTGGGCTTTCTCAGCCATTCCTCA
 
TCTGACCTGAGGAGGTGTGATGTCCTTGCAGGACTCCTCCAGCCCCTCTGAGCTTTTTTTTTTGTTTGAG
 
ACAGCATGTTGCTCTGTTGCCCAGGCTGGAGTGCAGTGGTGTGATCATGGCTTACTGCAGCCTCCACCTC
 
CTGGGGTCAAGCGATTCTCCCACCTCAGCCTTCCAAGTAGCTGGGACTATAGGTGTATGTTACCATGCTT
 
GAGTAATTTTTAAAATTTTTTGTAGAGATGTGATCTCAGGCTGGTCTTGAACTCCTGGGCTCACCTGGGC
 
TCAAGTAATCCTTGAGCCCAAAGTGTTGGGATTACAGATATGAGCCACCTTGCCAGGCCCTGAGCTCCTT
 
CTGTCTTCCATGAAGGACACACTATTCCTTGCCTTTTGTCTTTTTTTGTGTGTGTGGCAGGGTCTCTGTT
 
GCCCAGGCTGGGATGTGTTGGTGCAATCTGGGCTTATTGCAACCTCCGCTTCCCGGATTCAAGTGATTCT
 
CCTGCCTCAGCCTCTTGAGTAGCTGGGATTACAGGTGTGCGCAACCACGCCCAGCTAATTTTTGTATTTT
 
TGGTAGACATGGGGTTTCACCATGTTGGCCAGGCTGGTCTCAGCCTCCCGAAGTGTTGGGATTACAGGCA
 
TAAGCCACTGTGCCTGGCCACTTTTTGTCTTTTGCATGTTCTATACACCTTGCCGAGAATGTGCTTTCCC
 
TGCCTCCTGGTTATCCCTCAGGTTTCAGCTCTAACATCACTTCCCCAGGGAAGCCTTACCCTCGCTTTCT
 
TCTGCTATAGGATAATATGAACCACCATCATTGTTACTTGTTTAGTGTCTGCCCTCCCTGCTAGGCAGCA
 
AGGACAGATATCTTGCCTGTCTTAGGTCCCCAGAACCCAGCACAGGGCTTGGAACATGTCAGGTGCTCAA
 
ATATTTGTTGAATGAGTCAGTCTGGGCTCGCAGAGGGAGCTGGGACCAGAGTAGAGTGGACACTGAAGCG
 
TATGTAGCAGTTTGCCAGGCGATAGGGAAAGAGACCTTCTTGTAAAGCAAGCAGAGATGAGAGGGAGAAA
 
TGAGAATTGCTAAGATTGTTTTGAGCATTTGTTAGGTGGCTTACAGATGCTTTGCTGAACATTTTATTCA
 
TGTGATCTTGTTTAAGCCTCTCATTTCCATGAGGGGTTCTCCATTTTTATAGACAAGGAAGGGGACACGT
 
GGAGAGGGGAAGTTACTTTGCCATATCACCCTCTAAGGGATTTTTTTTTTTTTTTTTTTTTTTGAGATGG
 
AGTCTCACTGTCACCAGGCTAGAGTGCTATGGCGCAATCTCGGCTCACTGCAACCCCCACCTCCCGGGTT
 
CAAGTGATTCTCTTGCCTCAGCCTCCTGAGTAGCTGGGACTACAGGCATGTGCCACCACATCCGGCTAAT
 
TTTTCTATTTTTAGTGGGTACAGGGTTTCGCCATGTTGGCCACATTGATCTCGAACTCCTGACCTCAAGT
 
GAGATTCGCCTGCCTCTGCCTCCTAAAGGCAGTAAGGGAATTGAACCTAGGCAGTCTGACTCTTAAGAGC
 
TCAAAGACTTAAGCTCTGTCCTGAGTAGCTTCGCTGGTGTATGGTATAGGGATCAGAAGAGAGCTGGTCT
 
GTATGCCTAGAGGGCATCCTGCAGTGGGCATGTGTGTCTGTGGTCACAGTGGGAGAGAGAGAGGAGGGGG
 
CATCTGTGGCCAGAATGGGGAGGCCTCGAATGCCAGGATGAAGAGTTTGGGCCTATGGTAATGAGAATAG
 
TAATAGCTACCAGTTATTGACTTGCCAGCCACTTGATTGAGTCATTTAACCTCACAACAGTCCTATGGGA
 
TAGGTGTTAACTGTTTTCATCCCAGTTTTTCAGATGGGGAAACTGAGACACAGAAGTTGTAGTCAGTTAT
 
CCAAAGTGTCGCAGTGAGAAAGTGCTGGAGTTGGGGCTCAGACCCTGTGAGTCAGGCCCCAGAGTTCACT
 
GGGGCCACATGCTCAACCACCATGCCGCCTGCCTGCCGCCGGGGCTGAGCCAGCGGAAACCTGGGTGCTG
 
CTTCTGATGCAGGTGGTCTCCTCTGCTCTCTTCCCCAGCACCCCCCCAGGCAGCAGCAGCAGTTTTCATC
 
TCTGGATGACAAGCCCCAGTTCCCAGGGGCCTCGGCGGAGTTTATAGATAAGTTGGAATTCATCCAGCCC
 
AACGTCATCTCTGGAATCCCCATCTACCGCGTCATGGACCGGCAAGGCCAGATCATCAACCCCAGCGAGG
 
ACCCCCACGTGAGAGGCGGCCTCCCCCACTTCCCGTGCCCCCCACGCCCAGGCCCCTTGCCTGTCTCCTC
 
TCTGGTCCCAACTGCCCCACGTCTATCTGTGCCTCCACCCGCAGCTGCCGAAGGAGAAGGTGCTGAAGCT
 
CTACAAGAGCATGACACTGCTTAACACCATGGACCGCATCCTCTATGAGTCTCAGCGGCAGGTGCGTGGG
 
GACAGGACTAGGGGCGGGGGGCTGGAATTACCTGAGGTCCCCTACCTGTGTTTGGGCCAAAGGAATGGCT
 
CCCAAGGAGGACAGATTCTTTTTTGGGGGGGTTCCATAGAGTTCTGAGGGTTCTCTTGGGACTCTGGAGG
 
AGAAGTGTGAGAGCAGTCCTGCCTTAGGGCAGAAGGATGGACCTGATCACTAAAAAACACCTACTGTGTG
 
CCTGGCTGGGGGCTCGCCTGTGTGGGGAGAGGGCAGGGAGAAGGCCTGACAGCCCTTCCTCTTGGAAATG
 
CACAGTCCTGAGAAACAGGGTGGGTCCCCAGTGCTTCCTGGAGTGCTCCAGGGGACCGCCCTAGGCCCAT
 
GTGAAATGGGCTGTGATCCCCTGATAAATCCTAGTATCAGGGACAAGGATCCAGTCAAGCCAGTGTCAAC
 
AAAGAGGGAAGTTATTGACTCCAATGACCATACATCCAGAGGGTCCTGTTTTGGGGCACAGCTAGATCCA
 
GGGGGTCTTCCAGGCAGGAAGTAGAGGGAAGGCGGAAGGCCAGACATGGCACATCCCAGCTATCCCACAC
 
CCCCACCTTTATAGAGCTTTCTGGGTAGTCCCACCTAGTGACATCTGCATACATTTCCTGGGCCACTCCT
 
AGCTGCAGGGGAGACTGAGAAATGTATTAATAATACTTTAGTCCCGGCAGTTGCTGCCTGCGGGAGATTG
 
AGACCTCTGTTAACAGAGGAGGGGAATAGTATACTGGGAAGGCAACCAGGAGCCTCTGCCAGTAGCCCCG
 
AGTCCTGATCCTCAGGGGGCTGGTTGCAGGCTCTTACCCACGCTAACCTGGACAGTCCTGCCAGGCCCTC
 
AGTACTCAGCACCGCCCACTCACTTGGCACTGGAATGTGCATTAACTTTTAATTGGCTCCCCACTTAGAA
 
GAAAGCACACATTCATCAGTTTCCCCCACAGCAACGGGAAAGCCAGGCCTGGGAACACATGGAGGAGGGC
 
ACCGCCGAGCCGCGAGGAGCTCTGGCCCCGGGGTGTCCCTCAGGGTGCTGGCCAGGCTGCCTTTGTTCCG
 
CTTTACACATGGCCCTGTGTGATCTCATGTCCATCCGTATGCCAGGGCTGCCCAGAGCCCCAGCCAGACT
 
GCTTGTCAGAGCCCTGCTTGGCTTCTCGCCCTGGATGTCCCACAGCAAACTAATCCTGTCTGAACTTGAC
 
CTCGACATCTTCCTCCCAGGCGTGCTCCTTTGCCTCACCTAGTCCGTTTCTCAGTGAAGAGCACCACCAT
 
CCACCTGCTTGCCCACACCAGAAACCTCATCTCATCCCTGTCTCCTCCCTCCCCCTTGCTATCCCTTTCT
 
CCCCTCTCATTCCCTCTGTCTGTAGATATTTCTTTTTTTTTTTTTTTTTTACTTTTGAGATGGAGTTTTG
 
CTCTTGTTGCCCAGGCTGGAGTGCAATGGCACGATCTTGGCTCACCGCCTCCCTGGTTCAAGCGATTCTC
 
CTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCATGCGCCACCACACCTGGCTAATTTTGTATTTTTA
 
GTAGAGATGGGGTTTCTCCATGTTGGTCAGTCTGGTCTCGAACTCCCTACCTCAGGTGATCTGCCTGCCT
 
TGGCCTCCCAAAATGGTAGGATTATAGGTGTGAGCCACTGCACCCAGCTGTAGCTACTTCTTTTCTTTTA
 
TGTTATTTTTATTTATTTATTTAGAGACGGAGTCCACCTCTGTCGCCCAGGCTGGAGTGCAGTGGCGCAA
 
TCTCGGCTCACTGCAACCTCCGCCTCCCAGGTTCAAGTGATTCTCCTGCCTTAGCCCCCCAAGTAGCTGG
 
GATTACAGGCGCCCACCACCACACTCGGCTAATTTTTGTATTTTTAGTAGAGATGGGGTTTCACCACATT
 
GGCCAGGCTAGTCTGGAACTCCTGACCTCAAGTGATCCGCCTGCCTTGGCCTCCCAAAGTACCGGGATTA
 
CAGGTGTGAGCCACCGTGCCTAGCCTGTCTGTAGATACATCTTTAGGAAGTGCTCAGCTGCTCAGCAGCA
 
CACTCTGAGACCCAGCCCCATCCTTTCTCTGCTTCACTGTTGTTGGTGAGTTGACCTTCGTGCTCTGCTT
 
GTGGCCTCGTGTTCACAGCATGGCTGCTGCATGTCCAAGCTTCATGGCTGTCTGCAAGGCAGGAAGAAGT
 
AGGGAGTGGGTAGGGCTGGCACCAGCAGCAGTGGTCTCTTCTATCCAGAAAGCAAAAGCAGACTTCCACC
 
AAAATATCTGGCTAAAACTGTGTCCCATGGCCACCTCTGGCTGCAAGGGAGGCCAGGGACCTGAGTGCGT
 
GTCTTTTTGAGGCTCAGGTTAGGGGATTTGGGAGAAGGGAACTGGGCACTGGTGTGGCCAGCTGACAGGG
 
TCTGCTGCCCCCTAGCACAGCCGACTCAGCTTCCTCAGACACATCCACCTGCTCCTCTCCCGCCTCCCTC
 
TGCTACCAAGCTGGGCCTCTGCCCCCTCTCTCCTGGCTTGCTGCAATCAGCCTTGATGACTCCATCCATT
 
TACCTCACTGCAGCCTGAGAGCAATTCTAAAACACAGGTCTGACCACATTGCTCCCTTGAGAAGACTGTC
 
GGCAGGTTACCACTATTCTCAGAGTAAAATCAATGTATCGTTCCCTTTGTTATCTGGGTGCCATCTCCTC
 
TCCCTCCTCATCTTTCCCCACCCCTACCACTGTATGTCCCAGCCCTTCTGGCCTCATGGTCTATCATTCC
 
CTCTGCCTGGAACCCTCCACCCACCCCATCTTTGCGCACTGGTCCTCACCGATGCCCTTCCGTGGTGGAA
 
CAGGTGCTTCCTTTGGGCTTCTATCACAGCAACTCGATCCCTCTGGCAGTTCTAAGCAGTCTGGCAGCGT
 
CTTCTTAAAGGCAGGAACCCCAGCATGGCCTGGCCCAGGACTGGCCTTCAGGAGGACTGTGAATTCATGG
 
AGGTGTTGGAAGCTGGGCAGGATTTGGATGGCTCTCTGTCATTGCCCAGCATAACCAATTGTGGGACCCC
 
GGTCCCCTCTACACCCCCAGGGCCGGATCTCCTTCTACATGACCAACTATGGTGAGGAGGGCACGCACGT
 
GGGGAGTGCCGCCGCCCTGGACAACACGGACCTGGTGTTTGGCCAGTACCGGGAGGCAGGTACGTCTGTC
 
CGTGGTTTGGCCCTGTGGTCCCCATTGAAGTGTACACTTTAGTTTTTCTGAGTGTTCTTCCAGGAGCAGC
 
ATAGTGCCAGGAAGGTCTAAGGAGCTGGAAGAAGAGCTTTCCAAGCAAGAAAGAGACAAGCCAGGGTTAG
 
GACTCTGGAGGGGGCAGTCTTCCCACTCTGAAGGCCAGAAAGTGTAGTGGCTCAGGGTCTGTGCTCTTAA
 
CCCTTTGTTACCAGTTGTCAGGTTTTGGGCAAGTCACTTCCCTTCTCTGAGCCTCAGTTTTCTAATCTCC
 
AGAAGAGGGATTATGACGGTACCTGCATCCAAGGGTCGCTGGGAGGATGCAGTGAAGTCAACGTTCGCTG
 
TGGGTGTGGCCTGTGCAGGAGACCCTGTCTGCCTGGTGCAGTGATCCCCTGGCCCCTCCTCAGGTGCTGT
 
ATCCTTCGGGGAGCAGGAGCAGGTGGAAGAACCTTGACAGACCTGAGTGCGGTGCAAGATTAGAGGGGCC
 
CGGCAAGCCCAGATGAGTGAAAGGAAGGAAGGGTCAAAGACCAGGAATGAAGAAAATCATTGAGCAGGAA
 
TGAGTGCCTGGGAGCGAAGGAGTGAGCCCAGGGAGCAGTGAGTGGGGTCTTCCAGGGAGCTTGTTGGTGT
 
GGGGCCCTGCATGGGGACTACCCTCAGAGTGTCAGCGAGCACTGATAGGGCCAGGAAGGCATATCTTAAC
 
CCTTTGTCTTAGATGCTTGGGGCAAGTTGCTTAACTGCCCTGTGCCTCTGTTTCCTCATTCGTAAGGGGG
 
ATGGTAGCCCTTCCCAAGACAGCTGTTGAGGGCAGTCTTGGGGCAGTTGGCAAAGTGCCCTCAGCACAGA
 
GTAAGCGCTCAGGGCCAGGGGTTGGTGTCATGATCGGGGACTCGCCCTGGGACAGCCCTTCAGCAGGGCC
 
AGACCACGTGGCTGTCCAGAGCCCCCATGGACCAGCATGGCTTGCAGCCTGCTCCTGGACTGTCTACTGC
 
TCACAGCTGGACCAGGCCCTTGGCCCCACATGCTCTAGGAGGGCAAACGCCCTGTGGTGTTTGTCTAAGG
 
TCTAGTGTCATCCGTGCGGAAAAAGATTTCCAGACACCAGGCAGCAAGCTGGGCCCCCGCCAGCTGTGCT
 
GAGCACTGCACCTGTTTTTGCTGAAGGGTTCTCTTGTGAGGGCACCAGGCAGGACCGAGCAAGCCTCATG
 
GGAATCATGGGAGCCGCTGAGCTCTCCACATGGGAGAGCTGGCCGCCTCTGCAAGAATGACTGTCCAGCA
 
CTGAGGGTCAGCAGAAGTGTCTGGGCAGCTGCAAATGGGACTCATGCAGAGGCAGCATGGTGCAGGGTAA
 
GGAGTTGAGTCACATCCACTCTAGACTTCACTTGGCAGGCCACTCACCTTGGTGACGCTGTCTGTCGTCT
 
GTGAAGTGAGAAGAGTGGTGTTCATCATGATAGGTAGTGCCTCCCTGGGCTCTTGACACCAGAGTGCTGT
 
TCTTTTTTTCTTCTCCTTTTTTTTTTTTGAGATGGAATCTCGCTCTGTTGCCCAGGCTGGAGTGCAGTGG
 
CGCGATCTCGGCTCACTGCAAGCTCCACCTCCTGGGTTCACGCCATTCTCCTGCTTCAGCCTCCCGAGTA
 
GCTGGGACTACAGGCGCCCACCACCATGCCTGGCTAATTTTTTGTATTTTTAGTAGAGGCGGGTTTCACC
 
ATGTTAGCCAGGATGGTCTTGATCTCCTGACCTTGCGATCCGCCCTCCTCCACCTCCTAAAGTGCTGGGA
 
TTACAGGCGTGAGCCACCATGCCTGGCCTTTTTTTTTTTTTTTGAGGTGGAGTCTCACTCTGTCGCCCAG
 
GCTGGAGTACGGTGGCGCAGTCTTGGCTCATTGCAACTTCTGCCTTCCAGGTTCAGGTGATTCTCCTGCC
 
TCAGACTCCGGAGTAGCTGGAATTAAAGGCATGCACCACCATGCCTGGCTAATTTTTGTATTTTTAGTAG
 
AGATGGGGTTTCACCACGTTGGTCAGGCTGGTTTCAAACTCCTGACCTCAAGTGATTCGTTTGCCTTGGC
 
CTCCCAAAGTGCTGGGGATTACAGGCATGAGCCACCGCGCCGGGCCAGCCAAGGTGCTGTTATATGTGCT
 
CTTTAGCCTCGGAGACTGTCATTTCCCCTCACTGCACTGCAGAGCGAGGGCACACCGGGTGAGGTTAGGG
 
CAGGCCCCAGTGGTGTTGCCTCATTGAGGCACCTCCCCCATCCATGTGAGGCAGGAGAGGCAAGCATTAC
 
CATCAACTCTAAAAGCCAGGTAAGAACCTGAGTCAACAAAGGTGAGGTCATGGGCCCAGGCCACATTCTC
 
CTAAGGAGTGGAGCCGAGTCTTTTTATGGCAGTCTGACCCAGAGACAGCTCTCGCTGTTTCCCACCCTTG
 
GCTGTCTTCCCAGGGCTATATGTGGAGTATCAGTGGTGGCACTGGCAGGGAGTGGCGCCCGGTGAAAAGG
 
AGTGTTAAGAGGAAGCACGTGCCAGGTGTGGTGGCTCACGCCTGTAATCCCAGCACTTCGGGAGGCTGAG
 
GCGGGAGGATTGCTTGAGCCCAGGAGTTTGATACCAGCCTGGGCAACATGGCGAGACCCCCATCTCTAAA
 
AAATTAAAAAATTAGCCAGGCGTGGTGGTGCACACCTGTAGTCCCAGCTACTCAGGAAGCTGAGTGGGAG
 
GATTGCTTGAGCCCAGAGGTTCAAGACTGCAGTGAGCTATGATTCCACCACCGAACTCCAGTTTGGGTGA
 
CAGAGTAAGATCCTGTCTCTGAGACCAACAAGAAAAGAGGTAGCATGTGTCAAGCAAGAGCAGCAGTTCC
 
CACTGGACCTGGAGTTTCCTGGTGCAAATCAGCCTGTGAGTCTGTATTTTCTTTTTTCTTCAAGACGGGG
 
TCTTACTCTGTCACCCAGGTTGAAGTGCAGTGGCATGATCTCGGCTCACTGCAACCTCTGCCTCTTAGGA
 
TCAAGTGATCCTCCCACCTCAGCCCCCTGAATAGCTGGGACCACAGGTGCACGCCACCACACCTGGCTAA
 
TTTTTTTTTTGATTTTTGGTAGAGACGGGGTTTCGCTGTGTTGCCCAGGCTGGTCTCAAACTCCTGAGCT
 
CAAGCAGTCCACCTGCCTCACCCTTCCAAAGTGCTGGGATTACAGGCACGAGCCACTGAGCCTGACAAGT
 
TTGTACTTTTAGCACGTTCCCCTGCCCCTGCCCAGTCCTTCTCACATGGGTGGACCTAGTACCACATTCT
 
GAGAAAGCCTGATTCCAAGGTCCTGTATCAGTCCCTTGGACCTTCTTTGACCCTCTACAAGACCTTTCTT
 
TCCCTCCTTAACCAGAGCAACTCCTGTTTCTATCTGTTACATATGCAGCACAGCCACATCTTGTGTGGAT
 
TTGTATTATGTCCATTCAAAAGCATGTGTCAGAAAATACCAGTAAACCTGTATTAAAGTTGGATGTTGAA
 
GTAGTGCTAACCACCCTCAACTCTAACTAAAATATCAACAAGTGGGCTGGGCGTGGTGGCTCACACCTGT
 
AATTGCAGCACTTTGGGAGGCTGAGGTGGGTGGATCACCTGAGGTCGGGAGTTTGAGACCAGCCTGACCA
 
ACATGGAGAAACCCCGTCTCTCCTGAAAATGCAAAAATTAGCTGGATGTGGTGGCACATGCCTGTAATCC
 
CAGCTACTCAGGAGGCTGAGGCAGGAGAATCGCTTGAACCCAGGAGGCAGAGGTTGCAGTGAGCCAAGAT
 
TGCGCCATTGCATTCTGGGCAACAAGAGCAAAACTCCGTCTCAAAAAAAAAAAAAGAAAGAAAATTAGGT
 
AGATCTAGTGGTGCACACCTGGAATCCCAGCTATCTGGGAGGCTGAGGCATAAGAATTGCTTGAACCTGG
 
AAGGTGGAGGTTGCAGTGAGCTCAGATTGCGCCACTGCACTCCAGCCTGGGTGGCCAAGCCAGACTCTGT
 
CTCAAAAAAAAAAAAAAAAAAAAAAAAAGGTAACTGCTGTGTGTAATGTGATCTCCATTAAATTTGTATC
 
TAAAGAGCGGGTTCTGCGGTAAAGAAAGGATTTGGAAAGCACTGTACTAGAGAGCTCCCTCTGAATAATG
 
TTCCCGGTCTGAGTGCTTGCAGCATTGATCCACAGCGCTGCCCCTAAAGTGTTGGTGTGAAGGTTGTCCC
 
ACTCTTCAGGCGGGGCAAAGAGCTTGTCTGAGGGCAATCACAAACCCCCTTCTGTCTTCTAACTTCGGAG
 
GAAGGGCCCCCTTAACCTGGAGCGGTTCCCTTTGATGGGGAGGTGGGTGTTCACCAGAGGTTTTGTTTGA
 
AGAAAAGGTTTTACTGCCATACATGGGTTAGAGTACTTCTGTAGTCCAGAAGTTGTAAATTAGTGGTCTA
 
TGCTGTTATTTGGTTCATAATGATGATTTTTTTTTTTTTTCGAGACAGGGTCTTGCTTTGTAGCCCAGAT
 
TGGAGTGCAGTGGCACGATCTCAGCTGCCTGCAGCCTTGAACTTCTGGGCTGAAGCAATCCTCTGCCTCA
 
GCCTCCCAAGTAGCTGGAACCACAGGCACACATCACCATGCCTGGCTAATGTTTTGATTTTTTTTTTTTT
 
TTGTAGAAGCGAGGCCTCACCATGTTGCTCAGGCTGGTCTCCAACTCCTGGGCTCAAGCGATCCTCCAAC
 
GTCAGCCTCTCAAAGTGCTGGGATTACAGGCATGAACCACCATGCCTGGCCATAGTAATGATTTATGGTT
 
AGAATATTTCTTAAAGGTAATTGAATTAGTTGAAAGCACTTAAATACCAGACAATATCTCACTAAGAGAA
 
AGAAAGAAAAAGGCTAGATTTCTGCTTTCTTTTGAAAAATCAGATTGGCCTTGCTGGGCAGAGTCAGTCA
 
GTCTGAACATCAGTCTTCCTCTTAAAACAAGCCTGAGCTTTCCTGTCTGCCTGCCAGCATGCTGCAGGTC
 
ACCCACAGGGCTGAACTGTCCCCCTGTACTGCCCACTCGGCTAACCATTGCCTCCTCCCCTCCTAGGTGT
 
GCTGATGTATCGGGACTACCCCCTGGAACTATTCATGGCCCAGTGCTATGGCAACATCAGTGACTTGGGC
 
AAGGGGCGCCAGATGCCTGTCCACTACGGCTGCAAGGAACGCCACTTCGTCACTATCTCCTCTCCACTGG
 
CCACGCAGATCCCTCAGGGTGAGGATGCATGCCCTGTACCTTGCACATGTGCAGACCAATGTCACACCCC
 
TGTCCAGGCCTCAGCTCTTTTGCCTGCCTTCTGGGTGGAATTCTGGGTTGGTGACACCACTAGAGCCCTG
 
GTCTGTGCTCCAGACTTCTTCTTCTTATTTTTTTCTTTTTTGAGACGGAGTCTTGCACTGTTGCCCAGGC
 
TAGAGTGCAGTGGTGCTATCGCAGCTCACTGCAGCCTCCTTCTCCCAGCTTCAAGCAATTCTCCTGCCTC
 
AGCCTCCCGATTAGCTGAGACTACAGGTGTGCACCACCACGCCCAGCTAATTTTTGTATTTTTAATAGAG
 
ACAGGGTTTCACCGTGTTGGCCAGGCTGGTCTCAAACTCCCGGCCTCAGGTGATCCACCCGCCTCAGCCT
 
CTCAAAGTGCTGGGATTACAGGCGTTTAAGCCATTGTGCCCAGCCCACTTTTTTTTTTTTTTTTTTTTTT
 
TTTTTTTTTTTGGTCTGAGACGGGGTTTCACTCTCGTCAGCCAGGTTGGAGTGCAATGGTGCAATCTCGG
 
CTCACTGTAACCTCCACCTCCCAGGCTCAAGTGATTCTCCTGCCTTAGCCTCCTGAGTAGCTGGGATTAT
 
GGGCATATGCCATTGTGCTTGGCTAAATTTTTGTATTTTTTGTAGAGATAAGGTTTCACCATTTTGGCCA
 
GGCTGGTCTTGAACCTGAGTTCAAGTGATCTGCCCACCTTGGCCTCTCAAAGTGCTGGGATTACAGATGT
 
GAGCCACTGCACCTGGCCTGTGTTCCAGACTTTGAATGCACCTCTTGTCTGGGACCCTTCAGAAGAGGGC
 
TCTGTTGCTGTCCCCACCCCATTCAGGGGCCTGAAGCCTGTGTGACCACACTCTGGATTTCCCAACCACA
 
TGCCTACTGCATTTGTCTGCTCACAAACATAGCCAACGTGTGCACCAGGTTCCACTGTGCCAATGCTTCA
 
CGTTCATCATCTAATAAGGACCCTCATTGTGGCCATGGGGACAGAGGTTATTATTTGCTCTTGTTCCTGT
 
TTTCAAGAGCAGGATGCCCGTGTTTTGTGTGTGTGTGAGAGACAGCAAGTGAATGGGAAAAAAAAAAGTG
 
CAGGAGCCTGGCCAACATGGCAAAGCCCTGTCCCTACTAAACATACAAAAATTAGCTTGGCATGGTGGTG
 
CACACCTGTAATCCTAGCTACTCAGGAGGCTGAAGCAGGAGGATTGCTTGAACCTGGGAGGCAGAGTTTG
 
CAGTGAGCTGAGATCGTGCCATTGCACTCTAGCCAAAGTGAGATCCTGTCTCAAAAAAAAAAAAAAGGAA
 
ATTGAAGTCAAAGAGGGAAGTCACTCGCTAACGGTCATGTGGCACAGAGGGAGAGGAGCAGGGACTCAAG
 
TCATTGACTGTGTACCTCCAGAGCCTGGGTTCTTGGCCCTGACCTTTGTCTGCCTCAGTCCTCAGAGTCA
 
AGTAAGCTTCCCCAAAGCTTACCGTAGGTGCCCTGAAGTCCCTTCCCTTATAAGAGCTTTTTTGGGAAGC
 
AAACCCACTGGGTGCCAGGCCTGCACCCTCATAGTGTGGAGAACAAGGAATCGGAGCCCCTTCCTGTCCT
 
CCGGGGCCTGTAGCCTGATCTAGGGTTGGTCTGGGCCCTGGCTTCTTTGCCCTAGGGCAGCTGAGATGCA
 
TGAGTTTGTGTCTCTCCAGGCCTCAGCACTGTGCCCAGCACAGCAGGAGGCTCCACACCCCTTAGCTCAT
 
AGTGTTCTTTTGTTGAGGAACAAGAAGCAGTTTGAGCCTCTTAGGTGGGAGGTGGGCAGGGAAGCTTTCT
 
CAGAGGAGGGGGCATTTAGGGGGCATTTGAGCTAAAGGTAATAGGGTTTCCACAAGGCTGGGGCGTGGCC
 
CTAACCTTCCCTCTGGGTCCCTGCTAAGTCACTTCAAATACACGAAGTTCCTGCCATGTGCCGGTGCCAC
 
TCCAAGTGCAGGGAACAAAAGCAACGCAAACCTCAGCCCTCCTGGTGCTGCCATTGTGGTGGGGGAGACG
 
GACAGTAACAGAGATGGAGAGGGAACTCCTGGAGTATATGAGGGTGGGGGTCCTGTAGGGAGCAGCCAAG
 
CCAGCCGGGTGTGGTGCACTGTAGGGGTGGCTGGGGTCCGTCACTGGAGTGACACTGAAGTATGGACCAG
 
AGGATGTGAGGGGGGAGCTGTGTGGTGTCTTGAGGCAGGTGTGAGCCTGGCGTGTTTGAGGAGCTGGGAG
 
GAGCCCAGAGGGGCAGGTGTGGAGTGAGCAGGTGTGGAGTGAGCGGGTGTAGGGTGAGCAGGTGTGGGGT
 
GGTTGGAGGTAAAGTCCAGGAGGTGGTGGGTGGGGGTGCAGATGGTGCAGGTTGGGTGGGGCCTCAAGGC
 
CTTTGCCTGTTGCAGGGATGTCGACTTTTACTCAAAGTGGGACAGGAGCCGTGGGTGGGGAGGAGTCTGC
 
TCTTCCTTGGCCTGGCTCACCTCTTCATGGAATCCGCGGGCTGCCCTGTGGATAAGGGATTGGGCAGTGT
 
GTGAAGGGCTAGTAAGGAGGCTGCTCAGTAACTTGGGCTTGACCAGGGTGGTGCAGAGGAGGAGGTAAGA
 
GGGGTTAGGTTCTGAAGGTGGAGTAGATGTGGAGAATGGCGGGAAGTTTGGCTTGGAGATGATACTGGAC
 
TCTTCAGCCTGAGCAGTTGGAAGGGTGGAGTTGCTTTCAGCTGAGCAAGAGGAGACCTGGGTGGGCAGAT
 
GCTGGGGGGCATCAGGAGCTGAGGTGTTTCCTTTCCCCTTGAAGCCTGGTGACTGCTGGCCTGAGCCACG
 
CTTGAGCCGTGGGTCATGTGAGTGTGAATGAGTGTGAGTGCATGTGAGTCTCCGCCCCTGCTCACCACCC
 
TCTCATCCCCTGCAGCGGTGGGGGCGGCGTACGCAGCCAAGCGGGCCAATGCCAACAGGGTCGTCATCTG
 
TTACTTCGGCGAGGGGGCAGCCAGTGAGGGGGACGCCCATGCCGGCTTCAACTTCGCTGCCACACTTGAG
 
TGCCCCATCATCTTCTTCTGCCGGAACAATGGCTACGCCATCTCCACGCCCACCTCTGAGCAGTATCGCG
 
GCGATGGCATTGGTATGGGCTCTGCTGGCTGCTCCCCACCCCGCTGGGATCATCTCCTTCCCTCCCCAAT
 
CCTGCCACCTTCCTGCCACCCCTACCCTCCTTCCTGGTTCTCGTCCTGTGTCCTGTGGCGTCTGGCACTT
 
GGTCAGCCACAGGAGTTGAGGTCCTGAGCACTCAGCCTTGCTCTCTGTCCTCTCCCTGCTCGTCCCCTTG
 
GCCTCGTGCATGTTCCTTATCTCAGCCCTGGCCTGACCTGCCTTCTCTGTGTCCCCACAGCAGCACGAGG
 
CCCCGGGTATGGCATCATGTCAATCCGCGTGGATGGTAATGATGTGTTTGCCGTATACAACGCCACAAAG
 
GAGGCCCGACGGCGGGCTGTGGCAGAGAACCAGCCCTTCCTCATCGAGGCCATGACCTACAGGTGCCTGC
 
CGCTCCCCCCGTCAGCACCCCCACAGCACTGACAGCCACCGTAGCATCTTCCTCATATCGATCACTGTCT
 
CCAAAACATGGCCTTATCACCTGATGTTGCATCTCCCCCTTGCCTTTATTCCGTTTCCACTCCTCCTTCC
 
CTAGTTCATCCCCCATCCTCCCTCCTGACCCCCACTCCAGGGAGCCCACACTGACCTGGGGCCCCTTGCC
 
CCTGTGCAGGATCGGGCACCACAGCACCAGTGACGACAGTTCAGCGTACCGCTCGGTGGATGAGGTCAAT
 
TACTGGGATAAACAGGACCACCCCATCTCCCGGCTGCGGCACTATCTGCTGAGCCAAGGCTGGTGGGATG
 
AGGAGCAGGAGAAGGCCTGGAGGAAGCAGTCCCGCAGGAAGGTGAGGGTGCCCCGCCCGGGAGGGTGTGC
 
TGGGGGCTGCTGCGGCCTGCAGAGCTTGGGAAGGATTTGTGGAACACCGAACTGGGAGGCTCAGGGATAA
 
CCCCAGTGATGTCTCAGATGTGGCCTGTGGAGCCAGGCTGCTGGGGCCATGTGTGTCCTGGCTCTGTGTC
 
CTTGGCATGTTGCCCTCCACTTCCTTGTCTTTGAAAGGGGATGCTGGTGGTGCCCATTTCAGAAACTGGT
 
TTGAGACTGGGCATGGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCCGAGGTGGCAGATCACTTG
 
AGGTCAGGAGCTTGAGAACAGCCTGGCCAAACATGGTCAAGCCCTGTCTCTACTAAAAATACAAAAATTA
 
GGGCTGGGCGCAGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGCGGACAGATCACCTGAG
 
GTCGGGAGTTTGAGACCAGCCTGGCCAACATGGTGAAACCTCGTCTCTAATAAAAATACAAAAATTAGCC
 
GGGCATGGTGCCGGGTGCCTGTAATCCCAGCTACTTGGGAGGCCGAGGCAGGAGAATTGCTTGAACCTGG
 
GAGGTAGAGGTTGCAGTGAGCTGAGATCGTACCATTGCACTCCAGCCTGGGCAACGAGAGCAAGACTCCA
 
TCTCAAAAAAAAAAATTAGCCAGGCGTGGTGGTGCGTGCCTGTAGTCCCAGCTACTCAGGAAGGTGAGGC
 
AGAAGAATCGCTTGAACCCGGGAGGTGGAACTTGCAGTGATTGGAGATGGCGCCACTGCACTCCAGCCTG
 
GGCAACAGAGCAAGACTATCTCTTAAAAAAAAAAAAAAGCTGGTTTGAGGATTAGGTGGCAAACCGTGGT
 
GGAGTGTACATGCCCTGCCCAGAGTCAGTGTGCCACCCTCCCTGGCCCGGTCATTAGGACCCAGGGCCCG
 
TGCAGCCCAGTGGTGAGGGCAGCACCCGGCATAGGGCAGCACCGTGTGTCCTGTCCCTGCCCTTCTCTGT
 
GCCTCAGTTTCCTCATTGCTTAGCTGTCTACCTCTTAGGGCTGCTGAAAGCCTTAAATGATTCCACACAC
 
TTGGATAGCACCTGATCCCTGCTGGGATTTGAGGGTTTTCATTACACTTCTGCTAGGATAATCATTTCCA
 
TTGTCGAGGTGGGAACACAAAGGCTTGGAGTGGTTAATTCCTTGCCAAGGCCCCGCAGGAGGAAGCAGGG
 
TCCTGCATGGGAGGCCGGCTAGCCTGCCCACTGCCCCATGTCCCCACAGGTGATGGAGGCCTTTGAGCAG
 
GCCGAGCGGAAGCCCAAACCCAACCCCAACCTACTCTTCTCAGACGTGTATCAGGAGATGCCCGCCCAGC
 
TCCGCAAGCAGCAGGAGTCTCTGGCCCGCCACCTGCAGACCTACGGGGAGCACTACCCACTGGATCACTT
 
CGATAAGTGAGACCTGCTCAGCCCACCCCCACCCATCCTCAGCTACCCCGAGAGGTAGCCCCACTCTAAG
 
GGGAGCAGGGGGACCTGACAGCACACCACTGTCTTCCCCAGTCAGCTCCCTCTAAAATACTCAGCGGCCA
 
GGGCGGCTGCCACTCTTCACCCCTGCTCCTCCCGGCTGTTACATTGTCAGGGGACAGCATCTGCAGCAGT
 
TGCTGAGGCTCCGTCAGCCCCCTCTTCACCTGTTGTTACAGTGCCTTCTCCCAGGGGCTGGGTGAGGGCA
 
CATTCAGGACTAGAAGCCCCTCTGGGCATGGGGTGGACATGGCAGGTCAGCCTGTGGAACTTGCGCAGGT
 
GCGAGTGGCCAGCAGAGGTCACGAATAAACTGCATCTCTGCGCCTGGCTCTCTACCA
 

Latest revision as of 22:44, 16 June 2011

Sequence

  • Uniprot:

>sp|P12694|ODBA_HUMAN 2-oxoisovalerate dehydrogenase subunit alpha, mitochondrial OS=Homo sapiens GN=BCKDHA PE=1 SV=2

MAVAIAAARVWRLNRGLSQAALLLLRQPGARGLARSHPPRQQQQFSSLDDKPQFPGASAE
FIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYKSMTLLNTMDRILY
ESQRQGRISFYMTNYGEEGTHVGSAAALDNTDLVFGQYREAGVLMYRDYPLELFMAQCYG
NISDLGKGRQMPVHYGCKERHFVTISSPLATQIPQAVGAAYAAKRANANRVVICYFGEGA
ASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIAARGPGYGIMSIRVDG
NDVFAVYNATKEARRRAVAENQPFLIEAMTYRIGHHSTSDDSSAYRSVDEVNYWDKQDHP
ISRLRHYLLSQGWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNLLFSDVYQEMPAQL
RKQQESLARHLQTYGEHYPLDHFDK

Sequence info: [1] The Uniprot sequence is 445 aa long, as is contains the transit peptide sequence from position 1-45.

  • PDB:

>1U5B:A|PDBID|CHAIN|SEQUENCE

SSLDDKPQFPGASAEFIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYKSMTLLNTMDRILYESQRQ
GRISFYMTNYGEEGTHVGSAAALDNTDLVFGQYREAGVLMYRDYPLELFMAQCYGNISDLGKGRQMPVHYGCKERHFVTI
SSPLATQIPQAVGAAYAAKRANANRVVICYFGEGAASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIA
ARGPGYGIMSIRVDGNDVFAVYNATKEARRRAVAENQPFLIEAMTYRIGHHSTSDDSSAYRSVDEVNYWDKQDHPISRLR
HYLLSQGWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNLLFSDVYQEMPAQLRKQQESLARHLQTYGEHYPLDHFDK

Sequence info: [2]

Mutated sequence

The following sequence shows the sequence inclusive all point mutations (missense/nonsense) listed in HGMD. (green: signal sequence)

> bckdha 445 aminoacids; Mw=50481.62Da

MAVAIAAARVWRLNRGLSQAALLLLRQPGARGLARSHPPRQQQQFSSLDDKPQFPGASAE
FIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYKSMTLLNTMDRILY
ESQRQGRISFYMTNYGEEGTHVGSAAALDNTDLVFGQYREAGVLMYRDYPLELFMAQCYG
NISDLGKGRQMPVHYGCKERHFVTISSPLATQIPQAVGAAYAAKRANANRVVICYFGEGA
ASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIAARGPGYGIMSIRVDG
NDVFAVYNATKEARRRAVAENQPFLIEAMTYRIGHHSTSDDSSAYRSVDEVNYWDKQDHP
ISRLRHYLLSQGWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNLLFSDVYQEMPAQL
RKQQESLARHLQTYGEHYPLDHFDK*


The following sequence is the reference sequence used by dbSNP. Note that this sequence is longer that 400 amino acids (protein length) and even longer than 445 amino acids (protein plus signal sequence(green)). It contains additional amino acids both at the beginning and the end of the sequence (blue).

LRECRTAEWLLAKMAVAIAAARVWRLNRGLSQAALLLLRQPGARGLARSHPPRQQQQFSS
LDDKPQFPGASAEFIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYK
SMTLLNTMDRILYESQRQGRISFYMTNYGEEGTHVGSAAALDNTDLVFGQYREAGVLMYR
DYPLELFMAQCYGNISDLGKGRQMPVHYGCKERHFVTISSPLATQIPQAVGAAYAAKRAN
ANRVVICYFGEGAASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIAAR
GPGYGIMSIRVDGNDVFAVYNATKEARRRAVAENQPFLIEAMTYRIGHHSTSDDSSAYRS
VDEVNYWDKQDHPISRLRHYLLSQGWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNL
LFSDVYQEMPAQLRKQQESLARHLQTYGEHYPLDHFDK.DLLSPPPPILSYPER.PHSKG
SRGT.QHTTVFPSQLPLKYSAARAAATLHPCSSRLLHCQGTASAAVAEAPSAPSSPVVTV
PSPRGWVRAHSGLEAPLGMGWTWQVSLWNLRRCEWPAEVTNKLHLCAWLSTKKKKKK


This sequence has an additional 13 amino acids at the beginning, which should be taken care of when comparing the SNP positions with the positions retrieved by HGMD.

go to Task 2: Sequence_Alignments

go to Task 5: Mapping SNPs

back to Maple syrup urine disease main page