Difference between revisions of "Reference Sequence BCKDHA"

From Bioinformatikpedia
(Sequence)
(Mutated sequence)
 
(9 intermediate revisions by the same user not shown)
Line 4: Line 4:
   
 
<tt>
 
<tt>
MAVAIAAARVWRLNRGLSQAALLLLRQPGARGLARSHPPRQQQQFSSLDDKPQFPGASAE
+
MAVAIAAARVWRLNRGLSQAALLLLRQPGARGLARSHPPRQQQQFSSLDDKPQFPGASAE<br>
FIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYKSMTLLNTMDRILY
+
FIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYKSMTLLNTMDRILY<br>
ESQRQGRISFYMTNYGEEGTHVGSAAALDNTDLVFGQYREAGVLMYRDYPLELFMAQCYG
+
ESQRQGRISFYMTNYGEEGTHVGSAAALDNTDLVFGQYREAGVLMYRDYPLELFMAQCYG<br>
NISDLGKGRQMPVHYGCKERHFVTISSPLATQIPQAVGAAYAAKRANANRVVICYFGEGA
+
NISDLGKGRQMPVHYGCKERHFVTISSPLATQIPQAVGAAYAAKRANANRVVICYFGEGA<br>
ASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIAARGPGYGIMSIRVDG
+
ASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIAARGPGYGIMSIRVDG<br>
NDVFAVYNATKEARRRAVAENQPFLIEAMTYRIGHHSTSDDSSAYRSVDEVNYWDKQDHP
+
NDVFAVYNATKEARRRAVAENQPFLIEAMTYRIGHHSTSDDSSAYRSVDEVNYWDKQDHP<br>
ISRLRHYLLSQGWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNLLFSDVYQEMPAQL
+
ISRLRHYLLSQGWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNLLFSDVYQEMPAQL<br>
 
RKQQESLARHLQTYGEHYPLDHFDK
 
RKQQESLARHLQTYGEHYPLDHFDK
 
</tt>
 
</tt>
Line 21: Line 21:
   
 
<tt>
 
<tt>
SSLDDKPQFPGASAEFIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYKSMTLLNTMDRILYESQRQ
+
SSLDDKPQFPGASAEFIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYKSMTLLNTMDRILYESQRQ<br>
GRISFYMTNYGEEGTHVGSAAALDNTDLVFGQYREAGVLMYRDYPLELFMAQCYGNISDLGKGRQMPVHYGCKERHFVTI
+
GRISFYMTNYGEEGTHVGSAAALDNTDLVFGQYREAGVLMYRDYPLELFMAQCYGNISDLGKGRQMPVHYGCKERHFVTI<br>
SSPLATQIPQAVGAAYAAKRANANRVVICYFGEGAASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIA
+
SSPLATQIPQAVGAAYAAKRANANRVVICYFGEGAASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIA<br>
ARGPGYGIMSIRVDGNDVFAVYNATKEARRRAVAENQPFLIEAMTYRIGHHSTSDDSSAYRSVDEVNYWDKQDHPISRLR
+
ARGPGYGIMSIRVDGNDVFAVYNATKEARRRAVAENQPFLIEAMTYRIGHHSTSDDSSAYRSVDEVNYWDKQDHPISRLR<br>
HYLLSQGWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNLLFSDVYQEMPAQLRKQQESLARHLQTYGEHYPLDHFDK
+
HYLLSQGWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNLLFSDVYQEMPAQLRKQQESLARHLQTYGEHYPLDHFDK<br>
 
</tt>
 
</tt>
   
 
Sequence info: [http://www.pdb.org/pdb/explore/remediatedSequence.do?structureId=1U5B]
 
Sequence info: [http://www.pdb.org/pdb/explore/remediatedSequence.do?structureId=1U5B]
   
  +
== Mutated sequence ==
more about [[Sequence_Alignments]]
 
  +
  +
The following sequence shows the sequence inclusive all point mutations (missense/nonsense) listed in HGMD. (green: signal sequence)
  +
  +
<tt>
  +
> bckdha 445 aminoacids; Mw=50481.62Da
  +
  +
<font color=green>
  +
MAVAIAAARVWRLNRGLSQAALLLLRQPGARGLARSHPPRQQQQF</font>SSLDDKPQFPGASAE<br>
  +
FIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYKSMTLLNTMDRILY<br>
  +
ESQRQGRISFYMTNYGEEGTHVGSAAALDNTDLVFGQYREAGVLMYRDYPLELFMAQCYG<br>
  +
NISDLGKGRQMPVHYGCKERHFVTISSPLATQIPQAVGAAYAAKRANANRVVICYFGEGA<br>
  +
ASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIAARGPGYGIMSIRVDG<br>
  +
NDVFAVYNATKEARRRAVAENQPFLIEAMTYRIGHHSTSDDSSAYRSVDEVNYWDKQDHP<br>
  +
ISRLRHYLLSQGWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNLLFSDVYQEMPAQL<br>
  +
RKQQESLARHLQTYGEHYPLDHFDK*
  +
</tt>
  +
  +
  +
The following sequence is the reference sequence used by dbSNP. Note that this sequence is longer that 400 amino acids (protein length) and even longer than 445 amino acids (protein plus signal sequence(green)). It contains additional amino acids both at the beginning and the end of the sequence (blue).
  +
  +
<tt>
  +
<font color=blue>LRECRTAEWLLAK</font><font color=green>MAVAIAAARVWRLNRGLSQAALLLLRQPGARGLARSHPPRQQQQF</font>SS<br>
  +
LDDKPQFPGASAEFIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYK<br>
  +
SMTLLNTMDRILYESQRQGRISFYMTNYGEEGTHVGSAAALDNTDLVFGQYREAGVLMYR<br>
  +
DYPLELFMAQCYGNISDLGKGRQMPVHYGCKERHFVTISSPLATQIPQAVGAAYAAKRAN<br>
  +
ANRVVICYFGEGAASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIAAR<br>
  +
GPGYGIMSIRVDGNDVFAVYNATKEARRRAVAENQPFLIEAMTYRIGHHSTSDDSSAYRS<br>
  +
VDEVNYWDKQDHPISRLRHYLLSQGWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNL<br>
  +
LFSDVYQEMPAQLRKQQESLARHLQTYGEHYPLDHFDK<font color=blue>.DLLSPPPPILSYPER.PHSKG<br>
  +
SRGT.QHTTVFPSQLPLKYSAARAAATLHPCSSRLLHCQGTASAAVAEAPSAPSSPVVTV<br>
  +
PSPRGWVRAHSGLEAPLGMGWTWQVSLWNLRRCEWPAEVTNKLHLCAWLSTKKKKKK</font>
  +
</tt>
  +
  +
  +
This sequence has an additional 13 amino acids at the beginning, which should be taken care of when comparing the SNP positions with the positions retrieved by HGMD.
  +
  +
go to Task 2: [[Sequence_Alignments]]
  +
  +
go to Task 5: [[Mapping_SNPs_BCKDHA| Mapping SNPs]]
   
 
back to [[Maple syrup urine disease]] main page
 
back to [[Maple syrup urine disease]] main page

Latest revision as of 22:44, 16 June 2011

Sequence

  • Uniprot:

>sp|P12694|ODBA_HUMAN 2-oxoisovalerate dehydrogenase subunit alpha, mitochondrial OS=Homo sapiens GN=BCKDHA PE=1 SV=2

MAVAIAAARVWRLNRGLSQAALLLLRQPGARGLARSHPPRQQQQFSSLDDKPQFPGASAE
FIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYKSMTLLNTMDRILY
ESQRQGRISFYMTNYGEEGTHVGSAAALDNTDLVFGQYREAGVLMYRDYPLELFMAQCYG
NISDLGKGRQMPVHYGCKERHFVTISSPLATQIPQAVGAAYAAKRANANRVVICYFGEGA
ASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIAARGPGYGIMSIRVDG
NDVFAVYNATKEARRRAVAENQPFLIEAMTYRIGHHSTSDDSSAYRSVDEVNYWDKQDHP
ISRLRHYLLSQGWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNLLFSDVYQEMPAQL
RKQQESLARHLQTYGEHYPLDHFDK

Sequence info: [1] The Uniprot sequence is 445 aa long, as is contains the transit peptide sequence from position 1-45.

  • PDB:

>1U5B:A|PDBID|CHAIN|SEQUENCE

SSLDDKPQFPGASAEFIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYKSMTLLNTMDRILYESQRQ
GRISFYMTNYGEEGTHVGSAAALDNTDLVFGQYREAGVLMYRDYPLELFMAQCYGNISDLGKGRQMPVHYGCKERHFVTI
SSPLATQIPQAVGAAYAAKRANANRVVICYFGEGAASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIA
ARGPGYGIMSIRVDGNDVFAVYNATKEARRRAVAENQPFLIEAMTYRIGHHSTSDDSSAYRSVDEVNYWDKQDHPISRLR
HYLLSQGWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNLLFSDVYQEMPAQLRKQQESLARHLQTYGEHYPLDHFDK

Sequence info: [2]

Mutated sequence

The following sequence shows the sequence inclusive all point mutations (missense/nonsense) listed in HGMD. (green: signal sequence)

> bckdha 445 aminoacids; Mw=50481.62Da

MAVAIAAARVWRLNRGLSQAALLLLRQPGARGLARSHPPRQQQQFSSLDDKPQFPGASAE
FIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYKSMTLLNTMDRILY
ESQRQGRISFYMTNYGEEGTHVGSAAALDNTDLVFGQYREAGVLMYRDYPLELFMAQCYG
NISDLGKGRQMPVHYGCKERHFVTISSPLATQIPQAVGAAYAAKRANANRVVICYFGEGA
ASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIAARGPGYGIMSIRVDG
NDVFAVYNATKEARRRAVAENQPFLIEAMTYRIGHHSTSDDSSAYRSVDEVNYWDKQDHP
ISRLRHYLLSQGWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNLLFSDVYQEMPAQL
RKQQESLARHLQTYGEHYPLDHFDK*


The following sequence is the reference sequence used by dbSNP. Note that this sequence is longer that 400 amino acids (protein length) and even longer than 445 amino acids (protein plus signal sequence(green)). It contains additional amino acids both at the beginning and the end of the sequence (blue).

LRECRTAEWLLAKMAVAIAAARVWRLNRGLSQAALLLLRQPGARGLARSHPPRQQQQFSS
LDDKPQFPGASAEFIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYK
SMTLLNTMDRILYESQRQGRISFYMTNYGEEGTHVGSAAALDNTDLVFGQYREAGVLMYR
DYPLELFMAQCYGNISDLGKGRQMPVHYGCKERHFVTISSPLATQIPQAVGAAYAAKRAN
ANRVVICYFGEGAASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIAAR
GPGYGIMSIRVDGNDVFAVYNATKEARRRAVAENQPFLIEAMTYRIGHHSTSDDSSAYRS
VDEVNYWDKQDHPISRLRHYLLSQGWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNL
LFSDVYQEMPAQLRKQQESLARHLQTYGEHYPLDHFDK.DLLSPPPPILSYPER.PHSKG
SRGT.QHTTVFPSQLPLKYSAARAAATLHPCSSRLLHCQGTASAAVAEAPSAPSSPVVTV
PSPRGWVRAHSGLEAPLGMGWTWQVSLWNLRRCEWPAEVTNKLHLCAWLSTKKKKKK


This sequence has an additional 13 amino acids at the beginning, which should be taken care of when comparing the SNP positions with the positions retrieved by HGMD.

go to Task 2: Sequence_Alignments

go to Task 5: Mapping SNPs

back to Maple syrup urine disease main page