Difference between revisions of "Reference Sequence BCKDHA"
(→Sequence searches) |
(→Mutated sequence) |
||
(31 intermediate revisions by 2 users not shown) | |||
Line 4: | Line 4: | ||
<tt> |
<tt> |
||
− | MAVAIAAARVWRLNRGLSQAALLLLRQPGARGLARSHPPRQQQQFSSLDDKPQFPGASAE |
+ | MAVAIAAARVWRLNRGLSQAALLLLRQPGARGLARSHPPRQQQQFSSLDDKPQFPGASAE<br> |
− | FIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYKSMTLLNTMDRILY |
+ | FIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYKSMTLLNTMDRILY<br> |
− | ESQRQGRISFYMTNYGEEGTHVGSAAALDNTDLVFGQYREAGVLMYRDYPLELFMAQCYG |
+ | ESQRQGRISFYMTNYGEEGTHVGSAAALDNTDLVFGQYREAGVLMYRDYPLELFMAQCYG<br> |
− | NISDLGKGRQMPVHYGCKERHFVTISSPLATQIPQAVGAAYAAKRANANRVVICYFGEGA |
+ | NISDLGKGRQMPVHYGCKERHFVTISSPLATQIPQAVGAAYAAKRANANRVVICYFGEGA<br> |
− | ASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIAARGPGYGIMSIRVDG |
+ | ASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIAARGPGYGIMSIRVDG<br> |
− | NDVFAVYNATKEARRRAVAENQPFLIEAMTYRIGHHSTSDDSSAYRSVDEVNYWDKQDHP |
+ | NDVFAVYNATKEARRRAVAENQPFLIEAMTYRIGHHSTSDDSSAYRSVDEVNYWDKQDHP<br> |
− | ISRLRHYLLSQGWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNLLFSDVYQEMPAQL |
+ | ISRLRHYLLSQGWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNLLFSDVYQEMPAQL<br> |
RKQQESLARHLQTYGEHYPLDHFDK |
RKQQESLARHLQTYGEHYPLDHFDK |
||
</tt> |
</tt> |
||
Line 21: | Line 21: | ||
<tt> |
<tt> |
||
− | SSLDDKPQFPGASAEFIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYKSMTLLNTMDRILYESQRQ |
+ | SSLDDKPQFPGASAEFIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYKSMTLLNTMDRILYESQRQ<br> |
− | GRISFYMTNYGEEGTHVGSAAALDNTDLVFGQYREAGVLMYRDYPLELFMAQCYGNISDLGKGRQMPVHYGCKERHFVTI |
+ | GRISFYMTNYGEEGTHVGSAAALDNTDLVFGQYREAGVLMYRDYPLELFMAQCYGNISDLGKGRQMPVHYGCKERHFVTI<br> |
− | SSPLATQIPQAVGAAYAAKRANANRVVICYFGEGAASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIA |
+ | SSPLATQIPQAVGAAYAAKRANANRVVICYFGEGAASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIA<br> |
− | ARGPGYGIMSIRVDGNDVFAVYNATKEARRRAVAENQPFLIEAMTYRIGHHSTSDDSSAYRSVDEVNYWDKQDHPISRLR |
+ | ARGPGYGIMSIRVDGNDVFAVYNATKEARRRAVAENQPFLIEAMTYRIGHHSTSDDSSAYRSVDEVNYWDKQDHPISRLR<br> |
− | HYLLSQGWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNLLFSDVYQEMPAQLRKQQESLARHLQTYGEHYPLDHFDK |
+ | HYLLSQGWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNLLFSDVYQEMPAQLRKQQESLARHLQTYGEHYPLDHFDK<br> |
</tt> |
</tt> |
||
Sequence info: [http://www.pdb.org/pdb/explore/remediatedSequence.do?structureId=1U5B] |
Sequence info: [http://www.pdb.org/pdb/explore/remediatedSequence.do?structureId=1U5B] |
||
+ | == Mutated sequence == |
||
+ | The following sequence shows the sequence inclusive all point mutations (missense/nonsense) listed in HGMD. (green: signal sequence) |
||
− | == Sequence Alignments == |
||
+ | <tt> |
||
− | === Sequence searches === |
||
+ | > bckdha 445 aminoacids; Mw=50481.62Da |
||
− | * FASTA |
||
− | ../bin/fasta36 sequence.fasta database > FastaOutput.txt |
||
+ | <font color=green> |
||
− | * BLAST |
||
+ | MAVAIAAARVWRLNRGLSQAALLLLRQPGARGLARSHPPRQQQQF</font>SSLDDKPQFPGASAE<br> |
||
− | blastall -p blastp -d database -i sequence.fasta > BlastOutput.txt |
||
+ | FIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYKSMTLLNTMDRILY<br> |
||
+ | ESQRQGRISFYMTNYGEEGTHVGSAAALDNTDLVFGQYREAGVLMYRDYPLELFMAQCYG<br> |
||
+ | NISDLGKGRQMPVHYGCKERHFVTISSPLATQIPQAVGAAYAAKRANANRVVICYFGEGA<br> |
||
+ | ASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIAARGPGYGIMSIRVDG<br> |
||
+ | NDVFAVYNATKEARRRAVAENQPFLIEAMTYRIGHHSTSDDSSAYRSVDEVNYWDKQDHP<br> |
||
+ | ISRLRHYLLSQGWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNLLFSDVYQEMPAQL<br> |
||
+ | RKQQESLARHLQTYGEHYPLDHFDK* |
||
+ | </tt> |
||
+ | |||
+ | |||
+ | The following sequence is the reference sequence used by dbSNP. Note that this sequence is longer that 400 amino acids (protein length) and even longer than 445 amino acids (protein plus signal sequence(green)). It contains additional amino acids both at the beginning and the end of the sequence (blue). |
||
+ | |||
+ | <tt> |
||
+ | <font color=blue>LRECRTAEWLLAK</font><font color=green>MAVAIAAARVWRLNRGLSQAALLLLRQPGARGLARSHPPRQQQQF</font>SS<br> |
||
+ | LDDKPQFPGASAEFIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYK<br> |
||
+ | SMTLLNTMDRILYESQRQGRISFYMTNYGEEGTHVGSAAALDNTDLVFGQYREAGVLMYR<br> |
||
+ | DYPLELFMAQCYGNISDLGKGRQMPVHYGCKERHFVTISSPLATQIPQAVGAAYAAKRAN<br> |
||
+ | ANRVVICYFGEGAASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIAAR<br> |
||
+ | GPGYGIMSIRVDGNDVFAVYNATKEARRRAVAENQPFLIEAMTYRIGHHSTSDDSSAYRS<br> |
||
+ | VDEVNYWDKQDHPISRLRHYLLSQGWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNL<br> |
||
+ | LFSDVYQEMPAQLRKQQESLARHLQTYGEHYPLDHFDK<font color=blue>.DLLSPPPPILSYPER.PHSKG<br> |
||
+ | SRGT.QHTTVFPSQLPLKYSAARAAATLHPCSSRLLHCQGTASAAVAEAPSAPSSPVVTV<br> |
||
+ | PSPRGWVRAHSGLEAPLGMGWTWQVSLWNLRRCEWPAEVTNKLHLCAWLSTKKKKKK</font> |
||
+ | </tt> |
||
− | * PSIBLAST |
||
− | blastpgp -i sequence.fasta -j iterations -h evalueCutoff -d database > PsiblastOutput.txt |
||
+ | This sequence has an additional 13 amino acids at the beginning, which should be taken care of when comparing the SNP positions with the positions retrieved by HGMD. |
||
− | * HHSearch |
||
− | hhsearch -i query -d database -o output.txt |
||
+ | go to Task 2: [[Sequence_Alignments]] |
||
+ | go to Task 5: [[Mapping_SNPs_BCKDHA| Mapping SNPs]] |
||
− | database = /data/blast/nr/nr |
||
back to [[Maple syrup urine disease]] main page |
back to [[Maple syrup urine disease]] main page |
Latest revision as of 22:44, 16 June 2011
Sequence
- Uniprot:
>sp|P12694|ODBA_HUMAN 2-oxoisovalerate dehydrogenase subunit alpha, mitochondrial OS=Homo sapiens GN=BCKDHA PE=1 SV=2
MAVAIAAARVWRLNRGLSQAALLLLRQPGARGLARSHPPRQQQQFSSLDDKPQFPGASAE
FIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYKSMTLLNTMDRILY
ESQRQGRISFYMTNYGEEGTHVGSAAALDNTDLVFGQYREAGVLMYRDYPLELFMAQCYG
NISDLGKGRQMPVHYGCKERHFVTISSPLATQIPQAVGAAYAAKRANANRVVICYFGEGA
ASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIAARGPGYGIMSIRVDG
NDVFAVYNATKEARRRAVAENQPFLIEAMTYRIGHHSTSDDSSAYRSVDEVNYWDKQDHP
ISRLRHYLLSQGWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNLLFSDVYQEMPAQL
RKQQESLARHLQTYGEHYPLDHFDK
Sequence info: [1] The Uniprot sequence is 445 aa long, as is contains the transit peptide sequence from position 1-45.
- PDB:
>1U5B:A|PDBID|CHAIN|SEQUENCE
SSLDDKPQFPGASAEFIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYKSMTLLNTMDRILYESQRQ
GRISFYMTNYGEEGTHVGSAAALDNTDLVFGQYREAGVLMYRDYPLELFMAQCYGNISDLGKGRQMPVHYGCKERHFVTI
SSPLATQIPQAVGAAYAAKRANANRVVICYFGEGAASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIA
ARGPGYGIMSIRVDGNDVFAVYNATKEARRRAVAENQPFLIEAMTYRIGHHSTSDDSSAYRSVDEVNYWDKQDHPISRLR
HYLLSQGWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNLLFSDVYQEMPAQLRKQQESLARHLQTYGEHYPLDHFDK
Sequence info: [2]
Mutated sequence
The following sequence shows the sequence inclusive all point mutations (missense/nonsense) listed in HGMD. (green: signal sequence)
> bckdha 445 aminoacids; Mw=50481.62Da
MAVAIAAARVWRLNRGLSQAALLLLRQPGARGLARSHPPRQQQQFSSLDDKPQFPGASAE
FIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYKSMTLLNTMDRILY
ESQRQGRISFYMTNYGEEGTHVGSAAALDNTDLVFGQYREAGVLMYRDYPLELFMAQCYG
NISDLGKGRQMPVHYGCKERHFVTISSPLATQIPQAVGAAYAAKRANANRVVICYFGEGA
ASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIAARGPGYGIMSIRVDG
NDVFAVYNATKEARRRAVAENQPFLIEAMTYRIGHHSTSDDSSAYRSVDEVNYWDKQDHP
ISRLRHYLLSQGWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNLLFSDVYQEMPAQL
RKQQESLARHLQTYGEHYPLDHFDK*
The following sequence is the reference sequence used by dbSNP. Note that this sequence is longer that 400 amino acids (protein length) and even longer than 445 amino acids (protein plus signal sequence(green)). It contains additional amino acids both at the beginning and the end of the sequence (blue).
LRECRTAEWLLAKMAVAIAAARVWRLNRGLSQAALLLLRQPGARGLARSHPPRQQQQFSS
LDDKPQFPGASAEFIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYK
SMTLLNTMDRILYESQRQGRISFYMTNYGEEGTHVGSAAALDNTDLVFGQYREAGVLMYR
DYPLELFMAQCYGNISDLGKGRQMPVHYGCKERHFVTISSPLATQIPQAVGAAYAAKRAN
ANRVVICYFGEGAASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIAAR
GPGYGIMSIRVDGNDVFAVYNATKEARRRAVAENQPFLIEAMTYRIGHHSTSDDSSAYRS
VDEVNYWDKQDHPISRLRHYLLSQGWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNL
LFSDVYQEMPAQLRKQQESLARHLQTYGEHYPLDHFDK.DLLSPPPPILSYPER.PHSKG
SRGT.QHTTVFPSQLPLKYSAARAAATLHPCSSRLLHCQGTASAAVAEAPSAPSSPVVTV
PSPRGWVRAHSGLEAPLGMGWTWQVSLWNLRRCEWPAEVTNKLHLCAWLSTKKKKKK
This sequence has an additional 13 amino acids at the beginning, which should be taken care of when comparing the SNP positions with the positions retrieved by HGMD.
go to Task 2: Sequence_Alignments
go to Task 5: Mapping SNPs
back to Maple syrup urine disease main page