Difference between revisions of "Mapping mutations of ARS A"
From Bioinformatikpedia
(→The mutation map) |
|||
Line 283: | Line 283: | ||
* 136: The mutationas are different amino acid substitutions. P -> L is annotated in HGMD and P -> A is annotated in dbSNP. |
* 136: The mutationas are different amino acid substitutions. P -> L is annotated in HGMD and P -> A is annotated in dbSNP. |
||
* 82: Is mutation is identical in both databases and leads to a substitution: P -> L. |
* 82: Is mutation is identical in both databases and leads to a substitution: P -> L. |
||
+ | |||
+ | [[File:Mut map 1auk.png | 400px | center | thumb | Structure of ARSA. Synonymous mutations are shown in green, missense/nonsense mutations in red. The active site is depicted in yellow.]] |
Revision as of 15:01, 16 June 2011
HGMD
Accession Number | Codon change | Amino acid change | position |
CM042298 | cGAC-AAC | Asp-Asn | 29 |
CM990171 | cGGC-AGC | Gly-Ser | 32 |
CM065974 | CTG-CCG | Leu-Pro | 52 |
CM990172 | CTG-CCG | Leu-Pro | 68 |
CM950092 | CCG-CTG | Pro-Leu | 82 |
CM940096 | CGG-CAG | Arg-Gln | 84 |
CM990173 | tCGG-TGG | Arg-Trp | 84 |
CM940097 | GGC-GAC | Gly-Asp | 86 |
CM990174 | gCCC-GCC | Pro-Ala | 94 |
CM970109 | AGC-AAC | Ser-Asn | 95 |
CM910049 | TCC-TTC | Ser-Phe | 96 |
CM910050 | GGC-GAC | Gly-Asp | 99 |
CM990175 | GGC-GTC | Gly-Val | 99 |
CM970110 | aGGA-AGA | Gly-Arg | 119 |
CM940098 | cGGC-AGC | Gly-Ser | 122 |
CM980118 | CTG-CCG | Leu-Pro | 135 |
CM940099 | CCC-CTC | Pro-Leu | 136 |
CM990176 | gCCC-TCC | Pro-Ser | 136 |
CM004461 | tCGA-GGA | Arg-Gly | 143 |
CM990177 | CCG-CTG | Pro-Leu | 148 |
CM970111 | cGAC-TAC | Asp-Tyr | 152 |
CM962419 | CAGg-CAC | Gln-His | 153 |
CM940100 | GGC-GAC | Gly-Asp | 154 |
CM940101 | CCC-CGC | Pro-Arg | 155 |
CM032834 | CCC-CTC | Pro-Leu | 155 |
CM042299 | cTGC-CGC | Cys-Arg | 156 |
CM940102 | CCT-CGT | Pro-Arg | 167 |
CM940103 | cGAC-AAC | Asp-Asn | 169 |
CM950093 | TGT-TAT | Cys-Tyr | 172 |
CM910051 | ATC-AGC | Ile-Ser | 179 |
CM032835 | CTG-CAG | Leu-Gln | 181 |
CM950094 | CAGc-CAC | Gln-His | 190 |
CM990178 | gCCC-ACC | Pro-Thr | 191 |
CM990179 | TGGc-TGA | Trp-Term | 193 |
CM950095 | TAC-TGC | Tyr-Cys | 201 |
CM930039 | GCC-GTC | Ala-Val | 212 |
CM050538 | cTTC-GTC | Phe-Val | 219 |
CM930040 | GCC-GTC | Ala-Val | 224 |
CM990180 | cCAC-TAC | His-Tyr | 227 |
CM940104 | cCCT-ACT | Pro-Thr | 231 |
CM940105 | cCGC-TGC | Arg-Cys | 244 |
CM970112 | CGC-CAC | Arg-His | 244 |
CM930041 | cGGG-AGG | Gly-Arg | 245 |
CM034715 | TTT-TCT | Phe-Ser | 247 |
CM970113 | TCC-TAC | Ser-Tyr | 250 |
CM024340 | gGAG-AAG | Glu-Lys | 253 |
CM960078 | gGAT-CAT | Asp-His | 255 |
CM930042 | ACG-ATG | Thr-Met | 274 |
CM074714 | ACT-ATT | Thr-Ile | 279 |
CM993444 | aGAC-TAC | Asp-Tyr | 281 |
CM023013 | gACC-CCC | Thr-Pro | 286 |
CM990181 | CGT-CAT | Arg-His | 288 |
CM940106 | gCGT-TGT | Arg-Cys | 288 |
CM042300 | cGGC-AGC | Gly-Ser | 293 |
CM044574 | GGC-GAC | Gly-Asp | 293 |
CM042301 | TGC-TAC | Cys-Tyr | 294 |
CM930043 | TCC-TAC | Ser-Tyr | 295 |
CM980119 | TTG-TCG | Leu-Ser | 298 |
CM990182 | TGT-TTT | Cys-Phe | 300 |
CM032836 | cTAC-CAC | Tyr-His | 306 |
HM060041 | cGAG-AAG | Glu-Lys | 307 |
CM990183 | GGC-GAC | Gly-Asp | 308 |
CM962420 | GGC-GTC | Gly-Val | 308 |
CM930044 | cGGT-AGT | Gly-Ser | 309 |
CM950096 | CGA-CAA | Arg-Gln | 311 |
CM001061 | GAGc-GAT | Glu-Asp | 312 |
CM970114 | tGCC-ACC | Ala-Thr | 314 |
CM004546 | TGGc-TGA | Trp-Term | 318 |
CM032837 | cGGC-AGC | Gly-Ser | 325 |
CM990184 | ACC-ATC | Thr-Ile | 327 |
CM065973 | cGAG-TAG | Glu-Term | 329 |
CM940107 | GAC-GTC | Asp-Val | 335 |
CM890013 | AAT-AGT | Asn-Ser | 350 |
CM970115 | AAGa-AAC | Lys-Asn | 367 |
CM940108 | CGG-CAG | Arg-Gln | 370 |
CM940109 | tCGG-TGG | Arg-Trp | 370 |
CM940110 | CCG-CTG | Pro-Leu | 377 |
CM930045 | cGAG-AAG | Glu-Lys | 382 |
CM970116 | cCGT-TGT | Arg-Cys | 384 |
CM980120 | CGG-CAG | Arg-Gln | 390 |
CM940111 | gCGG-TGG | Arg-Trp | 390 |
CM910052 | ACT-AGT | Thr-Ser | 391 |
CM980121 | tCAC-TAC | His-Tyr | 397 |
CM065972 | cAGT-GGT | Ser-Gly | 406 |
CM012065 | ACC-ATC | Thr-Ile | 408 |
CM940112 | ACT-ATT | Thr-Ile | 409 |
CM990185 | gCCC-ACC | Pro-Thr | 425 |
CM940113 | CCG-CTG | Pro-Leu | 426 |
CM970117 | CTC-CCC | Leu-Pro | 428 |
CM032838 | TAT-TCT | Tyr-Ser | 429 |
CM990186 | GCC-GTC | Ala-Val | 464 |
CM034716 | GCT-GGT | Ala-Gly | 469 |
CM940114 | gCAG-TAG | Gln-Term | 486 |
CM044573 | cTGT-GGT | Cys-Gly | 489 |
>sp|P15289|ARSA_HUMAN
MGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLRFT
DFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLTGM
AGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIP
LLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSFAE
RSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLLRC
GKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDLSP
LLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPACHASSSL
TAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGPSQVARG
EDPALQICCHPGCTPRPACCHCPDPHA
dbSNP
The "SNP" search for ARSA yielded 123 known human mutations for the protein. These are more than selecting the SNPO over the GeneView interface... This is because there exist two isoforms of ARSA and we manually checked that a lot of the other mutations belonged to the other isoform (NP_001078897). We used the isoform, which we have used so far (NP_000478). We chose the gene sequence within GeneView such that it matched the sequence version, that we used so far.
SNP ID | SNP type | nucleotide (mutation) | amino acid (mutation) | nucleotide (reference) | amino acid (reference) | position |
---|---|---|---|---|---|---|
rs6151428 | missense | A | His [H] | G | Arg [R] | 496 |
rs117341984 | missense | A | Arg [R] | G | Gly [G] | 447 |
rs6151427 | missense | G | Ser [S] | A | Asn [N] | 440 |
rs6151425 | synonymous | T | Asp [D] | C | Asp [D] | 381 |
rs6151422 | missense | G | Val [V] | T | Phe [F] | 356 |
rs113990230 | synonymous | C | His [H] | T | His [H] | 206 |
rs62001867 | missense | A | Thr [T] | G | Ala [A] | 205 |
rs34457249 | synonymous | T | Pro [P] | C | Pro [P] | 195 |
rs6151415 | missense | T | Cys [C] | G | Trp [W] | 193 |
rs113209108 | synonymous | T | Ser [S] | C | Ser [S] | 186 |
rs6151412 | synonymous | T | His [H] | C | His [H] | 151 |
rs60504011 | missense | G | Ala [A] | C | Pro [P] | 136 |
rs6151411 | missense | G | Leu [L] | C | Pro [P] | 82 |
rs6151410 | missense | T | Gly [G] | C | Gly [G] | 79 |
>sp|P15289|ARSA_HUMAN
MGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLRFT
DFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLTGM
AGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIP
LLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSFAE
RSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLLRC
GKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDLSP
LLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPACHASSSL
TAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGPSQVARG
EDPALQICCHPGCTPRPACCHCPDPHA
The mutation map
>sp|P15289|ARSA_HUMAN
MGAPRSLLLALAAGLAVARPPNIVLIFADDLGGGDLGCYGHPSSTTPNLDQLLAGGLRFT
DFYVPVSLLTPSRAALLTGGLPPPRRGGYPGVLVPPSSGGGPLEEVTVAEVLAARGYLTGG
AGGWHLGVGPEGAFLLPPHQGFHRRLGIPPSHHDQGPCNLTCFPPATPPDDGCCQGLVPII
LLANLSSEAQQPWWPPLEARYYAFAAHLMADAARQDRPFFLYYAAHHHHYPPFSGQSFAE
RSGRRGFFDSSMEEDDAVGTLMTAIGDLGLLEETTVIFTTDDGPETTRRSRGGGCSLLLCC
KGTTYYEGGRREAAAFWPGHIAPGGTTELASSLDDLPTLAALAGAPLPNNTLDGFFLSP
LLLGTGKKPRRSLFFYPPYPDDERRVFAVRRTKYKAHHFTQGSAHSSTTTDPACHASSSL
TAHEPPPLLYLSKDPGENYNNLGGVAGGTPEVLQALKQLQLLKAALDAAATFGPSQVARG
EDPALQICCCPGCTPRRACCHCPDPHA
There are 3 identical mutated residues, that are annotated in both databases are at position:
- 193: The mutations are different. In HGMD, the mutation results in a premature stop codon, thus the main part of the whole protein is truncated. In dbSNP, there is a amino acid substitution (W -> C).
- 136: The mutationas are different amino acid substitutions. P -> L is annotated in HGMD and P -> A is annotated in dbSNP.
- 82: Is mutation is identical in both databases and leads to a substitution: P -> L.