Difference between revisions of "Researching SNPs TSD"
(→HGMD) |
(→dbSNP) |
||
Line 23: | Line 23: | ||
== dbSNP == |
== dbSNP == |
||
+ | dbSNP is available as build 135 and was last updated Oct 2011 and consits of 292,031,791 submissions. It stores the information on diverse DNA variations such as single base nucleotide substitutions and short deletion and insertion polymorphisms <ref name="dbSNPsummary"> http://www.ncbi.nlm.nih.gov/projects/SNP/snp_summary.cgi </ref>. |
||
− | A search for all mutations reveals 579 entries. Some rs IDs are obsolete and redundant thus there are 526 unique and up to date SNPs alltogether for HEXA in ''homo sapiens''. |
||
+ | |||
− | There are 18 unique non-synonymous mutations and 52 unique missense mutations. |
||
+ | A search for all mutations reveals 579 entries. Some rs IDs are obsolete and redundant thus there are 526 unique and up to date SNPs alltogether for HEXA in ''homo sapiens''. 406 of those are in an intron region, 14 mRNA utr, 14 nonsense or stop gained. |
||
+ | There are 18 unique non-synonymous mutations and 51 unique missense mutations. What kind of mutations the remaining 23 entries are remains in disguise. |
||
== SNPdbe == |
== SNPdbe == |
Revision as of 16:24, 9 June 2012
Oh it was gorgeousness and gorgeosity made flesh. The trombones crunched redgold under my bed, and behind my gulliver the trumpets three-wise silverflamed, and there by the door the timps rolling through my guts and out again crunched like candy thunder. Oh, it was wonder of wonders. And then, a bird of like rarest spun heavenmetal, or like silvery wine flowing in a spaceship, gravity all nonsense now, came the violin solo above all the other strings, and those strings were like a cage of silk round my bed. Then flute and oboe bored, like worms of like platinum, into the thick thick toffee gold and silver. I was in such bliss, my brothers.
-A Clockwork Orange
The journal for this task can be found here.
Contents
Sequence mapping
The different databases use different sequences as basis for the indices of their SNP data. In the following, the reference protein sequence remains P06865, however all databases base their annotations on nucleotide sequences as well. While the final annotations will only be displayed mapped onto the protein sequence, NM_000520.4 will be used as a nucleotide reference sequence in the background. This entry describes an mRNA of HEXA and is also linked to by the Uniprot entry of P06865.
HGMD lists NM_000520.3 as reference, which is a previous version of NM_000520.4 that was chosen as reference for this task. A Needleman-Wunsch pairwise sequence alignment between the two nucleotide sequences in the entries shows that there are two single nucleotide differences in the last third of the sequence and that the more current version of the entry is 117 nucleotides longer at the beginning of the sequence. Since this region is annotated to belong to an exon, the question remains whether this has an effect on the protein sequence. A short comparison shows that there is a single differing residue at position 436 where a Val in NM_000520.3 is subsituted by an Ile in NM_000520.4. However since HGMD does not list a SNP at this position, this is not an issue.
A table here?, with the different reference sequences per database? snpdbe has a lot tough! HGMD lists 'NM_000520.3' as
Mutations
Mutations are described by the simple scheme of TODO. Nonsense mutations were ignored, if present in the database at all, since it would not make sense to predict an effect for these in the following tasks.
HGMD
Database description
dbSNP
dbSNP is available as build 135 and was last updated Oct 2011 and consits of 292,031,791 submissions. It stores the information on diverse DNA variations such as single base nucleotide substitutions and short deletion and insertion polymorphisms <ref name="dbSNPsummary"> http://www.ncbi.nlm.nih.gov/projects/SNP/snp_summary.cgi </ref>.
A search for all mutations reveals 579 entries. Some rs IDs are obsolete and redundant thus there are 526 unique and up to date SNPs alltogether for HEXA in homo sapiens. 406 of those are in an intron region, 14 mRNA utr, 14 nonsense or stop gained. There are 18 unique non-synonymous mutations and 51 unique missense mutations. What kind of mutations the remaining 23 entries are remains in disguise.
SNPdbe
OMIM
don't know yet where to look
SNPedia
Mutation map
References
<references/>