Difference between revisions of "Task 5 - Mapping SNPs Canavan"
(→Coding SNPs) |
(→dbSNP) |
||
(76 intermediate revisions by 2 users not shown) | |||
Line 1: | Line 1: | ||
== First impression == |
== First impression == |
||
− | + | ==Protocol== |
|
Further information can be found in the [[CD_task5_protocol|protocol]]. |
Further information can be found in the [[CD_task5_protocol|protocol]]. |
||
+ | |||
+ | |||
+ | == Sources for mutations == |
||
===HGMD=== |
===HGMD=== |
||
+ | |||
− | 74 (79 in 2012 professional) total for cDNA sequence NM_000049.2 and amino acid sequence NP_000040.1, out of which |
||
+ | <table> |
||
− | * 47 missense/nonsense |
||
+ | <tr> |
||
− | * 5 splicing |
||
+ | <td width="60%" valign="top"> |
||
− | * 12 small deletions |
||
+ | The aim of ther Human Gene Mutation Database (HGMD) ist to collect known gene variants that are associated with human inherited diseases. All listed mutations are based on published results, that stand as a validation for the listed mutations. The database is manually curated and relies on the information given by authors in their work. |
||
− | * 2 small insertions |
||
+ | It is stated on the HGMD website that "''Many published mutation searches identify more than one genetic change in a single patient. In such cases, the relationship between a given lesion and the clinical phenotype has not always been immediately clear,[..]. The possibility of unintentional inclusion of some lesions with little or no pathological significance can therefore not be ruled out.''" Listed mutations therefore must not be causal for any disease |
||
− | * 1 indel |
||
+ | |||
− | * 7 gross deletions |
||
+ | </td> |
||
+ | <td> |
||
+ | For Canavan Disease there are 74 mutations listed (79 in the 2012 professional version): |
||
+ | |||
[http://www.hgmd.cf.ac.uk/ac/gene.php?gene=ASPA link to search] |
[http://www.hgmd.cf.ac.uk/ac/gene.php?gene=ASPA link to search] |
||
+ | <table cellspacing=0 align="center" cellpadding=5> |
||
− | === SNPdbe === |
||
+ | <tr> |
||
− | * 55 total, includes predicted functional effect. 29 of these 55 labelled as involved in Canavan Disease. |
||
+ | <td style="border-bottom:solid;border-right:solid;font-weight:bold">Type</td> |
||
+ | <td style="border-bottom:solid;border-right:solid;font-weight:bold">Description</td> |
||
+ | <td style="border-bottom:solid;font-weight:bold">Amount</td> |
||
+ | </tr> |
||
+ | |||
+ | <tr> |
||
+ | <td style="border-right:solid">missense/nonsense</td> |
||
+ | <td style="border-right:solid">base-pair substitution that results in a triplet change</td> |
||
+ | <td>47</td> |
||
+ | </tr> |
||
+ | |||
+ | <tr><td style="border-right:solid">splicing</td> |
||
+ | <td style="border-right:solid">mutations that alter splicing of the gene</td> |
||
+ | <td>5</td> |
||
+ | </tr> |
||
+ | |||
+ | <tr> |
||
+ | <td style="border-right:solid">small deletions</td> |
||
+ | <td style="border-right:solid">Micro-deletions (20 bp or less)</td> |
||
+ | <td>12</td> |
||
+ | </tr> |
||
+ | |||
+ | <tr> |
||
+ | <td style="border-right:solid">small insertions</td> |
||
+ | <td style="border-right:solid">Micro-insertions (20 bp or less)</td> |
||
+ | <td>2</td> |
||
+ | </tr> |
||
+ | |||
+ | <tr> |
||
+ | <td style="border-right:solid">small indels</td> |
||
+ | <td style="border-right:solid">Micro-indels (20 bp or less)</td> |
||
+ | <td>1</td> |
||
+ | </tr> |
||
+ | |||
+ | <tr> |
||
+ | <td style="border-right:solid">gross deletions</td> |
||
+ | <td style="border-right:solid">Deletions of >20 bp</td> |
||
+ | <td>7</td> |
||
+ | </tr> |
||
+ | </table> |
||
+ | |||
+ | |||
+ | </td></tr></table> |
||
+ | |||
+ | ===[http://www.rostlab.org/services/snpdbe/ SNPdbe] === |
||
+ | |||
+ | The nsSNP database of functional effects (SNPdbe) collects information on SNPs from a variety of accessible webservices and additionally aims at providing a functional annotation. For reach variant, SNPdbe lists its predicted effect (SIFT, Snap) and also experimentally derived functional effects. |
||
+ | |||
+ | For Aspartoacylase (P45381 and NP_000040), there are 55 results in total: |
||
+ | * 29 labelled as involved in Canavan Disease |
||
+ | * 12 with experimental evidence |
||
[http://www.rostlab.org/services/snpdbe/dosearch.php?id=name&val=aspa&organism2=human&organism1= link to search] |
[http://www.rostlab.org/services/snpdbe/dosearch.php?id=name&val=aspa&organism2=human&organism1= link to search] |
||
− | === dbSNP === |
+ | === [http://www.ncbi.nlm.nih.gov/projects/SNP/ dbSNP] === |
+ | |||
− | * same identifiers for sequence as HGMD |
||
+ | The Single Nucleotide Polymorphism database (dbSNP) aims at being a "central repository for both single base nucleotide subsitutions and short deletion and insertion polymorphisms". It also lists synonymous variants, that are not listed in any other web resources. |
||
+ | |||
+ | For the gene "aspa" there are 508 SNPs annotated in the whole gene region (including 5' and 3' flanking regions): |
||
+ | * 458 for NP_000040.1 (coding SNPs: 35) |
||
+ | * 493 for NP_001121557.1 (coding SNPs: 35) |
||
* "synonymous-codon"[Function_Class] AND ASPA[GENE] AND "human"[ORGN] AND "snp"[SNP_CLASS] yields only 9 results |
* "synonymous-codon"[Function_Class] AND ASPA[GENE] AND "human"[ORGN] AND "snp"[SNP_CLASS] yields only 9 results |
||
+ | |||
− | * 505 results for SNPs in general in human |
||
− | ** 458 for NP_000040.1 (coding: 23) |
||
− | ** 493 for NP_001121557.1 (coding: 23) |
||
[http://www.ncbi.nlm.nih.gov/projects/SNP/snp_ref.cgi?showRare=on&chooseRs=coding&locusId=443&mrna=NM_001128085.1&ctg=NT_010718.16&prot=NP_001121557.1&orien=forward&refresh=refresh= link to search] |
[http://www.ncbi.nlm.nih.gov/projects/SNP/snp_ref.cgi?showRare=on&chooseRs=coding&locusId=443&mrna=NM_001128085.1&ctg=NT_010718.16&prot=NP_001121557.1&orien=forward&refresh=refresh= link to search] |
||
− | ===SNPedia === |
+ | ===[http://www.snpedia.com/index.php/SNPedia SNPedia] === |
− | links to other pages |
||
+ | SNPedia comes up in wiki style and collects information on human genetics from other web resouces, citing peer-reviewed scientific publications. One can browse for genes, genomes, phenotypes and even medical substances. |
||
− | === OMIM === |
||
+ | |||
− | *12 allelic variants listed for Aspartoacylase => all linkes to Canavan Disease |
||
+ | For the gene "ASPA" it lists 9 variants out of which three are associated with Canavan disease. |
||
− | [http://omim.org/allelicVariant/608034=link to ASPA variants] |
||
+ | |||
+ | [http://www.snpedia.com/index.php/ASPA links to search] |
||
+ | |||
+ | === [http://www.omim.org/ OMIM] === |
||
+ | |||
+ | OMIM stands for Online Mendelian Inheritance in Man and represents a "compendium of human genes and genetic phenotypes". Vast information are given on human mendelian diseases and associated genes that are all based on referenced publications. The database is updated daily and also lists links to other genetic resources. |
||
+ | |||
+ | For Aspartoacylase there are 12 allelic variants listed, which are all linked to Canavan Disease. |
||
+ | |||
+ | [http://omim.org/allelicVariant/608034 link to ASPA variants] |
||
== Coding SNPs == |
== Coding SNPs == |
||
+ | <figure id="venny">[[File:overlap_venny.png|thumb|right|200px|<b><xr nolink id="venny"/></b><br> Overlap of contained SNPs between the databases.]]</figure> |
||
+ | |||
+ | From the above mentioned SNP databases we extracted all coding SNPs for Aspartoacylase. The venn diagram in <xr id="venny"/> shows the overlap of the contained SNPs in the three biggest database sources. |
||
+ | |||
+ | We used the sequence position of the listed SNPs as an identifier to create a unique list of known SNPs. <xr id="coding_snp_table"/> shows the resulting list of unique SNPs. For each polymorphism, the source is given (i.e., from which DB the SNP was extracted), as well as annotated validation, if available. |
||
+ | |||
+ | The resulting table contains 79 SNPs in total, out of which 48 are reported to cause the Canavan Disease. |
||
− | From the above mentioned SNP databases we extracted all coding SNPs for Aspartoacylase. We used the sequence position of the listed SNPs as identifier to create a unique list of known SNPs. <xr id="coding_snp_table"/> shows the resulting list of unique SNPS. For each polymorphism, the source is given (from which DB the SNP was extracted), as well as annotated validation. |
||
<figtable id="coding_snp_table"> |
<figtable id="coding_snp_table"> |
||
<table cellspacing=0 align="center" cellpadding=5> |
<table cellspacing=0 align="center" cellpadding=5> |
||
− | <caption align="center"><b><xr nolink id="coding_snp_table"/></b> In this table all SNPs for Aspartoacylase are listed that could be found in HGMD, dbSNPs, |
+ | <caption align="center"><b><xr nolink id="coding_snp_table"/></b> In this table all SNPs for Aspartoacylase are listed that could be found in HGMD, dbSNPs, SNPdbe and OMIM. Mutations in red are reported to cause the Canavan Disease.</caption> |
<tr> |
<tr> |
||
<td style="border-bottom:solid;border-right:solid;"><b>Residue Position</b></td> |
<td style="border-bottom:solid;border-right:solid;"><b>Residue Position</b></td> |
||
Line 50: | Line 126: | ||
</tr> |
</tr> |
||
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"> |
<td style="border-right:solid" align="left">4</td> |
<td style="border-right:solid" align="left">4</td> |
||
<td style="border-right:solid" align="left">rs142041344 </td> |
<td style="border-right:solid" align="left">rs142041344 </td> |
||
− | <td style="border-right:solid" align="left">dbSNP<br> |
+ | <td style="border-right:solid" align="left">dbSNP<br> SNPdbe</td> |
<td style="border-right:solid" align="left">1000Genomes </td> |
<td style="border-right:solid" align="left">1000Genomes </td> |
||
<td style="border-right:solid" align="left">missense</td> |
<td style="border-right:solid" align="left">missense</td> |
||
Line 59: | Line 135: | ||
</tr> |
</tr> |
||
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"> |
<td style="border-right:solid" align="left" >14</td> |
<td style="border-right:solid" align="left" >14</td> |
||
<td style="border-right:solid" align="left">CM063852</td> |
<td style="border-right:solid" align="left">CM063852</td> |
||
Line 67: | Line 143: | ||
<td style="color:red;border-right:solid;border-color:black" align="left">V14G</td></tr> |
<td style="color:red;border-right:solid;border-color:black" align="left">V14G</td></tr> |
||
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >16</td><td style="border-right:solid" align="left"><br> CM960084</td><td style="border-right:solid" align="left">SNPdbe<br> HGMD</td><td style="border-right:solid" align="left">Kaul (1996) Am J Hum Genet 59</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">I16T</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >18</td><td style="border-right:solid" align="left">CM067343</td><td style="border-right:solid" align="left">HGMD</td><td style="border-right:solid" align="left">Zeng (2006) Adv Exp Med Biol 576</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">G18R</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >21</td><td style="border-right:solid" align="left">CM001608</td><td style="border-right:solid" align="left">SNPdbe<br> HGMD</td><td style="border-right:solid" align="left">SHGMD857217istermans (2000) Eur J Hum Genet 8</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">H21P</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >24</td><td style="border-right:solid" align="left">rs104894551 <br> CM023602</td><td style="border-right:solid" align="left">dbSNP<br> SNPdbe<br> OMIM<br> HGMD</td><td style="border-right:solid" align="left"> Multiple independent submissions to the refSNP cluster<br> Zeng (2002) J Inherit Metab Dis 25</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">E24G</td> |
</tr> |
</tr> |
||
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >26</td><td style="border-right:solid" align="left">rs145616193 </td><td style="border-right:solid" align="left">dbSNP</td><td style="border-right:solid" align="left"></td><td style="border-right:solid" align="left">synonymous</td><td style="border-right:solid" align="left">T26T</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >27</td><td style="border-right:solid" align="left">CM960085</td><td style="border-right:solid" align="left">SNPdbe<br> HGMD</td><td style="border-right:solid" align="left"> Kaul (1996) Am J Hum Genet 59</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">G27R</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >33</td><td style="border-right:solid" align="left">rs138158568 </td><td style="border-right:solid" align="left">dbSNP<br> SNPdbe</td><td style="border-right:solid" align="left"></td><td style="border-right:solid" align="left">missense</td><td style="border-right:solid" align="left">H33R</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >53</td><td style="border-right:solid" align="left">rs17850703 </td><td style="border-right:solid" align="left">dbSNP<br> SNPdbe</td><td style="border-right:solid" align="left"></td><td style="border-right:solid" align="left">missense</td><td style="border-right:solid" align="left">T53A</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >57</td><td style="border-right:solid" align="left">CM001609</td><td style="border-right:solid" align="left">SNPdbe<br> HGMD</td><td style="border-right:solid" align="left">Sistermans (2000) Eur J Hum Genet 8</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">A57T</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >68</td><td style="border-right:solid" align="left"> CM023603</td><td style="border-right:solid" align="left">SNPdbe<br> HGMD</td><td style="border-right:solid" align="left">Zeng (2002) J Inherit Metab Dis 25</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">D68A</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >71</td><td style="border-right:solid" align="left">rs104894553 <br> CM060201</td><td style="border-right:solid" align="left">dbSNP<br> SNPdbe<br> HGMD</td><td style="border-right:solid" align="left">multiple independent submissions to the refSNP cluster<br>Janson (2006) Ann Neurol 59</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">R71H</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >71_2</td><td style="border-right:solid" align="left"></td><td style="border-right:solid" align="left">SNPdbe</td><td style="border-right:solid" align="left"></td><td style="border-right:solid" align="left">missense</td><td style="border-right:solid" align="left">R71K</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >82</td><td style="border-right:solid" align="left">rs80099330 </td><td style="border-right:solid" align="left">dbSNP<br> SNPdbe</td><td style="border-right:solid" align="left">Multiple independent submissions to the refSNP cluster<br>Validated by frequency or genotype data<br>1000 Genome project</td><td style="border-right:solid" align="left">missense</td><td style="border-right:solid" align="left">M82T</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >93</td><td style="border-right:solid" align="left">rs144639820 </td><td style="border-right:solid" align="left">dbSNP</td><td style="border-right:solid" align="left">Validated by frequency or genotype data</td><td style="border-right:solid" align="left">synonymous</td><td style="border-right:solid" align="left">A93A</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >109</td><td style="border-right:solid" align="left">CM990192</td><td style="border-right:solid" align="left">HGMD</td><td style="border-right:solid" align="left">Elpeleg (1999) J Inherit Metab Dis 22</td><td style="border-right:solid" align="left">nonsense</td><td style="color:red;border-right:solid;border-color:black" align="left">Y109Ter</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >111</td><td style="border-right:solid" align="left">rs181347986 </td><td style="border-right:solid" align="left">dbSNP<br> SNPdbe</td><td style="border-right:solid" align="left"> 1000 Genome project</td><td style="border-right:solid" align="left">missense</td><td style="border-right:solid" align="left">I111V</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >114</td><td style="border-right:solid" align="left">CM023014</td><td style="border-right:solid" align="left">SNPdbe<br> HGMD</td><td style="border-right:solid" align="left">Olsen (2002) J Med Genet 39</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">D114Y</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >114_2</td><td style="border-right:solid" align="left">CM960086</td><td style="border-right:solid" align="left">HGMD</td><td style="border-right:solid" align="left">Kaul (1996) Am J Hum Genet 59</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">D114E</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >121</td><td style="border-right:solid" align="left">rs148451498 </td><td style="border-right:solid" align="left">dbSNP<br> SNPdbe</td><td style="border-right:solid" align="left"></td><td style="border-right:solid" align="left">missense</td><td style="border-right:solid" align="left">N121D</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >121_2</td><td style="border-right:solid" align="left">CM063846</td><td style="border-right:solid" align="left">HGMD</td><td style="border-right:solid" align="left">Zeng (2006) Mol Genet Metab 89</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">N121I</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >123</td><td style="border-right:solid" align="left">CM960087</td><td style="border-right:solid" align="left">SNPdbe<br> HGMD</td><td style="border-right:solid" align="left"> Kaul (1996) Am J Hum Genet 59</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">G123E</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >143</td><td style="border-right:solid" align="left">rs199565861</td><td style="border-right:solid" align="left">dbSNP<br> SNPdbe</td><td style="border-right:solid" align="left"> 1000 Genome project</td><td style="border-right:solid" align="left">missense</td><td style="border-right:solid" align="left">I143V</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >143_2</td><td style="border-right:solid" align="left">CM063849</td><td style="border-right:solid" align="left">HGMD</td><td style="border-right:solid" align="left">Zeng (2006) Mol Genet Metab 89</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">I143F</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >143_3</td><td style="border-right:solid" align="left">CM980125</td><td style="border-right:solid" align="left">HGMD</td><td style="border-right:solid" align="left">Kobayashi (1998) Hum Mutat S1, S308</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">I143T</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >152</td><td style="border-right:solid" align="left">rs104894548 <br> CM950102</td><td style="border-right:solid" align="left">dbSNP<br> SNPdbe<br> OMIM<br> HGMD</td><td style="border-right:solid" align="left"> Multiple independent submissions to the refSNP cluster<br>Kaul (1995) Hum Mutat 5</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">C152R</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >152_2</td><td style="border-right:solid" align="left">CM023604</td><td style="border-right:solid" align="left">HGMD</td><td style="border-right:solid" align="left">Zeng (2002) J Inherit Metab Dis 25</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">C152W</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >152_3</td><td style="border-right:solid" align="left">CM960088</td><td style="border-right:solid" align="left">HGMD</td><td style="border-right:solid" align="left">Kaul (1996) Am J Hum Genet 59, 95</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">C152Y</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >153</td><td style="border-right:solid" align="left">rs141755746 </td><td style="border-right:solid" align="left">dbSNP</td><td style="border-right:solid" align="left"></td><td style="border-right:solid" align="left">synonymous</td><td style="border-right:solid" align="left">Y153Y</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >154</td><td style="border-right:solid" align="left">rs147193431 <br> rs2228435</td><td style="border-right:solid" align="left">dbSNP<br> SNPdbe</td><td style="border-right:solid" align="left"></td><td style="border-right:solid" align="left">missense</td><td style="border-right:solid" align="left">V154I</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >157</td><td style="border-right:solid" align="left">rs140357187 </td><td style="border-right:solid" align="left">dbSNP<br> SNPdbe</td><td style="border-right:solid" align="left"></td><td style="border-right:solid" align="left">missense</td><td style="border-right:solid" align="left">I157T</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >164</td><td style="border-right:solid" align="left"></td><td style="border-right:solid" align="left">SNPdbe</td><td style="border-right:solid" align="left"></td><td style="border-right:solid" align="left">missense</td><td style="border-right:solid" align="left">Y164F</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >166</td><td style="border-right:solid" align="left">CM063847</td><td style="border-right:solid" align="left">HGMD</td><td style="border-right:solid" align="left">Zeng (2006) Mol Genet Metab 89</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">T166I</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >168</td><td style="border-right:solid" align="left"> CM001610</td><td style="border-right:solid" align="left">SNPdbe<br> HGMD</td><td style="border-right:solid" align="left">Sistermans (2000) Eur J Hum Genet 8</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">R168H</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >168_2</td><td style="border-right:solid" align="left">CM960089</td><td style="border-right:solid" align="left">HGMD</td><td style="border-right:solid" align="left">Kaul (1996) Am J Hum Genet 59</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">R168C</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >170</td><td style="border-right:solid" align="left">rs144321760 </td><td style="border-right:solid" align="left">dbSNP<br> SNPdbe</td><td style="border-right:solid" align="left">Validated by frequency or genotype data<br> by freq</td><td style="border-right:solid" align="left">missense</td><td style="border-right:solid" align="left">I170T</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >178</td><td style="border-right:solid" align="left"></td><td style="border-right:solid" align="left">SNPdbe</td><td style="border-right:solid" align="left"></td><td style="border-right:solid" align="left">missense</td><td style="border-right:solid" align="left">E178A</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >181_2</td><td style="border-right:solid" align="left">CM063850</td><td style="border-right:solid" align="left">HGMD</td><td style="border-right:solid" align="left">Zeng (2006) Mol Genet Metab 89</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">P181L</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >181</td><td style="border-right:solid" align="left">CM001611</td><td style="border-right:solid" align="left">SNPdbe<br> HGMD</td><td style="border-right:solid" align="left">Sistermans (2000) Eur J Hum Genet 8</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">P181T</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >183</td><td style="border-right:solid" align="left">CM990193</td><td style="border-right:solid" align="left">SNPdbe<br> HGMD</td><td style="border-right:solid" align="left"> Elpeleg (1999) J Inherit Metab Dis 22</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">P183H</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >184</td><td style="border-right:solid" align="left">CM023605</td><td style="border-right:solid" align="left">HGMD</td><td style="border-right:solid" align="left">Zeng (2002) J Inherit Metab Dis 25</td><td style="border-right:solid" align="left">nonsense</td><td style="color:red;border-right:solid;border-color:black" align="left">Q184Ter</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >186</td><td style="border-right:solid" align="left">CM990194</td><td style="border-right:solid" align="left">SNPdbe<br> HGMD</td><td style="border-right:solid" align="left">Elpeleg (1999) J Inherit Metab Dis 22</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">V186F</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >195</td><td style="border-right:solid" align="left">CM990195</td><td style="border-right:solid" align="left">SNPdbe<br> HGMD</td><td style="border-right:solid" align="left">Elpeleg (1999) J Inherit Metab Dis 22</td><td style="border-right:solid" align="left">missense</td> |
+ | <td style="color:red;border-right:solid;border-color:black" align="left">M195R</td></tr> |
||
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >202</td><td style="border-right:solid" align="left">rs147763700 </td><td style="border-right:solid" align="left">dbSNP<br> SNPdbe</td><td style="border-right:solid" align="left"></td><td style="border-right:solid" align="left">missense</td><td style="border-right:solid" align="left">A202S</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >213</td><td style="border-right:solid" align="left">CM055097</td><td style="border-right:solid" align="left">HGMD</td><td style="border-right:solid" align="left">Tacke (2005) Neuropediatrics 36</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">K213E</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >214</td><td style="border-right:solid" align="left">CM023606</td><td style="border-right:solid" align="left">HGMD</td><td style="border-right:solid" align="left">Zeng (2002) J Inherit Metab Dis 25</td><td style="border-right:solid" align="left">nonsense</td><td style="color:red;border-right:solid;border-color:black" align="left">E214Ter</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >218</td><td style="border-right:solid" align="left">rs104894549<br>CM950103 </td><td style="border-right:solid" align="left">dbSNP<br>HGMD</td><td style="border-right:solid" align="left"> Multiple independent submissions to the refSNP cluster<br>Shaag (1995) Am J Hum Genet 57</td><td style="border-right:solid" align="left">nonsense</td><td style="color:red;border-right:solid;border-color:black" align="left">C218Ter</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >220</td><td style="border-right:solid" align="left">rs139053885 </td><td style="border-right:solid" align="left">dbSNP<br> SNPdbe</td><td style="border-right:solid" align="left"></td><td style="border-right:solid" align="left">missense</td><td style="border-right:solid" align="left">I220T</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >226_2</td><td style="border-right:solid" align="left">CM086530</td><td style="border-right:solid" align="left">HGMD</td><td style="border-right:solid" align="left">Di Pietro (2008) Clin Biochem 41</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">I226T</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >226</td><td style="border-right:solid" align="left">rs201887670</td><td style="border-right:solid" align="left">dbSNP</td><td style="border-right:solid" align="left"></td><td style="border-right:solid" align="left">missense</td><td style="border-right:solid" align="left">I226K</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >231</td><td style="border-right:solid" align="left">rs104894550 <br> CM994594</td><td style="border-right:solid" align="left">dbSNP<br> SNPdbe<br> OMIM<br> HGMD</td><td style="border-right:solid" align="left"> Multiple independent submissions to the refSNP cluster<br>Rady (1999) Am J Med Genet 87</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">Y231C</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >231_2</td><td style="border-right:solid" align="left">CM940123</td><td style="border-right:solid" align="left">HGMD</td><td style="border-right:solid" align="left">Kaul (1994) Am J Hum Genet 55</td><td style="border-right:solid" align="left">nonsense</td><td style="color:red;border-right:solid;border-color:black" align="left">Y231Ter</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >235</td><td style="border-right:solid" align="left">rs149842031 </td><td style="border-right:solid" align="left">dbSNP<br> SNPdbe</td><td style="border-right:solid" align="left">Multiple independent submissions to the refSNP cluster <br> Validated by frequency or genotype data <br>1000 Genome project</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">E235K</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >236</td><td style="border-right:solid" align="left">rs149189911 </td><td style="border-right:solid" align="left">dbSNP</td><td style="border-right:solid" align="left"></td><td style="border-right:solid" align="left">synonymous</td><td style="border-right:solid" align="left">N236N</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >239</td><td style="border-right:solid" align="left">rs145085349 </td><td style="border-right:solid" align="left">dbSNP<br> SNPdbe</td><td style="border-right:solid" align="left"></td><td style="border-right:solid" align="left">missense</td><td style="border-right:solid" align="left">I239T</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >244</td><td style="border-right:solid" align="left">CM023607</td><td style="border-right:solid" align="left">SNPdbe<br> HGMD</td><td style="border-right:solid" align="left">Zeng (2002) J Inherit Metab Dis 25</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">H244R</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >244_2</td><td style="border-right:solid" align="left">CM063848</td><td style="border-right:solid" align="left">HGMD</td><td style="border-right:solid" align="left">Zeng (2006) Mol Genet Metab 89</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">H244L</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >249</td><td style="border-right:solid" align="left">rs104894552 <br> CM023015</td><td style="border-right:solid" align="left">dbSNP<br> SNPdbe<br> HGMD</td><td style="border-right:solid" align="left"> Multiple independent submissions to the refSNP cluster<br>Olsen (2002) J Med Genet 39</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">D249V</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >270</td><td style="border-right:solid" align="left">rs200126822</td><td style="border-right:solid" align="left">dbSNP<br> SNPdbe</td><td style="border-right:solid" align="left">1000 Genome project </td><td style="border-right:solid" align="left">missense</td><td style="border-right:solid" align="left">I270T</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >272</td><td style="border-right:solid" align="left">CM063851</td><td style="border-right:solid" align="left">HGMD</td><td style="border-right:solid" align="left">Zeng (2006) Mol Genet Metab 89</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">L272P</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >274</td><td style="border-right:solid" align="left">CM950104</td><td style="border-right:solid" align="left">SNPdbe<br> HGMD</td><td style="border-right:solid" align="left">Shaag (1995) Am J Hum Genet 57</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">G274R</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >277</td><td style="border-right:solid" align="left">rs78677072 </td><td style="border-right:solid" align="left">dbSNP</td><td style="border-right:solid" align="left"> Multiple independent submissions to the refSNP cluster <br>1000 Genome project</td><td style="border-right:solid" align="left">synonymous</td><td style="border-right:solid" align="left">T277T</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >278</td><td style="border-right:solid" align="left">rs140581464 </td><td style="border-right:solid" align="left">dbSNP<br> SNPdbe</td><td style="border-right:solid" align="left"> Multiple independent submissions to the refSNP cluster <br> Validated by frequency or genotype data <br>1000Genome project</td><td style="border-right:solid" align="left">missense</td><td style="border-right:solid" align="left">V278M</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >279</td><td style="border-right:solid" align="left">rs145717248 </td><td style="border-right:solid" align="left">dbSNP<br> SNPdbe</td><td style="border-right:solid" align="left"></td><td style="border-right:solid" align="left">missense</td><td style="border-right:solid" align="left">Y279H</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >280</td><td style="border-right:solid" align="left">rs148081446 </td><td style="border-right:solid" align="left">dbSNP</td><td style="border-right:solid" align="left"> Multiple independent submissions to the refSNP cluster <br> Validated by frequency or genotype data <br>1000 Genome project</td><td style="border-right:solid" align="left">synonymous</td><td style="border-right:solid" align="left">P280P</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >280_2</td><td style="border-right:solid" align="left">CM990197</td><td style="border-right:solid" align="left">HGMD</td><td style="border-right:solid" align="left">Elpeleg (1999) J Inherit Metab Dis 22</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">P280S</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >280_3</td><td style="border-right:solid" align="left">VAR_039088</td><td style="border-right:solid" align="left">SNPdbe</td><td style="border-right:solid" align="left">Elpeleg (1999) J Inherit Metab Dis 22</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">P280L</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >281</td><td style="border-right:solid" align="left">rs141858640 </td><td style="border-right:solid" align="left">dbSNP<br> SNPdbe</td><td style="border-right:solid" align="left"></td><td style="border-right:solid" align="left">missense</td><td style="border-right:solid" align="left">V281M</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >285</td><td style="border-right:solid" align="left">rs28940279 <br> CM930046</td><td style="border-right:solid" align="left">dbSNP<br> SNPdbe<br> OMIM<br> HGMD</td><td style="border-right:solid" align="left"> Multiple independent submissions to the refSNP cluster <br> Validated by frequency or genotype data<br> Kaul (1993) Nat Genet 5</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">E285A</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >285_2</td><td style="border-right:solid" align="left"></td><td style="border-right:solid" align="left">SNPdbe</td><td style="border-right:solid" align="left"></td><td style="border-right:solid" align="left">missense</td><td style="border-right:solid" align="left">E285D</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >286</td><td style="border-right:solid" align="left">rs138062143 </td><td style="border-right:solid" align="left">dbSNP</td><td style="border-right:solid" align="left"></td><td style="border-right:solid" align="left">synonymous</td><td style="border-right:solid" align="left">A286A</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >287</td><td style="border-right:solid" align="left">CM990198</td><td style="border-right:solid" align="left">SNPdbe<br> HGMD</td><td style="border-right:solid" align="left">Elpeleg (1999) J Inherit Metab Dis 22</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">A287T</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >288</td><td style="border-right:solid" align="left"></td><td style="border-right:solid" align="left">SNPdbe</td><td style="border-right:solid" align="left"></td><td style="border-right:solid" align="left">missense</td><td style="border-right:solid" align="left">Y288F</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >288_2</td><td style="border-right:solid" align="left">CM034717</td><td style="border-right:solid" align="left">HGMD</td><td style="border-right:solid" align="left">Surendran (2003) Mol Genet Metab 80</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">Y288C</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >295</td><td style="border-right:solid" align="left">CM950105</td><td style="border-right:solid" align="left">SNPdbe<br> HGMD</td><td style="border-right:solid" align="left">Shaag (1995) Am J Hum Genet 57</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">F295S</td></tr> |
− | <tr bgcolor="# |
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >305</td><td style="border-right:solid" align="left">rs28940574 <br> CM940124</td><td style="border-right:solid" align="left">dbSNP<br> SNPdbe<br> OMIM<br> HGMD</td><td style="border-right:solid" align="left"> Multiple independent submissions to the refSNP cluster<br>Kaul (1994) Am J Hum Genet 55</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">A305E</td></tr> |
+ | |||
+ | <tr bgcolor="#CAE1FF"><td style="border-right:solid" align="left" >310</td><td style="border-right:solid" align="left"></td><td style="border-right:solid" align="left">SNPdbe</td><td style="border-right:solid" align="left"></td><td style="border-right:solid" align="left">missense</td><td style="border-right:solid" align="left">C310G</td></tr> |
||
+ | |||
+ | <tr bgcolor="#F5F5DC"><td style="border-right:solid" align="left" >314</td><td style="border-right:solid" align="left">CM023608</td><td style="border-right:solid" align="left">HGMD</td><td style="border-right:solid" align="left">Zeng (2002) J Inherit Metab Dis 25</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">Ter314W</td></tr> |
||
− | <tr bgcolor="#BFEFFF"><td style="border-right:solid" align="left" >314</td><td style="border-right:solid" align="left">CM023608</td><td style="border-right:solid" align="left">HGMD</td><td style="border-right:solid" align="left">Zeng (2002) J Inherit Metab Dis 25</td><td style="border-right:solid" align="left">missense</td><td style="color:red;border-right:solid;border-color:black" align="left">Ter314W</td></tr> |
||
</table> |
</table> |
||
</figtable> |
</figtable> |
||
+ | == SNP Visualization == |
||
+ | The coding SNPs are written below the reference sequence. An 'X' denotes a nonsense mutation coding a stop codon |
||
Line 239: | Line 321: | ||
K |
K |
||
GPKDSEDSYDIIFDLHNTTSNMGCTLILEDSRNNFLIQMFHYIKTSLAPL 150 |
GPKDSEDSYDIIFDLHNTTSNMGCTLILEDSRNNFLIQMFHYIKTSLAPL 150 |
||
− | + | X V Y D E V |
|
E I F |
E I F |
||
+ | T |
||
PCYVYLIEHPSLKYATTRSIAKYPVGIEVGPQPQGVLRADILDQMRKMIK 200 |
PCYVYLIEHPSLKYATTRSIAKYPVGIEVGPQPQGVLRADILDQMRKMIK 200 |
||
− | RYI T F I H T A T |
+ | RYI T F I H T A T HX F R |
W C L |
W C L |
||
+ | Y |
||
HALDFIHHFNEGKEFPPCAIEVYKIIEKVDYPRDENGEIAAIIHPNLQDQ 250 |
HALDFIHHFNEGKEFPPCAIEVYKIIEKVDYPRDENGEIAAIIHPNLQDQ 250 |
||
− | S |
+ | S EX X T K C KN T R V |
− | + | T X L |
|
DWKPLHPGDPMFLTLDGKTIPLGGDCTVYPVFVNEAAYYEKKEAFAKTTK 300 |
DWKPLHPGDPMFLTLDGKTIPLGGDCTVYPVFVNEAAYYEKKEAFAKTTK 300 |
||
T P R TMHPM AATF S |
T P R TMHPM AATF S |
||
Line 252: | Line 336: | ||
LTLNAKSIRCCLH- 314 |
LTLNAKSIRCCLH- 314 |
||
E G W |
E G W |
||
+ | |||
+ | == Hotspots == |
||
+ | |||
+ | ===Visual inspection=== |
||
+ | |||
+ | For the visual inspection, we looked at two kinds of reported SNPs: first, those reported by the HGMD, so they are reported to cause the Canavan Disease. Second, SNPs from dbSNP and xx that are not reported to cause the Canavan Disease (however, they are also not reported to NOT cause it). |
||
+ | |||
+ | For the HGMD data, SNPs were pretty much scattered all over the structure. It was surprising for us that even many SNPs on the surface of the protein, not even close to the dimer interaction site, were also reported to cause the Canavan Disease. See <xr id="hgmd_all" /> and <xr id ="hgmd_surface"/>. |
||
+ | |||
+ | <figtable id="HGMD"> |
||
+ | <table align="center"> |
||
+ | <tr> |
||
+ | <td> |
||
+ | <figure id="hgmd_all">[[File:HGMD_SNPs.png|thumb|300px|<b><xr nolink id="hgmd_all"/></b><br> Mutations of Aspartoacylase which cause the Canavan Disease residues are coloured in red. Many of these mutations are close to the binding site (around the zinc ion), as can be expected.]]</figure> |
||
+ | </td> |
||
+ | <td> |
||
+ | <figure id="hgmd_surface">[[File:HGMD_SNPs_surface.png|thumb|300px|<b><xr nolink id="hgmd_surface"/></b><br> Mutations of Aspartoacylase in red - surface view. Many of these mutations that cause the Canavan Disease can be found both on the surface, far away from the binding site or the dimer interaction site, which we found surprising.]]</figure></td> |
||
+ | </tr> |
||
+ | </table> |
||
+ | </figtable> |
||
+ | |||
+ | |||
+ | For SNPs NOT reported to cause the Canavan Disease, SNPs again were scattered all over the structure. We had expected to visually be able to see some concentration of mutations in certain - apparently functionally unimportant or structurally flexible - regions, but could not validate this expectation with the given data, at least not visually. See <xr id="others_all" /> and <xr id ="others_surface"/>. |
||
+ | |||
+ | <figtable id="others"> |
||
+ | <table align="center"> |
||
+ | <tr> |
||
+ | <td> |
||
+ | <figure id="others_all">[[File:SNPs_no_Canavan_report.png|thumb|340px|<b><xr nolink id="others_all"/></b><br> Mutations of Aspartoacylase residues are coloured in blue. Visually, we cannot detect a pattern or hotspots.]]</figure> |
||
+ | </td> |
||
+ | <td> |
||
+ | <figure id="others_surface">[[File:SNPs_no_Canavan_report_surface.png|thumb|300px|<b><xr nolink id="others_surface"/></b><br> Mutations of Aspartoacylase in blue - surface view. Mutations that are not reported to cause the Canavan Disease can be found both on the surface, as seen in this picture, as well as on the inside of the protein (as seen on the left).]]</figure></td> |
||
+ | </tr> |
||
+ | </table> |
||
+ | </figtable> |
||
+ | |||
+ | ===Frequency Distribution=== |
||
+ | Since visual inspection did not bring further enlightenment, we had a look at the frequency distribution of the disease causing and non-disease causing SNPs. These can be found in <xr id = "can"/> and <xr id="no_can"/>. Neither for the disease causing, nor for the non-disease causing SNPs, we were able to identify distinct hotspot-regions. The only exception might be the last block of mutations in <xr id = "no_can"/> close to the end of the sequence. It is curious, though, that this sequence part should be evolutionary flexible, since it includes residue 288, which is part of the binding site (but keep in mind that maybe this mutation has just not been annotated to cause the Canavan Disease. It might do it, still). |
||
+ | <table align="center"> |
||
+ | <tr> |
||
+ | <td> |
||
+ | <figure id="can">[[File:Canavan.png|thumb|800px|<b><xr nolink id="can"/></b><br> Frequency distribution of disease-causing mutations along the sequence. Mutations are fairly evenly distributed.]]</figure> |
||
+ | </td> |
||
+ | </tr> |
||
+ | </table> |
||
+ | |||
+ | |||
+ | <table align="center"> |
||
+ | <tr> |
||
+ | <td> |
||
+ | <figure id="no_can">[[File:No_Canavan.png|thumb|800px|<b><xr nolink id="no_can"/></b><br> Frequency distribution of mutations not reported to cause Canavan Disease. Again, the distribution is fairly even, except for a slight accumulation towards the end of the sequence in regions 277-288.]]</figure></td> |
||
+ | </tr> |
||
+ | </table> |
Latest revision as of 13:47, 30 August 2012
Contents
First impression
Protocol
Further information can be found in the protocol.
Sources for mutations
HGMD
The aim of ther Human Gene Mutation Database (HGMD) ist to collect known gene variants that are associated with human inherited diseases. All listed mutations are based on published results, that stand as a validation for the listed mutations. The database is manually curated and relies on the information given by authors in their work. It is stated on the HGMD website that "Many published mutation searches identify more than one genetic change in a single patient. In such cases, the relationship between a given lesion and the clinical phenotype has not always been immediately clear,[..]. The possibility of unintentional inclusion of some lesions with little or no pathological significance can therefore not be ruled out." Listed mutations therefore must not be causal for any disease |
For Canavan Disease there are 74 mutations listed (79 in the 2012 professional version):
|
SNPdbe
The nsSNP database of functional effects (SNPdbe) collects information on SNPs from a variety of accessible webservices and additionally aims at providing a functional annotation. For reach variant, SNPdbe lists its predicted effect (SIFT, Snap) and also experimentally derived functional effects.
For Aspartoacylase (P45381 and NP_000040), there are 55 results in total:
- 29 labelled as involved in Canavan Disease
- 12 with experimental evidence
dbSNP
The Single Nucleotide Polymorphism database (dbSNP) aims at being a "central repository for both single base nucleotide subsitutions and short deletion and insertion polymorphisms". It also lists synonymous variants, that are not listed in any other web resources.
For the gene "aspa" there are 508 SNPs annotated in the whole gene region (including 5' and 3' flanking regions):
- 458 for NP_000040.1 (coding SNPs: 35)
- 493 for NP_001121557.1 (coding SNPs: 35)
- "synonymous-codon"[Function_Class] AND ASPA[GENE] AND "human"[ORGN] AND "snp"[SNP_CLASS] yields only 9 results
SNPedia
SNPedia comes up in wiki style and collects information on human genetics from other web resouces, citing peer-reviewed scientific publications. One can browse for genes, genomes, phenotypes and even medical substances.
For the gene "ASPA" it lists 9 variants out of which three are associated with Canavan disease.
OMIM
OMIM stands for Online Mendelian Inheritance in Man and represents a "compendium of human genes and genetic phenotypes". Vast information are given on human mendelian diseases and associated genes that are all based on referenced publications. The database is updated daily and also lists links to other genetic resources.
For Aspartoacylase there are 12 allelic variants listed, which are all linked to Canavan Disease.
Coding SNPs
<figure id="venny">
</figure>
From the above mentioned SNP databases we extracted all coding SNPs for Aspartoacylase. The venn diagram in <xr id="venny"/> shows the overlap of the contained SNPs in the three biggest database sources.
We used the sequence position of the listed SNPs as an identifier to create a unique list of known SNPs. <xr id="coding_snp_table"/> shows the resulting list of unique SNPs. For each polymorphism, the source is given (i.e., from which DB the SNP was extracted), as well as annotated validation, if available.
The resulting table contains 79 SNPs in total, out of which 48 are reported to cause the Canavan Disease.
<figtable id="coding_snp_table">
Residue Position | Identifier | Reference DB | Validation evidence | SNP Type | Mutation |
4 | rs142041344 | dbSNP SNPdbe |
1000Genomes | missense | C4R |
14 | CM063852 | HGMD | Zeng (2006) Mol Genet Metab 89 | missense | V14G |
16 | CM960084 | SNPdbe HGMD | Kaul (1996) Am J Hum Genet 59 | missense | I16T |
18 | CM067343 | HGMD | Zeng (2006) Adv Exp Med Biol 576 | missense | G18R |
21 | CM001608 | SNPdbe HGMD | SHGMD857217istermans (2000) Eur J Hum Genet 8 | missense | H21P |
24 | rs104894551 CM023602 | dbSNP SNPdbe OMIM HGMD | Multiple independent submissions to the refSNP cluster Zeng (2002) J Inherit Metab Dis 25 | missense | E24G |
26 | rs145616193 | dbSNP | synonymous | T26T | |
27 | CM960085 | SNPdbe HGMD | Kaul (1996) Am J Hum Genet 59 | missense | G27R |
33 | rs138158568 | dbSNP SNPdbe | missense | H33R | |
53 | rs17850703 | dbSNP SNPdbe | missense | T53A | |
57 | CM001609 | SNPdbe HGMD | Sistermans (2000) Eur J Hum Genet 8 | missense | A57T |
68 | CM023603 | SNPdbe HGMD | Zeng (2002) J Inherit Metab Dis 25 | missense | D68A |
71 | rs104894553 CM060201 | dbSNP SNPdbe HGMD | multiple independent submissions to the refSNP cluster Janson (2006) Ann Neurol 59 | missense | R71H |
71_2 | SNPdbe | missense | R71K | ||
82 | rs80099330 | dbSNP SNPdbe | Multiple independent submissions to the refSNP cluster Validated by frequency or genotype data 1000 Genome project | missense | M82T |
93 | rs144639820 | dbSNP | Validated by frequency or genotype data | synonymous | A93A |
109 | CM990192 | HGMD | Elpeleg (1999) J Inherit Metab Dis 22 | nonsense | Y109Ter |
111 | rs181347986 | dbSNP SNPdbe | 1000 Genome project | missense | I111V |
114 | CM023014 | SNPdbe HGMD | Olsen (2002) J Med Genet 39 | missense | D114Y |
114_2 | CM960086 | HGMD | Kaul (1996) Am J Hum Genet 59 | missense | D114E |
121 | rs148451498 | dbSNP SNPdbe | missense | N121D | |
121_2 | CM063846 | HGMD | Zeng (2006) Mol Genet Metab 89 | missense | N121I |
123 | CM960087 | SNPdbe HGMD | Kaul (1996) Am J Hum Genet 59 | missense | G123E |
143 | rs199565861 | dbSNP SNPdbe | 1000 Genome project | missense | I143V |
143_2 | CM063849 | HGMD | Zeng (2006) Mol Genet Metab 89 | missense | I143F |
143_3 | CM980125 | HGMD | Kobayashi (1998) Hum Mutat S1, S308 | missense | I143T |
152 | rs104894548 CM950102 | dbSNP SNPdbe OMIM HGMD | Multiple independent submissions to the refSNP cluster Kaul (1995) Hum Mutat 5 | missense | C152R |
152_2 | CM023604 | HGMD | Zeng (2002) J Inherit Metab Dis 25 | missense | C152W |
152_3 | CM960088 | HGMD | Kaul (1996) Am J Hum Genet 59, 95 | missense | C152Y |
153 | rs141755746 | dbSNP | synonymous | Y153Y | |
154 | rs147193431 rs2228435 | dbSNP SNPdbe | missense | V154I | |
157 | rs140357187 | dbSNP SNPdbe | missense | I157T | |
164 | SNPdbe | missense | Y164F | ||
166 | CM063847 | HGMD | Zeng (2006) Mol Genet Metab 89 | missense | T166I |
168 | CM001610 | SNPdbe HGMD | Sistermans (2000) Eur J Hum Genet 8 | missense | R168H |
168_2 | CM960089 | HGMD | Kaul (1996) Am J Hum Genet 59 | missense | R168C |
170 | rs144321760 | dbSNP SNPdbe | Validated by frequency or genotype data by freq | missense | I170T |
178 | SNPdbe | missense | E178A | ||
181_2 | CM063850 | HGMD | Zeng (2006) Mol Genet Metab 89 | missense | P181L |
181 | CM001611 | SNPdbe HGMD | Sistermans (2000) Eur J Hum Genet 8 | missense | P181T |
183 | CM990193 | SNPdbe HGMD | Elpeleg (1999) J Inherit Metab Dis 22 | missense | P183H |
184 | CM023605 | HGMD | Zeng (2002) J Inherit Metab Dis 25 | nonsense | Q184Ter |
186 | CM990194 | SNPdbe HGMD | Elpeleg (1999) J Inherit Metab Dis 22 | missense | V186F |
195 | CM990195 | SNPdbe HGMD | Elpeleg (1999) J Inherit Metab Dis 22 | missense | M195R |
202 | rs147763700 | dbSNP SNPdbe | missense | A202S | |
213 | CM055097 | HGMD | Tacke (2005) Neuropediatrics 36 | missense | K213E |
214 | CM023606 | HGMD | Zeng (2002) J Inherit Metab Dis 25 | nonsense | E214Ter |
218 | rs104894549 CM950103 | dbSNP HGMD | Multiple independent submissions to the refSNP cluster Shaag (1995) Am J Hum Genet 57 | nonsense | C218Ter |
220 | rs139053885 | dbSNP SNPdbe | missense | I220T | |
226_2 | CM086530 | HGMD | Di Pietro (2008) Clin Biochem 41 | missense | I226T |
226 | rs201887670 | dbSNP | missense | I226K | |
231 | rs104894550 CM994594 | dbSNP SNPdbe OMIM HGMD | Multiple independent submissions to the refSNP cluster Rady (1999) Am J Med Genet 87 | missense | Y231C |
231_2 | CM940123 | HGMD | Kaul (1994) Am J Hum Genet 55 | nonsense | Y231Ter |
235 | rs149842031 | dbSNP SNPdbe | Multiple independent submissions to the refSNP cluster Validated by frequency or genotype data 1000 Genome project | missense | E235K |
236 | rs149189911 | dbSNP | synonymous | N236N | |
239 | rs145085349 | dbSNP SNPdbe | missense | I239T | |
244 | CM023607 | SNPdbe HGMD | Zeng (2002) J Inherit Metab Dis 25 | missense | H244R |
244_2 | CM063848 | HGMD | Zeng (2006) Mol Genet Metab 89 | missense | H244L |
249 | rs104894552 CM023015 | dbSNP SNPdbe HGMD | Multiple independent submissions to the refSNP cluster Olsen (2002) J Med Genet 39 | missense | D249V |
270 | rs200126822 | dbSNP SNPdbe | 1000 Genome project | missense | I270T |
272 | CM063851 | HGMD | Zeng (2006) Mol Genet Metab 89 | missense | L272P |
274 | CM950104 | SNPdbe HGMD | Shaag (1995) Am J Hum Genet 57 | missense | G274R |
277 | rs78677072 | dbSNP | Multiple independent submissions to the refSNP cluster 1000 Genome project | synonymous | T277T |
278 | rs140581464 | dbSNP SNPdbe | Multiple independent submissions to the refSNP cluster Validated by frequency or genotype data 1000Genome project | missense | V278M |
279 | rs145717248 | dbSNP SNPdbe | missense | Y279H | |
280 | rs148081446 | dbSNP | Multiple independent submissions to the refSNP cluster Validated by frequency or genotype data 1000 Genome project | synonymous | P280P |
280_2 | CM990197 | HGMD | Elpeleg (1999) J Inherit Metab Dis 22 | missense | P280S |
280_3 | VAR_039088 | SNPdbe | Elpeleg (1999) J Inherit Metab Dis 22 | missense | P280L |
281 | rs141858640 | dbSNP SNPdbe | missense | V281M | |
285 | rs28940279 CM930046 | dbSNP SNPdbe OMIM HGMD | Multiple independent submissions to the refSNP cluster Validated by frequency or genotype data Kaul (1993) Nat Genet 5 | missense | E285A |
285_2 | SNPdbe | missense | E285D | ||
286 | rs138062143 | dbSNP | synonymous | A286A | |
287 | CM990198 | SNPdbe HGMD | Elpeleg (1999) J Inherit Metab Dis 22 | missense | A287T |
288 | SNPdbe | missense | Y288F | ||
288_2 | CM034717 | HGMD | Surendran (2003) Mol Genet Metab 80 | missense | Y288C |
295 | CM950105 | SNPdbe HGMD | Shaag (1995) Am J Hum Genet 57 | missense | F295S |
305 | rs28940574 CM940124 | dbSNP SNPdbe OMIM HGMD | Multiple independent submissions to the refSNP cluster Kaul (1994) Am J Hum Genet 55 | missense | A305E |
310 | SNPdbe | missense | C310G | ||
314 | CM023608 | HGMD | Zeng (2002) J Inherit Metab Dis 25 | missense | Ter314W |
</figtable>
SNP Visualization
The coding SNPs are written below the reference sequence. An 'X' denotes a nonsense mutation coding a stop codon
MTSCHIAEEHIQKVAIFGGTHGNELTGVFLVKHWLENGAEIQRTGLEVKP 50 R G T R P G TR R FITNPRAVKKCTRYIDCDLNRIFDLENLGKKMSEDLPYEVRRAQEINHLF 100 A T A H T A K GPKDSEDSYDIIFDLHNTTSNMGCTLILEDSRNNFLIQMFHYIKTSLAPL 150 X V Y D E V E I F T PCYVYLIEHPSLKYATTRSIAKYPVGIEVGPQPQGVLRADILDQMRKMIK 200 RYI T F I H T A T HX F R W C L Y HALDFIHHFNEGKEFPPCAIEVYKIIEKVDYPRDENGEIAAIIHPNLQDQ 250 S EX X T K C KN T R V T X L DWKPLHPGDPMFLTLDGKTIPLGGDCTVYPVFVNEAAYYEKKEAFAKTTK 300 T P R TMHPM AATF S S D C LTLNAKSIRCCLH- 314 E G W
Hotspots
Visual inspection
For the visual inspection, we looked at two kinds of reported SNPs: first, those reported by the HGMD, so they are reported to cause the Canavan Disease. Second, SNPs from dbSNP and xx that are not reported to cause the Canavan Disease (however, they are also not reported to NOT cause it).
For the HGMD data, SNPs were pretty much scattered all over the structure. It was surprising for us that even many SNPs on the surface of the protein, not even close to the dimer interaction site, were also reported to cause the Canavan Disease. See <xr id="hgmd_all" /> and <xr id ="hgmd_surface"/>.
<figtable id="HGMD">
<figure id="hgmd_all"></figure> | <figure id="hgmd_surface"></figure> |
</figtable>
For SNPs NOT reported to cause the Canavan Disease, SNPs again were scattered all over the structure. We had expected to visually be able to see some concentration of mutations in certain - apparently functionally unimportant or structurally flexible - regions, but could not validate this expectation with the given data, at least not visually. See <xr id="others_all" /> and <xr id ="others_surface"/>.
<figtable id="others">
<figure id="others_all"></figure> | <figure id="others_surface"></figure> |
</figtable>
Frequency Distribution
Since visual inspection did not bring further enlightenment, we had a look at the frequency distribution of the disease causing and non-disease causing SNPs. These can be found in <xr id = "can"/> and <xr id="no_can"/>. Neither for the disease causing, nor for the non-disease causing SNPs, we were able to identify distinct hotspot-regions. The only exception might be the last block of mutations in <xr id = "no_can"/> close to the end of the sequence. It is curious, though, that this sequence part should be evolutionary flexible, since it includes residue 288, which is part of the binding site (but keep in mind that maybe this mutation has just not been annotated to cause the Canavan Disease. It might do it, still).
<figure id="can"></figure> |
<figure id="no_can"></figure> |