Difference between revisions of "Dot-Bracket Notation"

From Protein Prediction 2 Winter Semester 2014
Line 5: Line 5:
 
The main task of the visualization is the following:
 
The main task of the visualization is the following:
   
The program gets an RNA secondary structure in Dot-Bracked Notation as input. This inputs consists of two strings where the first one is the RNA sequence, and the second string sequence of dots, round, and square brackets with the same length as the RNA sequence. For more information, please see [http://ultrastudio.org/en/Dot-Bracket_Notation Dot-Bracket Notation].
+
The program gets an RNA secondary structure in Dot-Bracked (Vienna) Notation as input. This inputs consists of two strings where the first one is the RNA sequence, and the second string sequence of dots, round, and square brackets with the same length as the RNA sequence. For more information, please see [http://ultrastudio.org/en/Dot-Bracket_Notation Dot-Bracket Notation].
   
 
From this input a graph-like visualization of the RNA's secondary structure is created. In this representation the nodes refer to the ribonucleotides and the edges are the hydrogen bonds that connect them. The graph is connected and undirected.
 
From this input a graph-like visualization of the RNA's secondary structure is created. In this representation the nodes refer to the ribonucleotides and the edges are the hydrogen bonds that connect them. The graph is connected and undirected.
Line 27: Line 27:
   
 
Dot-Bracket Notation is not the only popular way to represent RNA secondary structure. Other commonly used formats are:
 
Dot-Bracket Notation is not the only popular way to represent RNA secondary structure. Other commonly used formats are:
 
*[http://www.ncbi.nlm.nih.gov/pubmed/12088144 RNAML]
 
   
 
*[http://www.ibi.vu.nl/programs/k2nwww/static/data_formats.html PBseq]
 
*[http://www.ibi.vu.nl/programs/k2nwww/static/data_formats.html PBseq]
   
  +
*[http://www.ibi.vu.nl/programs/k2nwww/static/data_formats.html Connect (.ct)]
*[http://en.wikipedia.org/wiki/Stockholm_format STOCKHOLM]
 
   
 
= Progress =
 
= Progress =

Revision as of 17:23, 19 November 2014

The aim of the project is to visualize RNA secondary structures. RNAs are chains of ribonucleotides which form complex two-dimensional structures through the formation of hydrogen bonds between cytosine and guanine, between adenine and uracil and between guanine and uracil.

Core Functionality

The main task of the visualization is the following:

The program gets an RNA secondary structure in Dot-Bracked (Vienna) Notation as input. This inputs consists of two strings where the first one is the RNA sequence, and the second string sequence of dots, round, and square brackets with the same length as the RNA sequence. For more information, please see Dot-Bracket Notation.

From this input a graph-like visualization of the RNA's secondary structure is created. In this representation the nodes refer to the ribonucleotides and the edges are the hydrogen bonds that connect them. The graph is connected and undirected. The visualization will be done using the Force-Layout of D3.

Illustration of the desired core functionality of the secondary structure visualization.
Illustration of the desired core functionality of the secondary structure visualization.

For an example, please have a look at the figure on the right.

Adding additional functionality to the Visualization

After the core functionality is established, we plan to add additional functionality to the visualization. Current ideas are:

  • Display additional information on mouse-over
  • Drag nodes or even the whole visualization to enable examination of the structure from different angles
  • Make the visualization editable: Allow addition and deletion of nodes and edges and export the changes in a suited format.

Allow different input formats

Dot-Bracket Notation is not the only popular way to represent RNA secondary structure. Other commonly used formats are:

Progress

Task Implemented
Core Functionality No
Drag nodes No
Display additional info on mouse-over No
Edit graph and export changes No
RNAMBL format compatibility No
PBseq format compatibility No
STOCKHOLM format compatibility No

Source Code

Additional Links