Difference between revisions of "Project ideas"

From Protein Prediction 2 Winter Semester 2014
(Protein Viewer)
(Protein Viewer)
Line 14: Line 14:
 
* [http://biasmv.github.io/pv/ PV]
 
* [http://biasmv.github.io/pv/ PV]
 
* [http://istar.cse.cuhk.edu.hk/iview/ iView]
 
* [http://istar.cse.cuhk.edu.hk/iview/ iView]
*
 
 
Mentors: Björn Grüning (Galaxy) gruening. (at) .informatik.uni-freiburg.de<br>
 
Mentors: Björn Grüning (Galaxy) gruening. (at) .informatik.uni-freiburg.de<br>
 
Students: 4-5
 
Students: 4-5
   
[[File:pmv.jpg | 150px | center | PDB structure]]
+
[[File:pmv.jpg | 200px | center | PDB structure]]
   
 
==Gene Cluster Viewer==
 
==Gene Cluster Viewer==

Revision as of 20:51, 11 November 2014

Venn Diagram Viewer

Venn diagrams present a very popular method to display list comparisons. [Jvenn] is an interactive Venn diagram viewer written in JavaScript. The objective of this project would be to use the code base of Jvenn to make it compatible with BioJS2.0.
Literature: jvenn: an interactive Venn diagram viewer
Mentors: PP2_CS_2014 mentors
Students: 2

Jvenn example

Protein Viewer

Visualization of PDB files - 3D structures of protein sequences

Similar projects:

Mentors: Björn Grüning (Galaxy) gruening. (at) .informatik.uni-freiburg.de
Students: 4-5

PDB structure

Gene Cluster Viewer

The viewer is supposed to show the conserved gene order in prokaryotic genomes. The data will be derived from GenBank.

Source: Example for visualization
Mentors: Björn Grüning (Galaxy) gruening. (at) .informatik.uni-freiburg.de
Students: 2

Gene cluster

Dot-Bracket Notation 1

RNA secondary structure is often defined using Dot-Bracket Notation (DBN). Valid structures in DBN format are well-parenthesized words consisting of dots '.', opening '(' and closing ')' parentheses. Dotted positions are unpaired, whereas matching parenthesized positions represent base-pairing nucleotides. As the number of nucleotides interacting is always even (everyone must have a parter), the brackets must be balanced. Source: [Wikipedia: http://ultrastudio.org/en/Dot-Bracket_Notation]

Sources:

Mentors: Björn Grüning (Galaxy) gruening. (at) .informatik.uni-freiburg.de
Students: 2

RNA

Dot-Bracket Notation 2

This project deals with a slightly different representation of the Dot-Bracket Notation.

Sources:

Mentors: Björn Grüning (Galaxy) gruening. (at) .informatik.uni-freiburg.de
Students: 2

RNA

Pedigree Chart Visualization

A pedigree chart is a simple and easy to read diagram showing the occurrence and appearance or phenotypes of a particular gene in an organism and its ancestors. Pedigrees use a standardized set of symbols:

  • squares: males
  • circles: females
  • diamonds: the sex of the person is unknown
  • filled-in (darker) symbol: someone with the phenotype in question
  • shaded or half-filled symbol: heterozygotes
  • horizontal and a vertical line: connects parents to their offspring
  • ....

Literature:

Mentors: PP2_CS_2014 mentors
Students: 2

Pedigree chart

Sub-cellular localization in a cell

Archaea, Bacteria and Eukaryota form the three domains of life. Eukaryotic cells contain a nucleus and other membrane-bound organelles. The cells of archaea and bacteria in contrast are formed by a single compartment that is surrounded by the plasma membrane (Gram-negative bacteria have an additional outer membrane). The objective of this project is to visualize biological cells and highlight by a user selected sub-cellular compartments in a way that they stand out from the un-selected ones. Similar idea: The Compartments database
Mentors: PP2_CS_2014 mentors, Manuel Corpas (TGAC) mc. (at) .manuelcorpas.com
Students: 2

Pedigree chart

Force directed network (spring algorithm), Graph Viewer

The objective of this project is to visualize a network (large networks of >2000 nodes) in a way that the distance of a node from the rest of the network is determined by the number of nodes it is connected to => the more neighbors a node has the larger is its distance from the network. The component must allow zooming in/out, selection by the number of neighbors, coloring by various thresholds and other graph-related features.

Relevant sources:

Mentors: PP2_CS_2014 mentors, Yana Bromberg (Rutgers University), Björn Grüning (Galaxy) gruening. (at) .informatik.uni-freiburg.de

Students: 3-4

Graph

HSSP curve

The HHSP curve at a threshold of interest (HSSP value=0 is default) must be visualized in a 2D graph. Additionally, alignments of protein sequences, provided by the user, must be plotted on the graph.

Literature:

Mentors: PP2_CS_2014 mentors
Students: 2

HSSP curve

Graphical Model Editor

@Juanmi, can you please add a description here? Thanks :)

2D Chemical Components Visualizer

The goal is to automatically create 2D diagrams of chemical complexes with known 3D structure according to chemical drawing conventions.

Similar projects:

Mentors: Julian Heinrich (CSIRO) julian.heinrich. (at) .csiro.au, Björn Grüning (Galaxy) gruening. (at) .informatik.uni-freiburg.de
Students: 2-3

Poseview

Genome Browser

@Miguel, Manny: can you please add a description here?

Relevant sources:

BigWig and BigBed File Viewers

The idea came from Saket, but Ricardo might be working on it already. Wrote these guys on email/Skype and awaiting reply.

Visualization of iAnn events

The iAnn calendar is one of the most used tools to annotate and curate scientific announcements. The idea of this project is to visualize iAnn announcements in the following ways:

  • as an interactive map
  • a table
  • and e.g. a pie chart or histograms showing statistics by various keywords (dates, country, field, etc.)

Relevant sources:

Mentor: Manuel Corpas (TGAC) mc. (at) .manuelcorpas.com
Students: 2-3

Poseview

Visualization of events on the GOBLET platform

Similar idea as for iAnn events -> visualization of events based on keywords

Sources:

Mentor: Manuel Corpas (TGAC) mc. (at) .manuelcorpas.com
Students: 2-3

Visualization of FastQ formats

FASTQis a common file format storing sequencing read data together with its associated per base quality score. The objective of this project to visualize file in the fastQ format in an attractive ans easily interprtetable way. An example of a fastQ file is given below.

Sources:

Mentor: Manuel Corpas (TGAC) mc. (at) .manuelcorpas.com
Students: 2

Poseview


Parser for GenBank format and visualization of annotations