Difference between revisions of "FASTA"
From Bioinformatikpedia
(Created page with "==Basic Information== {| style="float: right; border: 1px solid #BBB; margin: .46em 0 0 .2em;" ! Author | David J. Lipman, William R. Pearson |- ! Year | 1985 |- ! Reference | […") |
|||
Line 15: | Line 15: | ||
|- |
|- |
||
! Method |
! Method |
||
+ | | look-up table & agglomerative alignment |
||
− | | |
||
|} |
|} |
||
FASTA is one of the heuristics to sequence alignments. |
FASTA is one of the heuristics to sequence alignments. |
||
+ | |||
+ | |||
+ | |||
+ | |||
Revision as of 16:30, 17 August 2011
Basic Information
Author | David J. Lipman, William R. Pearson |
---|---|
Year | 1985 |
Reference | Rapid and sensitive protein similarity searches |
Short description | Alignment heuristic |
Method | look-up table & agglomerative alignment |
FASTA is one of the heuristics to sequence alignments.
Details
The algorithm works in four steps:
- Identify regions of highest density in each sequence comparison: Searching for identical tuples (length 2) between the sequence and a look-up-table of the database. A match is declared, if a certain number of consecutive identical tuples (ktup-value) was found between two sequences. The matches can be visualized by diagonals in a matrix comparing two sequences. The best 10 local regions selected from all the diagonals put together are then saved.
- Rescan the regions taken using the scoring matrices. trimming the ends of the region to include only those contributing to the highest score.
- Optimal alignment of initial regions as a combination of compatible regions with maximal score.
- Use a banded Smith-Waterman algorithm to calculate an optimal score for alignment.