[Debian-med-packaging] Bug#808690: ITP: fsa -- Fast Statistical Alignment of protein, RNA or DNA sequences
Andreas Tille
tille at debian.org
Mon Dec 21 20:22:06 UTC 2015
Package: wnpp
Severity: wishlist
Owner: Andreas Tille <tille at debian.org>
* Package name : fsa
Version : 1.15.9
Upstream Author : Ariel Schwartz, Chuong Do, Robert Bradley, Jaeyoung Do, Colin Dewey, Ian Holmes, Lars Barquist
* URL : http://fsa.sourceforge.net/
* License : GPL
Programming Lang: C
Description : Fast Statistical Alignment of protein, RNA or DNA sequences
FSA is a probabilistic multiple sequence alignment algorithm which uses
a "distance-based" approach to aligning homologous protein, RNA or DNA
sequences. Much as distance-based phylogenetic reconstruction methods
like Neighbor-Joining build a phylogeny using only pairwise divergence
estimates, FSA builds a multiple alignment using only pairwise
estimations of homology. This is made possible by the sequence annealing
technique for constructing a multiple alignment from pairwise
comparisons, developed by Ariel Schwartz.
.
FSA brings the high accuracies previously available only for
small-scale analyses of proteins or RNAs to large-scale problems such as
aligning thousands of sequences or megabase-long sequences. FSA
introduces several novel methods for constructing better alignments:
* FSA uses machine-learning techniques to estimate gap and
substitution parameters on the fly for each set of input sequences.
This "query-specific learning" alignment method makes FSA very robust:
it can produce superior alignments of sets of homologous sequences
which are subject to very different evolutionary constraints.
* FSA is capable of aligning hundreds or even thousands of sequences
using a randomized inference algorithm to reduce the computational
cost of multiple alignment. This randomized inference can be over ten
times faster than a direct approach with little loss of accuracy.
* FSA can quickly align very long sequences using the "anchor
annealing" technique for resolving anchors and projecting them with
transitive anchoring. It then stitches together the alignment between
the anchors using the methods described above.
* The included GUI, MAD (Multiple Alignment Display), can display the
intermediate alignments produced by FSA, where each character is
colored according to the probability that it is correctly aligned
Remark: This package enhances the package t-coffee also maintained by
the Debian Med team. The packaging can be found at
git://anonscm.debian.org/debian-med/fsa.git
More information about the Debian-med-packaging
mailing list