[Debian-med-packaging] Bug#910357: ITP: segemehl -- short read mapping with gaps
Andreas Tille
tille at debian.org
Fri Oct 5 10:53:02 BST 2018
Package: wnpp
Severity: wishlist
Owner: Andreas Tille <tille at debian.org>
* Package name : segemehl
Version : 0.3
Upstream Author : Bioinformatik Leipzig
* URL : http://www.bioinf.uni-leipzig.de/Software/segemehl/
* License : GPL-3+
Programming Lang: C
Description : short read mapping with gaps
Segemehl is a software to map short sequencer reads to reference
genomes. Segemehl implements a matching strategy based on enhanced
suffix arrays (ESA). Segemehl accepts fasta and fastq queries (gzip’ed
and bgzip'ed). In addition to the alignment of reads from standard DNA-
and RNA-seq protocols, it also allows the mapping of bisulfite converted
reads (Lister and Cokus) and implements a split read mapping strategy.
The output of segemehl is a SAM or BAM formatted alignment file. In the
case of split-read mapping, additional BED files are written to the
disc. These BED files may be summarized with the postprocessing tool
haarz. In the case of the alignment of bisulfite converted reads, raw
methylation rates may also be called with haarz.
.
In brief, for each suffix of a read, segemehl aims to find the
best-scoring seed. Seeds might contain insertions, deletions, and
mismatches (differences). The number of differences allowed within a
single seed is user-controlled and is crucial for the runtime of the
program. Subsequently, seeds that undercut the user-defined E-value are
passed on to an exact semi-global alignment procedure. Finally, reads
with a minimum accuracy of percent are reported to the user.
Remark: This package is maintained by Debian Med Packaging Team at
https://salsa.debian.org/med-team/segemehl
More information about the Debian-med-packaging
mailing list