[Debian-med-packaging] Bug#961661: ITP: samclip -- filter SAM file for soft and hard clipped alignments
Andreas Tille
tille at debian.org
Wed May 27 12:43:46 BST 2020
Package: wnpp
Severity: wishlist
Subject: ITP: samclip -- filter SAM file for soft and hard clipped alignments
Package: wnpp
Owner: Andreas Tille <tille at debian.org>
Severity: wishlist
* Package name : samclip
Version : 0.4.0
Upstream Author : Torsten Seemann
* URL : https://github.com/tseemann/samclip
* License : GPL-3
Programming Lang: (C, C++, C#, Perl, Python, etc.)
Description : filter SAM file for soft and hard clipped alignments
Most short read aligners perform local alignment of reads to the
reference genome. Examples includes bwa mem, minimap2, and bowtie2
(unless in --end-to-end mode). This means the ends of the read may not
be part of the best alignment.
.
This can be caused by:
.
* adapter sequences (aren't in the reference)
* poor quality bases (mismatches only make the alignment score worse)
* structural variation in your sample compared to the reference
* reads overlapping the start and end of contigs (including
circular genomes)
.
Read aligners output a SAM file. Column 6 in this format stores the
CIGAR string. which describes which parts of the read aligned and which
didn't. The unaligned ends of the read can be "soft" or "hard" clipped,
denoted with S and H at each end of the CIGAR string. It is possible for
both types to be present, but that is not common. Soft and hard don't
mean anything biologically, they just refer to whether the full read
sequence is in the SAM file or not.
Remark: This package is maintained by Debian Med Packaging Team at
https://salsa.debian.org/med-team/samclip
More information about the Debian-med-packaging
mailing list