[med-svn] [Git][med-team/python-cutadapt][master] 5 commits: New upstream version 2.8

Fri Jan 24 11:56:33 GMT 2020


Steffen Möller pushed to branch master at Debian Med / python-cutadapt


Commits:
e2da6662 by Steffen Moeller at 2020-01-24T12:50:13+01:00
New upstream version 2.8
- - - - -
18b81d78 by Steffen Moeller at 2020-01-24T12:50:13+01:00
routine-update: New upstream version

- - - - -
adb5e509 by Steffen Moeller at 2020-01-24T12:50:15+01:00
Update upstream source from tag 'upstream/2.8'

Update to upstream version '2.8'
with Debian dir 51a8f112f846c5abdeca7cb9a07880aafc293c12
- - - - -
1b0a01b1 by Steffen Moeller at 2020-01-24T12:50:30+01:00
Set upstream metadata fields: Bug-Database, Bug-Submit, Repository, Repository-Browse.
- - - - -
4c6471ae by Steffen Moeller at 2020-01-24T12:54:25+01:00
routine-update: Ready to upload to unstable

- - - - -


26 changed files:

- CHANGES.rst
- debian/changelog
- debian/control
- debian/upstream/metadata
- doc/guide.rst
- + mypy.ini
- src/cutadapt/__main__.py
- src/cutadapt/adapters.py
- src/cutadapt/filters.py
- src/cutadapt/modifiers.py
- src/cutadapt/parser.py
- src/cutadapt/pipeline.py
- src/cutadapt/report.py
- src/cutadapt/utils.py
- + tests/cut/linked-info.txt
- + tests/cut/linked-lowercase.fasta
- + tests/cut/revcomp-single-normalize.fastq
- + tests/cut/revcomp-single.fastq
- + tests/data/revcomp.1.fastq
- + tests/data/revcomp.2.fastq
- tests/test_adapters.py
- tests/test_commandline.py
- tests/test_modifiers.py
- tests/test_parser.py
- tests/test_trim.py
- tox.ini


Changes:

=====================================
CHANGES.rst
=====================================
@@ -2,6 +2,15 @@
 Changes
 =======
 
+v2.8 (2020-01-13)
+-----------------
+
+* :issue:`220`: With option ``--revcomp``, Cutadapt now searches both the read
+  and its reverse complement for adapters. The version that matches best is
+  kept. This can be used to “normalize” strandedness.
+* :issue:`430`: ``--action=lowercase`` now works with linked adapters
+* :issue:`431`: Info files can now be written even for linked adapters.
+
 v2.7 (2019-11-22)
 -----------------
 


=====================================
debian/changelog
=====================================
@@ -1,3 +1,13 @@
+python-cutadapt (2.8-1) unstable; urgency=medium
+
+  * Team upload.
+  * New upstream version
+  * Standards-Version: 4.5.0
+  * Set upstream metadata fields: Bug-Database, Bug-Submit, Repository,
+    Repository-Browse.
+
+ -- Steffen Moeller <moeller at debian.org>  Fri, 24 Jan 2020 12:50:31 +0100
+
 python-cutadapt (2.7-2) unstable; urgency=medium
 
   * Team upload.


=====================================
debian/control
=====================================
@@ -17,7 +17,7 @@ Build-Depends: debhelper-compat (= 12),
                python3-xopen (>= 0.5.0),
                python3-dnaio,
                cython3
-Standards-Version: 4.4.1
+Standards-Version: 4.5.0
 Vcs-Browser: https://salsa.debian.org/med-team/python-cutadapt
 Vcs-Git: https://salsa.debian.org/med-team/python-cutadapt.git
 Homepage: https://pypi.python.org/pypi/cutadapt


=====================================
debian/upstream/metadata
=====================================
@@ -1,23 +1,27 @@
 Reference:
- - Author: Marcel Martin
-   Title: >
+- Author: Marcel Martin
+  Title: >
     Cutadapt removes adapter sequences from high-throughput sequencing
     reads
-   Journal: EMBnet.journal
-   Year: 2015
-   Volume: 17
-   Number: 1
-   Pages: 10-12
-   DOI: 10.14806/ej.17.1.200
-   URL: http://journal.embnet.org/index.php/embnetjournal/article/view/200
-   eprint: >
+  Journal: EMBnet.journal
+  Year: 2015
+  Volume: 17
+  Number: 1
+  Pages: 10-12
+  DOI: 10.14806/ej.17.1.200
+  URL: http://journal.embnet.org/index.php/embnetjournal/article/view/200
+  eprint: >
     http://journal.embnet.org/index.php/embnetjournal/article/view/200/458
 Registry:
- - Name: OMICtools
-   Entry: OMICS_01086
- - Name: bio.tools
-   Entry: cutadapt
- - Name: SciCrunch
-   Entry: SCR_011841
- - Name: conda:bioconda
-   Entry: cutadapt
+- Name: OMICtools
+  Entry: OMICS_01086
+- Name: bio.tools
+  Entry: cutadapt
+- Name: SciCrunch
+  Entry: SCR_011841
+- Name: conda:bioconda
+  Entry: cutadapt
+Bug-Database: https://github.com/marcelm/cutadapt/issues
+Bug-Submit: https://github.com/marcelm/cutadapt/issues/new
+Repository: https://github.com/marcelm/cutadapt.git
+Repository-Browse: https://github.com/marcelm/cutadapt


=====================================
doc/guide.rst
=====================================
@@ -225,8 +225,9 @@ Adapter sequences :ref:`may also contain any IUPAC wildcard
 character <wildcards>` (such as ``N``).
 
 In addition, it is possible to :ref:`remove a fixed number of
-bases <cut-bases>` from the beginning or end of each read, and to :ref:`remove
-low-quality bases (quality trimming) <quality-trimming>` from the 3' and 5' ends.
+bases <cut-bases>` from the beginning or end of each read, to :ref:`remove
+low-quality bases (quality trimming) <quality-trimming>` from the 3' and 5' ends,
+and to :ref:`search for adapters also in the reverse-complemented reads <reverse-complement>`.
 
 
 Overview of adapter types
@@ -800,6 +801,43 @@ at both ends, use ``-g "ADAPTER;anywhere"``.
     :ref:`increase the minimum overlap length <random-matches>`.
 
 
+.. _reverse-complement:
+
+Searching reverse complements
+-----------------------------
+
+.. note::
+    Option ``--revcomp`` is added on a tentative basis. Its behaviour
+
+
+By default, Cutadapt expects adapters to be given in the same orientation (5' to 3') as the reads.
+
+To change this, use option ``--revcomp`` or its abbreviation ``--rc``. If given, Cutadapt searches
+both the read and its reverse complement for adapters. If the reverse complemented read yields
+a better match, then that version of the read is kept. That is, the output file will contain the
+reverse-complemented sequence. This can be used to “normalize” read orientation/strandedness.
+
+To determine which version of the read yields the better match, the full adapter search (possibly
+multiple rounds if ``--times`` is used) is done independently on both versions, and the version that
+results in the higher number of matching nucleotides is considered to be the better one.
+
+The name of a reverse-complemented read is changed by adding a space and ``rc`` to it. (Please
+file an issue if you would like this to be configurable.)
+
+The report will show the number of reads that were reverse-complemented, like this::
+
+    Total reads processed:  60
+    Reads with adapters:    50 (83.3%)
+    Reverse-complemented:   20 (33.3%)
+
+Here, 20 reverse-complemented reads contain an adapter and 50 - 20 = 30 reads that did not need to
+be reverse-complemented contain an adapter.
+
+Option ``--revcomp`` is currently available only for single-end data.
+
+.. versionadded:: 2.8
+
+
 Specifying adapter sequences
 ============================
 
@@ -1850,7 +1888,10 @@ starts with something like this::
 
     Sequence: 'ACGTACGTACGTTAGCTAGC'; Length: 20; Trimmed: 2402 times.
 
-The meaning of this should be obvious.
+The meaning of this should be obvious. If option ``--revcomp`` was used,
+this line will additionally contain something like ``Reverse-complemented:
+984 times``. This describes how many times of the 2402 total times the
+adapter was found on the reverse complement of the read.
 
 The next piece of information is this::
 
@@ -1934,13 +1975,20 @@ Format of the info file
 -----------------------
 
 When the ``--info-file`` command-line parameter is given, detailed
-information about the found adapters is written to the given file. The
-output is a tab-separated text file. Each line corresponds to one read
-of the input file (unless `--times` is used, see below). A row is written
-for *all* reads, even those that are discarded from the final output
-FASTA/FASTQ due to filtering options (such as ``--minimum-length``).
+information about where adapters were found in each read are written
+to the given file. It is a tab-separated text file that contains at
+least one row per input read. Normally, there is exactly one row per
+input read, but in the following cases, multiple rows may be output:
 
-The fields in each row are:
+  * The option ``--times`` is in use.
+  * A linked adapter is used.
+
+A row is written for *all* input reads, even those that are discarded
+from the final FASTA/FASTQ output due to filtering options
+(such as ``--minimum-length``). Which fields are output in each row
+depends on whether an adapter match was found in the read or not.
+
+The fields in a row that describes a match are:
 
 1. Read name
 2. Number of errors
@@ -1963,12 +2011,12 @@ Concatenating them yields the full sequence of quality values.
 If no adapter was found, the format is as follows:
 
 1. Read name
-2. The value -1
+2. The value -1 (use this to distinguish between match and non-match)
 3. The read sequence
 4. Quality values
 
 When parsing the file, be aware that additional columns may be added in
-the future. Note also that some fields can be empty, resulting in
+the future. Also, some fields can be empty, resulting in
 consecutive tabs within a line.
 
 If the ``--times`` option is used and greater than 1, each read can appear
@@ -1979,5 +2027,16 @@ accordingly for columns 9-11). For subsequent lines, the shown sequence are the
 ones that were used in subsequent rounds of adapter trimming, that is, they get
 successively shorter.
 
+Linked adapters appear with up to two rows for each read, one for each constituent
+adapter for which a match has been found. To be able to see which of the two
+adapters a row describes, the adapter name in column 8 is modified: If the row
+describes a match of the 5' adapter, the string ``;1`` is added. If it describes
+a match of the 3' adapter, the string ``;2`` is added. If there are two rows, the
+5' match always comes first.
+
+
 .. versionadded:: 1.9
     Columns 9-11 were added.
+
+.. versionadded:: 2.8
+    Linked adapters in info files work.


=====================================
mypy.ini
=====================================
@@ -0,0 +1,14 @@
+[mypy]
+warn_unused_configs = True
+
+[mypy-xopen]
+ignore_missing_imports = True
+
+[mypy-dnaio.*]
+ignore_missing_imports = True
+
+[mypy-cutadapt._align.*]
+ignore_missing_imports = True
+
+[mypy-cutadapt.qualtrim.*]
+ignore_missing_imports = True


=====================================
src/cutadapt/__main__.py
=====================================
@@ -34,11 +34,10 @@ For paired-end reads:
     cutadapt -a ADAPT1 -A ADAPT2 [options] -o out1.fastq -p out2.fastq in1.fastq in2.fastq
 
 Replace "ADAPTER" with the actual sequence of your 3' adapter. IUPAC wildcard
-characters are supported. The reverse complement is *not* automatically
-searched. All reads from input.fastq will be written to output.fastq with the
-adapter sequence removed. Adapter matching is error-tolerant. Multiple adapter
-sequences can be given (use further -a options), but only the best-matching
-adapter will be removed.
+characters are supported. All reads from input.fastq will be written to
+output.fastq with the adapter sequence removed. Adapter matching is
+error-tolerant. Multiple adapter sequences can be given (use further -a
+options), but only the best-matching adapter will be removed.
 
 Input may also be in FASTA format. Compressed input and output is supported and
 auto-detected from the file name (.gz, .xz, .bz2). Use the file name '-' for
@@ -58,19 +57,21 @@ import sys
 import time
 import logging
 import platform
+from typing import Tuple, Optional, Sequence, List, Any, Iterator, Union
 from argparse import ArgumentParser, SUPPRESS, HelpFormatter
 
 import dnaio
 
 from cutadapt import __version__
-from cutadapt.adapters import warn_duplicate_adapters
+from cutadapt.adapters import warn_duplicate_adapters, Adapter
 from cutadapt.parser import AdapterParser
-from cutadapt.modifiers import (LengthTagModifier, SuffixRemover, PrefixSuffixAdder,
+from cutadapt.modifiers import (Modifier, LengthTagModifier, SuffixRemover, PrefixSuffixAdder,
     ZeroCapper, QualityTrimmer, UnconditionalCutter, NEndTrimmer, AdapterCutter,
-    PairedAdapterCutterError, PairedAdapterCutter, NextseqQualityTrimmer, Shortener)
+    PairedAdapterCutterError, PairedAdapterCutter, NextseqQualityTrimmer, Shortener,
+    ReverseComplementer)
 from cutadapt.report import full_report, minimal_report
-from cutadapt.pipeline import (SingleEndPipeline, PairedEndPipeline, InputFiles, OutputFiles,
-    SerialPipelineRunner, ParallelPipelineRunner)
+from cutadapt.pipeline import (Pipeline, SingleEndPipeline, PairedEndPipeline, InputFiles,
+    OutputFiles, SerialPipelineRunner, ParallelPipelineRunner)
 from cutadapt.utils import available_cpu_count, Progress, DummyProgress, FileOpener
 from cutadapt.log import setup_logging, REPORT
 
@@ -108,7 +109,7 @@ class CommandLineError(Exception):
     pass
 
 
-def get_argument_parser():
+def get_argument_parser() -> ArgumentParser:
     # noqa: E131
     parser = CutadaptArgumentParser(usage=__doc__, add_help=False)
     group = parser.add_argument_group("Options")
@@ -185,6 +186,11 @@ def get_argument_parser():
             "lowercase: convert to lowercase; "
             "none: leave unchanged (useful with "
             "--discard-untrimmed). Default: %(default)s")
+    group.add_argument("--rc", "--revcomp", dest="reverse_complement", default=False,
+        action="store_true",
+        help="Check both the read and its reverse complement for adapter matches. If "
+            "match is on reverse-complemented version, output that one. "
+            "Default: check only read")
     group.add_argument("--no-trim", dest='action', action='store_const', const='none',
         help=SUPPRESS)  # Deprecated, use --action=none
     group.add_argument("--mask-adapter", dest='action', action='store_const', const='mask',
@@ -335,13 +341,13 @@ def get_argument_parser():
     return parser
 
 
-def parse_cutoffs(s):
-    """Parse a string INT[,INT] into a two-element list of integers
+def parse_cutoffs(s: str) -> Tuple[int, int]:
+    """Parse a string INT[,INT] into a pair of integers
 
     >>> parse_cutoffs("5")
-    [0, 5]
+    (0, 5)
     >>> parse_cutoffs("6,7")
-    [6, 7]
+    (6, 7)
     """
     try:
         cutoffs = [int(value) for value in s.split(",")]
@@ -353,10 +359,11 @@ def parse_cutoffs(s):
     elif len(cutoffs) != 2:
         raise CommandLineError("Expected one value or two values separated by comma for "
             "the quality cutoff")
-    return cutoffs
 
+    return (cutoffs[0], cutoffs[1])
 
-def parse_lengths(s):
+
+def parse_lengths(s: str) -> Tuple[Optional[int], ...]:
     """Parse [INT][:[INT]] into a pair of integers. If a value is omitted, use None
 
     >>> parse_lengths('25')
@@ -380,64 +387,38 @@ def parse_lengths(s):
     return tuple(values)
 
 
-def open_output_files(args, default_outfile, interleaved, file_opener):
+def open_output_files(
+    args, default_outfile, interleaved: bool, file_opener: FileOpener
+) -> OutputFiles:
     """
     Return an OutputFiles instance. If demultiplex is True, the untrimmed, untrimmed2, out and out2
     attributes are not opened files, but paths (out and out2 with the '{name}' template).
     """
 
-    def open1(path):
-        """Return opened file (or None if path is None)"""
-        if path is None:
-            return None
-        return file_opener.xopen(path, "wb")
-
-    def open2(path1, path2):
-        file1 = file2 = None
-        if path1 is not None:
-            file1 = file_opener.xopen(path1, "wb")
-            if path2 is not None:
-                file2 = file_opener.xopen(path2, "wb")
-        return file1, file2
-
-    rest_file = open1(args.rest_file)
-    info_file = open1(args.info_file)
-    wildcard = open1(args.wildcard_file)
+    rest_file = file_opener.xopen_or_none(args.rest_file, "wb")
+    info_file = file_opener.xopen_or_none(args.info_file, "wb")
+    wildcard = file_opener.xopen_or_none(args.wildcard_file, "wb")
 
     too_short = too_short2 = None
     if args.minimum_length is not None:
-        too_short, too_short2 = open2(args.too_short_output, args.too_short_paired_output)
+        too_short, too_short2 = file_opener.xopen_pair(
+            args.too_short_output, args.too_short_paired_output, "wb")
 
     too_long = too_long2 = None
     if args.maximum_length is not None:
-        too_long, too_long2 = open2(args.too_long_output, args.too_long_paired_output)
+        too_long, too_long2 = file_opener.xopen_pair(
+            args.too_long_output, args.too_long_paired_output, "wb")
 
     if int(args.discard_trimmed) + int(args.discard_untrimmed) + int(
             args.untrimmed_output is not None) > 1:
         raise CommandLineError("Only one of the --discard-trimmed, --discard-untrimmed "
             "and --untrimmed-output options can be used at the same time.")
 
-    demultiplex = args.output is not None and '{name}' in args.output
-
-    if args.paired_output is not None and (demultiplex != ('{name}' in args.paired_output)):
-        raise CommandLineError('When demultiplexing paired-end data, "{name}" must appear in '
-            'both output file names (-o and -p)')
-
-    demultiplex_combinatorial = (
-        args.output is not None
-        and args.paired_output is not None
-        and '{name1}' in args.output
-        and '{name2}' in args.output
-        and '{name1}' in args.paired_output
-        and '{name2}' in args.paired_output
-    )
-    if (demultiplex or demultiplex_combinatorial) and args.discard_trimmed:
+    demultiplex_mode = determine_demultiplex_mode(args)
+    if demultiplex_mode and args.discard_trimmed:
         raise CommandLineError("Do not use --discard-trimmed when demultiplexing.")
 
-    if demultiplex:
-        if demultiplex_combinatorial:
-            raise CommandLineError("You cannot combine {name} with {name1} and {name2}")
-
+    if demultiplex_mode == "normal":
         out = args.output
         untrimmed = args.output.replace('{name}', 'unknown')
         if args.untrimmed_output:
@@ -452,12 +433,11 @@ def open_output_files(args, default_outfile, interleaved, file_opener):
                 untrimmed2 = args.untrimmed_paired_output
             if args.discard_untrimmed:
                 untrimmed2 = None
-
         else:
             untrimmed2 = out2 = None
 
         assert out is not None and '{name}' in out and (out2 is None or '{name}' in out2)
-    elif demultiplex_combinatorial:
+    elif demultiplex_mode == "combinatorial":
         out = args.output
         out2 = args.paired_output
         if args.untrimmed_output or args.untrimmed_paired_output:
@@ -468,8 +448,9 @@ def open_output_files(args, default_outfile, interleaved, file_opener):
         else:
             untrimmed = untrimmed2 = 'unknown'
     else:
-        untrimmed, untrimmed2 = open2(args.untrimmed_output, args.untrimmed_paired_output)
-        out, out2 = open2(args.output, args.paired_output)
+        untrimmed, untrimmed2 = file_opener.xopen_pair(
+            args.untrimmed_output, args.untrimmed_paired_output, "wb")
+        out, out2 = file_opener.xopen_pair(args.output, args.paired_output, "wb")
         if out is None:
             out = default_outfile
 
@@ -485,13 +466,41 @@ def open_output_files(args, default_outfile, interleaved, file_opener):
         untrimmed2=untrimmed2,
         out=out,
         out2=out2,
-        demultiplex=demultiplex or demultiplex_combinatorial,
+        demultiplex=bool(demultiplex_mode),
         interleaved=interleaved,
         force_fasta=args.fasta,
     )
 
 
-def determine_paired_mode(args):
+def determine_demultiplex_mode(args) -> Union[str, bool]:
+    """Return one of "normal", "combinatorial" or False"""
+
+    demultiplex = args.output is not None and '{name}' in args.output
+
+    if args.paired_output is not None and (demultiplex != ('{name}' in args.paired_output)):
+        raise CommandLineError('When demultiplexing paired-end data, "{name}" must appear in '
+                               'both output file names (-o and -p)')
+
+    demultiplex_combinatorial = (
+        args.output is not None
+        and args.paired_output is not None
+        and '{name1}' in args.output
+        and '{name2}' in args.output
+        and '{name1}' in args.paired_output
+        and '{name2}' in args.paired_output
+    )
+    if demultiplex and demultiplex_combinatorial:
+        raise CommandLineError("You cannot combine {name} with {name1} and {name2}")
+
+    if demultiplex:
+        return "normal"
+    elif demultiplex_combinatorial:
+        return "combinatorial"
+    else:
+        return False
+
+
+def determine_paired(args) -> bool:
     """
     Determine whether we should work in paired-end mode.
     """
@@ -506,7 +515,7 @@ def determine_paired_mode(args):
         or args.too_long_paired_output)
 
 
-def determine_interleaved(args):
+def determine_interleaved(args) -> Tuple[bool, bool]:
     is_interleaved_input = False
     is_interleaved_output = False
     if args.interleaved:
@@ -518,7 +527,9 @@ def determine_interleaved(args):
     return is_interleaved_input, is_interleaved_output
 
 
-def input_files_from_parsed_args(inputs, paired, interleaved):
+def setup_input_files(
+    inputs: Sequence[str], paired: bool, interleaved: bool
+) -> Tuple[str, Optional[str]]:
     """
     Return tuple (input_filename, input_paired_filename)
     """
@@ -541,7 +552,7 @@ def input_files_from_parsed_args(inputs, paired, interleaved):
                 "but then you also need to provide two input files (you provided one) or "
                 "use --interleaved.")
         else:
-            input_paired_filename = inputs[1]
+            input_paired_filename = inputs[1]  # type: Optional[str]
     else:
         if len(inputs) == 2:
             raise CommandLineError(
@@ -553,7 +564,7 @@ def input_files_from_parsed_args(inputs, paired, interleaved):
     return input_filename, input_paired_filename
 
 
-def check_arguments(args, paired, is_interleaved_output):
+def check_arguments(args, paired: bool, is_interleaved_output: bool) -> None:
     if not paired:
         if args.untrimmed_paired_output:
             raise CommandLineError("Option --untrimmed-paired-output can only be used when "
@@ -597,7 +608,7 @@ def check_arguments(args, paired, is_interleaved_output):
         raise CommandLineError("--pair-adapters cannot be used with --times")
 
 
-def pipeline_from_parsed_args(args, paired, is_interleaved_output, file_opener):
+def pipeline_from_parsed_args(args, paired, file_opener) -> Pipeline:
     """
     Setup a processing pipeline from parsed command-line arguments.
 
@@ -605,38 +616,23 @@ def pipeline_from_parsed_args(args, paired, is_interleaved_output, file_opener):
 
     Return an instance of Pipeline (SingleEndPipeline or PairedEndPipeline)
     """
-    check_arguments(args, paired, is_interleaved_output)
     if args.action == 'none':
         args.action = None
 
-    adapter_parser = AdapterParser(
-        max_error_rate=args.error_rate,
-        min_overlap=args.overlap,
-        read_wildcards=args.match_read_wildcards,
-        adapter_wildcards=args.match_adapter_wildcards,
-        indels=args.indels,
-    )
-    try:
-        adapters = adapter_parser.parse_multi(args.adapters)
-        adapters2 = adapter_parser.parse_multi(args.adapters2)
-    except (FileNotFoundError, ValueError) as e:
-        raise CommandLineError(e)
-    warn_duplicate_adapters(adapters)
-    warn_duplicate_adapters(adapters2)
-    if args.debug:
-        for adapter in adapters + adapters2:
-            adapter.enable_debug()
+    adapters, adapters2 = adapters_from_args(args)
 
     # Create the processing pipeline
     if paired:
         pair_filter_mode = 'any' if args.pair_filter is None else args.pair_filter
-        pipeline = PairedEndPipeline(pair_filter_mode, file_opener)
+        pipeline = PairedEndPipeline(
+            pair_filter_mode, file_opener
+        )  # type: Any
     else:
         pipeline = SingleEndPipeline(file_opener)
 
     # When adapters are being trimmed only in R1 or R2, override the pair filter mode
     # as using the default of 'any' would regard all read pairs as untrimmed.
-    if paired and (not adapters2 or not adapters) and (
+    if isinstance(pipeline, PairedEndPipeline) and (not adapters2 or not adapters) and (
             args.discard_untrimmed or args.untrimmed_output or args.untrimmed_paired_output):
         pipeline.override_untrimmed_pair_filter = True
 
@@ -650,24 +646,16 @@ def pipeline_from_parsed_args(args, paired, is_interleaved_output, file_opener):
         cutoffs = parse_cutoffs(args.quality_cutoff)
         pipeline_add(QualityTrimmer(cutoffs[0], cutoffs[1], args.quality_base))
 
-    if args.pair_adapters:
-        try:
-            cutter = PairedAdapterCutter(adapters, adapters2, args.action)
-        except PairedAdapterCutterError as e:
-            raise CommandLineError("--pair-adapters: " + str(e))
-        pipeline.add_paired_modifier(cutter)
-    else:
-        adapter_cutter, adapter_cutter2 = None, None
-        if adapters:
-            adapter_cutter = AdapterCutter(adapters, args.times, args.action)
-        if adapters2:
-            adapter_cutter2 = AdapterCutter(adapters2, args.times, args.action)
-        if paired:
-            if adapter_cutter or adapter_cutter2:
-                pipeline.add(adapter_cutter, adapter_cutter2)
-        else:
-            if adapter_cutter:
-                pipeline.add(adapter_cutter)
+    add_adapter_cutter(
+        pipeline,
+        adapters,
+        adapters2,
+        paired,
+        args.pair_adapters,
+        args.action,
+        args.times,
+        args.reverse_complement,
+    )
 
     for modifier in modifiers_applying_to_both_ends_if_paired(args):
         pipeline_add(modifier)
@@ -691,7 +679,28 @@ def pipeline_from_parsed_args(args, paired, is_interleaved_output, file_opener):
     return pipeline
 
 
-def add_unconditional_cutters(pipeline, cut1, cut2, paired):
+def adapters_from_args(args) -> Tuple[List[Adapter], List[Adapter]]:
+    adapter_parser = AdapterParser(
+        max_error_rate=args.error_rate,
+        min_overlap=args.overlap,
+        read_wildcards=args.match_read_wildcards,
+        adapter_wildcards=args.match_adapter_wildcards,
+        indels=args.indels,
+    )
+    try:
+        adapters = adapter_parser.parse_multi(args.adapters)
+        adapters2 = adapter_parser.parse_multi(args.adapters2)
+    except (FileNotFoundError, ValueError) as e:
+        raise CommandLineError(e)
+    warn_duplicate_adapters(adapters)
+    warn_duplicate_adapters(adapters2)
+    if args.debug:
+        for adapter in adapters + adapters2:
+            adapter.enable_debug()
+    return adapters, adapters2
+
+
+def add_unconditional_cutters(pipeline: Pipeline, cut1: List[int], cut2: List[int], paired: bool):
     for i, cut_arg in enumerate([cut1, cut2]):
         # cut_arg is a list
         if not cut_arg:
@@ -705,8 +714,10 @@ def add_unconditional_cutters(pipeline, cut1, cut2, paired):
                 continue
             if i == 0:  # R1
                 if paired:
+                    assert isinstance(pipeline, PairedEndPipeline)
                     pipeline.add(UnconditionalCutter(c), None)
                 else:
+                    assert isinstance(pipeline, SingleEndPipeline)
                     pipeline.add(UnconditionalCutter(c))
             else:
                 # R2
@@ -714,7 +725,46 @@ def add_unconditional_cutters(pipeline, cut1, cut2, paired):
                 pipeline.add(None, UnconditionalCutter(c))
 
 
-def modifiers_applying_to_both_ends_if_paired(args):
+def add_adapter_cutter(
+    pipeline,
+    adapters,
+    adapters2,
+    paired: bool,
+    pair_adapters: bool,
+    action: str,
+    times: int,
+    reverse_complement: bool,
+):
+    if pair_adapters:
+        if reverse_complement:
+            raise CommandLineError("Cannot use --revcomp with --pair-adapters")
+        try:
+            cutter = PairedAdapterCutter(adapters, adapters2, action)
+        except PairedAdapterCutterError as e:
+            raise CommandLineError("--pair-adapters: " + str(e))
+        pipeline.add_paired_modifier(cutter)
+    else:
+        adapter_cutter, adapter_cutter2 = None, None
+        if adapters:
+            adapter_cutter = AdapterCutter(adapters, times, action)
+        if adapters2:
+            adapter_cutter2 = AdapterCutter(adapters2, times, action)
+        if paired:
+            if reverse_complement:
+                raise CommandLineError("--revcomp not implemented for paired-end reads")
+            if adapter_cutter or adapter_cutter2:
+                pipeline.add(adapter_cutter, adapter_cutter2)
+        elif adapter_cutter:
+            if reverse_complement:
+                modifier = ReverseComplementer(
+                    adapter_cutter
+                )  # type: Union[AdapterCutter,ReverseComplementer]
+            else:
+                modifier = adapter_cutter
+            pipeline.add(modifier)
+
+
+def modifiers_applying_to_both_ends_if_paired(args) -> Iterator[Modifier]:
     if args.length is not None:
         yield Shortener(args.length)
     if args.trim_n:
@@ -762,6 +812,8 @@ def main(cmdlineargs=None, default_outfile=sys.stdout.buffer):
         import cProfile
         profiler = cProfile.Profile()
         profiler.enable()
+    else:
+        profiler = None
 
     if args.quiet and args.report:
         parser.error("Options --quiet and --report cannot be used at the same time")
@@ -773,7 +825,7 @@ def main(cmdlineargs=None, default_outfile=sys.stdout.buffer):
             "--strip-f3, --maq, --bwa, --no-zero-cap. "
             "Use Cutadapt 1.18 or earlier to work with colorspace data.")
 
-    paired = determine_paired_mode(args)
+    paired = determine_paired(args)
     assert paired in (False, True)
 
     # Print the header now because some of the functions below create logging output
@@ -786,9 +838,10 @@ def main(cmdlineargs=None, default_outfile=sys.stdout.buffer):
         compression_level=args.compression_level, threads=0 if cores == 1 else None)
     try:
         is_interleaved_input, is_interleaved_output = determine_interleaved(args)
-        input_filename, input_paired_filename = input_files_from_parsed_args(args.inputs,
+        input_filename, input_paired_filename = setup_input_files(args.inputs,
             paired, is_interleaved_input)
-        pipeline = pipeline_from_parsed_args(args, paired, is_interleaved_output, file_opener)
+        check_arguments(args, paired, is_interleaved_output)
+        pipeline = pipeline_from_parsed_args(args, paired, file_opener)
         outfiles = open_output_files(args, default_outfile, is_interleaved_output, file_opener)
     except CommandLineError as e:
         parser.error(str(e))
@@ -838,7 +891,7 @@ def main(cmdlineargs=None, default_outfile=sys.stdout.buffer):
     else:
         report = full_report
     logger.log(REPORT, '%s', report(stats, elapsed, args.gc_content / 100))
-    if args.profile:
+    if profiler is not None:
         import pstats
         profiler.disable()
         pstats.Stats(profiler).sort_stats('time').print_stats(20)


=====================================
src/cutadapt/adapters.py
=====================================
@@ -7,6 +7,8 @@ The ...Match classes trim the reads.
 import logging
 from enum import Enum
 from collections import defaultdict
+from typing import Optional, Tuple, Sequence, Dict, Any, List
+from abc import ABC, abstractmethod
 
 from . import align
 
@@ -28,18 +30,24 @@ class Where(Enum):
     LINKED = 'linked'
 
 
-# TODO put this in some kind of "list of pre-defined adapter types" along with the info above
+class WhereToRemove(Enum):
+    PREFIX = 1
+    SUFFIX = 2
+    AUTO = 3
+
+
 WHERE_TO_REMOVE_MAP = {
-    Where.PREFIX: 'prefix',
-    Where.FRONT_NOT_INTERNAL: 'prefix',
-    Where.FRONT: 'prefix',
-    Where.BACK: 'suffix',
-    Where.SUFFIX: 'suffix',
-    Where.BACK_NOT_INTERNAL: 'suffix',
-    Where.ANYWHERE: 'auto',
+    Where.PREFIX: WhereToRemove.PREFIX,
+    Where.FRONT_NOT_INTERNAL: WhereToRemove.PREFIX,
+    Where.FRONT: WhereToRemove.PREFIX,
+    Where.BACK: WhereToRemove.SUFFIX,
+    Where.SUFFIX: WhereToRemove.SUFFIX,
+    Where.BACK_NOT_INTERNAL: WhereToRemove.SUFFIX,
+    Where.ANYWHERE: WhereToRemove.AUTO,
 }
 
 
+# TODO could become a property/attribute of the Adapter classes
 ADAPTER_TYPE_NAMES = {
     Where.BACK: "regular 3'",
     Where.BACK_NOT_INTERNAL: "non-internal 3'",
@@ -62,15 +70,15 @@ def returns_defaultdict_int():
 class EndStatistics:
     """Statistics about the 5' or 3' end"""
 
-    def __init__(self, adapter):
-        self.where = adapter.where
-        self.max_error_rate = adapter.max_error_rate
-        self.sequence = adapter.sequence
-        self.effective_length = adapter.effective_length
-        self.has_wildcards = adapter.adapter_wildcards
+    def __init__(self, adapter: "SingleAdapter"):
+        self.where = adapter.where  # type: Where
+        self.max_error_rate = adapter.max_error_rate  # type: float
+        self.sequence = adapter.sequence  # type: str
+        self.effective_length = adapter.effective_length  # type: int
+        self.has_wildcards = adapter.adapter_wildcards  # type: bool
         # self.errors[l][e] == n iff n times a sequence of length l matching at e errors was removed
-        self.errors = defaultdict(returns_defaultdict_int)
-        self._remove = adapter.remove
+        self.errors = defaultdict(returns_defaultdict_int)  # type: Dict[int, Dict[int, int]]
+        self._remove = adapter.remove  # type: Optional[WhereToRemove]
         self.adjacent_bases = {'A': 0, 'C': 0, 'G': 0, 'T': 0, '': 0}
 
     def __repr__(self):
@@ -82,7 +90,7 @@ class EndStatistics:
             self.adjacent_bases,
         )
 
-    def __iadd__(self, other):
+    def __iadd__(self, other: Any):
         if not isinstance(other, self.__class__):
             raise ValueError("Cannot compare")
         if (
@@ -119,16 +127,16 @@ class EndStatistics:
         """
         seq = self.sequence
         # FIXME this is broken for self._remove == 'auto'
-        if self._remove == 'prefix':
+        if self._remove == WhereToRemove.PREFIX:
             seq = seq[::-1]
         allowed_bases = 'CGRYSKMBDHVN' if self.has_wildcards else 'GC'
-        p = 1
+        p = 1.
         probabilities = [p]
         for i, c in enumerate(seq):
             if c in allowed_bases:
                 p *= gc_content / 2.
             else:
-                p *= (1 - gc_content) / 2
+                p *= (1. - gc_content) / 2.
             probabilities.append(p)
         return probabilities
 
@@ -137,13 +145,19 @@ class AdapterStatistics:
     """
     Statistics about an adapter. An adapter can work on the 5' end (front)
     or 3' end (back) of a read, and statistics for that are captured
-    separately.
+    separately in EndStatistics objects.
     """
 
-    def __init__(self, adapter, adapter2=None, where=None):
+    def __init__(
+        self,
+        adapter: "SingleAdapter",
+        adapter2: Optional["SingleAdapter"] = None,
+        where: Optional[Where] = None,
+    ):
         self.name = adapter.name
         self.where = where if where is not None else adapter.where
         self.front = EndStatistics(adapter)
+        self.reverse_complemented = 0
         if adapter2 is None:
             self.back = EndStatistics(adapter)
         else:
@@ -157,15 +171,26 @@ class AdapterStatistics:
             self.back,
         )
 
-    def __iadd__(self, other):
+    def __iadd__(self, other: "AdapterStatistics"):
         if self.where != other.where:  # TODO self.name != other.name or
             raise ValueError('incompatible objects')
         self.front += other.front
         self.back += other.back
+        self.reverse_complemented += other.reverse_complemented
         return self
 
 
-class Match:
+class Match(ABC):
+    @abstractmethod
+    def remainder_interval(self) -> Tuple[int, int]:
+        pass
+
+    @abstractmethod
+    def get_info_records(self) -> List[List]:
+        pass
+
+
+class SingleMatch(Match):
     """
     Representation of a single adapter matched to a single read.
     """
@@ -173,37 +198,48 @@ class Match:
         'adapter', 'read', 'length', '_trimmed_read', 'adjacent_base']
 
     # TODO Can remove_before be removed from the constructor parameters?
-    def __init__(self, astart, astop, rstart, rstop, matches, errors, remove_before, adapter, read):
+    def __init__(
+        self,
+        astart: int,
+        astop: int,
+        rstart: int,
+        rstop: int,
+        matches: int,
+        errors: int,
+        remove_before: bool,
+        adapter: "SingleAdapter",
+        read,
+    ):
         """
         remove_before -- True: remove bases before adapter. False: remove after
         """
-        self.astart = astart
-        self.astop = astop
-        self.rstart = rstart
-        self.rstop = rstop
-        self.matches = matches
-        self.errors = errors
-        self.adapter = adapter
+        self.astart = astart  # type: int
+        self.astop = astop  # type: int
+        self.rstart = rstart  # type: int
+        self.rstop = rstop  # type: int
+        self.matches = matches  # type: int
+        self.errors = errors  # type: int
+        self.adapter = adapter  # type: SingleAdapter
         self.read = read
         if remove_before:
             # Compute the trimmed read, assuming it’s a 'front' adapter
             self._trimmed_read = read[rstop:]
-            self.adjacent_base = ''
+            self.adjacent_base = ""  # type: str
         else:
             # Compute the trimmed read, assuming it’s a 'back' adapter
             self.adjacent_base = read.sequence[rstart - 1:rstart]
             self._trimmed_read = read[:rstart]
-        self.remove_before = remove_before
+        self.remove_before = remove_before  # type: bool
         # Number of aligned characters in the adapter. If there are
         # indels, this may be different from the number of characters
         # in the read.
-        self.length = astop - astart
+        self.length = astop - astart  # type: int
 
     def __repr__(self):
-        return 'Match(astart={}, astop={}, rstart={}, rstop={}, matches={}, errors={})'.format(
+        return 'SingleMatch(astart={}, astop={}, rstart={}, rstop={}, matches={}, errors={})'.format(
             self.astart, self.astop, self.rstart, self.rstop, self.matches, self.errors)
 
-    def wildcards(self, wildcard_char='N'):
+    def wildcards(self, wildcard_char: str = "N") -> str:
         """
         Return a string that contains, for each wildcard character,
         the character that it matches. For example, if the adapter
@@ -217,7 +253,7 @@ class Match:
                 self.rstart + i < len(self.read.sequence)]
         return ''.join(wildcards)
 
-    def rest(self):
+    def rest(self) -> str:
         """
         Return the part of the read before this match if this is a
         'front' (5') adapter,
@@ -229,10 +265,20 @@ class Match:
         else:
             return self.read.sequence[self.rstop:]
 
-    def get_info_record(self):
+    def remainder_interval(self) -> Tuple[int, int]:
+        """
+        Return an interval (start, stop) that describes the part of the read that would
+        remain after trimming
+        """
+        if self.remove_before:
+            return self.rstop, len(self.read.sequence)
+        else:
+            return 0, self.rstart
+
+    def get_info_records(self) -> List[List]:
         seq = self.read.sequence
         qualities = self.read.qualities
-        info = (
+        info = [
             self.read.name,
             self.errors,
             self.rstart,
@@ -240,23 +286,23 @@ class Match:
             seq[0:self.rstart],
             seq[self.rstart:self.rstop],
             seq[self.rstop:],
-            self.adapter.name
-        )
+            self.adapter.name,
+        ]
         if qualities:
-            info += (
+            info += [
                 qualities[0:self.rstart],
                 qualities[self.rstart:self.rstop],
                 qualities[self.rstop:]
-            )
+            ]
         else:
-            info += ('', '', '')
+            info += ["", "", ""]
 
-        return info
+        return [info]
 
     def trimmed(self):
         return self._trimmed_read
 
-    def update_statistics(self, statistics):
+    def update_statistics(self, statistics: AdapterStatistics):
         """Update AdapterStatistics in place"""
         if self.remove_before:
             statistics.front.errors[self.rstop][self.errors] += 1
@@ -268,13 +314,26 @@ class Match:
                 statistics.back.adjacent_bases[''] = 1
 
 
-def _generate_adapter_name(_start=[1]):
+def _generate_adapter_name(_start=[1]) -> str:
     name = str(_start[0])
     _start[0] += 1
     return name
 
 
-class Adapter:
+class Adapter(ABC):
+    def __init__(self, *args, **kwargs):
+        pass
+
+    @abstractmethod
+    def enable_debug(self):
+        pass
+
+    @abstractmethod
+    def match_to(self, read):
+        pass
+
+
+class SingleAdapter(Adapter):
     """
     This class can find a single adapter characterized by sequence, error rate,
     type etc. within reads.
@@ -282,11 +341,12 @@ class Adapter:
     where --  A Where enum value. This influences where the adapter is allowed to appear within the
         read.
 
-    remove -- describes which part of the read to remove if the adapter was found:
-          * "prefix" (for a 3' adapter)
-          * "suffix" (for a 5' adapter)
-          * "auto" for a 5'/3' mixed adapter (if the match involves the first base of the read, it
-            is assumed to be a 5' adapter and a 3' otherwise)
+    remove -- a WhereToRemove enum value. This describes which part of the read to remove if the
+        adapter was found:
+          * WhereToRemove.PREFIX (for a 3' adapter)
+          * WhereToRemove.SUFFIX (for a 5' adapter)
+          * WhereToRemove.AUTO for a 5'/3' mixed adapter (if the match involves the first base of
+            the read, it is assumed to be a 5' adapter and a 3' otherwise)
           * None: One of the above is chosen depending on the 'where' parameter
 
     sequence -- The adapter sequence as string. Will be converted to uppercase.
@@ -308,19 +368,28 @@ class Adapter:
         unique number.
     """
 
-    def __init__(self, sequence, where, remove=None, max_error_rate=0.1, min_overlap=3,
-            read_wildcards=False, adapter_wildcards=True, name=None, indels=True):
-        self._debug = False
-        self.name = _generate_adapter_name() if name is None else name
-        self.sequence = sequence.upper().replace('U', 'T')
+    def __init__(
+        self,
+        sequence: str,
+        where: Where,
+        remove: Optional[WhereToRemove] = None,
+        max_error_rate: float = 0.1,
+        min_overlap: int = 3,
+        read_wildcards: bool = False,
+        adapter_wildcards: bool = True,
+        name: Optional[str] = None,
+        indels: bool = True,
+    ):
+        super().__init__()
+        self._debug = False  # type: bool
+        self.name = _generate_adapter_name() if name is None else name  # type: str
+        self.sequence = sequence.upper().replace("U", "T")  # type: str
         if not self.sequence:
             raise ValueError("Adapter sequence is empty")
-        self.where = where
-        if remove not in (None, 'prefix', 'suffix', 'auto'):
-            raise ValueError('remove parameter must be "prefix", "suffix", "auto" or None')
-        self.remove = WHERE_TO_REMOVE_MAP[where] if remove is None else remove
-        self.max_error_rate = max_error_rate
-        self.min_overlap = min(min_overlap, len(self.sequence))
+        self.where = where  # type: Where
+        self.remove = WHERE_TO_REMOVE_MAP[where] if remove is None else remove  # type: WhereToRemove
+        self.max_error_rate = max_error_rate  # type: float
+        self.min_overlap = min(min_overlap, len(self.sequence))  # type: int
         iupac = frozenset('XACGTURYSWKMBDHVN')
         if adapter_wildcards and not set(self.sequence) <= iupac:
             for c in self.sequence:
@@ -329,9 +398,9 @@ class Adapter:
                         'not a valid IUPAC code. Use only characters '
                         'XACGTURYSWKMBDHVN.'.format(c, self.sequence))
         # Optimization: Use non-wildcard matching if only ACGT is used
-        self.adapter_wildcards = adapter_wildcards and not set(self.sequence) <= set('ACGT')
-        self.read_wildcards = read_wildcards
-        self.indels = indels
+        self.adapter_wildcards = adapter_wildcards and not set(self.sequence) <= set("ACGT")  # type: bool
+        self.read_wildcards = read_wildcards  # type: bool
+        self.indels = indels  # type: bool
         if self.is_anchored and not self.indels:
             aligner_class = align.PrefixComparer if self.where is Where.PREFIX else align.SuffixComparer
             self.aligner = aligner_class(
@@ -357,7 +426,7 @@ class Adapter:
             )
 
     def __repr__(self):
-        return '<Adapter(name={name!r}, sequence={sequence!r}, where={where}, '\
+        return '<SingleAdapter(name={name!r}, sequence={sequence!r}, where={where}, '\
             'remove={remove}, max_error_rate={max_error_rate}, min_overlap={min_overlap}, '\
             'read_wildcards={read_wildcards}, '\
             'adapter_wildcards={adapter_wildcards}, '\
@@ -420,12 +489,12 @@ class Adapter:
 
         if match_args is None:
             return None
-        if self.remove == 'auto':
+        if self.remove == WhereToRemove.AUTO:
             # guess: if alignment starts at pos 0, it’s a 5' adapter
             remove_before = match_args[2] == 0  # index 2 is rstart
         else:
-            remove_before = self.remove == 'prefix'
-        match = Match(*match_args, remove_before=remove_before, adapter=self, read=read)
+            remove_before = self.remove == WhereToRemove.PREFIX
+        match = SingleMatch(*match_args, remove_before=remove_before, adapter=self, read=read)
 
         assert match.length >= self.min_overlap
         return match
@@ -437,7 +506,7 @@ class Adapter:
         return AdapterStatistics(self)
 
 
-class BackOrFrontAdapter(Adapter):
+class BackOrFrontAdapter(SingleAdapter):
     """A 5' or 3' adapter.
 
     This is separate from the Adapter class so that a specialized match_to
@@ -450,7 +519,7 @@ class BackOrFrontAdapter(Adapter):
     def __init__(self, *args, **kwargs):
         super().__init__(*args, **kwargs)
         assert self.where is Where.BACK or self.where is Where.FRONT
-        self._remove_before = self.remove == 'prefix'
+        self._remove_before = self.remove is WhereToRemove.PREFIX
 
     def match_to(self, read):
         """
@@ -477,21 +546,19 @@ class BackOrFrontAdapter(Adapter):
         if alignment is None:
             return None
 
-        match = Match(*alignment, remove_before=self._remove_before, adapter=self, read=read)
+        match = SingleMatch(*alignment, remove_before=self._remove_before, adapter=self, read=read)
         return match
 
 
-class LinkedMatch:
+class LinkedMatch(Match):
     """
     Represent a match of a LinkedAdapter
     """
-    def __init__(self, front_match, back_match, adapter):
-        """
-        One of front_match and back_match must be not None!
-        """
-        self.front_match = front_match
-        self.back_match = back_match
-        self.adapter = adapter
+    def __init__(self, front_match: SingleMatch, back_match: SingleMatch, adapter: "LinkedAdapter"):
+        assert front_match is not None or back_match is not None
+        self.front_match = front_match  # type: SingleMatch
+        self.back_match = back_match  # type: SingleMatch
+        self.adapter = adapter  # type: LinkedAdapter
 
     def __repr__(self):
         return '<LinkedMatch(front_match={!r}, back_match={}, adapter={})>'.format(
@@ -536,8 +603,25 @@ class LinkedMatch:
         if self.back_match:
             statistics.back.errors[len(self.back_match.read) - self.back_match.rstart][self.back_match.errors] += 1
 
+    def remainder_interval(self) -> Tuple[int, int]:
+        matches = [match for match in [self.front_match, self.back_match] if match is not None]
+        return remainder(matches)
+
+    def get_info_records(self) -> List[List]:
+        records = []
+        for match, namesuffix in [
+            (self.front_match, ";1"),
+            (self.back_match, ";2"),
+        ]:
+            if match is None:
+                continue
+            record = match.get_info_records()[0]
+            record[7] = self.adapter.name + namesuffix
+            records.append(record)
+        return records
+
 
-class LinkedAdapter:
+class LinkedAdapter(Adapter):
     """
     """
     def __init__(
@@ -548,6 +632,7 @@ class LinkedAdapter:
         back_required,
         name,
     ):
+        super().__init__()
         self.front_required = front_required
         self.back_required = back_required
 
@@ -592,7 +677,7 @@ class LinkedAdapter:
         return None
 
 
-class MultiAdapter:
+class MultiAdapter(Adapter):
     """
     Represent multiple adapters of the same type at once and use an index data structure
     to speed up matching. This acts like a "normal" Adapter as it provides a match_to
@@ -610,6 +695,7 @@ class MultiAdapter:
 
     def __init__(self, adapters):
         """All given adapters must be of the same type, either Where.PREFIX or Where.SUFFIX"""
+        super().__init__()
         if not adapters:
             raise ValueError("Adapter list is empty")
         self._where = adapters[0].where
@@ -723,18 +809,21 @@ class MultiAdapter:
             else:
                 assert self._where is Where.SUFFIX
                 rstart, rstop = len(read) - best_length, len(read)
-            return Match(
+            return SingleMatch(
                 astart=0,
                 astop=len(best_adapter.sequence),
                 rstart=rstart,
                 rstop=rstop,
                 matches=best_m,
                 errors=best_e,
-                remove_before=best_adapter.remove == 'prefix',
+                remove_before=best_adapter.remove is WhereToRemove.PREFIX,
                 adapter=best_adapter,
                 read=read
             )
 
+    def enable_debug(self):
+        pass
+
 
 def warn_duplicate_adapters(adapters):
     d = dict()
@@ -745,3 +834,20 @@ def warn_duplicate_adapters(adapters):
                 "Please make sure that this is what you want.",
                 adapter.sequence, ADAPTER_TYPE_NAMES[adapter.where])
         d[key] = adapter.name
+
+
+def remainder(matches: Sequence[Match]) -> Tuple[int, int]:
+    """
+    Determine which section of the read would not be trimmed. Return a tuple (start, stop)
+    that gives the interval of the untrimmed part relative to the original read.
+
+    matches must be non-empty
+    """
+    if not matches:
+        raise ValueError("matches must not be empty")
+    start = 0
+    for match in matches:
+        match_start, match_stop = match.remainder_interval()
+        start += match_start
+    length = match_stop - match_start
+    return (start, start + length)


=====================================
src/cutadapt/filters.py
=====================================
@@ -14,6 +14,8 @@ The read is then assumed to have been "consumed", that is, either written
 somewhere or filtered (should be discarded).
 """
 from abc import ABC, abstractmethod
+from typing import List
+from .adapters import Match
 
 
 # Constants used when returning from a Filter’s __call__ method to improve
@@ -403,11 +405,11 @@ class InfoFileWriter(SingleEndFilter):
     def __init__(self, file):
         self.file = file
 
-    def __call__(self, read, matches):
+    def __call__(self, read, matches: List[Match]):
         if matches:
             for match in matches:
-                info_record = match.get_info_record()
-                print(*info_record, sep='\t', file=self.file)
+                for info_record in match.get_info_records():
+                    print(*info_record, sep='\t', file=self.file)
         else:
             seq = read.sequence
             qualities = read.qualities if read.qualities is not None else ''


=====================================
src/cutadapt/modifiers.py
=====================================
@@ -4,19 +4,34 @@ A modifier must be callable and typically implemented as a class with a
 __call__ method.
 """
 import re
+from typing import Sequence, List, Optional
 from abc import ABC, abstractmethod
 from collections import OrderedDict
+
 from .qualtrim import quality_trim_index, nextseq_trim_index
-from .adapters import Where, MultiAdapter
+from .adapters import Where, MultiAdapter, Match, remainder
+from .utils import reverse_complemented_sequence
+
+
+class Modifier(ABC):
+    @abstractmethod
+    def __call__(self, read, matches: List[Match]):
+        pass
+
 
+class PairedModifier(ABC):
+    @abstractmethod
+    def __call__(self, read1, read2, matches1, matches2):
+        pass
 
-class PairedModifier:
+
+class PairedModifierWrapper(PairedModifier):
     """
     Wrapper for modifiers that work on both reads in a paired-end read
     """
     paired = True
 
-    def __init__(self, modifier1, modifier2):
+    def __init__(self, modifier1: Optional[Modifier], modifier2: Optional[Modifier]):
         """Set one of the modifiers to None to work on R1 or R2 only"""
         self._modifier1 = modifier1
         self._modifier2 = modifier2
@@ -33,12 +48,6 @@ class PairedModifier:
         return self._modifier1(read1, matches1), self._modifier2(read2, matches2)
 
 
-class Modifier(ABC):
-    @abstractmethod
-    def __call__(self, read, matches):
-        pass
-
-
 class AdapterCutter(Modifier):
     """
     Repeatedly find one of multiple adapters in reads.
@@ -116,30 +125,8 @@ class AdapterCutter(Modifier):
         return best_match
 
     @staticmethod
-    def remainder(matches):
-        """
-        Determine which part of the read was not trimmed. Return a tuple (start, stop)
-        that gives the interval of the untrimmed part relative to the original read.
-
-        matches is a list of Match objects. The original read is assumed to be
-        matches[0].read
-        """
-        # Start with the full read
-        read = matches[0].read
-        start, stop = 0, len(read)
-        for match in matches:
-            if match.remove_before:
-                # Length of the prefix that was removed
-                start += match.rstop
-            else:
-                # Length of the suffix that was removed
-                stop -= len(match.read) - match.rstart
-        return (start, stop)
-
-    @staticmethod
-    def masked_read(trimmed_read, matches):
-        start, stop = AdapterCutter.remainder(matches)
-        read = matches[0].read
+    def masked_read(read, trimmed_read, matches: Sequence[Match]):
+        start, stop = remainder(matches)
         # TODO modification in place
         trimmed_read.sequence = (
             'N' * start
@@ -149,19 +136,19 @@ class AdapterCutter(Modifier):
         return trimmed_read
 
     @staticmethod
-    def lowercased_read(trimmed_read, matches):
-        start, stop = AdapterCutter.remainder(matches)
-        read_sequence = matches[0].read.sequence
+    def lowercased_read(read, trimmed_read, matches: Sequence[Match]):
+        start, stop = remainder(matches)
+        read_sequence = read.sequence
         # TODO modification in place
         trimmed_read.sequence = (
             read_sequence[:start].lower()
             + read_sequence[start:stop].upper()
             + read_sequence[stop:].lower()
         )
-        trimmed_read.qualities = matches[0].read.qualities
+        trimmed_read.qualities = read.qualities
         return trimmed_read
 
-    def __call__(self, read, inmatches):
+    def __call__(self, read, inmatches: List[Match]):
         trimmed_read, matches = self.match_and_trim(read)
         if matches:
             self.with_adapters += 1
@@ -183,9 +170,9 @@ class AdapterCutter(Modifier):
         Return a pair (trimmed_read, matches), where matches is a list of Match instances.
         """
         matches = []
-        trimmed_read = read
         if self.action == 'lowercase':
-            trimmed_read.sequence = trimmed_read.sequence.upper()
+            read.sequence = read.sequence.upper()
+        trimmed_read = read
         for _ in range(self.times):
             match = AdapterCutter.best_match(self.adapters, trimmed_read)
             if match is None:
@@ -201,9 +188,9 @@ class AdapterCutter(Modifier):
             # read is already trimmed, nothing to do
             pass
         elif self.action == 'mask':
-            trimmed_read = self.masked_read(trimmed_read, matches)
+            trimmed_read = self.masked_read(read, trimmed_read, matches)
         elif self.action == 'lowercase':
-            trimmed_read = self.lowercased_read(trimmed_read, matches)
+            trimmed_read = self.lowercased_read(read, trimmed_read, matches)
             assert len(trimmed_read.sequence) == len(read)
         elif self.action is None:  # --no-trim
             trimmed_read = read[:]
@@ -211,11 +198,51 @@ class AdapterCutter(Modifier):
         return trimmed_read, matches
 
 
+class ReverseComplementer(Modifier):
+    """Trim adapters from a read and its reverse complement"""
+
+    def __init__(self, adapter_cutter: AdapterCutter, rc_suffix: Optional[str] = " rc"):
+        """
+        rc_suffix -- suffix to add to the read name if sequence was reverse-complemented
+        """
+        self.adapter_cutter = adapter_cutter
+        self.reverse_complemented = 0
+        self._suffix = rc_suffix
+
+    def __call__(self, read, inmatches: List[Match]):
+        reverse_read = reverse_complemented_sequence(read)
+
+        forward_trimmed_read, forward_matches = self.adapter_cutter.match_and_trim(read)
+        reverse_trimmed_read, reverse_matches = self.adapter_cutter.match_and_trim(reverse_read)
+
+        forward_match_count = sum(m.matches for m in forward_matches)
+        reverse_match_count = sum(m.matches for m in reverse_matches)
+        use_reverse_complement = reverse_match_count > forward_match_count
+
+        if use_reverse_complement:
+            self.reverse_complemented += 1
+            assert reverse_matches
+            trimmed_read, matches = reverse_trimmed_read, reverse_matches
+            if self._suffix:
+                trimmed_read.name += self._suffix
+        else:
+            trimmed_read, matches = forward_trimmed_read, forward_matches
+
+        if matches:
+            self.adapter_cutter.with_adapters += 1
+            for match in matches:
+                stats = self.adapter_cutter.adapter_statistics[match.adapter]
+                match.update_statistics(stats)
+                stats.reverse_complemented += bool(use_reverse_complement)
+            inmatches.extend(matches)
+        return trimmed_read
+
+
 class PairedAdapterCutterError(Exception):
     pass
 
 
-class PairedAdapterCutter:
+class PairedAdapterCutter(PairedModifier):
     """
     A Modifier that trims adapter pairs from R1 and R2.
     """
@@ -230,6 +257,7 @@ class PairedAdapterCutter:
 
         action -- What to do with a found adapter: None, 'trim', or 'mask'
         """
+        super().__init__()
         if len(adapters1) != len(adapters2):
             raise PairedAdapterCutterError(
                 "The number of reads to trim from R1 and R2 must be the same. "
@@ -294,10 +322,10 @@ class UnconditionalCutter(Modifier):
     If the length is positive, the bases are removed from the beginning of the read.
     If the length is negative, the bases are removed from the end of the read.
     """
-    def __init__(self, length):
+    def __init__(self, length: int):
         self.length = length
 
-    def __call__(self, read, matches):
+    def __call__(self, read, matches: List[Match]):
         if self.length > 0:
             return read[self.length:]
         elif self.length < 0:


=====================================
src/cutadapt/parser.py
=====================================
@@ -3,9 +3,10 @@ Parse adapter specifications
 """
 import re
 import logging
+from typing import Type, Optional, List, Tuple, Iterator, Any, Dict
 from xopen import xopen
 from dnaio.readers import FastaReader
-from .adapters import Where, WHERE_TO_REMOVE_MAP, Adapter, BackOrFrontAdapter, LinkedAdapter
+from .adapters import Where, WHERE_TO_REMOVE_MAP, Adapter, SingleAdapter, BackOrFrontAdapter, LinkedAdapter
 
 logger = logging.getLogger(__name__)
 
@@ -26,7 +27,14 @@ class AdapterSpecification:
     AdapterSpecification(name='a_name', restriction=None, sequence='ACGT', parameters={'anywhere': True}, cmdline_type='back')
     """
 
-    def __init__(self, name, restriction, sequence, parameters, cmdline_type):
+    def __init__(
+        self,
+        name: str,
+        restriction: Optional[str],
+        sequence: str,
+        parameters,
+        cmdline_type: str,
+    ):
         assert restriction in (None, "anchored", "noninternal")
         assert cmdline_type in ("front", "back", "anywhere")
         self.name = name
@@ -55,7 +63,7 @@ class AdapterSpecification:
         )
 
     @staticmethod
-    def expand_braces(sequence):
+    def expand_braces(sequence: str) -> str:
         """
         Replace all occurrences of ``x{n}`` (where x is any character) with n
         occurrences of x. Raise ValueError if the expression cannot be parsed.
@@ -95,16 +103,15 @@ class AdapterSpecification:
         return result
 
     @staticmethod
-    def _extract_name(spec):
+    def _extract_name(spec: str) -> Tuple[Optional[str], str]:
         """
         Parse an adapter specification given as 'name=adapt' into 'name' and 'adapt'.
         """
         fields = spec.split('=', 1)
+        name = None  # type: Optional[str]
         if len(fields) > 1:
             name, spec = fields
             name = name.strip()
-        else:
-            name = None
         spec = spec.strip()
         return name, spec
 
@@ -123,16 +130,16 @@ class AdapterSpecification:
     }
 
     @classmethod
-    def _parse_parameters(cls, spec):
+    def _parse_parameters(cls, spec: str):
         """Parse key=value;key=value;key=value into a dict"""
 
         fields = spec.split(';')
-        result = dict()
+        result = dict()  # type: Dict[str,Any]
         for field in fields:
             field = field.strip()
             if not field:
                 continue
-            key, equals, value = field.partition('=')
+            key, equals, value = field.partition('=')  # type: (str, str, Any)
             if equals == '=' and value == '':
                 raise ValueError('No value given')
             key = key.strip()
@@ -140,7 +147,7 @@ class AdapterSpecification:
                 raise KeyError('Unknown parameter {}'.format(key))
             # unabbreviate
             while cls.allowed_parameters[key] is not None:
-                key = cls.allowed_parameters[key]
+                key = cls.allowed_parameters[key]  # type: ignore
             value = value.strip()
             if value == '':
                 value = True
@@ -274,7 +281,7 @@ class AdapterParser:
         # kwargs: max_error_rate, min_overlap, read_wildcards, adapter_wildcards, indels
         self.default_parameters = kwargs
 
-    def _parse(self, spec: str, cmdline_type='back', name=None):
+    def _parse(self, spec: str, cmdline_type: str = "back", name: Optional[str] = None) -> Adapter:
         """
         Parse an adapter specification not using ``file:`` notation and return
         an object of an appropriate Adapter class.
@@ -298,7 +305,7 @@ class AdapterParser:
         return self._parse_not_linked(spec, name, cmdline_type)
 
     @staticmethod
-    def _normalize_ellipsis(spec1: str, spec2: str, cmdline_type):
+    def _normalize_ellipsis(spec1: str, spec2: str, cmdline_type) -> Tuple[str, str]:
         if not spec1:
             if cmdline_type == 'back':
                 # -a ...ADAPTER
@@ -318,23 +325,23 @@ class AdapterParser:
             raise ValueError("Expected either spec1 or spec2")
         return spec, cmdline_type
 
-    def _parse_not_linked(self, spec: str, name, cmdline_type):
-        spec = AdapterSpecification.parse(spec, cmdline_type)
-        where = spec.where()
+    def _parse_not_linked(self, spec: str, name: Optional[str], cmdline_type: str) -> Adapter:
+        aspec = AdapterSpecification.parse(spec, cmdline_type)
+        where = aspec.where()
         if not name:
-            name = spec.name
-        if spec.parameters.pop('anywhere', False):
-            spec.parameters['remove'] = WHERE_TO_REMOVE_MAP[where]
+            name = aspec.name
+        if aspec.parameters.pop('anywhere', False):
+            aspec.parameters['remove'] = WHERE_TO_REMOVE_MAP[where]
             where = Where.ANYWHERE
         parameters = self.default_parameters.copy()
-        parameters.update(spec.parameters)
+        parameters.update(aspec.parameters)
         if where in (Where.FRONT, Where.BACK):
-            adapter_class = BackOrFrontAdapter
+            adapter_class = BackOrFrontAdapter  # type: Type[Adapter]
         else:
-            adapter_class = Adapter
-        return adapter_class(sequence=spec.sequence, where=where, name=name, **parameters)
+            adapter_class = SingleAdapter
+        return adapter_class(sequence=aspec.sequence, where=where, name=name, **parameters)
 
-    def _parse_linked(self, spec1: str, spec2: str, name, cmdline_type):
+    def _parse_linked(self, spec1: str, spec2: str, name: Optional[str], cmdline_type: str) -> LinkedAdapter:
         """Return a linked adapter from two specification strings"""
 
         if cmdline_type == 'anywhere':
@@ -375,9 +382,9 @@ class AdapterParser:
         front_required = front_parameters.pop('required', front_required)
         back_required = back_parameters.pop('required', back_required)
 
-        front_adapter = Adapter(front_spec.sequence, where=front_spec.where(), name=None,
+        front_adapter = SingleAdapter(front_spec.sequence, where=front_spec.where(), name=None,
             **front_parameters)
-        back_adapter = Adapter(back_spec.sequence, where=back_spec.where(), name=None,
+        back_adapter = SingleAdapter(back_spec.sequence, where=back_spec.where(), name=None,
             **back_parameters)
 
         return LinkedAdapter(
@@ -388,7 +395,7 @@ class AdapterParser:
             name=name,
         )
 
-    def parse(self, spec, cmdline_type='back'):
+    def parse(self, spec: str, cmdline_type: str = 'back') -> Iterator[Adapter]:
         """
         Parse an adapter specification and yield appropriate Adapter classes.
         This works like the _parse_no_file() function above, but also supports the
@@ -407,7 +414,7 @@ class AdapterParser:
         else:
             yield self._parse(spec, cmdline_type, name=None)
 
-    def parse_multi(self, type_spec_pairs):
+    def parse_multi(self, type_spec_pairs: List[Tuple[str, str]]) -> List[Adapter]:
         """
         Parse all three types of commandline options that can be used to
         specify adapters. adapters must be a list of (str, str) pairs, where the first is
@@ -416,7 +423,7 @@ class AdapterParser:
 
         Return a list of appropriate Adapter classes.
         """
-        adapters = []
+        adapters = []  # type: List[Adapter]
         for cmdline_type, spec in type_spec_pairs:
             if cmdline_type not in {'front', 'back', 'anywhere'}:
                 raise ValueError('adapter type must be front, back or anywhere')


=====================================
src/cutadapt/pipeline.py
=====================================
@@ -4,17 +4,19 @@ import sys
 import copy
 import logging
 import functools
+from typing import List, IO, Optional, BinaryIO, TextIO, Any, Tuple, Union
 from abc import ABC, abstractmethod
 from multiprocessing import Process, Pipe, Queue
 from pathlib import Path
 import multiprocessing.connection
+from multiprocessing.connection import Connection
 import traceback
 
 from xopen import xopen
 import dnaio
 
 from .utils import Progress, FileOpener
-from .modifiers import PairedModifier
+from .modifiers import Modifier, PairedModifier, PairedModifierWrapper
 from .report import Statistics
 from .filters import (Redirector, PairedRedirector, NoFilter, PairedNoFilter, InfoFileWriter,
     RestFileWriter, WildcardFileWriter, TooShortReadFilter, TooLongReadFilter, NContentFilter,
@@ -25,7 +27,7 @@ logger = logging.getLogger()
 
 
 class InputFiles:
-    def __init__(self, file1, file2=None, interleaved=False):
+    def __init__(self, file1: BinaryIO, file2: Optional[BinaryIO] = None, interleaved: bool = False):
         self.file1 = file1
         self.file2 = file2
         self.interleaved = interleaved
@@ -44,20 +46,20 @@ class OutputFiles:
     # TODO interleaving for the other file pairs (too_short, too_long, untrimmed)?
     def __init__(
         self,
-        out=None,
-        out2=None,
-        untrimmed=None,
-        untrimmed2=None,
-        too_short=None,
-        too_short2=None,
-        too_long=None,
-        too_long2=None,
-        info=None,
-        rest=None,
-        wildcard=None,
-        demultiplex=False,
-        interleaved=False,
-        force_fasta=None,
+        out: Optional[BinaryIO] = None,
+        out2: Optional[BinaryIO] = None,
+        untrimmed: Optional[BinaryIO] = None,
+        untrimmed2: Optional[BinaryIO] = None,
+        too_short: Optional[BinaryIO] = None,
+        too_short2: Optional[BinaryIO] = None,
+        too_long: Optional[BinaryIO] = None,
+        too_long2: Optional[BinaryIO] = None,
+        info: Optional[BinaryIO] = None,
+        rest: Optional[BinaryIO] = None,
+        wildcard: Optional[BinaryIO] = None,
+        demultiplex: bool = False,
+        interleaved: bool = False,
+        force_fasta: Optional[bool] = None,
     ):
         self.out = out
         self.out2 = out2
@@ -95,13 +97,14 @@ class Pipeline(ABC):
     n_adapters = 0
 
     def __init__(self, file_opener: FileOpener):
-        self._close_files = []
+        self._close_files = []  # type: List[IO]
         self._reader = None
-        self._filters = None
-        self._modifiers = []
-        self._outfiles = None
+        self._filters = []  # type: List[Any]
+        # TODO type should be Union[List[Modifier], List[PairedModifier]]
+        self._modifiers = []  # type: List[Union[Modifier, PairedModifier]]
+        self._outfiles = None  # type: Optional[OutputFiles]
         self._demultiplexer = None
-        self._textiowrappers = []
+        self._textiowrappers = []  # type: List[TextIO]
 
         # Filter settings
         self._minimum_length = None
@@ -117,7 +120,13 @@ class Pipeline(ABC):
             interleaved=infiles.interleaved, mode='r')
         self._set_output(outfiles)
 
-    def _open_writer(self, file, file2=None, force_fasta=None, **kwargs):
+    def _open_writer(
+        self,
+        file: BinaryIO,
+        file2: Optional[BinaryIO] = None,
+        force_fasta: Optional[bool] = None,
+        **kwargs
+    ):
         # TODO backwards-incompatible change (?) would be to use outfiles.interleaved
         # for all outputs
         if force_fasta:
@@ -125,7 +134,7 @@ class Pipeline(ABC):
         # file and file2 must already be file-like objects because we don’t want to
         # take care of threads and compression levels here.
         for f in (file, file2):
-            assert not (f is None and isinstance(f, (str, bytes, Path)))
+            assert not isinstance(f, (str, bytes, Path))
         return dnaio.open(file, file2=file2, mode='w', qualities=self.uses_qualities,
             **kwargs)
 
@@ -193,16 +202,18 @@ class Pipeline(ABC):
                     untrimmed_filter_wrapper(untrimmed_writer, DiscardUntrimmedFilter(), DiscardUntrimmedFilter()))
             self._filters.append(self._final_filter(outfiles))
 
-    def flush(self):
+    def flush(self) -> None:
         for f in self._textiowrappers:
             f.flush()
+        assert self._outfiles is not None
         for f in self._outfiles:
             if f is not None:
                 f.flush()
 
-    def close(self):
+    def close(self) -> None:
         for f in self._textiowrappers:
             f.close()  # This also closes the underlying files; a second close occurs below
+        assert self._outfiles is not None
         for f in self._outfiles:
             # TODO do not use hasattr
             if f is not None and f is not sys.stdin and f is not sys.stdout and hasattr(f, 'close'):
@@ -211,7 +222,8 @@ class Pipeline(ABC):
             self._demultiplexer.close()
 
     @property
-    def uses_qualities(self):
+    def uses_qualities(self) -> bool:
+        assert self._reader is not None
         return self._reader.delivers_qualities
 
     @abstractmethod
@@ -227,11 +239,11 @@ class Pipeline(ABC):
         pass
 
     @abstractmethod
-    def _final_filter(self, outfiles):
+    def _final_filter(self, outfiles: OutputFiles):
         pass
 
     @abstractmethod
-    def _create_demultiplexer(self, outfiles):
+    def _create_demultiplexer(self, outfiles: OutputFiles):
         pass
 
 
@@ -245,7 +257,7 @@ class SingleEndPipeline(Pipeline):
         super().__init__(file_opener)
         self._modifiers = []
 
-    def add(self, modifier):
+    def add(self, modifier: Modifier):
         if modifier is None:
             raise ValueError("Modifier must not be None")
         self._modifiers.append(modifier)
@@ -254,6 +266,7 @@ class SingleEndPipeline(Pipeline):
         """Run the pipeline. Return statistics"""
         n = 0  # no. of processed reads  # TODO turn into attribute
         total_bp = 0
+        assert self._reader is not None
         for read in self._reader:
             n += 1
             if n % 10000 == 0 and progress:
@@ -273,12 +286,12 @@ class SingleEndPipeline(Pipeline):
     def _untrimmed_filter_wrapper(self):
         return Redirector
 
-    def _final_filter(self, outfiles):
-        assert outfiles.out2 is None
+    def _final_filter(self, outfiles: OutputFiles):
+        assert outfiles.out2 is None and outfiles.out is not None
         writer = self._open_writer(outfiles.out, force_fasta=outfiles.force_fasta)
         return NoFilter(writer)
 
-    def _create_demultiplexer(self, outfiles):
+    def _create_demultiplexer(self, outfiles: OutputFiles):
         return Demultiplexer(outfiles.out, outfiles.untrimmed, qualities=self.uses_qualities,
             file_opener=self.file_opener)
 
@@ -314,30 +327,31 @@ class PairedEndPipeline(Pipeline):
         # Whether to ignore pair_filter mode for discard-untrimmed filter
         self.override_untrimmed_pair_filter = False
 
-    def add(self, modifier1, modifier2):
+    def add(self, modifier1: Optional[Modifier], modifier2: Optional[Modifier]):
         """
         Add a modifier for R1 and R2. One of them can be None, in which case the modifier
         will only be added for the respective read.
         """
         if modifier1 is None and modifier2 is None:
             raise ValueError("Not both modifiers can be None")
-        self._modifiers.append(PairedModifier(modifier1, modifier2))
+        self._modifiers.append(PairedModifierWrapper(modifier1, modifier2))
 
-    def add_both(self, modifier):
+    def add_both(self, modifier: Modifier):
         """
         Add one modifier for both R1 and R2
         """
         assert modifier is not None
-        self._modifiers.append(PairedModifier(modifier, copy.copy(modifier)))
+        self._modifiers.append(PairedModifierWrapper(modifier, copy.copy(modifier)))
 
-    def add_paired_modifier(self, paired_modifier):
-        """Add a Modifier without wrapping it in a PairedModifier"""
+    def add_paired_modifier(self, paired_modifier: PairedModifier):
+        """Add a Modifier (without wrapping it in a PairedModifierWrapper)"""
         self._modifiers.append(paired_modifier)
 
     def process_reads(self, progress: Progress = None):
         n = 0  # no. of processed reads
         total1_bp = 0
         total2_bp = 0
+        assert self._reader is not None
         for read1, read2 in self._reader:
             n += 1
             if n % 10000 == 0 and progress:
@@ -403,54 +417,66 @@ class PairedEndPipeline(Pipeline):
         self._maximum_length = value
 
 
-def reader_process(file, file2, connections, queue, buffer_size, stdin_fd):
+class ReaderProcess(Process):
     """
-    Read chunks of FASTA or FASTQ data from *file* and send to a worker.
-
-    queue -- a Queue of worker indices. A worker writes its own index into this
-        queue to notify the reader that it is ready to receive more data.
-    connections -- a list of Connection objects, one for each worker.
+    Read chunks of FASTA or FASTQ data (single-end or paired) and send to a worker.
 
-    The function repeatedly
+    The reader repeatedly
 
-    - reads a chunk from the file
+    - reads a chunk from the file(s)
     - reads a worker index from the Queue
     - sends the chunk to connections[index]
 
-    and finally sends "poison pills" (the value -1) to all connections.
+    and finally sends the stop token -1 ("poison pills") to all connections.
     """
 
-    def send_to_worker(chunk_index, chunk1, chunk2=None):
-        worker_index = queue.get()
-        connection = connections[worker_index]
+    def __init__(self, file, file2, connections, queue, buffer_size, stdin_fd):
+        """
+        queue -- a Queue of worker indices. A worker writes its own index into this
+            queue to notify the reader that it is ready to receive more data.
+        connections -- a list of Connection objects, one for each worker.
+        """
+        super().__init__()
+        self.file = file
+        self.file2 = file2
+        self.connections = connections
+        self.queue = queue
+        self.buffer_size = buffer_size
+        self.stdin_fd = stdin_fd
+
+    def run(self):
+        if self.stdin_fd != -1:
+            sys.stdin.close()
+            sys.stdin = os.fdopen(self.stdin_fd)
+        try:
+            with xopen(self.file, 'rb') as f:
+                if self.file2:
+                    with xopen(self.file2, 'rb') as f2:
+                        for chunk_index, (chunk1, chunk2) in enumerate(
+                                dnaio.read_paired_chunks(f, f2, self.buffer_size)):
+                            self.send_to_worker(chunk_index, chunk1, chunk2)
+                else:
+                    for chunk_index, chunk in enumerate(dnaio.read_chunks(f, self.buffer_size)):
+                        self.send_to_worker(chunk_index, chunk)
+
+            # Send poison pills to all workers
+            for _ in range(len(self.connections)):
+                worker_index = self.queue.get()
+                self.connections[worker_index].send(-1)
+        except Exception as e:
+            # TODO better send this to a common "something went wrong" Queue
+            for connection in self.connections:
+                connection.send(-2)
+                connection.send((e, traceback.format_exc()))
+
+    def send_to_worker(self, chunk_index, chunk1, chunk2=None):
+        worker_index = self.queue.get()
+        connection = self.connections[worker_index]
         connection.send(chunk_index)
         connection.send_bytes(chunk1)
         if chunk2 is not None:
             connection.send_bytes(chunk2)
 
-    if stdin_fd != -1:
-        sys.stdin.close()
-        sys.stdin = os.fdopen(stdin_fd)
-    try:
-        with xopen(file, 'rb') as f:
-            if file2:
-                with xopen(file2, 'rb') as f2:
-                    for chunk_index, (chunk1, chunk2) in enumerate(dnaio.read_paired_chunks(f, f2, buffer_size)):
-                        send_to_worker(chunk_index, chunk1, chunk2)
-            else:
-                for chunk_index, chunk in enumerate(dnaio.read_chunks(f, buffer_size)):
-                    send_to_worker(chunk_index, chunk)
-
-        # Send poison pills to all workers
-        for _ in range(len(connections)):
-            worker_index = queue.get()
-            connections[worker_index].send(-1)
-    except Exception as e:
-        # TODO better send this to a common "something went wrong" Queue
-        for worker_index in range(len(connections)):
-            connections[worker_index].send(-2)
-            connections[worker_index].send((e, traceback.format_exc()))
-
 
 class WorkerProcess(Process):
     """
@@ -535,7 +561,7 @@ class WorkerProcess(Process):
 
         return output_files
 
-    def _send_outfiles(self, outfiles, chunk_index, n_reads):
+    def _send_outfiles(self, outfiles: OutputFiles, chunk_index: int, n_reads: int):
         self._write_pipe.send(chunk_index)
         self._write_pipe.send(n_reads)
 
@@ -543,6 +569,7 @@ class WorkerProcess(Process):
             if f is None:
                 continue
             f.flush()
+            assert isinstance(f, io.BytesIO)
             processed_chunk = f.getvalue()
             self._write_pipe.send_bytes(processed_chunk)
 
@@ -575,7 +602,7 @@ class PipelineRunner(ABC):
     """
     A read processing pipeline
     """
-    def __init__(self, pipeline, progress):
+    def __init__(self, pipeline: Pipeline, progress: Progress):
         self._pipeline = pipeline
         self._progress = progress
 
@@ -618,48 +645,47 @@ class ParallelPipelineRunner(PipelineRunner):
         outfiles: OutputFiles,
         progress: Progress,
         n_workers: int,
-        buffer_size=4*1024**2,
+        buffer_size: int = 4 * 1024**2,
     ):
         super().__init__(pipeline, progress)
-        self._pipes = []  # the workers read from these
-        self._reader_process = None
-        self._outfiles = None
-        self._two_input_files = None
-        self._interleaved_input = None
         self._n_workers = n_workers
-        self._need_work_queue = Queue()
+        self._need_work_queue = Queue()  # type: Queue
         self._buffer_size = buffer_size
         self._assign_input(infiles.file1, infiles.file2, infiles.interleaved)
         self._assign_output(outfiles)
 
-    def _assign_input(self, file1, file2=None, interleaved=False):
-        if self._reader_process is not None:
-            raise RuntimeError('Do not call connect_io more than once')
+    def _assign_input(
+        self,
+        file1: BinaryIO,
+        file2: Optional[BinaryIO] = None,
+        interleaved: bool = False,
+    ):
         self._two_input_files = file2 is not None
         self._interleaved_input = interleaved
+        # the workers read from these connections
         connections = [Pipe(duplex=False) for _ in range(self._n_workers)]
-        self._pipes, connw = zip(*connections)
+        self._connections, connw = zip(*connections)
         try:
             fileno = sys.stdin.fileno()
         except io.UnsupportedOperation:
             # This happens during tests: pytest sets sys.stdin to an object
             # that does not have a file descriptor.
             fileno = -1
-        self._reader_process = Process(target=reader_process, args=(file1, file2, connw,
-            self._need_work_queue, self._buffer_size, fileno))
+        self._reader_process = ReaderProcess(file1, file2, connw,
+            self._need_work_queue, self._buffer_size, fileno)
         self._reader_process.daemon = True
         self._reader_process.start()
 
     @staticmethod
-    def can_output_to(outfiles):
+    def can_output_to(outfiles: OutputFiles) -> bool:
         return outfiles.out is not None and not outfiles.demultiplex
 
-    def _assign_output(self, outfiles):
+    def _assign_output(self, outfiles: OutputFiles):
         if not self.can_output_to(outfiles):
             raise ValueError()
         self._outfiles = outfiles
 
-    def _start_workers(self):
+    def _start_workers(self) -> Tuple[List[WorkerProcess], List[Connection]]:
         workers = []
         connections = []
         for index in range(self._n_workers):
@@ -669,7 +695,7 @@ class ParallelPipelineRunner(PipelineRunner):
                 index, self._pipeline,
                 self._two_input_files,
                 self._interleaved_input, self._outfiles,
-                self._pipes[index], conn_w, self._need_work_queue)
+                self._connections[index], conn_w, self._need_work_queue)
             worker.daemon = True
             worker.start()
             workers.append(worker)


=====================================
src/cutadapt/report.py
=====================================
@@ -3,8 +3,10 @@ Routines for printing a report.
 """
 from io import StringIO
 import textwrap
-from .adapters import Where, EndStatistics, ADAPTER_TYPE_NAMES
-from .modifiers import QualityTrimmer, NextseqQualityTrimmer, AdapterCutter, PairedAdapterCutter
+from typing import Any, Optional, List
+from .adapters import Where, EndStatistics, AdapterStatistics, ADAPTER_TYPE_NAMES
+from .modifiers import (Modifier, PairedModifier, QualityTrimmer, NextseqQualityTrimmer,
+    AdapterCutter, PairedAdapterCutter, ReverseComplementer)
 from .filters import (NoFilter, PairedNoFilter, TooShortReadFilter, TooLongReadFilter,
     PairedDemultiplexer, CombinatorialDemultiplexer, Demultiplexer, NContentFilter, InfoFileWriter,
     WildcardFileWriter, RestFileWriter)
@@ -26,23 +28,24 @@ def add_if_not_none(a, b):
 
 
 class Statistics:
-    def __init__(self):
+    def __init__(self) -> None:
         """
         """
-        self.paired = None
+        self.paired = None  # type: Optional[bool]
+        self.did_quality_trimming = None  # type: Optional[bool]
         self.too_short = None
         self.too_long = None
         self.too_many_n = None
-        self.did_quality_trimming = None
+        self.reverse_complemented = None  # type: Optional[int]
         self.n = 0
         self.written = 0
         self.total_bp = [0, 0]
         self.written_bp = [0, 0]
         self.with_adapters = [0, 0]
         self.quality_trimmed_bp = [0, 0]
-        self.adapter_stats = [[], []]
+        self.adapter_stats = [[], []]  # type: List[List[AdapterStatistics]]
 
-    def __iadd__(self, other):
+    def __iadd__(self, other: Any):
         self.n += other.n
         self.written += other.written
 
@@ -55,6 +58,8 @@ class Statistics:
         elif self.did_quality_trimming != other.did_quality_trimming:
             raise ValueError('Incompatible Statistics: did_quality_trimming is not equal')
 
+        self.reverse_complemented = add_if_not_none(
+            self.reverse_complemented, other.reverse_complemented)
         self.too_short = add_if_not_none(self.too_short, other.too_short)
         self.too_long = add_if_not_none(self.too_long, other.too_long)
         self.too_many_n = add_if_not_none(self.too_many_n, other.too_many_n)
@@ -111,13 +116,13 @@ class Statistics:
         elif isinstance(w.filter, NContentFilter):
             self.too_many_n = w.filtered
 
-    def _collect_modifier(self, m):
+    def _collect_modifier(self, m: Modifier):
         if isinstance(m, PairedAdapterCutter):
             for i in 0, 1:
                 self.with_adapters[i] += m.with_adapters
                 self.adapter_stats[i] = list(m.adapter_statistics[i].values())
             return
-        if getattr(m, 'paired', False):
+        if isinstance(m, PairedModifier):
             modifiers_list = [(0, m._modifier1), (1, m._modifier2)]
         else:
             modifiers_list = [(0, m)]
@@ -128,6 +133,10 @@ class Statistics:
             elif isinstance(modifier, AdapterCutter):
                 self.with_adapters[i] += modifier.with_adapters
                 self.adapter_stats[i] = list(modifier.adapter_statistics.values())
+            elif isinstance(modifier, ReverseComplementer):
+                self.with_adapters[i] += modifier.adapter_cutter.with_adapters
+                self.adapter_stats[i] = list(modifier.adapter_cutter.adapter_statistics.values())
+                self.reverse_complemented = modifier.reverse_complemented
 
     @property
     def total(self):
@@ -157,6 +166,10 @@ class Statistics:
     def total_written_bp_fraction(self):
         return safe_divide(self.total_written_bp, self.total)
 
+    @property
+    def reverse_complemented_fraction(self):
+        return safe_divide(self.reverse_complemented, self.n)
+
     @property
     def too_short_fraction(self):
         return safe_divide(self.too_short, self.n)
@@ -262,7 +275,7 @@ class AdjacentBaseStatistics:
         return sio.getvalue()
 
 
-def full_report(stats, time, gc_content) -> str:
+def full_report(stats: Statistics, time: float, gc_content: float) -> str:  # noqa: C901
     """Print report to standard output."""
     if stats.n == 0:
         return "No reads processed!"
@@ -288,6 +301,9 @@ def full_report(stats, time, gc_content) -> str:
         Total reads processed:           {o.n:13,d}
         Reads with adapters:             {o.with_adapters[0]:13,d} ({o.with_adapters_fraction[0]:.1%})
         """)
+    if stats.reverse_complemented is not None:
+        report += "Reverse-complemented:            " \
+                  "{o.reverse_complemented:13,d} ({o.reverse_complemented_fraction:.1%})\n"
     if stats.too_short is not None:
         report += "{pairs_or_reads} that were too short:       {o.too_short:13,d} ({o.too_short_fraction:.1%})\n"
     if stats.too_long is not None:
@@ -324,6 +340,7 @@ def full_report(stats, time, gc_content) -> str:
             total_front = sum(adapter_statistics.front.lengths.values())
             total_back = sum(adapter_statistics.back.lengths.values())
             total = total_front + total_back
+            reverse_complemented = adapter_statistics.reverse_complemented
             where = adapter_statistics.where
             where_backs = (Where.BACK, Where.BACK_NOT_INTERNAL, Where.SUFFIX)
             where_fronts = (Where.FRONT, Where.FRONT_NOT_INTERNAL, Where.PREFIX)
@@ -348,9 +365,13 @@ def full_report(stats, time, gc_content) -> str:
                         len(adapter_statistics.back.sequence),
                         total_front, total_back))
             else:
-                print_s("Sequence: {}; Type: {}; Length: {}; Trimmed: {} times.".
+                print_s("Sequence: {}; Type: {}; Length: {}; Trimmed: {} times".
                     format(adapter_statistics.front.sequence, ADAPTER_TYPE_NAMES[adapter_statistics.where],
-                        len(adapter_statistics.front.sequence), total))
+                        len(adapter_statistics.front.sequence), total), end="")
+            if reverse_complemented is not None:
+                print_s("; Reverse-complemented: {} times".format(reverse_complemented))
+            else:
+                print_s()
             if total == 0:
                 print_s()
                 continue
@@ -396,7 +417,7 @@ def full_report(stats, time, gc_content) -> str:
     return sio.getvalue().rstrip()
 
 
-def minimal_report(stats, time, gc_content) -> str:
+def minimal_report(stats, _time, _gc_content) -> str:
     """Create a minimal tabular report suitable for concatenation"""
 
     def none(value):


=====================================
src/cutadapt/utils.py
=====================================
@@ -154,6 +154,23 @@ class FileOpener:
     def xopen(self, path, mode):
         return xopen(path, mode, compresslevel=self.compression_level, threads=self.threads)
 
+    def xopen_or_none(self, path, mode):
+        """Return opened file or None if the path is None"""
+        if path is None:
+            return None
+        return self.xopen(path, mode)
+
+    def xopen_pair(self, path1, path2, mode):
+        file1 = file2 = None
+        if path1 is not None:
+            file1 = self.xopen(path1, mode)
+            if path2 is not None:
+                file2 = self.xopen(path2, mode)
+        elif path2 is not None:
+            raise ValueError("When giving paths for paired-end files, only providing the second"
+                " file is not supported")
+        return file1, file2
+
     def dnaio_open(self, *args, **kwargs):
         kwargs["opener"] = self.xopen
         return dnaio.open(*args, **kwargs)


=====================================
tests/cut/linked-info.txt
=====================================
@@ -0,0 +1,9 @@
+r1 5' adapter and 3' adapter	0	0	10		AAAAAAAAAA	CCCCCCCCCCTTTTTTTTTTGGGGGGG	linkedadapter;1			
+r1 5' adapter and 3' adapter	0	10	20	CCCCCCCCCC	TTTTTTTTTT	GGGGGGG	linkedadapter;2			
+r2 without any adapter	-1	GGGGGGGGGGGGGGGGGGG	
+r3 5' adapter, partial 3' adapter	0	0	10		AAAAAAAAAA	CCCGGCCCCCTTTTT	linkedadapter;1			
+r3 5' adapter, partial 3' adapter	0	10	15	CCCGGCCCCC	TTTTT		linkedadapter;2			
+r4 only 3' adapter	-1	GGGGGGGGGGCCCCCCCCCCTTTTTTTTTTGGGGGGG	
+r5 only 5' adapter	0	0	10		AAAAAAAAAA	CCCCCCCCCCGGGGGGG	linkedadapter;1			
+r6 partial 5' adapter	-1	AAAAAACCCCCCCCCCTTTTTTTTTTGGGGGGG	
+r7 5' adapter plus preceding bases	-1	AACCGGTTTTAAAAAAAAAACCCCCCCCCCTTTTTTTTTTGGGGGGG	


=====================================
tests/cut/linked-lowercase.fasta
=====================================
@@ -0,0 +1,14 @@
+>r1 5' adapter and 3' adapter
+AAAAAAAAAACCCCCCCCCCTTTTTTTTTTGGGGGGG
+>r2 without any adapter
+GGGGGGGGGGGGGGGGGGG
+>r3 5' adapter, partial 3' adapter
+aaaaAAAAAACCCGGCCCCCTtttt
+>r4 only 3' adapter
+GGGGGGGGGGCCCCCCCCCCTTTTTTTTTTGGGGGGG
+>r5 only 5' adapter
+AAAAAAAAAACCCCCCCCCCGGGGGGG
+>r6 partial 5' adapter
+AAAAAACCCCCCCCCCTTTTTTTTTTGGGGGGG
+>r7 5' adapter plus preceding bases
+aaccggttttaaaaAAAAAACCCCCCCCCCTTTTTTttttggggggg


=====================================
tests/cut/revcomp-single-normalize.fastq
=====================================
@@ -0,0 +1,24 @@
+ at read1/1
+CCAGCTTAGACATATCGCCT
++
+G=1C(C1=J1J=C(18C(1(
+ at read2/1 rc
+ACCATCCGATATGTCTAATGTGGCCTGTTG
++
+18C=((GC=C88(1JJ=JC88=C1(CJJJ8
+ at read3/1
+CCAACTTGATATTAATAACATTagacaXataa
++
+111C1=8(G(8CJ8118=C8J=88G(J=8(=8
+ at read4/1
+GACAGGCCGTTTGAATGTTGACGGGATGTT
++
+11=CCJJ(G===1==J8G1=8=J=CCGJ(J
+ at read5
+CCAGCTTAGACATATCGCCT
++
+(J=1(G(J1GCC==CCG(=G
+ at read6 rc
+ATGAGGAAGAAGTCAGATATGATCGAAATGTTG
++
+11G1=G8(CJJGC=G(JG1J=(=1(J88=J1(J


=====================================
tests/cut/revcomp-single.fastq
=====================================
@@ -0,0 +1,24 @@
+ at read1/1
+CCAGCTTAGACATATCGCCT
++
+G=1C(C1=J1J=C(18C(1(
+ at read2/1
+CAACAGGCCACATTAGACATATCGGATGGT
++
+8JJJC(1C=88CJ=JJ1(88C=CG((=C81
+ at read3/1
+CCAACTTGATATTAATAACATTagacaXataa
++
+111C1=8(G(8CJ8118=C8J=88G(J=8(=8
+ at read4/1
+GACAGGCCGTTTGAATGTTGACGGGATGTT
++
+11=CCJJ(G===1==J8G1=8=J=CCGJ(J
+ at read5
+CCAGCTTAGACATATCGCCT
++
+(J=1(G(J1GCC==CCG(=G
+ at read6
+CAACATTTCGATCATATCTGACTTCTTCCTCAT
++
+J(1J=88J(1=(=J1GJ(G=CGJJC(8G=1G11


=====================================
tests/data/revcomp.1.fastq
=====================================
@@ -0,0 +1,24 @@
+ at read1/1
+ttatttgtctCCAGCTTAGACATATCGCCT
++
+(8J181J(J1G=1C(C1=J1J=C(18C(1(
+ at read2/1
+CAACAGGCCACATTAGACATATCGGATGGTagacaaataa
++
+8JJJC(1C=88CJ=JJ1(88C=CG((=C81GJG8=J=8=(
+ at read3/1
+tccgcactggCCAACTTGATATTAATAACATTagacaXataa
++
+1GC((GCG=(111C1=8(G(8CJ8118=C8J=88G(J=8(=8
+ at read4/1
+GACAGGCCGTTTGAATGTTGACGGGATGTT
++
+11=CCJJ(G===1==J8G1=8=J=CCGJ(J
+ at read5
+ttatttgtctCCAGCTTAGACATATCGCCT
++
+=GG=1GC(JG(J=1(G(J1GCC==CCG(=G
+ at read6
+CAACATTTCGATCATATCTGACTTCTTCCTCATagacaaataa
++
+J(1J=88J(1=(=J1GJ(G=CGJJC(8G=1G118JC8(CC8((


=====================================
tests/data/revcomp.2.fastq
=====================================
@@ -0,0 +1,24 @@
+ at read1/2
+GCTGGAGACAAATAACAGTGGAGTAGTTTT
++
+0FIF<<IF<B7''F7FB'0'<B'F'7F7BI
+ at read2/2
+TGTGGCCTGTTGCAGTGGAGTAACTCCAGC
++
+'<07I'FIB'<<FBI''I0'BFB7F'<<0I
+ at read3/2
+TGTTATTAATATCAAGTTGGCAGTG
++
+B7B'<<7BB77'F<'<0F7<0I000
+ at read4/2
+CATCCCGTCAACATTCAAACGGCCTGTCCAagacaaataa
++
+7F0<'I0F<'F<B7FB7B<77'00BBF'BFIB<'<BF7<7
+ at read5
+CAACAGGCCACATTAGACATATCGGATGGTagacaaataa
++
+7IF<F0B0''I'77<F'IB'<F7FI<<F0'F<7<7BBB'B
+ at read6
+AGAAGCGGGGCGGGTGTCTCccagtgcgga
++
+B0FF7I''I7BBFF<<B'FBB0F<I<I700


=====================================
tests/test_adapters.py
=====================================
@@ -1,11 +1,11 @@
 import pytest
 
 from dnaio import Sequence
-from cutadapt.adapters import Adapter, Match, Where, LinkedAdapter
+from cutadapt.adapters import SingleAdapter, SingleMatch, Where, LinkedAdapter
 
 
 def test_issue_52():
-    adapter = Adapter(
+    adapter = SingleAdapter(
         sequence='GAACTCCAGTCACNNNNN',
         where=Where.BACK,
         remove='suffix',
@@ -14,7 +14,7 @@ def test_issue_52():
         read_wildcards=False,
         adapter_wildcards=True)
     read = Sequence(name="abc", sequence='CCCCAGAACTACAGTCCCGGC')
-    am = Match(astart=0, astop=17, rstart=5, rstop=21, matches=15, errors=2,
+    am = SingleMatch(astart=0, astop=17, rstart=5, rstop=21, matches=15, errors=2,
         remove_before=False, adapter=adapter, read=read)
     assert am.wildcards() == 'GGC'
     """
@@ -42,7 +42,7 @@ def test_issue_80():
     # This is correct, albeit a little surprising, since an alignment without
     # indels would have only two errors.
 
-    adapter = Adapter(
+    adapter = SingleAdapter(
         sequence="TCGTATGCCGTCTTC",
         where=Where.BACK,
         remove='suffix',
@@ -58,14 +58,14 @@ def test_issue_80():
 
 
 def test_str():
-    a = Adapter('ACGT', where=Where.BACK, remove='suffix', max_error_rate=0.1)
+    a = SingleAdapter('ACGT', where=Where.BACK, remove='suffix', max_error_rate=0.1)
     str(a)
     str(a.match_to(Sequence(name='seq', sequence='TTACGT')))
 
 
 def test_linked_adapter():
-    front_adapter = Adapter('AAAA', where=Where.PREFIX, min_overlap=4)
-    back_adapter = Adapter('TTTT', where=Where.BACK, min_overlap=3)
+    front_adapter = SingleAdapter('AAAA', where=Where.PREFIX, min_overlap=4)
+    back_adapter = SingleAdapter('TTTT', where=Where.BACK, min_overlap=3)
 
     linked_adapter = LinkedAdapter(
         front_adapter, back_adapter, front_required=True, back_required=False, name='name')
@@ -79,7 +79,7 @@ def test_linked_adapter():
 
 
 def test_info_record():
-    adapter = Adapter(
+    adapter = SingleAdapter(
         sequence='GAACTCCAGTCACNNNNN',
         where=Where.BACK,
         max_error_rate=0.12,
@@ -88,9 +88,9 @@ def test_info_record():
         adapter_wildcards=True,
         name="Foo")
     read = Sequence(name="abc", sequence='CCCCAGAACTACAGTCCCGGC')
-    am = Match(astart=0, astop=17, rstart=5, rstop=21, matches=15, errors=2, remove_before=False,
+    am = SingleMatch(astart=0, astop=17, rstart=5, rstop=21, matches=15, errors=2, remove_before=False,
         adapter=adapter, read=read)
-    assert am.get_info_record() == (
+    assert am.get_info_records() == [[
         "abc",
         2,
         5,
@@ -102,26 +102,26 @@ def test_info_record():
         '',
         '',
         '',
-    )
+    ]]
 
 
 def test_random_match_probabilities():
-    a = Adapter('A', where=Where.BACK, max_error_rate=0.1).create_statistics()
+    a = SingleAdapter('A', where=Where.BACK, max_error_rate=0.1).create_statistics()
     assert a.back.random_match_probabilities(0.5) == [1, 0.25]
     assert a.back.random_match_probabilities(0.2) == [1, 0.4]
 
     for s in ('ACTG', 'XMWH'):
-        a = Adapter(s, where=Where.BACK, max_error_rate=0.1).create_statistics()
+        a = SingleAdapter(s, where=Where.BACK, max_error_rate=0.1).create_statistics()
         assert a.back.random_match_probabilities(0.5) == [1, 0.25, 0.25**2, 0.25**3, 0.25**4]
         assert a.back.random_match_probabilities(0.2) == [1, 0.4, 0.4*0.1, 0.4*0.1*0.4, 0.4*0.1*0.4*0.1]
 
-    a = Adapter('GTCA', where=Where.FRONT, max_error_rate=0.1).create_statistics()
+    a = SingleAdapter('GTCA', where=Where.FRONT, max_error_rate=0.1).create_statistics()
     assert a.front.random_match_probabilities(0.5) == [1, 0.25, 0.25**2, 0.25**3, 0.25**4]
     assert a.front.random_match_probabilities(0.2) == [1, 0.4, 0.4*0.1, 0.4*0.1*0.4, 0.4*0.1*0.4*0.1]
 
 
 def test_add_adapter_statistics():
-    stats = Adapter('A', name='name', where=Where.BACK, max_error_rate=0.1).create_statistics()
+    stats = SingleAdapter('A', name='name', where=Where.BACK, max_error_rate=0.1).create_statistics()
     end_stats = stats.back
     end_stats.adjacent_bases['A'] = 7
     end_stats.adjacent_bases['C'] = 19
@@ -136,7 +136,7 @@ def test_add_adapter_statistics():
     end_stats.errors[20][1] = 66
     end_stats.errors[20][2] = 6
 
-    stats2 = Adapter('A', name='name', where=Where.BACK, max_error_rate=0.1).create_statistics()
+    stats2 = SingleAdapter('A', name='name', where=Where.BACK, max_error_rate=0.1).create_statistics()
     end_stats2 = stats2.back
     end_stats2.adjacent_bases['A'] = 43
     end_stats2.adjacent_bases['C'] = 31
@@ -164,8 +164,8 @@ def test_add_adapter_statistics():
 def test_issue_265():
     """Crash when accessing the matches property of non-anchored linked adapters"""
     s = Sequence('name', 'AAAATTTT')
-    front_adapter = Adapter('GGG', where=Where.FRONT)
-    back_adapter = Adapter('TTT', where=Where.BACK)
+    front_adapter = SingleAdapter('GGG', where=Where.FRONT)
+    back_adapter = SingleAdapter('TTT', where=Where.BACK)
     la = LinkedAdapter(front_adapter, back_adapter, front_required=False, back_required=False, name='name')
     assert la.match_to(s).matches == 3
 
@@ -173,6 +173,6 @@ def test_issue_265():
 @pytest.mark.parametrize("where", [Where.PREFIX, Where.SUFFIX])
 def test_no_indels_empty_read(where):
     # Issue #376
-    adapter = Adapter('ACGT', where=where, indels=False)
+    adapter = SingleAdapter('ACGT', where=where, indels=False)
     empty = Sequence('name', '')
     adapter.match_to(empty)


=====================================
tests/test_commandline.py
=====================================
@@ -530,6 +530,18 @@ def test_linked_discard_untrimmed_g(run):
     run('-g AAAAAAAAAA...TTTTTTTTTT --discard-untrimmed', 'linked-discard-g.fasta', 'linked.fasta')
 
 
+def test_linked_lowercase(run):
+    run('-a ^AACCGGTTTT...GGGGGGG$ -a ^AAAA...TTTT$ --times=2 --action=lowercase',
+        'linked-lowercase.fasta', 'linked.fasta')
+
+
+def test_linked_info_file(tmpdir):
+    info_path = str(tmpdir.join('info.txt'))
+    main(['-a linkedadapter=^AAAAAAAAAA...TTTTTTTTTT', '--info-file', info_path,
+        '-o', str(tmpdir.join('out.fasta')), datapath('linked.fasta')])
+    assert_files_equal(cutpath('linked-info.txt'), info_path)
+
+
 def test_linked_anywhere():
     with pytest.raises(SystemExit):
         with redirect_stderr():
@@ -695,3 +707,17 @@ def test_print_progress_to_tty(tmpdir, mocker):
 def test_adapter_order(run):
     run("-g ^AAACC -a CCGGG", "adapterorder-ga.fasta", "adapterorder.fasta")
     run("-a CCGGG -g ^AAACC", "adapterorder-ag.fasta", "adapterorder.fasta")
+
+
+ at pytest.mark.skip(reason="Not implemented")
+def test_reverse_complement_not_normalized(run):
+    run("--rc=yes -g ^TTATTTGTCT -g ^TCCGCACTGG",
+        "revcomp-notnormalized-single.fastq", "revcomp.1.fastq")
+
+
+def test_reverse_complement_normalized(run):
+    run(
+        "--revcomp -g ^TTATTTGTCT -g ^TCCGCACTGG",
+        "revcomp-single-normalize.fastq",
+        "revcomp.1.fastq",
+    )


=====================================
tests/test_modifiers.py
=====================================
@@ -52,8 +52,8 @@ def test_shortener():
 
 
 def test_adapter_cutter():
-    from cutadapt.adapters import Adapter, Where
-    a1 = Adapter('GTAGTCCCGC', where=Where.BACK)
-    a2 = Adapter('GTAGTCCCCC', where=Where.BACK)
+    from cutadapt.adapters import SingleAdapter, Where
+    a1 = SingleAdapter('GTAGTCCCGC', where=Where.BACK)
+    a2 = SingleAdapter('GTAGTCCCCC', where=Where.BACK)
     match = AdapterCutter.best_match([a1, a2], Sequence("name", "ATACCCCTGTAGTCCCC"))
     assert match.adapter is a2


=====================================
tests/test_parser.py
=====================================
@@ -2,7 +2,7 @@ from textwrap import dedent
 import pytest
 
 from dnaio import Sequence
-from cutadapt.adapters import Where, LinkedAdapter
+from cutadapt.adapters import Where, WhereToRemove, LinkedAdapter, SingleAdapter
 from cutadapt.parser import AdapterParser, AdapterSpecification
 
 
@@ -164,6 +164,7 @@ def test_linked_adapter_parameters():
 def test_linked_adapter_name():
     # issue #414
     a = AdapterParser()._parse("the_name=^ACG...TGT")
+    assert isinstance(a, LinkedAdapter)
     assert a.create_statistics().name == "the_name"
 
 
@@ -171,7 +172,8 @@ def test_anywhere_parameter():
     parser = AdapterParser(max_error_rate=0.2, min_overlap=4, read_wildcards=False,
         adapter_wildcards=False, indels=True)
     adapter = list(parser.parse('CTGAAGTGAAGTACACGGTT;anywhere', 'back'))[0]
-    assert adapter.remove == 'suffix'
+    assert isinstance(adapter, SingleAdapter)
+    assert adapter.remove == WhereToRemove.SUFFIX
     assert adapter.where is Where.ANYWHERE
     read = Sequence('foo1', 'TGAAGTACACGGTTAAAAAAAAAA')
     from cutadapt.modifiers import AdapterCutter


=====================================
tests/test_trim.py
=====================================
@@ -1,11 +1,11 @@
 from dnaio import Sequence
-from cutadapt.adapters import Adapter, Where
+from cutadapt.adapters import SingleAdapter, Where
 from cutadapt.modifiers import AdapterCutter
 
 
 def test_statistics():
     read = Sequence('name', 'AAAACCCCAAAA')
-    adapters = [Adapter('CCCC', Where.BACK, max_error_rate=0.1)]
+    adapters = [SingleAdapter('CCCC', Where.BACK, max_error_rate=0.1)]
     cutter = AdapterCutter(adapters, times=3)
     cutter(read, [])
     # TODO make this a lot simpler
@@ -25,7 +25,7 @@ def test_end_trim_with_mismatch():
     the hit and so the match is considered good. An insertion or substitution
     at the same spot is not a match.
     """
-    adapter = Adapter('TCGATCGATCGAT', Where.BACK, max_error_rate=0.1)
+    adapter = SingleAdapter('TCGATCGATCGAT', Where.BACK, max_error_rate=0.1)
 
     read = Sequence('foo1', 'AAAAAAAAAAATCGTCGATC')
     cutter = AdapterCutter([adapter], times=1)
@@ -46,7 +46,7 @@ def test_end_trim_with_mismatch():
 
 
 def test_anywhere_with_errors():
-    adapter = Adapter('CCGCATTTAG', Where.ANYWHERE, max_error_rate=0.1)
+    adapter = SingleAdapter('CCGCATTTAG', Where.ANYWHERE, max_error_rate=0.1)
     for seq, expected_trimmed in (
         ('AACCGGTTccgcatttagGATC', 'AACCGGTT'),
         ('AACCGGTTccgcgtttagGATC', 'AACCGGTT'),  # one mismatch


=====================================
tox.ini
=====================================
@@ -1,5 +1,5 @@
 [tox]
-envlist = flake8,py35,py36,py37,py38,docs
+envlist = flake8,py35,py36,py37,py38,mypy,docs
 requires = Cython>=0.29.13
 
 [testenv]
@@ -26,9 +26,14 @@ basepython = python3.6
 deps = flake8
 commands = flake8 src/ tests/
 
+[testenv:mypy]
+basepython = python3.6
+deps = mypy
+commands = mypy src/
+
 [travis]
 python =
-  3.6: py36, docs
+  3.6: py36, docs, mypy
 
 [coverage:run]
 parallel = True
@@ -43,6 +48,6 @@ source =
 
 [flake8]
 max-line-length = 120
-max-complexity = 23
+max-complexity = 20
 select = E,F,W,C90,W504
-extend_ignore = E128,E131,W503
+extend_ignore = E128,E131,W503,E203



View it on GitLab: https://salsa.debian.org/med-team/python-cutadapt/compare/760df09c274eaf00ed901787cc21855c726f1551...4c6471aea1780c2f4b5feb1eef928e6e5ac7e529

-- 
View it on GitLab: https://salsa.debian.org/med-team/python-cutadapt/compare/760df09c274eaf00ed901787cc21855c726f1551...4c6471aea1780c2f4b5feb1eef928e6e5ac7e529
You're receiving this email because of your account on salsa.debian.org.


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://alioth-lists.debian.net/pipermail/debian-med-commit/attachments/20200124/b4aac720/attachment-0001.html>