[med-svn] [Git][med-team/python-deeptools][upstream] New upstream version 3.5.5+dfsg
Étienne Mollier (@emollier)
gitlab at salsa.debian.org
Sun May 12 18:57:22 BST 2024
Étienne Mollier pushed to branch upstream at Debian Med / python-deeptools
Commits:
bbcd084c by Étienne Mollier at 2024-05-12T19:43:03+02:00
New upstream version 3.5.5+dfsg
- - - - -
27 changed files:
- .github/CONTRIBUTING.md
- .github/test_and_build.yml
- .github/workflows/planemo.yml
- .github/workflows/test.yml
- .readthedocs.yaml
- CHANGES.txt
- README.md
- deeptools/alignmentSieve.py
- deeptools/bamPEFragmentSize.py
- deeptools/bigwigAverage.py
- deeptools/bigwigCompare.py
- deeptools/computeMatrix.py
- deeptools/computeMatrixOperations.py
- − deeptools/deepBlue.py
- deeptools/deeptools_list_tools.py
- deeptools/estimateReadFiltering.py
- deeptools/estimateScaleFactor.py
- deeptools/getFragmentAndReadSize.py
- deeptools/multiBamSummary.py
- deeptools/multiBigwigSummary.py
- deeptools/parserCommon.py
- deeptools/plotCorrelation.py
- deeptools/plotCoverage.py
- deeptools/plotHeatmap.py
- deeptools/plotPCA.py
- galaxy/wrapper/deepTools_macros.xml
- pyproject.toml
Changes:
=====================================
.github/CONTRIBUTING.md
=====================================
@@ -17,4 +17,4 @@ for contributing to the repository :**
## Testing
-* Please make sure that travis tests are passing
+* Please make sure that github actions are passing
=====================================
.github/test_and_build.yml
=====================================
@@ -2,7 +2,7 @@ channels:
- conda-forge
- bioconda
dependencies:
- - python >= 3.7
+ - python > 3.7
- numpy
- scipy
- flake8
=====================================
.github/workflows/planemo.yml
=====================================
@@ -49,7 +49,7 @@ jobs:
runs-on: ubuntu-latest
strategy:
matrix:
- python-version: ['3.7', '3.11']
+ python-version: ['3.8', '3.11']
steps:
- uses: actions/download-artifact at v3
with:
=====================================
.github/workflows/test.yml
=====================================
@@ -63,9 +63,9 @@ jobs:
micromamba activate test_and_build
rm -f dist/*
python -m build
- - uses: actions/upload-artifact at master
+ - uses: actions/upload-artifact at v3
with:
- name: "Dist files"
+ name: "distfiles"
path: "dist"
test-wheels:
name: test wheel
@@ -73,12 +73,12 @@ jobs:
needs: build-linux
strategy:
matrix:
- python-version: ['3.7','3.8','3.9','3.10', '3.11']
+ python-version: ['3.8','3.9','3.10', '3.11', '3.12']
steps:
- uses: actions/checkout at v3
- uses: actions/download-artifact at v3
with:
- name: "Dist files"
+ name: "distfiles"
path: ~/dist/
- uses: actions/setup-python at v4
with:
=====================================
.readthedocs.yaml
=====================================
@@ -3,11 +3,13 @@ version: 2
build:
os: ubuntu-22.04
tools:
- python: "3.11"
+ python: "3.12"
sphinx:
configuration: docs/conf.py
python:
install:
+ - method: pip
+ path: .
- requirements: docs/requirements.txt
=====================================
CHANGES.txt
=====================================
@@ -1,3 +1,19 @@
+3.5.5
+* drop support for python 3.7
+* doc fixes (argparse properly displayed, minor changes in installation instructions)
+* deepblue support stops
+* initiate deprecation of tight_layout in plotheatmap, in favor of constrained_layout. Minor changes in paddings, etc can occur (but for the better).
+* documentation changes to improve ESS tab, table constraints have been lifted & sphinx_rtd_theme to v2.0.0
+* upload artifact in gh test runner pinned to 3
+* Try to get the number of processors from sched_getaffinity, to avoid using to many in job submissions for example. #1199
+* Fix typo in estimateScaleFactor that fixes broken argparsing. #1286
+
+3.5.4
+* error handling and cases for bwAverage with >2 samples
+* Tick.label deprecation for mpl 3.8
+* minimal mpl version is 3.5
+* cicd update for pypi push
+
3.5.3
* requirement cap for matplotlib lifted (changes in plotting can occur)
* nose has been deprecated in favor of pytests
=====================================
README.md
=====================================
@@ -4,7 +4,6 @@
[![install with bioconda](https://img.shields.io/badge/install%20with-bioconda-brightgreen.svg?style=flat)](http://bioconda.github.io/recipes/deeptools/README.html)
[![European Galaxy server](https://img.shields.io/badge/usegalaxy-.eu-brightgreen?logo=)](https://usegalaxy.eu/root?tool_id=deeptools_compute_matrix)
![test](https://github.com/deeptools/deepTools/actions/workflows/test.yml/badge.svg)
-![planemo](https://github.com/deeptools/deepTools/actions/workflows/planemo.yml/badge.svg)
## User-friendly tools for exploring deep-sequencing data
@@ -34,35 +33,26 @@ Our [Gallery](http://deeptools.readthedocs.org/en/latest/content/example_gallery
deepTools are available for:
-* Command line usage (via pip/anaconda/github)
+* Command line usage (via pip / conda / github)
* Integration into Galaxy servers (via toolshed/API/web-browser)
-There are many easy ways to install deepTools. Details can be found [here](https://deeptools.readthedocs.io/en/latest/content/installation.html)
+There are many easy ways to install deepTools. More details can be found [here](https://deeptools.readthedocs.io/en/latest/content/installation.html).
-**Install by cloning this repository:**
+In Brief:
-You can install any one of the deepTools branches on command line (linux/mac) by cloning this git repository :
+**Install through pypi**
- $ git clone https://github.com/deeptools/deepTools
- $ cd deepTools
- $ python setup.py install
-
-By default, the script will install the python library and executable
-codes globally, which means you need to be root or administrator of
-the machine to complete the installation. If you need to
-provide a nonstandard install prefix, or any other nonstandard
-options, you can provide many command line options to the install
-script.
+ $ pip install deeptools
- $ python setup.py --help
+**Install via conda**
-For example, to install under a specific location use:
+ $ conda install -c bioconda deeptools
- $ python setup.py install --prefix <target directory>
+**Install by cloning the repository**
-To install into your home directory, use:
-
- $ python setup.py install --user
+ $ git clone https://github.com/deeptools/deepTools
+ $ cd deepTools
+ $ pip install .
<a name="galaxy"/></a>
### Galaxy Installation
=====================================
deeptools/alignmentSieve.py
=====================================
@@ -7,11 +7,8 @@ import sys
from deeptools import parserCommon
from deeptools.bamHandler import openBam
from deeptools.mapReduce import mapReduce
-try: # keep python 3.7 support.
- from importlib.metadata import version
-except ModuleNotFoundError:
- from importlib_metadata import version
from deeptools.utilities import getTLen, smartLabels, getTempFileName
+from importlib.metadata import version
def parseArguments():
=====================================
deeptools/bamPEFragmentSize.py
=====================================
@@ -18,10 +18,7 @@ import plotly.graph_objs as go
# own tools
from deeptools.parserCommon import writableFile
from deeptools.getFragmentAndReadSize import get_read_and_fragment_length
-try: # keep python 3.7 support.
- from importlib.metadata import version
-except ModuleNotFoundError:
- from importlib_metadata import version
+from importlib.metadata import version
def parse_arguments():
=====================================
deeptools/bigwigAverage.py
=====================================
@@ -1,13 +1,10 @@
#!/usr/bin/env python
# -*- coding: utf-8 -*-
-import argparse # to parse command line arguments
+import argparse
import sys
-import multiprocessing
-import os
import numpy as np
from deeptools import parserCommon
from deeptools import writeBedGraph_bam_and_bw
-import deeptools.deepBlue as db
debug = 0
@@ -15,9 +12,8 @@ debug = 0
def parse_arguments(args=None):
parentParser = parserCommon.getParentArgParse()
outputParser = parserCommon.output()
- dbParser = parserCommon.deepBlueOptionalArgs()
parser = argparse.ArgumentParser(
- parents=[parentParser, outputParser, dbParser],
+ parents=[parentParser, outputParser],
formatter_class=argparse.ArgumentDefaultsHelpFormatter,
description='This tool average multiple bigWig files based on the number '
'of mapped reads. To average the bigWig files, the genome is '
@@ -59,7 +55,7 @@ def parse_arguments(args=None):
def getType(fname):
"""
- Tries to determine if a file is a wiggle file from deepBlue or a bigWig file.
+ Tries to determine if a file is a wiggle file a bigWig file.
Returns 'wiggle' if the file name ends with .wig, otherwise 'bigwig'
"""
if fname.endswith(".wig") or fname.endswith(".wiggle"):
@@ -119,29 +115,6 @@ def main(args=None):
FUNC = average
function_args = {'scaleFactors': scaleFactors}
- # Preload deepBlue files, which need to then be deleted
- deepBlueFiles = []
- for idx, fname in enumerate(args.bigwigs):
- if db.isDeepBlue(fname):
- deepBlueFiles.append([fname, idx])
- if len(deepBlueFiles) > 0:
- sys.stderr.write("Preloading the following deepBlue files: {}\n".format(",".join([x[0] for x in deepBlueFiles])))
- foo = db.deepBlue(deepBlueFiles[0][0], url=args.deepBlueURL, userKey=args.userKey)
- regs = db.makeChromTiles(foo)
- for x in deepBlueFiles:
- x.extend([args, regs])
- if len(deepBlueFiles) > 1 and args.numberOfProcessors > 1:
- pool = multiprocessing.Pool(args.numberOfProcessors)
- res = pool.map_async(db.preloadWrapper, deepBlueFiles).get(9999999)
- else:
- res = list(map(db.preloadWrapper, deepBlueFiles))
-
- # substitute the file names with the temp files
- for (ftuple, r) in zip(deepBlueFiles, res):
- args.bigwigs[ftuple[1]] = r
- deepBlueFiles = [[x[0], x[1]] for x in deepBlueFiles]
- del regs
-
writeBedGraph_bam_and_bw.writeBedGraph(
[(b, getType(b)) for b in args.bigwigs],
args.outFileName, 0, FUNC,
@@ -154,12 +127,3 @@ def main(args=None):
smoothLength=False,
missingDataAsZero=not args.skipNonCoveredRegions,
extendPairedEnds=False)
-
- # Clean up temporary bigWig files, if applicable
- if not args.deepBlueKeepTemp:
- for k, v in deepBlueFiles:
- os.remove(args.bigwigs[v])
- else:
- for k, v in deepBlueFiles:
- foo = args.bigwigs[v]
- print("{} is stored in {}".format(k, foo))
=====================================
deeptools/bigwigCompare.py
=====================================
@@ -1,13 +1,9 @@
#!/usr/bin/env python
# -*- coding: utf-8 -*-
-import argparse # to parse command line arguments
-import sys
-import multiprocessing
-import os
+import argparse
from deeptools import parserCommon
from deeptools.getRatio import getRatio
from deeptools import writeBedGraph_bam_and_bw
-import deeptools.deepBlue as db
debug = 0
@@ -15,9 +11,8 @@ debug = 0
def parse_arguments(args=None):
parentParser = parserCommon.getParentArgParse()
outputParser = parserCommon.output()
- dbParser = parserCommon.deepBlueOptionalArgs()
parser = argparse.ArgumentParser(
- parents=[parentParser, outputParser, dbParser],
+ parents=[parentParser, outputParser],
formatter_class=argparse.ArgumentDefaultsHelpFormatter,
description='This tool compares two bigWig files based on the number '
'of mapped reads. To compare the bigWig files, the genome is '
@@ -104,7 +99,7 @@ def parse_arguments(args=None):
def getType(fname):
"""
- Tries to determine if a file is a wiggle file from deepBlue or a bigWig file.
+ Tries to determine if a file is a wiggle file or a bigWig file.
Returns 'wiggle' if the file name ends with .wig, otherwise 'bigwig'
"""
if fname.endswith(".wig") or fname.endswith(".wiggle"):
@@ -136,32 +131,6 @@ def main(args=None):
'scaleFactors': scaleFactors,
'pseudocount': args.pseudocount}
- # Preload deepBlue files, which need to then be deleted
- deepBlueFiles = []
- for idx, fname in enumerate([args.bigwig1, args.bigwig2]):
- if db.isDeepBlue(fname):
- deepBlueFiles.append([fname, idx])
- if len(deepBlueFiles) > 0:
- sys.stderr.write("Preloading the following deepBlue files: {}\n".format(",".join([x[0] for x in deepBlueFiles])))
- foo = db.deepBlue(deepBlueFiles[0][0], url=args.deepBlueURL, userKey=args.userKey)
- regs = db.makeChromTiles(foo)
- for x in deepBlueFiles:
- x.extend([args, regs])
- if len(deepBlueFiles) > 1 and args.numberOfProcessors > 1:
- pool = multiprocessing.Pool(args.numberOfProcessors)
- res = pool.map_async(db.preloadWrapper, deepBlueFiles).get(9999999)
- else:
- res = list(map(db.preloadWrapper, deepBlueFiles))
-
- # substitute the file names with the temp files
- for (ftuple, r) in zip(deepBlueFiles, res):
- if ftuple[1] == 0:
- args.bigwig1 = r
- else:
- args.bigwig2 = r
- deepBlueFiles = [[x[0], x[1]] for x in deepBlueFiles]
- del regs
-
writeBedGraph_bam_and_bw.writeBedGraph(
[(args.bigwig1, getType(args.bigwig1)),
(args.bigwig2, getType(args.bigwig2))],
@@ -176,17 +145,3 @@ def main(args=None):
missingDataAsZero=not args.skipNonCoveredRegions,
extendPairedEnds=False,
fixedStep=args.fixedStep)
-
- # Clean up temporary bigWig files, if applicable
- if not args.deepBlueKeepTemp:
- for k, v in deepBlueFiles:
- if v == 0:
- os.remove(args.bigwig1)
- else:
- os.remove(args.bigwig2)
- else:
- for k, v in deepBlueFiles:
- foo = args.bigwig1
- if v == 1:
- foo = args.bigwig2
- print("{} is stored in {}".format(k, foo))
=====================================
deeptools/computeMatrix.py
=====================================
@@ -3,18 +3,11 @@
import argparse
import sys
-import os
-import multiprocessing
-
from deeptools.parserCommon import writableFile, numberOfProcessors
-try: # keep python 3.7 support.
- from importlib.metadata import version
-except ModuleNotFoundError:
- from importlib_metadata import version
from deeptools import parserCommon
from deeptools import heatmapper
import deeptools.computeMatrixOperations as cmo
-import deeptools.deepBlue as db
+from importlib.metadata import version
def parse_arguments(args=None):
@@ -47,17 +40,16 @@ $ computeMatrix scale-regions --help
dest='command',
metavar='')
- dbParser = parserCommon.deepBlueOptionalArgs()
-
# scale-regions mode options
subparsers.add_parser(
'scale-regions',
formatter_class=argparse.ArgumentDefaultsHelpFormatter,
- parents=[computeMatrixRequiredArgs(),
- computeMatrixOutputArgs(),
- computeMatrixOptArgs(case='scale-regions'),
- parserCommon.gtf_options(),
- dbParser],
+ parents=[
+ computeMatrixRequiredArgs(),
+ computeMatrixOutputArgs(),
+ computeMatrixOptArgs(case='scale-regions'),
+ parserCommon.gtf_options()
+ ],
help="In the scale-regions mode, all regions in the BED file are "
"stretched or shrunken to the length (in bases) indicated by the user.",
usage='An example usage is:\n computeMatrix scale-regions -S '
@@ -70,8 +62,8 @@ $ computeMatrix scale-regions --help
parents=[computeMatrixRequiredArgs(),
computeMatrixOutputArgs(),
computeMatrixOptArgs(case='reference-point'),
- parserCommon.gtf_options(),
- dbParser],
+ parserCommon.gtf_options()
+ ],
help="Reference-point refers to a position within a BED region "
"(e.g., the starting point). In this mode, only those genomic"
"positions before (upstream) and/or after (downstream) of the "
@@ -402,28 +394,6 @@ def main(args=None):
hm = heatmapper.heatmapper()
- # Preload deepBlue files, which need to then be deleted
- deepBlueFiles = []
- for idx, fname in enumerate(args.scoreFileName):
- if db.isDeepBlue(fname):
- deepBlueFiles.append([fname, idx])
- if len(deepBlueFiles) > 0:
- sys.stderr.write("Preloading the following deepBlue files: {}\n".format(",".join([x[0] for x in deepBlueFiles])))
- regs = db.makeRegions(args.regionsFileName, args)
- for x in deepBlueFiles:
- x.extend([args, regs])
- if len(deepBlueFiles) > 1 and args.numberOfProcessors > 1:
- pool = multiprocessing.Pool(args.numberOfProcessors)
- res = pool.map_async(db.preloadWrapper, deepBlueFiles).get(9999999)
- else:
- res = list(map(db.preloadWrapper, deepBlueFiles))
-
- # substitute the file names with the temp files
- for (ftuple, r) in zip(deepBlueFiles, res):
- args.scoreFileName[ftuple[1]] = r
- deepBlueFiles = [[x[0], x[1]] for x in deepBlueFiles]
- del regs
-
scores_file_list = args.scoreFileName
hm.computeMatrix(scores_file_list, args.regionsFileName, parameters, blackListFileName=args.blackListFileName, verbose=args.verbose, allArgs=args)
if args.sortRegions not in ['no', 'keep']:
@@ -449,11 +419,3 @@ def main(args=None):
if args.outFileSortedRegions:
hm.save_BED(args.outFileSortedRegions)
-
- # Clean up temporary bigWig files, if applicable
- if not args.deepBlueKeepTemp:
- for k, v in deepBlueFiles:
- os.remove(args.scoreFileName[v])
- else:
- for k, v in deepBlueFiles:
- print("{} is stored in {}".format(k, args.scoreFileName[v]))
=====================================
deeptools/computeMatrixOperations.py
=====================================
@@ -6,10 +6,7 @@ import argparse
import sys
import os
import csv
-try: # keep python 3.7 support.
- from importlib.metadata import version
-except ModuleNotFoundError:
- from importlib_metadata import version
+from importlib.metadata import version
def parse_arguments():
=====================================
deeptools/deepBlue.py deleted
=====================================
@@ -1,286 +0,0 @@
-#!/usr/bin/env python
-try:
- # python 2
- import xmlrpclib
-except:
- # python 3
- import xmlrpc.client as xmlrpclib
-import time
-import tempfile
-import os.path
-import sys
-import pyBigWig
-from deeptools.utilities import mungeChromosome
-from deeptoolsintervals import GTF
-import datetime
-
-
-def isDeepBlue(fname):
- """
- Returns true if the file ends in .wig, .wiggle, or .bedgraph, since these indicate a file on the deepBlue server
- """
- if fname.endswith(".wig"):
- return True
- if fname.endswith(".wiggle"):
- return True
- if fname.endswith(".bedgraph"):
- return True
- if fname.startswith("http") or fname.startswith("ftp"):
- return False
- # For ENCODE samples, the "Name" is just the ENCODE sample ID, so as a fallback check for files that aren't there.
- if not os.path.exists(fname):
- return True
- return False
-
-
-def mergeRegions(regions):
- """
- Given a list of [(chrom, start, end), ...], merge all overlapping regions
-
- This returns a dict, where values are sorted lists of [start, end].
- """
- bar = sorted(regions)
- out = dict()
- last = [None, None, None]
- for reg in bar:
- if reg[0] == last[0] and reg[1] <= last[2]:
- if reg[2] > last[2]:
- last[2] = reg[2]
- continue
- else:
- if last[0]:
- if last[0] not in out:
- out[last[0]] = list()
- out[last[0]].append([last[1], last[2]])
- last = [reg[0], reg[1], reg[2]]
- if last[0] not in out:
- out[last[0]] = list()
- out[last[0]].append([last[1], last[2]])
- return out
-
-
-def makeTiles(db, args):
- """
- Given a deepBlue object, return a list of regions that will be queried
- """
- out = []
- for (k, v) in db.chromsTuple:
- start = 0
- while start <= v:
- end = start + args.binSize
- if end > v:
- end = v
- out.append([k, start, end])
- start += end + args.distanceBetweenBins
- return out
-
-
-def makeChromTiles(db):
- """
- Make a region for each chromosome
- """
- out = []
- for (k, v) in db.chromsTuple:
- out.append([k, 0, v])
- return out
-
-
-def makeRegions(BED, args):
- """
- Given a list of BED/GTF files, make a list of regions.
- These are vaguely extended as appropriate. For simplicity, the maximum of --beforeRegionStartLength
- and --afterRegionStartLength are tacked on to each end and transcripts are used for GTF files.
- """
- itree = GTF(BED, transcriptID=args.transcriptID, transcript_id_designator=args.transcript_id_designator)
- o = []
- extend = 0
- # The before/after stuff is specific to computeMatrix
- if "beforeRegionStartLength" in args:
- extend = max(args.beforeRegionStartLength, args.afterRegionStartLength)
- for chrom in itree.chroms:
- regs = itree.findOverlaps(chrom, 0, 4294967295) # bigWig files use 32 bit coordinates
- for reg in regs:
- o.append([chrom, max(0, reg[0] - extend), reg[1] + extend])
- del itree
- return o
-
-
-def preloadWrapper(foo):
- """
- This is a wrapper around the preload function for multiprocessing
- """
- args = foo[2]
- regs = foo[3]
- res = deepBlue(foo[0], url=args.deepBlueURL, userKey=args.userKey)
- return res.preload(regs, tmpDir=args.deepBlueTempDir)
-
-
-class deepBlue(object):
- def __init__(self, sample, url="http://deepblue.mpi-inf.mpg.de/xmlrpc", userKey="anonymous_key"):
- """
- Connect to the requested deepblue server with the given user key and request the specifed sample from it.
-
- >>> sample = "S002R5H1.ERX300721.H3K4me3.bwa.GRCh38.20150528.bedgraph"
- >>> db = deepBlue(sample) # doctest: +SKIP
- >>> assert db.chroms("chr1") == 248956422 # doctest: +SKIP
- """
- self.sample = sample
- self.url = url
- self.userKey = userKey
- self.server = xmlrpclib.Server(url, allow_none=True)
- self.info = None
- self.experimentID = None
- self.genome = None
- self.chromsDict = None
- self.chromsTuple = None
-
- # Set self.experimentID
- experimentID = self.getEID()
- if not experimentID:
- raise RuntimeError("The requested sample({}) has no associated experiment! If you did not intend to use samples on deepBlue, then it appears either you misspelled a file name or (if you're using BAM files for input) one of your BAM files is lacking a valid index.".format(sample))
-
- # Set self.info
- (status, resp) = self.server.info(self.experimentID, userKey)
- if status != "okay":
- raise RuntimeError("Received the following error while fetching information about '{}': {}".format(resp, sample))
- self.info = resp[0]
-
- # Set self.genome
- genome = self.getGenome()
- if not genome:
- raise RuntimeError("Unable to determine an appropriate genome for '{}'".format(sample))
-
- # Set self.chroms
- chroms = self.getChroms()
- if not chroms:
- raise RuntimeError("Unable to determine chromosome names/sizes for '{}'".format(sample))
-
- def getEID(self):
- """
- Given a sample name, return its associated experiment ID (or None on error).
-
- self.experimentID is then the internal ID (e.g., e52525)
- """
- (status, resps) = self.server.search(self.sample, "experiments", self.userKey)
- if status != "okay":
- raise RuntimeError("Received an error ({}) while searching for the experiment associated with '{}'".format(resps, self.sample))
- for resp in resps:
- if resp[1] == self.sample:
- self.experimentID = resp[0]
- return resp[0]
- return None
-
- def getGenome(self):
- """
- Determines and sets the genome assigned to a given sample. On error, this raises a runtime exception.
-
- self.genome is then the internal genome ID.
- """
- if "genome" in self.info.keys():
- self.genome = self.info["genome"]
- return self.genome
-
- def getChroms(self):
- """
- Determines and sets the chromosome names/sizes for a given sample. On error, this raises a runtime exception.
-
- self.chroms is then a dictionary of chromosome:length pairs
- """
- (status, resp) = self.server.chromosomes(self.genome, self.userKey)
- if status != "okay":
- raise RuntimeError("Received an error while fetching chromosome information for '{}': {}".format(self.sample, resp))
- self.chromsDict = {k: v for k, v in resp}
- self.chromsTuple = [(k, v) for k, v in resp]
- return resp
-
- def chroms(self, chrom=None):
- """
- Like the chroms() function in pyBigWig, returns either chromsDict (chrom is None) or the length of a given chromosome
- """
- if chrom is None:
- return self.chromsDict
- elif chrom in self.chromsDict:
- return self.chromsDict[chrom]
- return None
-
- def close(self):
- pass
-
- def preload(self, regions, tmpDir=None):
- """
- Given a sample and a set of regions, write a bigWig file containing the underlying signal.
-
- This function returns the file name, which needs to be deleted by the calling function at some point.
-
- This sends queries one chromosome at a time, due to memory limits on deepBlue
- """
- startTime = datetime.datetime.now()
- regions2 = mergeRegions(regions)
-
- # Make a temporary file
- f = tempfile.NamedTemporaryFile(delete=False, dir=tmpDir)
- fname = f.name
- f.close()
-
- # Start with the bigWig file
- bw = pyBigWig.open(fname, "w")
- bw.addHeader(self.chromsTuple, maxZooms=0) # This won't work in IGV!
-
- # Make a string out of everything in a resonable order
- for k, v in self.chromsTuple:
- # Munge chromosome names as appropriate
- chrom = mungeChromosome(k, regions2.keys())
- if not chrom:
- continue
- if chrom not in regions2 or len(regions2) == 0:
- continue
- regionsStr = "\n".join(["{}\t{}\t{}".format(k, reg[0], reg[1]) for reg in regions2[chrom]])
- regionsStr += "\n"
-
- # Send the regions
- (status, regionsID) = self.server.input_regions(self.genome, regionsStr, self.userKey)
- if status != "okay":
- raise RuntimeError("Received the following error while sending regions for '{}': {}".format(regionsID, self.sample))
-
- # Get the experiment information
- (status, queryID) = self.server.select_experiments(self.sample, k, None, None, self.userKey)
- if status != "okay":
- raise RuntimeError("Received the following error while running select_experiments on file '{}': {}".format(self.sample, queryID))
- if not queryID:
- raise RuntimeError("Somehow, we received None as a query ID (file '{}')".format(self.sample))
-
- # Intersect
- (status, intersectID) = self.server.intersection(queryID, regionsID, self.userKey)
- if status != "okay":
- raise RuntimeError("Received the following error while running intersection on file '{}': {}".format(self.sample, intersectID))
- if not intersectID:
- raise RuntimeError("Somehow, we received None as an intersect ID (file '{}')".format(self.sample))
-
- # Query the regions
- (status, reqID) = self.server.get_regions(intersectID, "START,END,VALUE", self.userKey)
- if status != "okay":
- raise RuntimeError("Received the following error while fetching regions in file '{}': {}".format(self.sample, reqID))
-
- # Wait for the server to process the data
- (status, info) = self.server.info(reqID, self.userKey)
- request_status = info[0]["state"]
- while request_status != "done" and request_status != "failed":
- time.sleep(0.1)
- (status, info) = self.server.info(reqID, self.userKey)
- request_status = info[0]["state"]
-
- # Get the actual data
- (status, resp) = self.server.get_request_data(reqID, self.userKey)
- if status != "okay":
- raise RuntimeError("Received the following error while fetching data in file '{}': {}".format(self.sample, resp))
-
- for intervals in resp.split("\n"):
- interval = intervals.split("\t")
- if interval[0] == '':
- continue
- bw.addEntries([k], [int(interval[0]) - 1], ends=[int(interval[1]) - 1], values=[float(interval[2])])
- bw.close()
- sys.stderr.write("{} done (took {})\n".format(self.sample, datetime.datetime.now() - startTime))
- sys.stderr.flush()
-
- return fname
=====================================
deeptools/deeptools_list_tools.py
=====================================
@@ -3,10 +3,7 @@
import argparse
import sys
-try: # keep python 3.7 support.
- from importlib.metadata import version
-except ModuleNotFoundError:
- from importlib_metadata import version
+from importlib.metadata import version
def parse_arguments(args=None):
=====================================
deeptools/estimateReadFiltering.py
=====================================
@@ -5,10 +5,7 @@ import sys
from deeptools import parserCommon, bamHandler, utilities
from deeptools.mapReduce import mapReduce
from deeptools.utilities import smartLabels
-try: # keep python 3.7 support.
- from importlib.metadata import version
-except ModuleNotFoundError:
- from importlib_metadata import version
+from importlib.metadata import version
def parseArguments():
=====================================
deeptools/estimateScaleFactor.py
=====================================
@@ -6,11 +6,7 @@ import sys
from deeptools.SES_scaleFactor import estimateScaleFactor
from deeptools.parserCommon import numberOfProcessors
-try: # keep python 3.7 support.
- from importlib.metadata import version
-except ModuleNotFoundError:
- from importlib_metadata import version
-
+from importlib.metadata import version
debug = 0
@@ -102,7 +98,7 @@ def main(args=None):
between to samples
"""
- args = parseArguments().parse_args(args)
+ args = parseArguments(args)
if len(args.bamfiles) > 2:
print("SES method to estimate scale factors only works for two samples")
exit(0)
=====================================
deeptools/getFragmentAndReadSize.py
=====================================
@@ -76,7 +76,8 @@ def get_read_and_fragment_length(bamFile, return_lengths=False, blackListFileNam
-------
d : dict
tuple of two dictionaries, one for the fragment length and the other
- for the read length. The dictionaries summarise the mean, median etc. values
+for the read length. The dictionaries summarise the mean, median etc. values
+
"""
bam_handle = bamHandler.openBam(bamFile)
=====================================
deeptools/multiBamSummary.py
=====================================
@@ -9,11 +9,7 @@ import numpy as np
import deeptools.countReadsPerBin as countR
from deeptools import parserCommon
from deeptools.utilities import smartLabels
-try: # keep python 3.7 support.
- from importlib.metadata import version
-except ModuleNotFoundError:
- from importlib_metadata import version
-
+from importlib.metadata import version
old_settings = np.seterr(all='ignore')
=====================================
deeptools/multiBigwigSummary.py
=====================================
@@ -5,15 +5,10 @@ import sys
import argparse
import os.path
import numpy as np
-import multiprocessing
from deeptools import parserCommon
from deeptools.utilities import smartLabels
import deeptools.getScorePerBigWigBin as score_bw
-import deeptools.deepBlue as db
-try: # keep python 3.7 support.
- from importlib.metadata import version
-except ModuleNotFoundError:
- from importlib_metadata import version
+from importlib.metadata import version
old_settings = np.seterr(all='ignore')
@@ -53,17 +48,16 @@ A detailed sub-commands help is available by typing:
metavar='')
parent_parser = parserCommon.getParentArgParse(binSize=False)
- dbParser = parserCommon.deepBlueOptionalArgs()
# bins mode options
subparsers.add_parser(
'bins',
formatter_class=argparse.ArgumentDefaultsHelpFormatter,
- parents=[multiBigwigSummaryArgs(case='bins'),
- parent_parser,
- parserCommon.gtf_options(suppress=True),
- dbParser
- ],
+ parents=[
+ multiBigwigSummaryArgs(case='bins'),
+ parent_parser,
+ parserCommon.gtf_options(suppress=True)
+ ],
help="The average score is based on equally sized bins "
"(10 kilobases by default), which consecutively cover the "
"entire genome. The only exception is the last bin of a chromosome, which "
@@ -79,11 +73,11 @@ A detailed sub-commands help is available by typing:
subparsers.add_parser(
'BED-file',
formatter_class=argparse.ArgumentDefaultsHelpFormatter,
- parents=[multiBigwigSummaryArgs(case='BED-file'),
- parent_parser,
- parserCommon.gtf_options(),
- dbParser
- ],
+ parents=[
+ multiBigwigSummaryArgs(case='BED-file'),
+ parent_parser,
+ parserCommon.gtf_options()
+ ],
help="The user provides a BED file that contains all regions "
"that should be considered for the analysis. A "
"common use is to compare scores (e.g. ChIP-seq scores) between "
@@ -230,33 +224,6 @@ def main(args=None):
"--outRawCounts. The resulting output will NOT be "
"useful with any deepTools program!\n")
- # Preload deepBlue files, which need to then be deleted
- deepBlueFiles = []
- for idx, fname in enumerate(args.bwfiles):
- if db.isDeepBlue(fname):
- deepBlueFiles.append([fname, idx])
- if len(deepBlueFiles) > 0:
- sys.stderr.write("Preloading the following deepBlue files: {}\n".format(",".join([x[0] for x in deepBlueFiles])))
- if 'BED' in args:
- regs = db.makeRegions(args.BED, args)
- else:
- foo = db.deepBlue(deepBlueFiles[0][0], url=args.deepBlueURL, userKey=args.userKey)
- regs = db.makeTiles(foo, args)
- del foo
- for x in deepBlueFiles:
- x.extend([args, regs])
- if len(deepBlueFiles) > 1 and args.numberOfProcessors > 1:
- pool = multiprocessing.Pool(args.numberOfProcessors)
- res = pool.map_async(db.preloadWrapper, deepBlueFiles).get(9999999)
- else:
- res = list(map(db.preloadWrapper, deepBlueFiles))
-
- # substitute the file names with the temp files
- for (ftuple, r) in zip(deepBlueFiles, res):
- args.bwfiles[ftuple[1]] = r
- deepBlueFiles = [[x[0], x[1]] for x in deepBlueFiles]
- del regs
-
num_reads_per_bin = score_bw.getScorePerBin(
args.bwfiles,
args.binSize,
@@ -312,11 +279,3 @@ def main(args=None):
args.outRawCounts.write(fmt.format(*tuple(row)))
"""
f.close()
-
- # Clean up temporary bigWig files, if applicable
- if not args.deepBlueKeepTemp:
- for k, v in deepBlueFiles:
- os.remove(args.bwfiles[v])
- else:
- for k, v in deepBlueFiles:
- print("{} is stored in {}".format(k, args.bwfiles[v]))
=====================================
deeptools/parserCommon.py
=====================================
@@ -1,9 +1,7 @@
import argparse
import os
-try: # keep python 3.7 support.
- from importlib.metadata import version
-except ModuleNotFoundError:
- from importlib_metadata import version
+from importlib.metadata import version
+import multiprocessing
def check_float_0_1(value):
@@ -344,8 +342,12 @@ def getParentArgParse(args=None, binSize=True, blackList=True):
def numberOfProcessors(string):
- import multiprocessing
- availProc = multiprocessing.cpu_count()
+ try:
+ # won't work on macOS or windows
+ # limit threads to what is available (e.g. grid submissions, issue #1199)
+ availProc = len(os.sched_getaffinity(0))
+ except AttributeError:
+ availProc = multiprocessing.cpu_count()
if string == "max/2": # default case
# by default half of the available processors are used
@@ -865,43 +867,6 @@ def heatmapperOptionalArgs(mode=['heatmap', 'profile'][0]):
return parser
-def deepBlueOptionalArgs():
-
- parser = argparse.ArgumentParser(add_help=False)
- dbo = parser.add_argument_group('deepBlue arguments', 'Options used only for remote bedgraph/wig files hosted on deepBlue')
- dbo.add_argument(
- '--deepBlueURL',
- help='For remote files bedgraph/wiggle files hosted on deepBlue, this '
- 'specifies the server URL. The default is '
- '"http://deepblue.mpi-inf.mpg.de/xmlrpc", which should not be '
- 'changed without good reason.',
- default='http://deepblue.mpi-inf.mpg.de/xmlrpc')
- dbo.add_argument(
- '--userKey',
- help='For remote files bedgraph/wiggle files hosted on deepBlue, this '
- 'specifies the user key to use for access. The default is '
- '"anonymous_key", which suffices for public datasets. If you need '
- 'access to a restricted access/private dataset, then request a '
- 'key from deepBlue and specify it here.',
- default='anonymous_key')
- dbo.add_argument(
- '--deepBlueTempDir',
- help='If specified, temporary files from preloading datasets from '
- 'deepBlue will be written here (note, this directory must exist). '
- 'If not specified, where ever temporary files would normally be written '
- 'on your system is used.',
- default=None)
- dbo.add_argument(
- '--deepBlueKeepTemp',
- action='store_true',
- help='If specified, temporary bigWig files from preloading deepBlue '
- 'datasets are not deleted. A message will be printed noting where these '
- 'files are and what sample they correspond to. These can then be used '
- 'if you wish to analyse the same sample with the same regions again.')
-
- return parser
-
-
def requiredLength(minL, maxL):
"""
This is an optional action that can be given to argparse.add_argument(..., nargs='+')
=====================================
deeptools/plotCorrelation.py
=====================================
@@ -10,13 +10,9 @@ matplotlib.rcParams['pdf.fonttype'] = 42
matplotlib.rcParams['svg.fonttype'] = 'none'
from deeptools import cm # noqa: F401
import matplotlib.pyplot as plt
-
+from importlib.metadata import version
from deeptools.correlation import Correlation
from deeptools.parserCommon import writableFile
-try: # keep python 3.7 support.
- from importlib.metadata import version
-except ModuleNotFoundError:
- from importlib_metadata import version
old_settings = np.seterr(all='ignore')
=====================================
deeptools/plotCoverage.py
=====================================
@@ -14,14 +14,10 @@ from deeptools import cm # noqa: F401
import matplotlib.pyplot as plt
import plotly.offline as py
import plotly.graph_objs as go
-
+from importlib.metadata import version
import deeptools.countReadsPerBin as countR
from deeptools import parserCommon
from deeptools.utilities import smartLabels
-try: # keep python 3.7 support.
- from importlib.metadata import version
-except ModuleNotFoundError:
- from importlib_metadata import version
old_settings = np.seterr(all='ignore')
=====================================
deeptools/plotHeatmap.py
=====================================
@@ -62,7 +62,7 @@ def process_args(args=None):
return args
-def prepare_layout(hm_matrix, heatmapsize, showSummaryPlot, showColorbar, perGroup, colorbar_position):
+def prepare_layout(hm_matrix, heatmapsize, showSummaryPlot, showColorbar, perGroup, colorbar_position, fig):
"""
prepare the plot layout
as a grid having as many rows
@@ -113,7 +113,7 @@ def prepare_layout(hm_matrix, heatmapsize, showSummaryPlot, showColorbar, perGro
# numbers to heatmapheigt fractions
height_ratio = np.concatenate([[sumplot_height, spacer_height], height_ratio])
- grids = gridspec.GridSpec(numrows, numcols, height_ratios=height_ratio, width_ratios=width_ratio)
+ grids = gridspec.GridSpec(numrows, numcols, height_ratios=height_ratio, width_ratios=width_ratio, figure=fig)
return grids
@@ -498,9 +498,6 @@ def plotMatrix(hm, outFileName,
else:
colorbar_position = 'side'
- grids = prepare_layout(hm.matrix, (heatmapWidth, heatmapHeight),
- showSummaryPlot, showColorbar, perGroup, colorbar_position)
-
# figsize: w,h tuple in inches
figwidth = heatmapWidth / 2.54
figheight = heatmapHeight / 2.54
@@ -521,9 +518,19 @@ def plotMatrix(hm, outFileName,
else:
total_figwidth += 1 / 2.54
- fig = plt.figure(figsize=(total_figwidth, figheight))
+ fig = plt.figure(figsize=(total_figwidth, figheight), constrained_layout=True)
fig.suptitle(plotTitle, y=1 - (0.06 / figheight))
+ grids = prepare_layout(
+ hm.matrix,
+ (heatmapWidth, heatmapHeight),
+ showSummaryPlot,
+ showColorbar,
+ perGroup,
+ colorbar_position,
+ fig
+ )
+
# color map for the summary plot (profile) on top of the heatmap
cmap_plot = plt.get_cmap('jet')
numgroups = hm.matrix.get_num_groups()
@@ -582,17 +589,6 @@ def plotMatrix(hm, outFileName,
iterNum = hm.matrix.get_num_samples()
iterNum2 = numgroups
ax_list = addProfilePlot(hm, plt, fig, grids, iterNum, iterNum2, perGroup, averageType, plot_type, yAxisLabel, color_list, yMin, yMax, None, None, colorbar_position, label_rotation)
- if len(yMin) > 1 or len(yMax) > 1:
- # replot with a tight layout
- import matplotlib.tight_layout as tl
- specList = tl.get_subplotspec_list(fig.axes, grid_spec=grids)
- renderer = tl.get_renderer(fig)
- kwargs = tl.get_tight_layout_figure(fig, fig.axes, specList, renderer, pad=1.08)
-
- for ax in ax_list:
- fig.delaxes(ax)
-
- ax_list = addProfilePlot(hm, plt, fig, grids, iterNum, iterNum2, perGroup, averageType, plot_type, yAxisLabel, color_list, yMin, yMax, kwargs['wspace'], kwargs['hspace'], colorbar_position, label_rotation)
if legend_location != 'none':
ax_list[-1].legend(loc=legend_location.replace('-', ' '), ncol=1, prop=fontP,
@@ -764,10 +760,10 @@ def plotMatrix(hm, outFileName,
fig.colorbar(img, cax=ax)
if box_around_heatmaps:
- plt.subplots_adjust(wspace=0.10, hspace=0.025, top=0.85, bottom=0, left=0.04, right=0.96)
+ fig.get_layout_engine().set(wspace=0.10, hspace=0.025, rect=(0.04, 0, 0.96, 0.85))
else:
# When no box is plotted the space between heatmaps is reduced
- plt.subplots_adjust(wspace=0.05, hspace=0.01, top=0.85, bottom=0, left=0.04, right=0.96)
+ fig.get_layout_engine().set(wspace=0.05, hspace=0.01, rect=(0.04, 0, 0.96, 0.85))
plt.savefig(outFileName, bbox_inches='tight', pad_inches=0.1, dpi=dpi, format=image_format)
plt.close()
=====================================
deeptools/plotPCA.py
=====================================
@@ -8,13 +8,9 @@ matplotlib.use('Agg')
matplotlib.rcParams['pdf.fonttype'] = 42
matplotlib.rcParams['svg.fonttype'] = 'none'
from deeptools import cm # noqa: F401
-
+from importlib.metadata import version
from deeptools.correlation import Correlation
from deeptools.parserCommon import writableFile
-try: # keep python 3.7 support.
- from importlib.metadata import version
-except ModuleNotFoundError:
- from importlib_metadata import version
def parse_arguments(args=None):
=====================================
galaxy/wrapper/deepTools_macros.xml
=====================================
@@ -1,7 +1,7 @@
<macros>
<token name="@THREADS@">--numberOfProcessors "\${GALAXY_SLOTS:-4}"</token>
- <token name="@TOOL_VERSION@">3.5.4</token>
+ <token name="@TOOL_VERSION@">3.5.5</token>
<token name="@GALAXY_VERSION@">22.05</token>
<xml name="requirements">
<requirements>
=====================================
pyproject.toml
=====================================
@@ -5,7 +5,7 @@ requires = [
[project]
name = "deepTools"
-version = "3.5.4"
+version = "3.5.5"
authors = [
{name="Fidel Ramirez"},
{name="Devon P Ryan"},
@@ -19,7 +19,7 @@ authors = [
{name="Thomas Manke"},
{email="bioinfo-core at ie-freiburg.mpg.de"}
]
-requires-python = ">=3.7"
+requires-python = ">=3.8"
dependencies = [
"numpy >= 1.9.0",
"scipy >= 0.17.0",
@@ -29,8 +29,7 @@ dependencies = [
"pyBigWig >= 0.2.1",
"py2bit >= 0.2.0",
"plotly >= 4.9",
- "deeptoolsintervals >= 0.1.8",
- "importlib-metadata" # python 3.7 support
+ "deeptoolsintervals >= 0.1.8"
]
description = "Useful tools for exploring deep sequencing data."
license = {file = "LICENSE.txt"}
View it on GitLab: https://salsa.debian.org/med-team/python-deeptools/-/commit/bbcd084c63144233ef6003646d2f7176be7fde5f
--
View it on GitLab: https://salsa.debian.org/med-team/python-deeptools/-/commit/bbcd084c63144233ef6003646d2f7176be7fde5f
You're receiving this email because of your account on salsa.debian.org.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://alioth-lists.debian.net/pipermail/debian-med-commit/attachments/20240512/a0c298bb/attachment-0001.htm>
More information about the debian-med-commit
mailing list