[med-svn] [Git][med-team/busco][master] 3 commits: New upstream version 5.3.1

Nilesh Patra (@nilesh) gitlab at salsa.debian.org
Tue Mar 29 18:42:59 BST 2022



Nilesh Patra pushed to branch master at Debian Med / busco


Commits:
973f8605 by Nilesh Patra at 2022-03-29T23:05:40+05:30
New upstream version 5.3.1
- - - - -
ecb6d80c by Nilesh Patra at 2022-03-29T23:05:44+05:30
Update upstream source from tag 'upstream/5.3.1'

Update to upstream version '5.3.1'
with Debian dir e277924ad0fcebd3bdcb29f6c59aae67fc1c716a
- - - - -
6c2f83bf by Nilesh Patra at 2022-03-29T23:07:09+05:30
Upload to unstable

- - - - -


9 changed files:

- CHANGELOG
- debian/changelog
- src/busco/BuscoPlacer.py
- src/busco/BuscoRunner.py
- src/busco/_version.py
- src/busco/analysis/BuscoAnalysis.py
- src/busco/analysis/GenomeAnalysis.py
- test_data/bacteria/expected_log.txt
- test_data/eukaryota/expected_log.txt


Changes:

=====================================
CHANGELOG
=====================================
@@ -1,3 +1,8 @@
+5.3.1
+- Issue #510 fixed
+- Issue #542 fixed
+- Bug fixes
+
 5.3.0
 - Introduce JSON summary file
 - Issue #405 fixed


=====================================
debian/changelog
=====================================
@@ -1,3 +1,9 @@
+busco (5.3.1-1) unstable; urgency=medium
+
+  * New upstream version 5.3.1
+
+ -- Nilesh Patra <nilesh at debian.org>  Tue, 29 Mar 2022 23:05:56 +0530
+
 busco (5.3.0-1) unstable; urgency=medium
 
   * New upstream version 5.3.0


=====================================
src/busco/BuscoPlacer.py
=====================================
@@ -132,7 +132,7 @@ class BuscoPlacer:
                     root_lineage
                 )
             )
-            dataset = (root_lineage.split("_")[0], None, None)
+            dataset = (root_lineage, None, None)
 
         return dataset, placement_file_versions
 


=====================================
src/busco/BuscoRunner.py
=====================================
@@ -248,11 +248,13 @@ class BatchRunner:
             except BuscoError as be:
                 if "did not recognize any genes" in be.value:
                     type(self).batch_results.append(
-                        "{}\tNo genes found\n".format(os.path.basename(input_file))
+                        "{}\tNo genes found\t\t\t\t\t\t\t\t\t\n".format(
+                            os.path.basename(input_file)
+                        )
                     )
                 else:
                     type(self).batch_results.append(
-                        "{}\tRun failed; check logs\n".format(
+                        "{}\tRun failed; check logs\t\t\t\t\t\t\t\t\t\n".format(
                             os.path.basename(input_file)
                         )
                     )


=====================================
src/busco/_version.py
=====================================
@@ -6,4 +6,4 @@ Copyright (c) 2016-2022, Evgeny Zdobnov (ez at ezlab.org)
 Licensed under the MIT license. See LICENSE.md file.
 
 """
-__version__ = "5.3.0"
+__version__ = "5.3.1"


=====================================
src/busco/analysis/BuscoAnalysis.py
=====================================
@@ -287,7 +287,10 @@ class BuscoAnalysis(metaclass=ABCMeta):
         return
 
     def reset(self):
-        self.hmmer_runner.reset()
+        if (
+            self.hmmer_runner
+        ):  # If final run has already been run, then the hmmer_runner object in the final runner object will still be set to None
+            self.hmmer_runner.reset()
 
     @property
     @abstractmethod


=====================================
src/busco/analysis/GenomeAnalysis.py
=====================================
@@ -92,7 +92,10 @@ class GenomeAnalysisProkaryotes(GenomeAnalysis):
 
     def reset(self):
         super().reset()
-        self.prodigal_runner.reset()
+        if (
+            self.prodigal_runner
+        ):  # If final run has already been run, then the prodigal_runner object in the final runner object will still be set to None
+            self.prodigal_runner.reset()
 
     @log("***** Run Prodigal on input to predict and extract genes *****", logger)
     def _run_prodigal(self):


=====================================
test_data/bacteria/expected_log.txt
=====================================
@@ -1,97 +1,112 @@
-2022-02-24 16:54:19 INFO:	***** Start a BUSCO v5.3.0 analysis, current time: 02/24/2022 16:54:19 *****
-2022-02-24 16:54:19 INFO:	Configuring BUSCO with local environment
-2022-02-24 16:54:19 INFO:	Mode is genome
-2022-02-24 16:54:19 INFO:	'Force' option selected; overwriting previous results directory
-2022-02-24 16:54:19 INFO:	Downloading information on latest versions of BUSCO data...
-2022-02-24 16:54:19 WARNING:	Running Auto Lineage Selector as no lineage dataset was specified. This will take a little longer than normal. If you know what lineage dataset you want to use, please specify this in the config file or using the -l (--lineage-dataset) flag in the command line.
-2022-02-24 16:54:19 INFO:	Input file is /home/cegg/berkeley/BUSCO/busco_dev/test_data/bacteria/genome.fna
-2022-02-24 16:54:19 INFO:	No lineage specified. Running lineage auto selector.
+2022-03-24 15:33:58 INFO:	***** Start a BUSCO v5.3.1 analysis, current time: 03/24/2022 15:33:58 *****
+2022-03-24 15:33:58 INFO:	Configuring BUSCO with local environment
+2022-03-24 15:33:58 INFO:	Mode is genome
+2022-03-24 15:33:58 INFO:	Downloading information on latest versions of BUSCO data...
+2022-03-24 15:33:58 WARNING:	Running Auto Lineage Selector as no lineage dataset was specified. This will take a little longer than normal. If you know what lineage dataset you want to use, please specify this in the config file or using the -l (--lineage-dataset) flag in the command line.
+2022-03-24 15:33:58 INFO:	Input file is /home/cegg/berkeley/BUSCO/busco_dev/test_data/bacteria/genome.fna
+2022-03-24 15:33:58 INFO:	No lineage specified. Running lineage auto selector.
 
-2022-02-24 16:54:19 INFO:	***** Starting Auto Select Lineage *****
+2022-03-24 15:33:58 INFO:	***** Starting Auto Select Lineage *****
 	This process runs BUSCO on the generic lineage datasets for the domains archaea, bacteria and eukaryota. Once the optimal domain is selected, BUSCO automatically attempts to find the most appropriate BUSCO dataset to use based on phylogenetic placement.
 	--auto-lineage-euk and --auto-lineage-prok are also available if you know your input assembly is, or is not, an eukaryote. See the user guide for more information.
 	A reminder: Busco evaluations are valid when an appropriate dataset is used, i.e., the dataset belongs to the lineage of the species to test. Because of overlapping markers/spurious matches among domains, busco matches in another domain do not necessarily mean that your genome/proteome contains sequences from this domain. However, a high busco score in multiple domains might help you identify possible contaminations.
-2022-02-24 16:54:19 INFO:	Running BUSCO using lineage dataset archaea_odb10 (prokaryota, 2021-02-23)
-2022-02-24 16:54:19 INFO:	***** Run Prodigal on input to predict and extract genes *****
-2022-02-24 16:54:19 INFO:	Running Prodigal with genetic code 11 in single mode
-2022-02-24 16:54:19 INFO:	Running 1 job(s) on prodigal, starting at 02/24/2022 16:54:19
-2022-02-24 16:54:21 INFO:	[prodigal]	1 of 1 task(s) completed
-2022-02-24 16:54:21 INFO:	Genetic code 11 selected as optimal
-2022-02-24 16:54:21 INFO:	***** Run HMMER on gene sequences *****
-2022-02-24 16:54:21 INFO:	Running 194 job(s) on hmmsearch, starting at 02/24/2022 16:54:21
-2022-02-24 16:54:22 INFO:	[hmmsearch]	20 of 194 task(s) completed
-2022-02-24 16:54:22 INFO:	[hmmsearch]	39 of 194 task(s) completed
-2022-02-24 16:54:22 INFO:	[hmmsearch]	59 of 194 task(s) completed
-2022-02-24 16:54:22 INFO:	[hmmsearch]	78 of 194 task(s) completed
-2022-02-24 16:54:22 INFO:	[hmmsearch]	97 of 194 task(s) completed
-2022-02-24 16:54:23 INFO:	[hmmsearch]	117 of 194 task(s) completed
-2022-02-24 16:54:23 INFO:	[hmmsearch]	136 of 194 task(s) completed
-2022-02-24 16:54:23 INFO:	[hmmsearch]	156 of 194 task(s) completed
-2022-02-24 16:54:23 INFO:	[hmmsearch]	175 of 194 task(s) completed
-2022-02-24 16:54:23 INFO:	[hmmsearch]	194 of 194 task(s) completed
-2022-02-24 16:54:23 INFO:	Results:	C:5.2%[S:5.2%,D:0.0%],F:1.5%,M:93.3%,n:194	   
+2022-03-24 15:33:58 INFO:	Downloading file 'https://busco-data.ezlab.org/v5/data/lineages/archaea_odb10.2021-02-23.tar.gz'
+2022-03-24 15:33:58 INFO:	Decompressing file '/home/cegg/berkeley/BUSCO/busco_dev/busco_downloads/lineages/archaea_odb10.tar.gz'
+2022-03-24 15:33:59 INFO:	Running BUSCO using lineage dataset archaea_odb10 (prokaryota, 2021-02-23)
+2022-03-24 15:33:59 INFO:	***** Run Prodigal on input to predict and extract genes *****
+2022-03-24 15:33:59 INFO:	Running Prodigal with genetic code 11 in single mode
+2022-03-24 15:33:59 INFO:	Running 1 job(s) on prodigal, starting at 03/24/2022 15:33:59
+2022-03-24 15:34:00 INFO:	[prodigal]	1 of 1 task(s) completed
+2022-03-24 15:34:00 INFO:	Genetic code 11 selected as optimal
+2022-03-24 15:34:00 INFO:	***** Run HMMER on gene sequences *****
+2022-03-24 15:34:00 INFO:	Running 194 job(s) on hmmsearch, starting at 03/24/2022 15:34:00
+2022-03-24 15:34:01 INFO:	[hmmsearch]	20 of 194 task(s) completed
+2022-03-24 15:34:01 INFO:	[hmmsearch]	39 of 194 task(s) completed
+2022-03-24 15:34:02 INFO:	[hmmsearch]	59 of 194 task(s) completed
+2022-03-24 15:34:02 INFO:	[hmmsearch]	78 of 194 task(s) completed
+2022-03-24 15:34:02 INFO:	[hmmsearch]	97 of 194 task(s) completed
+2022-03-24 15:34:02 INFO:	[hmmsearch]	117 of 194 task(s) completed
+2022-03-24 15:34:02 INFO:	[hmmsearch]	156 of 194 task(s) completed
+2022-03-24 15:34:02 INFO:	[hmmsearch]	175 of 194 task(s) completed
+2022-03-24 15:34:02 INFO:	[hmmsearch]	194 of 194 task(s) completed
+2022-03-24 15:34:02 INFO:	Results:	C:5.2%[S:5.2%,D:0.0%],F:1.5%,M:93.3%,n:194	   
 
-2022-02-24 16:54:23 INFO:	Running BUSCO using lineage dataset bacteria_odb10 (prokaryota, 2020-03-06)
-2022-02-24 16:54:23 INFO:	***** Run Prodigal on input to predict and extract genes *****
-2022-02-24 16:54:23 INFO:	Genetic code 11 selected as optimal
-2022-02-24 16:54:23 INFO:	***** Run HMMER on gene sequences *****
-2022-02-24 16:54:23 INFO:	Running 124 job(s) on hmmsearch, starting at 02/24/2022 16:54:23
-2022-02-24 16:54:24 INFO:	[hmmsearch]	13 of 124 task(s) completed
-2022-02-24 16:54:25 INFO:	[hmmsearch]	25 of 124 task(s) completed
-2022-02-24 16:54:25 INFO:	[hmmsearch]	38 of 124 task(s) completed
-2022-02-24 16:54:25 INFO:	[hmmsearch]	50 of 124 task(s) completed
-2022-02-24 16:54:25 INFO:	[hmmsearch]	63 of 124 task(s) completed
-2022-02-24 16:54:25 INFO:	[hmmsearch]	75 of 124 task(s) completed
-2022-02-24 16:54:25 INFO:	[hmmsearch]	87 of 124 task(s) completed
-2022-02-24 16:54:25 INFO:	[hmmsearch]	100 of 124 task(s) completed
-2022-02-24 16:54:25 INFO:	[hmmsearch]	112 of 124 task(s) completed
-2022-02-24 16:54:25 INFO:	[hmmsearch]	124 of 124 task(s) completed
-2022-02-24 16:54:25 INFO:	Results:	C:21.0%[S:21.0%,D:0.0%],F:0.8%,M:78.2%,n:124	   
+2022-03-24 15:34:02 INFO:	Downloading file 'https://busco-data.ezlab.org/v5/data/lineages/bacteria_odb10.2020-03-06.tar.gz'
+2022-03-24 15:34:03 INFO:	Decompressing file '/home/cegg/berkeley/BUSCO/busco_dev/busco_downloads/lineages/bacteria_odb10.tar.gz'
+2022-03-24 15:34:03 INFO:	Running BUSCO using lineage dataset bacteria_odb10 (prokaryota, 2020-03-06)
+2022-03-24 15:34:03 INFO:	***** Run Prodigal on input to predict and extract genes *****
+2022-03-24 15:34:03 INFO:	Genetic code 11 selected as optimal
+2022-03-24 15:34:03 INFO:	***** Run HMMER on gene sequences *****
+2022-03-24 15:34:03 INFO:	Running 124 job(s) on hmmsearch, starting at 03/24/2022 15:34:03
+2022-03-24 15:34:04 INFO:	[hmmsearch]	13 of 124 task(s) completed
+2022-03-24 15:34:04 INFO:	[hmmsearch]	25 of 124 task(s) completed
+2022-03-24 15:34:04 INFO:	[hmmsearch]	38 of 124 task(s) completed
+2022-03-24 15:34:04 INFO:	[hmmsearch]	50 of 124 task(s) completed
+2022-03-24 15:34:05 INFO:	[hmmsearch]	87 of 124 task(s) completed
+2022-03-24 15:34:05 INFO:	[hmmsearch]	100 of 124 task(s) completed
+2022-03-24 15:34:05 INFO:	[hmmsearch]	112 of 124 task(s) completed
+2022-03-24 15:34:05 INFO:	[hmmsearch]	124 of 124 task(s) completed
+2022-03-24 15:34:05 INFO:	Results:	C:21.0%[S:21.0%,D:0.0%],F:0.8%,M:78.2%,n:124	   
 
-2022-02-24 16:54:25 INFO:	Running BUSCO using lineage dataset eukaryota_odb10 (eukaryota, 2020-09-10)
-2022-02-24 16:54:25 INFO:	Running 1 job(s) on metaeuk, starting at 02/24/2022 16:54:25
-2022-02-24 16:54:49 INFO:	[metaeuk]	1 of 1 task(s) completed
-2022-02-24 16:54:50 INFO:	***** Run HMMER on gene sequences *****
-2022-02-24 16:54:50 INFO:	Running 255 job(s) on hmmsearch, starting at 02/24/2022 16:54:50
-2022-02-24 16:54:51 INFO:	[hmmsearch]	26 of 255 task(s) completed
-2022-02-24 16:54:51 INFO:	[hmmsearch]	51 of 255 task(s) completed
-2022-02-24 16:54:51 INFO:	[hmmsearch]	77 of 255 task(s) completed
-2022-02-24 16:54:51 INFO:	[hmmsearch]	128 of 255 task(s) completed
-2022-02-24 16:54:51 INFO:	[hmmsearch]	153 of 255 task(s) completed
-2022-02-24 16:54:51 INFO:	[hmmsearch]	179 of 255 task(s) completed
-2022-02-24 16:54:52 INFO:	[hmmsearch]	204 of 255 task(s) completed
-2022-02-24 16:54:52 INFO:	[hmmsearch]	230 of 255 task(s) completed
-2022-02-24 16:54:52 INFO:	[hmmsearch]	255 of 255 task(s) completed
-2022-02-24 16:54:52 INFO:	Results:	C:1.2%[S:1.2%,D:0.0%],F:0.0%,M:98.8%,n:255	   
+2022-03-24 15:34:05 INFO:	Downloading file 'https://busco-data.ezlab.org/v5/data/lineages/eukaryota_odb10.2020-09-10.tar.gz'
+2022-03-24 15:34:07 INFO:	Decompressing file '/home/cegg/berkeley/BUSCO/busco_dev/busco_downloads/lineages/eukaryota_odb10.tar.gz'
+2022-03-24 15:34:09 INFO:	Running BUSCO using lineage dataset eukaryota_odb10 (eukaryota, 2020-09-10)
+2022-03-24 15:34:11 INFO:	Running 1 job(s) on metaeuk, starting at 03/24/2022 15:34:11
+2022-03-24 15:34:34 INFO:	[metaeuk]	1 of 1 task(s) completed
+2022-03-24 15:34:34 INFO:	***** Run HMMER on gene sequences *****
+2022-03-24 15:34:34 INFO:	Running 255 job(s) on hmmsearch, starting at 03/24/2022 15:34:34
+2022-03-24 15:34:35 INFO:	[hmmsearch]	26 of 255 task(s) completed
+2022-03-24 15:34:35 INFO:	[hmmsearch]	51 of 255 task(s) completed
+2022-03-24 15:34:35 INFO:	[hmmsearch]	77 of 255 task(s) completed
+2022-03-24 15:34:35 INFO:	[hmmsearch]	102 of 255 task(s) completed
+2022-03-24 15:34:36 INFO:	[hmmsearch]	128 of 255 task(s) completed
+2022-03-24 15:34:36 INFO:	[hmmsearch]	153 of 255 task(s) completed
+2022-03-24 15:34:36 INFO:	[hmmsearch]	179 of 255 task(s) completed
+2022-03-24 15:34:36 INFO:	[hmmsearch]	204 of 255 task(s) completed
+2022-03-24 15:34:36 INFO:	[hmmsearch]	230 of 255 task(s) completed
+2022-03-24 15:34:36 INFO:	[hmmsearch]	255 of 255 task(s) completed
+2022-03-24 15:34:36 INFO:	Results:	C:1.2%[S:1.2%,D:0.0%],F:0.0%,M:98.8%,n:255	   
 
-2022-02-24 16:54:52 INFO:	Extracting missing and fragmented buscos from the file refseq_db.faa...
-2022-02-24 16:55:08 INFO:	Running 1 job(s) on metaeuk, starting at 02/24/2022 16:55:08
-2022-02-24 16:55:41 INFO:	[metaeuk]	1 of 1 task(s) completed
-2022-02-24 16:55:41 INFO:	***** Run HMMER on gene sequences *****
-2022-02-24 16:55:41 INFO:	Running 252 job(s) on hmmsearch, starting at 02/24/2022 16:55:41
-2022-02-24 16:55:42 INFO:	[hmmsearch]	26 of 252 task(s) completed
-2022-02-24 16:55:43 INFO:	[hmmsearch]	51 of 252 task(s) completed
-2022-02-24 16:55:43 INFO:	[hmmsearch]	76 of 252 task(s) completed
-2022-02-24 16:55:43 INFO:	[hmmsearch]	101 of 252 task(s) completed
-2022-02-24 16:55:43 INFO:	[hmmsearch]	126 of 252 task(s) completed
-2022-02-24 16:55:43 INFO:	[hmmsearch]	152 of 252 task(s) completed
-2022-02-24 16:55:43 INFO:	[hmmsearch]	177 of 252 task(s) completed
-2022-02-24 16:55:45 INFO:	[hmmsearch]	202 of 252 task(s) completed
-2022-02-24 16:55:45 INFO:	[hmmsearch]	227 of 252 task(s) completed
-2022-02-24 16:55:45 INFO:	[hmmsearch]	252 of 252 task(s) completed
-2022-02-24 16:55:45 INFO:	Validating exons and removing overlapping matches
-2022-02-24 16:55:45 INFO:	Results:	C:1.2%[S:1.2%,D:0.0%],F:0.0%,M:98.8%,n:255	   
+2022-03-24 15:34:36 INFO:	Extracting missing and fragmented buscos from the file refseq_db.faa...
+2022-03-24 15:34:53 INFO:	Running 1 job(s) on metaeuk, starting at 03/24/2022 15:34:53
+2022-03-24 15:35:23 INFO:	[metaeuk]	1 of 1 task(s) completed
+2022-03-24 15:35:23 INFO:	***** Run HMMER on gene sequences *****
+2022-03-24 15:35:23 INFO:	Running 252 job(s) on hmmsearch, starting at 03/24/2022 15:35:23
+2022-03-24 15:35:24 INFO:	[hmmsearch]	26 of 252 task(s) completed
+2022-03-24 15:35:24 INFO:	[hmmsearch]	51 of 252 task(s) completed
+2022-03-24 15:35:25 INFO:	[hmmsearch]	76 of 252 task(s) completed
+2022-03-24 15:35:25 INFO:	[hmmsearch]	101 of 252 task(s) completed
+2022-03-24 15:35:25 INFO:	[hmmsearch]	126 of 252 task(s) completed
+2022-03-24 15:35:25 INFO:	[hmmsearch]	152 of 252 task(s) completed
+2022-03-24 15:35:25 INFO:	[hmmsearch]	177 of 252 task(s) completed
+2022-03-24 15:35:25 INFO:	[hmmsearch]	202 of 252 task(s) completed
+2022-03-24 15:35:25 INFO:	[hmmsearch]	227 of 252 task(s) completed
+2022-03-24 15:35:25 INFO:	[hmmsearch]	252 of 252 task(s) completed
+2022-03-24 15:35:26 INFO:	Validating exons and removing overlapping matches
+2022-03-24 15:35:26 INFO:	Results:	C:1.2%[S:1.2%,D:0.0%],F:0.0%,M:98.8%,n:255	   
 
-2022-02-24 16:55:46 INFO:	bacteria_odb10 selected
+2022-03-24 15:35:26 INFO:	bacteria_odb10 selected
 
-2022-02-24 16:55:46 INFO:	***** Searching tree for chosen lineage to find best taxonomic match *****
+2022-03-24 15:35:26 INFO:	***** Searching tree for chosen lineage to find best taxonomic match *****
 
-2022-02-24 16:55:46 INFO:	Extract markers...
-2022-02-24 16:55:46 INFO:	Place the markers on the reference tree...
-2022-02-24 16:55:46 INFO:	Running 1 job(s) on sepp, starting at 02/24/2022 16:55:46
-2022-02-24 16:56:35 INFO:	[sepp]	1 of 1 task(s) completed
-2022-02-24 16:56:36 INFO:	Not enough markers were placed on the tree (11). Root lineage bacteria is kept
-2022-02-24 16:56:36 INFO:	
+2022-03-24 15:35:26 INFO:	Extract markers...
+2022-03-24 15:35:26 INFO:	Downloading file 'https://busco-data.ezlab.org/v5/data/placement_files/list_of_reference_markers.bacteria_odb10.2019-12-16.txt.tar.gz'
+2022-03-24 15:35:26 INFO:	Decompressing file '/home/cegg/berkeley/BUSCO/busco_dev/busco_downloads/placement_files/list_of_reference_markers.bacteria_odb10.2019-12-16.txt.tar.gz'
+2022-03-24 15:35:26 INFO:	Downloading file 'https://busco-data.ezlab.org/v5/data/placement_files/tree.bacteria_odb10.2019-12-16.nwk.tar.gz'
+2022-03-24 15:35:26 INFO:	Decompressing file '/home/cegg/berkeley/BUSCO/busco_dev/busco_downloads/placement_files/tree.bacteria_odb10.2019-12-16.nwk.tar.gz'
+2022-03-24 15:35:26 INFO:	Downloading file 'https://busco-data.ezlab.org/v5/data/placement_files/tree_metadata.bacteria_odb10.2019-12-16.txt.tar.gz'
+2022-03-24 15:35:26 INFO:	Decompressing file '/home/cegg/berkeley/BUSCO/busco_dev/busco_downloads/placement_files/tree_metadata.bacteria_odb10.2019-12-16.txt.tar.gz'
+2022-03-24 15:35:26 INFO:	Downloading file 'https://busco-data.ezlab.org/v5/data/placement_files/supermatrix.aln.bacteria_odb10.2019-12-16.faa.tar.gz'
+2022-03-24 15:35:26 INFO:	Decompressing file '/home/cegg/berkeley/BUSCO/busco_dev/busco_downloads/placement_files/supermatrix.aln.bacteria_odb10.2019-12-16.faa.tar.gz'
+2022-03-24 15:35:27 INFO:	Downloading file 'https://busco-data.ezlab.org/v5/data/placement_files/mapping_taxids-busco_dataset_name.bacteria_odb10.2019-12-16.txt.tar.gz'
+2022-03-24 15:35:27 INFO:	Decompressing file '/home/cegg/berkeley/BUSCO/busco_dev/busco_downloads/placement_files/mapping_taxids-busco_dataset_name.bacteria_odb10.2019-12-16.txt.tar.gz'
+2022-03-24 15:35:27 INFO:	Downloading file 'https://busco-data.ezlab.org/v5/data/placement_files/mapping_taxid-lineage.bacteria_odb10.2019-12-16.txt.tar.gz'
+2022-03-24 15:35:27 INFO:	Decompressing file '/home/cegg/berkeley/BUSCO/busco_dev/busco_downloads/placement_files/mapping_taxid-lineage.bacteria_odb10.2019-12-16.txt.tar.gz'
+2022-03-24 15:35:27 INFO:	Place the markers on the reference tree...
+2022-03-24 15:35:27 INFO:	Running 1 job(s) on sepp, starting at 03/24/2022 15:35:27
+2022-03-24 15:36:15 INFO:	[sepp]	1 of 1 task(s) completed
+2022-03-24 15:36:16 INFO:	Not enough markers were placed on the tree (11). Root lineage bacteria is kept
+2022-03-24 15:36:16 INFO:	
 
 	--------------------------------------------------
 	|Results from dataset bacteria_odb10              |
@@ -104,12 +119,12 @@
 	|97	Missing BUSCOs (M)                        |
 	|124	Total BUSCO groups searched               |
 	--------------------------------------------------
-2022-02-24 16:56:36 INFO:	BUSCO analysis done with WARNING(s). Total running time: 137 seconds
+2022-03-24 15:36:16 INFO:	BUSCO analysis done with WARNING(s). Total running time: 138 seconds
 
 ***** Summary of warnings: *****
-2022-02-24 16:54:19 WARNING:busco.run_BUSCO	Running Auto Lineage Selector as no lineage dataset was specified. This will take a little longer than normal. If you know what lineage dataset you want to use, please specify this in the config file or using the -l (--lineage-dataset) flag in the command line.
+2022-03-24 15:33:58 WARNING:busco.run_BUSCO	Running Auto Lineage Selector as no lineage dataset was specified. This will take a little longer than normal. If you know what lineage dataset you want to use, please specify this in the config file or using the -l (--lineage-dataset) flag in the command line.
 
-2022-02-24 16:56:36 INFO:	Results written in /home/cegg/berkeley/BUSCO/busco_dev/test_bacteria
-2022-02-24 16:56:36 INFO:	For assistance with interpreting the results, please consult the userguide: https://busco.ezlab.org/busco_userguide.html
+2022-03-24 15:36:16 INFO:	Results written in /home/cegg/berkeley/BUSCO/busco_dev/test_bacteria
+2022-03-24 15:36:16 INFO:	For assistance with interpreting the results, please consult the userguide: https://busco.ezlab.org/busco_userguide.html
 
-2022-02-24 16:56:36 INFO:	Visit this page https://gitlab.com/ezlab/busco#how-to-cite-busco to see how to cite BUSCO
+2022-03-24 15:36:16 INFO:	Visit this page https://gitlab.com/ezlab/busco#how-to-cite-busco to see how to cite BUSCO


=====================================
test_data/eukaryota/expected_log.txt
=====================================
@@ -1,133 +1,144 @@
-2022-02-24 16:56:37 INFO:	***** Start a BUSCO v5.3.0 analysis, current time: 02/24/2022 16:56:36 *****
-2022-02-24 16:56:37 INFO:	Configuring BUSCO with local environment
-2022-02-24 16:56:37 INFO:	Mode is genome
-2022-02-24 16:56:37 INFO:	'Force' option selected; overwriting previous results directory
-2022-02-24 16:56:37 INFO:	Downloading information on latest versions of BUSCO data...
-2022-02-24 16:56:37 WARNING:	Running Auto Lineage Selector as no lineage dataset was specified. This will take a little longer than normal. If you know what lineage dataset you want to use, please specify this in the config file or using the -l (--lineage-dataset) flag in the command line.
-2022-02-24 16:56:37 INFO:	Input file is /home/cegg/berkeley/BUSCO/busco_dev/test_data/eukaryota/genome.fna
-2022-02-24 16:56:37 INFO:	No lineage specified. Running lineage auto selector.
+2022-03-24 15:36:16 INFO:	***** Start a BUSCO v5.3.1 analysis, current time: 03/24/2022 15:36:16 *****
+2022-03-24 15:36:16 INFO:	Configuring BUSCO with local environment
+2022-03-24 15:36:16 INFO:	Mode is genome
+2022-03-24 15:36:16 INFO:	Downloading information on latest versions of BUSCO data...
+2022-03-24 15:36:17 WARNING:	Running Auto Lineage Selector as no lineage dataset was specified. This will take a little longer than normal. If you know what lineage dataset you want to use, please specify this in the config file or using the -l (--lineage-dataset) flag in the command line.
+2022-03-24 15:36:17 INFO:	Input file is /home/cegg/berkeley/BUSCO/busco_dev/test_data/eukaryota/genome.fna
+2022-03-24 15:36:17 INFO:	No lineage specified. Running lineage auto selector.
 
-2022-02-24 16:56:37 INFO:	***** Starting Auto Select Lineage *****
+2022-03-24 15:36:17 INFO:	***** Starting Auto Select Lineage *****
 	This process runs BUSCO on the generic lineage datasets for the domains archaea, bacteria and eukaryota. Once the optimal domain is selected, BUSCO automatically attempts to find the most appropriate BUSCO dataset to use based on phylogenetic placement.
 	--auto-lineage-euk and --auto-lineage-prok are also available if you know your input assembly is, or is not, an eukaryote. See the user guide for more information.
 	A reminder: Busco evaluations are valid when an appropriate dataset is used, i.e., the dataset belongs to the lineage of the species to test. Because of overlapping markers/spurious matches among domains, busco matches in another domain do not necessarily mean that your genome/proteome contains sequences from this domain. However, a high busco score in multiple domains might help you identify possible contaminations.
-2022-02-24 16:56:37 INFO:	Running BUSCO using lineage dataset archaea_odb10 (prokaryota, 2021-02-23)
-2022-02-24 16:56:37 INFO:	***** Run Prodigal on input to predict and extract genes *****
-2022-02-24 16:56:37 INFO:	Running Prodigal with genetic code 11 in single mode
-2022-02-24 16:56:37 INFO:	Running 1 job(s) on prodigal, starting at 02/24/2022 16:56:37
-2022-02-24 16:56:38 INFO:	[prodigal]	1 of 1 task(s) completed
-2022-02-24 16:56:38 INFO:	Genetic code 11 selected as optimal
-2022-02-24 16:56:38 INFO:	***** Run HMMER on gene sequences *****
-2022-02-24 16:56:38 INFO:	Running 194 job(s) on hmmsearch, starting at 02/24/2022 16:56:38
-2022-02-24 16:56:39 INFO:	[hmmsearch]	20 of 194 task(s) completed
-2022-02-24 16:56:39 INFO:	[hmmsearch]	39 of 194 task(s) completed
-2022-02-24 16:56:39 INFO:	[hmmsearch]	59 of 194 task(s) completed
-2022-02-24 16:56:39 INFO:	[hmmsearch]	78 of 194 task(s) completed
-2022-02-24 16:56:39 INFO:	[hmmsearch]	97 of 194 task(s) completed
-2022-02-24 16:56:39 INFO:	[hmmsearch]	117 of 194 task(s) completed
-2022-02-24 16:56:40 INFO:	[hmmsearch]	136 of 194 task(s) completed
-2022-02-24 16:56:40 INFO:	[hmmsearch]	156 of 194 task(s) completed
-2022-02-24 16:56:40 INFO:	[hmmsearch]	175 of 194 task(s) completed
-2022-02-24 16:56:40 INFO:	[hmmsearch]	194 of 194 task(s) completed
-2022-02-24 16:56:40 INFO:	Results:	C:1.0%[S:1.0%,D:0.0%],F:0.5%,M:98.5%,n:194	   
+2022-03-24 15:36:17 INFO:	Running BUSCO using lineage dataset archaea_odb10 (prokaryota, 2021-02-23)
+2022-03-24 15:36:17 INFO:	***** Run Prodigal on input to predict and extract genes *****
+2022-03-24 15:36:17 INFO:	Running Prodigal with genetic code 11 in single mode
+2022-03-24 15:36:17 INFO:	Running 1 job(s) on prodigal, starting at 03/24/2022 15:36:17
+2022-03-24 15:36:17 INFO:	[prodigal]	1 of 1 task(s) completed
+2022-03-24 15:36:17 INFO:	Genetic code 11 selected as optimal
+2022-03-24 15:36:17 INFO:	***** Run HMMER on gene sequences *****
+2022-03-24 15:36:17 INFO:	Running 194 job(s) on hmmsearch, starting at 03/24/2022 15:36:17
+2022-03-24 15:36:18 INFO:	[hmmsearch]	20 of 194 task(s) completed
+2022-03-24 15:36:18 INFO:	[hmmsearch]	39 of 194 task(s) completed
+2022-03-24 15:36:18 INFO:	[hmmsearch]	59 of 194 task(s) completed
+2022-03-24 15:36:18 INFO:	[hmmsearch]	78 of 194 task(s) completed
+2022-03-24 15:36:19 INFO:	[hmmsearch]	97 of 194 task(s) completed
+2022-03-24 15:36:19 INFO:	[hmmsearch]	117 of 194 task(s) completed
+2022-03-24 15:36:19 INFO:	[hmmsearch]	136 of 194 task(s) completed
+2022-03-24 15:36:19 INFO:	[hmmsearch]	156 of 194 task(s) completed
+2022-03-24 15:36:19 INFO:	[hmmsearch]	175 of 194 task(s) completed
+2022-03-24 15:36:19 INFO:	[hmmsearch]	194 of 194 task(s) completed
+2022-03-24 15:36:19 INFO:	Results:	C:1.0%[S:1.0%,D:0.0%],F:0.5%,M:98.5%,n:194	   
 
-2022-02-24 16:56:40 INFO:	Running BUSCO using lineage dataset bacteria_odb10 (prokaryota, 2020-03-06)
-2022-02-24 16:56:40 INFO:	***** Run Prodigal on input to predict and extract genes *****
-2022-02-24 16:56:40 INFO:	Genetic code 11 selected as optimal
-2022-02-24 16:56:40 INFO:	***** Run HMMER on gene sequences *****
-2022-02-24 16:56:40 INFO:	Running 124 job(s) on hmmsearch, starting at 02/24/2022 16:56:40
-2022-02-24 16:56:41 INFO:	[hmmsearch]	13 of 124 task(s) completed
-2022-02-24 16:56:41 INFO:	[hmmsearch]	25 of 124 task(s) completed
-2022-02-24 16:56:41 INFO:	[hmmsearch]	38 of 124 task(s) completed
-2022-02-24 16:56:41 INFO:	[hmmsearch]	50 of 124 task(s) completed
-2022-02-24 16:56:41 INFO:	[hmmsearch]	63 of 124 task(s) completed
-2022-02-24 16:56:41 INFO:	[hmmsearch]	75 of 124 task(s) completed
-2022-02-24 16:56:41 INFO:	[hmmsearch]	87 of 124 task(s) completed
-2022-02-24 16:56:41 INFO:	[hmmsearch]	87 of 124 task(s) completed
-2022-02-24 16:56:41 INFO:	[hmmsearch]	100 of 124 task(s) completed
-2022-02-24 16:56:42 INFO:	[hmmsearch]	112 of 124 task(s) completed
-2022-02-24 16:56:42 INFO:	[hmmsearch]	124 of 124 task(s) completed
-2022-02-24 16:56:42 WARNING:	BUSCO did not find any match. Make sure to check the log files if this is unexpected.
-2022-02-24 16:56:42 INFO:	Results:	C:0.0%[S:0.0%,D:0.0%],F:0.0%,M:100.0%,n:124	   
+2022-03-24 15:36:19 INFO:	Running BUSCO using lineage dataset bacteria_odb10 (prokaryota, 2020-03-06)
+2022-03-24 15:36:19 INFO:	***** Run Prodigal on input to predict and extract genes *****
+2022-03-24 15:36:19 INFO:	Genetic code 11 selected as optimal
+2022-03-24 15:36:19 INFO:	***** Run HMMER on gene sequences *****
+2022-03-24 15:36:19 INFO:	Running 124 job(s) on hmmsearch, starting at 03/24/2022 15:36:19
+2022-03-24 15:36:20 INFO:	[hmmsearch]	13 of 124 task(s) completed
+2022-03-24 15:36:20 INFO:	[hmmsearch]	25 of 124 task(s) completed
+2022-03-24 15:36:20 INFO:	[hmmsearch]	38 of 124 task(s) completed
+2022-03-24 15:36:20 INFO:	[hmmsearch]	50 of 124 task(s) completed
+2022-03-24 15:36:20 INFO:	[hmmsearch]	63 of 124 task(s) completed
+2022-03-24 15:36:20 INFO:	[hmmsearch]	75 of 124 task(s) completed
+2022-03-24 15:36:20 INFO:	[hmmsearch]	87 of 124 task(s) completed
+2022-03-24 15:36:20 INFO:	[hmmsearch]	100 of 124 task(s) completed
+2022-03-24 15:36:20 INFO:	[hmmsearch]	112 of 124 task(s) completed
+2022-03-24 15:36:20 INFO:	[hmmsearch]	124 of 124 task(s) completed
+2022-03-24 15:36:21 WARNING:	BUSCO did not find any match. Make sure to check the log files if this is unexpected.
+2022-03-24 15:36:21 INFO:	Results:	C:0.0%[S:0.0%,D:0.0%],F:0.0%,M:100.0%,n:124	   
 
-2022-02-24 16:56:42 INFO:	Running BUSCO using lineage dataset eukaryota_odb10 (eukaryota, 2020-09-10)
-2022-02-24 16:56:42 INFO:	Running 1 job(s) on metaeuk, starting at 02/24/2022 16:56:42
-2022-02-24 16:57:06 INFO:	[metaeuk]	1 of 1 task(s) completed
-2022-02-24 16:57:06 INFO:	***** Run HMMER on gene sequences *****
-2022-02-24 16:57:06 INFO:	Running 255 job(s) on hmmsearch, starting at 02/24/2022 16:57:06
-2022-02-24 16:57:07 INFO:	[hmmsearch]	26 of 255 task(s) completed
-2022-02-24 16:57:07 INFO:	[hmmsearch]	51 of 255 task(s) completed
-2022-02-24 16:57:07 INFO:	[hmmsearch]	77 of 255 task(s) completed
-2022-02-24 16:57:07 INFO:	[hmmsearch]	102 of 255 task(s) completed
-2022-02-24 16:57:07 INFO:	[hmmsearch]	128 of 255 task(s) completed
-2022-02-24 16:57:08 INFO:	[hmmsearch]	153 of 255 task(s) completed
-2022-02-24 16:57:08 INFO:	[hmmsearch]	179 of 255 task(s) completed
-2022-02-24 16:57:08 INFO:	[hmmsearch]	204 of 255 task(s) completed
-2022-02-24 16:57:08 INFO:	[hmmsearch]	230 of 255 task(s) completed
-2022-02-24 16:57:08 INFO:	[hmmsearch]	255 of 255 task(s) completed
-2022-02-24 16:57:09 INFO:	Results:	C:19.2%[S:19.2%,D:0.0%],F:0.8%,M:80.0%,n:255	   
+2022-03-24 15:36:21 INFO:	Running BUSCO using lineage dataset eukaryota_odb10 (eukaryota, 2020-09-10)
+2022-03-24 15:36:21 INFO:	Running 1 job(s) on metaeuk, starting at 03/24/2022 15:36:21
+2022-03-24 15:36:43 INFO:	[metaeuk]	1 of 1 task(s) completed
+2022-03-24 15:36:43 INFO:	***** Run HMMER on gene sequences *****
+2022-03-24 15:36:43 INFO:	Running 255 job(s) on hmmsearch, starting at 03/24/2022 15:36:43
+2022-03-24 15:36:44 INFO:	[hmmsearch]	26 of 255 task(s) completed
+2022-03-24 15:36:45 INFO:	[hmmsearch]	51 of 255 task(s) completed
+2022-03-24 15:36:45 INFO:	[hmmsearch]	77 of 255 task(s) completed
+2022-03-24 15:36:45 INFO:	[hmmsearch]	102 of 255 task(s) completed
+2022-03-24 15:36:45 INFO:	[hmmsearch]	128 of 255 task(s) completed
+2022-03-24 15:36:45 INFO:	[hmmsearch]	153 of 255 task(s) completed
+2022-03-24 15:36:45 INFO:	[hmmsearch]	179 of 255 task(s) completed
+2022-03-24 15:36:45 INFO:	[hmmsearch]	204 of 255 task(s) completed
+2022-03-24 15:36:45 INFO:	[hmmsearch]	230 of 255 task(s) completed
+2022-03-24 15:36:46 INFO:	[hmmsearch]	255 of 255 task(s) completed
+2022-03-24 15:36:46 INFO:	Results:	C:19.2%[S:19.2%,D:0.0%],F:0.8%,M:80.0%,n:255	   
 
-2022-02-24 16:57:09 INFO:	Extracting missing and fragmented buscos from the file refseq_db.faa...
-2022-02-24 16:57:24 INFO:	Running 1 job(s) on metaeuk, starting at 02/24/2022 16:57:24
-2022-02-24 16:57:42 INFO:	[metaeuk]	1 of 1 task(s) completed
-2022-02-24 16:57:42 INFO:	***** Run HMMER on gene sequences *****
-2022-02-24 16:57:42 INFO:	Running 206 job(s) on hmmsearch, starting at 02/24/2022 16:57:42
-2022-02-24 16:57:43 INFO:	[hmmsearch]	21 of 206 task(s) completed
-2022-02-24 16:57:43 INFO:	[hmmsearch]	42 of 206 task(s) completed
-2022-02-24 16:57:43 INFO:	[hmmsearch]	62 of 206 task(s) completed
-2022-02-24 16:57:43 INFO:	[hmmsearch]	83 of 206 task(s) completed
-2022-02-24 16:57:43 INFO:	[hmmsearch]	104 of 206 task(s) completed
-2022-02-24 16:57:43 INFO:	[hmmsearch]	124 of 206 task(s) completed
-2022-02-24 16:57:43 INFO:	[hmmsearch]	145 of 206 task(s) completed
-2022-02-24 16:57:43 INFO:	[hmmsearch]	165 of 206 task(s) completed
-2022-02-24 16:57:43 INFO:	[hmmsearch]	165 of 206 task(s) completed
-2022-02-24 16:57:43 INFO:	[hmmsearch]	186 of 206 task(s) completed
-2022-02-24 16:57:43 INFO:	[hmmsearch]	206 of 206 task(s) completed
-2022-02-24 16:57:44 INFO:	Validating exons and removing overlapping matches
-2022-02-24 16:57:44 INFO:	Results:	C:19.2%[S:19.2%,D:0.0%],F:0.8%,M:80.0%,n:255	   
+2022-03-24 15:36:46 INFO:	Extracting missing and fragmented buscos from the file refseq_db.faa...
+2022-03-24 15:37:00 INFO:	Running 1 job(s) on metaeuk, starting at 03/24/2022 15:37:00
+2022-03-24 15:37:18 INFO:	[metaeuk]	1 of 1 task(s) completed
+2022-03-24 15:37:18 INFO:	***** Run HMMER on gene sequences *****
+2022-03-24 15:37:18 INFO:	Running 206 job(s) on hmmsearch, starting at 03/24/2022 15:37:18
+2022-03-24 15:37:19 INFO:	[hmmsearch]	21 of 206 task(s) completed
+2022-03-24 15:37:19 INFO:	[hmmsearch]	42 of 206 task(s) completed
+2022-03-24 15:37:19 INFO:	[hmmsearch]	62 of 206 task(s) completed
+2022-03-24 15:37:19 INFO:	[hmmsearch]	83 of 206 task(s) completed
+2022-03-24 15:37:20 INFO:	[hmmsearch]	104 of 206 task(s) completed
+2022-03-24 15:37:20 INFO:	[hmmsearch]	124 of 206 task(s) completed
+2022-03-24 15:37:20 INFO:	[hmmsearch]	145 of 206 task(s) completed
+2022-03-24 15:37:20 INFO:	[hmmsearch]	165 of 206 task(s) completed
+2022-03-24 15:37:20 INFO:	[hmmsearch]	186 of 206 task(s) completed
+2022-03-24 15:37:20 INFO:	[hmmsearch]	206 of 206 task(s) completed
+2022-03-24 15:37:21 INFO:	Validating exons and removing overlapping matches
+2022-03-24 15:37:21 INFO:	Results:	C:19.2%[S:19.2%,D:0.0%],F:0.8%,M:80.0%,n:255	   
 
-2022-02-24 16:57:44 INFO:	eukaryota_odb10 selected
+2022-03-24 15:37:21 INFO:	eukaryota_odb10 selected
 
-2022-02-24 16:57:44 INFO:	***** Searching tree for chosen lineage to find best taxonomic match *****
+2022-03-24 15:37:21 INFO:	***** Searching tree for chosen lineage to find best taxonomic match *****
 
-2022-02-24 16:57:44 INFO:	Extract markers...
-2022-02-24 16:57:44 INFO:	Place the markers on the reference tree...
-2022-02-24 16:57:44 INFO:	Running 1 job(s) on sepp, starting at 02/24/2022 16:57:44
-2022-02-24 16:59:25 INFO:	[sepp]	1 of 1 task(s) completed
-2022-02-24 16:59:25 INFO:	Lineage saccharomycetes is selected, supported by 18 markers out of 19
-2022-02-24 16:59:25 INFO:	Running BUSCO using lineage dataset saccharomycetes_odb10 (eukaryota, 2020-08-05)
-2022-02-24 16:59:25 INFO:	Running 1 job(s) on metaeuk, starting at 02/24/2022 16:59:25
-2022-02-24 16:59:29 INFO:	[metaeuk]	1 of 1 task(s) completed
-2022-02-24 16:59:30 INFO:	***** Run HMMER on gene sequences *****
-2022-02-24 16:59:30 INFO:	Running 2137 job(s) on hmmsearch, starting at 02/24/2022 16:59:30
-2022-02-24 16:59:31 INFO:	[hmmsearch]	214 of 2137 task(s) completed
-2022-02-24 16:59:32 INFO:	[hmmsearch]	428 of 2137 task(s) completed
-2022-02-24 16:59:33 INFO:	[hmmsearch]	642 of 2137 task(s) completed
-2022-02-24 16:59:34 INFO:	[hmmsearch]	855 of 2137 task(s) completed
-2022-02-24 16:59:35 INFO:	[hmmsearch]	1069 of 2137 task(s) completed
-2022-02-24 16:59:36 INFO:	[hmmsearch]	1283 of 2137 task(s) completed
-2022-02-24 16:59:37 INFO:	[hmmsearch]	1496 of 2137 task(s) completed
-2022-02-24 16:59:38 INFO:	[hmmsearch]	1710 of 2137 task(s) completed
-2022-02-24 16:59:39 INFO:	[hmmsearch]	1924 of 2137 task(s) completed
-2022-02-24 16:59:40 INFO:	[hmmsearch]	2137 of 2137 task(s) completed
-2022-02-24 16:59:42 INFO:	Extracting missing and fragmented buscos from the file refseq_db.faa...
-2022-02-24 16:59:47 INFO:	Running 1 job(s) on metaeuk, starting at 02/24/2022 16:59:47
-2022-02-24 16:59:52 INFO:	[metaeuk]	1 of 1 task(s) completed
-2022-02-24 16:59:52 INFO:	***** Run HMMER on gene sequences *****
-2022-02-24 16:59:52 INFO:	Running 2093 job(s) on hmmsearch, starting at 02/24/2022 16:59:52
-2022-02-24 16:59:55 INFO:	[hmmsearch]	210 of 2093 task(s) completed
-2022-02-24 16:59:55 INFO:	[hmmsearch]	419 of 2093 task(s) completed
-2022-02-24 16:59:57 INFO:	[hmmsearch]	628 of 2093 task(s) completed
-2022-02-24 16:59:58 INFO:	[hmmsearch]	1047 of 2093 task(s) completed
-2022-02-24 16:59:59 INFO:	[hmmsearch]	1256 of 2093 task(s) completed
-2022-02-24 17:00:00 INFO:	[hmmsearch]	1466 of 2093 task(s) completed
-2022-02-24 17:00:01 INFO:	[hmmsearch]	1675 of 2093 task(s) completed
-2022-02-24 17:00:02 INFO:	[hmmsearch]	1884 of 2093 task(s) completed
-2022-02-24 17:00:03 INFO:	[hmmsearch]	2093 of 2093 task(s) completed
-2022-02-24 17:00:06 INFO:	Validating exons and removing overlapping matches
-2022-02-24 17:00:06 INFO:	Results:	C:2.1%[S:2.1%,D:0.0%],F:0.0%,M:97.9%,n:2137	   
+2022-03-24 15:37:21 INFO:	Extract markers...
+2022-03-24 15:37:21 INFO:	Downloading file 'https://busco-data.ezlab.org/v5/data/placement_files/list_of_reference_markers.eukaryota_odb10.2019-12-16.txt.tar.gz'
+2022-03-24 15:37:21 INFO:	Decompressing file '/home/cegg/berkeley/BUSCO/busco_dev/busco_downloads/placement_files/list_of_reference_markers.eukaryota_odb10.2019-12-16.txt.tar.gz'
+2022-03-24 15:37:21 INFO:	Downloading file 'https://busco-data.ezlab.org/v5/data/placement_files/tree.eukaryota_odb10.2019-12-16.nwk.tar.gz'
+2022-03-24 15:37:21 INFO:	Decompressing file '/home/cegg/berkeley/BUSCO/busco_dev/busco_downloads/placement_files/tree.eukaryota_odb10.2019-12-16.nwk.tar.gz'
+2022-03-24 15:37:21 INFO:	Downloading file 'https://busco-data.ezlab.org/v5/data/placement_files/tree_metadata.eukaryota_odb10.2019-12-16.txt.tar.gz'
+2022-03-24 15:37:21 INFO:	Decompressing file '/home/cegg/berkeley/BUSCO/busco_dev/busco_downloads/placement_files/tree_metadata.eukaryota_odb10.2019-12-16.txt.tar.gz'
+2022-03-24 15:37:21 INFO:	Downloading file 'https://busco-data.ezlab.org/v5/data/placement_files/supermatrix.aln.eukaryota_odb10.2019-12-16.faa.tar.gz'
+2022-03-24 15:37:21 INFO:	Decompressing file '/home/cegg/berkeley/BUSCO/busco_dev/busco_downloads/placement_files/supermatrix.aln.eukaryota_odb10.2019-12-16.faa.tar.gz'
+2022-03-24 15:37:22 INFO:	Downloading file 'https://busco-data.ezlab.org/v5/data/placement_files/mapping_taxids-busco_dataset_name.eukaryota_odb10.2019-12-16.txt.tar.gz'
+2022-03-24 15:37:22 INFO:	Decompressing file '/home/cegg/berkeley/BUSCO/busco_dev/busco_downloads/placement_files/mapping_taxids-busco_dataset_name.eukaryota_odb10.2019-12-16.txt.tar.gz'
+2022-03-24 15:37:22 INFO:	Downloading file 'https://busco-data.ezlab.org/v5/data/placement_files/mapping_taxid-lineage.eukaryota_odb10.2019-12-16.txt.tar.gz'
+2022-03-24 15:37:22 INFO:	Decompressing file '/home/cegg/berkeley/BUSCO/busco_dev/busco_downloads/placement_files/mapping_taxid-lineage.eukaryota_odb10.2019-12-16.txt.tar.gz'
+2022-03-24 15:37:22 INFO:	Place the markers on the reference tree...
+2022-03-24 15:37:22 INFO:	Running 1 job(s) on sepp, starting at 03/24/2022 15:37:22
+2022-03-24 15:39:04 INFO:	[sepp]	1 of 1 task(s) completed
+2022-03-24 15:39:04 INFO:	Lineage saccharomycetes is selected, supported by 18 markers out of 19
+2022-03-24 15:39:04 INFO:	Downloading file 'https://busco-data.ezlab.org/v5/data/lineages/saccharomycetes_odb10.2020-08-05.tar.gz'
+2022-03-24 15:39:06 INFO:	Decompressing file '/home/cegg/berkeley/BUSCO/busco_dev/busco_downloads/lineages/saccharomycetes_odb10.tar.gz'
+2022-03-24 15:39:15 INFO:	Running BUSCO using lineage dataset saccharomycetes_odb10 (eukaryota, 2020-08-05)
+2022-03-24 15:39:15 INFO:	Running 1 job(s) on metaeuk, starting at 03/24/2022 15:39:15
+2022-03-24 15:39:20 INFO:	[metaeuk]	1 of 1 task(s) completed
+2022-03-24 15:39:20 INFO:	***** Run HMMER on gene sequences *****
+2022-03-24 15:39:20 INFO:	Running 2137 job(s) on hmmsearch, starting at 03/24/2022 15:39:20
+2022-03-24 15:39:22 INFO:	[hmmsearch]	214 of 2137 task(s) completed
+2022-03-24 15:39:23 INFO:	[hmmsearch]	428 of 2137 task(s) completed
+2022-03-24 15:39:24 INFO:	[hmmsearch]	642 of 2137 task(s) completed
+2022-03-24 15:39:25 INFO:	[hmmsearch]	855 of 2137 task(s) completed
+2022-03-24 15:39:25 INFO:	[hmmsearch]	1069 of 2137 task(s) completed
+2022-03-24 15:39:26 INFO:	[hmmsearch]	1283 of 2137 task(s) completed
+2022-03-24 15:39:27 INFO:	[hmmsearch]	1496 of 2137 task(s) completed
+2022-03-24 15:39:28 INFO:	[hmmsearch]	1710 of 2137 task(s) completed
+2022-03-24 15:39:28 INFO:	[hmmsearch]	1924 of 2137 task(s) completed
+2022-03-24 15:39:30 INFO:	[hmmsearch]	2137 of 2137 task(s) completed
+2022-03-24 15:39:32 INFO:	Extracting missing and fragmented buscos from the file refseq_db.faa...
+2022-03-24 15:39:37 INFO:	Running 1 job(s) on metaeuk, starting at 03/24/2022 15:39:37
+2022-03-24 15:39:43 INFO:	[metaeuk]	1 of 1 task(s) completed
+2022-03-24 15:39:43 INFO:	***** Run HMMER on gene sequences *****
+2022-03-24 15:39:43 INFO:	Running 2093 job(s) on hmmsearch, starting at 03/24/2022 15:39:43
+2022-03-24 15:39:45 INFO:	[hmmsearch]	419 of 2093 task(s) completed
+2022-03-24 15:39:52 INFO:	[hmmsearch]	628 of 2093 task(s) completed
+2022-03-24 15:39:53 INFO:	[hmmsearch]	838 of 2093 task(s) completed
+2022-03-24 15:39:53 INFO:	[hmmsearch]	1047 of 2093 task(s) completed
+2022-03-24 15:39:54 INFO:	[hmmsearch]	1256 of 2093 task(s) completed
+2022-03-24 15:39:55 INFO:	[hmmsearch]	1466 of 2093 task(s) completed
+2022-03-24 15:39:56 INFO:	[hmmsearch]	1675 of 2093 task(s) completed
+2022-03-24 15:39:56 INFO:	[hmmsearch]	1884 of 2093 task(s) completed
+2022-03-24 15:39:57 INFO:	[hmmsearch]	2093 of 2093 task(s) completed
+2022-03-24 15:40:00 INFO:	Validating exons and removing overlapping matches
+2022-03-24 15:40:00 INFO:	Results:	C:2.1%[S:2.1%,D:0.0%],F:0.0%,M:97.9%,n:2137	   
 
-2022-02-24 17:00:06 INFO:	
+2022-03-24 15:40:00 INFO:	
 
 	--------------------------------------------------
 	|Results from generic domain eukaryota_odb10      |
@@ -152,13 +163,13 @@
 	|2091	Missing BUSCOs (M)                        |
 	|2137	Total BUSCO groups searched               |
 	--------------------------------------------------
-2022-02-24 17:00:06 INFO:	BUSCO analysis done with WARNING(s). Total running time: 210 seconds
+2022-03-24 15:40:00 INFO:	BUSCO analysis done with WARNING(s). Total running time: 223 seconds
 
 ***** Summary of warnings: *****
-2022-02-24 16:56:37 WARNING:busco.run_BUSCO	Running Auto Lineage Selector as no lineage dataset was specified. This will take a little longer than normal. If you know what lineage dataset you want to use, please specify this in the config file or using the -l (--lineage-dataset) flag in the command line.
-2022-02-24 16:56:42 WARNING:busco.busco_tools.hmmer	BUSCO did not find any match. Make sure to check the log files if this is unexpected.
+2022-03-24 15:36:17 WARNING:busco.run_BUSCO	Running Auto Lineage Selector as no lineage dataset was specified. This will take a little longer than normal. If you know what lineage dataset you want to use, please specify this in the config file or using the -l (--lineage-dataset) flag in the command line.
+2022-03-24 15:36:21 WARNING:busco.busco_tools.hmmer	BUSCO did not find any match. Make sure to check the log files if this is unexpected.
 
-2022-02-24 17:00:06 INFO:	Results written in /home/cegg/berkeley/BUSCO/busco_dev/test_eukaryota
-2022-02-24 17:00:06 INFO:	For assistance with interpreting the results, please consult the userguide: https://busco.ezlab.org/busco_userguide.html
+2022-03-24 15:40:00 INFO:	Results written in /home/cegg/berkeley/BUSCO/busco_dev/test_eukaryota
+2022-03-24 15:40:00 INFO:	For assistance with interpreting the results, please consult the userguide: https://busco.ezlab.org/busco_userguide.html
 
-2022-02-24 17:00:06 INFO:	Visit this page https://gitlab.com/ezlab/busco#how-to-cite-busco to see how to cite BUSCO
+2022-03-24 15:40:00 INFO:	Visit this page https://gitlab.com/ezlab/busco#how-to-cite-busco to see how to cite BUSCO



View it on GitLab: https://salsa.debian.org/med-team/busco/-/compare/70396107c9963af9000fa3b3faf8c23c5998067a...6c2f83bfeb15ffeffe425df953ae06ac0de621a6

-- 
View it on GitLab: https://salsa.debian.org/med-team/busco/-/compare/70396107c9963af9000fa3b3faf8c23c5998067a...6c2f83bfeb15ffeffe425df953ae06ac0de621a6
You're receiving this email because of your account on salsa.debian.org.


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://alioth-lists.debian.net/pipermail/debian-med-commit/attachments/20220329/484c8a91/attachment-0001.htm>


More information about the debian-med-commit mailing list