[Git][qa/jenkins.debian.net][master] 2 commits: drop openqa worker override (now use auto units)

Holger Levsen (@holger) gitlab at salsa.debian.org
Thu Jun 6 10:01:22 BST 2024



Holger Levsen pushed to branch master at Debian QA / jenkins.debian.net


Commits:
b8c7e010 by Philip Hands at 2024-06-06T11:01:02+02:00
drop openqa worker override (now use auto units)

We now use the openqa-worker-auto-restart@ units, which have the
Restart=always setting built-in, so no need for an override.

Signed-off-by: Holger Levsen <holger at layer-acht.org>

- - - - -
3e092dbd by Philip Hands at 2024-06-06T11:01:02+02:00
limit number of openqa-workers when load is high

I've increased the number of workers to 7, but made sure that the last 3
are only active when system load is relatively low, by individually
setting CRITICAL_LOAD_AVG_THRESHOLD to between 15 and 9, so all 7
workers will only run when the load is under 9.  This is an experiment,
so I guess those numbers might need adjusting.

Signed-off-by: Holger Levsen <holger at layer-acht.org>

- - - - -


3 changed files:

- hosts/osuosl3-amd64/etc/openqa/workers.ini
- − hosts/osuosl3-amd64/etc/systemd/system/openqa-worker-plain\@.service.d/override.conf
- update_jdn.sh


Changes:

=====================================
hosts/osuosl3-amd64/etc/openqa/workers.ini
=====================================
@@ -79,15 +79,15 @@ PACKAGES_CMD = apt list --installed
 # The default value is 40 to prevent system overload based on experiences with
 # system stability so far.
 # Set to 0 to disable.
-#CRITICAL_LOAD_AVG_THRESHOLD = 40
+CRITICAL_LOAD_AVG_THRESHOLD = 15
 
 # This machine is under load, allow for longer timeouts
 TIMEOUT_SCALE = 1.8
 
 # The section ids are the instance of the workers.
 # The key/value pairs will appear in vars.json
-[1]
-WORKER_CLASS = qemu_x86_64
+#[1]
+#WORKER_CLASS = qemu_x86_64
 
 #[2]
 #WORKER_CLASS = qemu_x86_64_staging,qemu_x86_64
@@ -97,3 +97,14 @@ WORKER_CLASS = qemu_x86_64
 
 [https://openqa.debian.net]
 TESTPOOLSERVER = rsync://openqa.debian.net/tests
+
+# avoid over-loading the server when busy
+[5]
+CRITICAL_LOAD_AVG_THRESHOLD = 12
+
+[6]
+CRITICAL_LOAD_AVG_THRESHOLD = 10
+
+[7]
+CRITICAL_LOAD_AVG_THRESHOLD = 9
+


=====================================
hosts/osuosl3-amd64/etc/systemd/system/openqa-worker-plain\@.service.d/override.conf deleted
=====================================
@@ -1,6 +0,0 @@
-[Service]
-# this file is maintained in the jenkins.debian.net git repo
-
-# This setting is needed to keep workers running, because of the
-# TERMINATE_AFTER_JOBS_DONE setting in /etc/openqa/workers.ini
-Restart=always


=====================================
update_jdn.sh
=====================================
@@ -789,7 +789,7 @@ if ! $UP2DATE ; then
 						# openqa does not use slirpvde: reset its status and disable
 						sudo systemctl reset-failed openqa-slirpvde.service
 						sudo systemctl disable --now openqa-slirpvde.service
-						OPENQA_WORKERS_TO_RUN=5
+						OPENQA_WORKERS_TO_RUN=7
 						# disable any excess autostart instances, and all non-autostart workers
 						for id in $(systemctl show openqa-worker\*@\*.service --property=Id | sed 's/^Id=//'); do
 							autostart_instance=$(sed -nE 's/^.*auto-restart[@]([0-9]+).*$/\1/p' <<<$id)



View it on GitLab: https://salsa.debian.org/qa/jenkins.debian.net/-/compare/c5bb6bb837293a025cffff89adf4a4f66bb3ceb1...3e092dbdd616a2ddca0ae7e8bb6b4e834e8f18da

-- 
This project does not include diff previews in email notifications.
View it on GitLab: https://salsa.debian.org/qa/jenkins.debian.net/-/compare/c5bb6bb837293a025cffff89adf4a4f66bb3ceb1...3e092dbdd616a2ddca0ae7e8bb6b4e834e8f18da
You're receiving this email because of your account on salsa.debian.org.


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://alioth-lists.debian.net/pipermail/qa-jenkins-scm/attachments/20240606/55c9079c/attachment-0001.htm>


More information about the Qa-jenkins-scm mailing list