Bug#1100120: libopenmpi-dev: mpi4py spawn tests get OPAL ERROR: Unreachable in file ../../../ompi/runtime/ompi_mpi_finalize.c at line 286

Drew Parsons dparsons at debian.org
Tue Mar 11 19:32:09 GMT 2025


Package: libopenmpi-dev
Followup-For: Bug #1100120

We can see the spawn errors in the mpi4py 4.0.3-2 build logs,
e.g.  https://buildd.debian.org/status/fetch.php?pkg=mpi4py&arch=amd64&ver=4.0.3-2&stamp=1741705221&raw=0

Skipping the test_spawn tests, remaing tests pass
but I get the following error locally (in pdebuild pbuilder chroot).

It looks like a different variation of the OPAL ERROR I reported in
this bug. But this backtrace refers to libucs.so.0.  Could it a bug in
ucx, which has just been upgraded to 1.18.1?

The end of the build log is:

...
test_starmap (test_util_pool.TestThreadPool.test_starmap) ... ok
testConstructor (test_win.TestWinNull.testConstructor) ... ok
testGetName (test_win.TestWinNull.testGetName) ... ok

----------------------------------------------------------------------
Ran 1813 tests in 58.573s

OK (skipped=162)
[sandy:25412:0:25412] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x10)
==== backtrace (tid:  25412) ====
 0  /lib/x86_64-linux-gnu/libucs.so.0(ucs_handle_error+0x2bc) [0x7fd827e8064c]
 1  /lib/x86_64-linux-gnu/libucs.so.0(+0x3182f) [0x7fd827e8082f]
 2  /lib/x86_64-linux-gnu/libucs.so.0(+0x319fa) [0x7fd827e809fa]
 3  /lib/x86_64-linux-gnu/libc.so.6(+0x3fdb0) [0x7fd82aa90db0]
 4  /lib/x86_64-linux-gnu/libopen-pal.so.80(opal_net_get_hostname+0x12) [0x7fd827dd90a2]
 5  /lib/x86_64-linux-gnu/libopen-pal.so.80(+0xfc518) [0x7fd827e0e518]
 6  /lib/x86_64-linux-gnu/libopen-pal.so.80(+0xfc8a5) [0x7fd827e0e8a5]
 7  /lib/x86_64-linux-gnu/libopen-pal.so.80(mca_btl_tcp_proc_create+0x464) [0x7fd827df8924]
 8  /lib/x86_64-linux-gnu/libopen-pal.so.80(mca_btl_tcp_add_procs+0x6f) [0x7fd827df09cf]
 9  /lib/x86_64-linux-gnu/libmpi.so.40(+0xf3c0c) [0x7fd8282f3c0c]
10  /lib/x86_64-linux-gnu/libmpi.so.40(mca_pml_ob1_isend+0x84d) [0x7fd8284841ad]
11  /lib/x86_64-linux-gnu/libmpi.so.40(ompi_dpm_dyn_finalize+0x1b5) [0x7fd828280575]
12  /lib/x86_64-linux-gnu/libmpi.so.40(+0x64c27) [0x7fd828264c27]
13  /lib/x86_64-linux-gnu/libopen-pal.so.80(opal_finalize_cleanup_domain+0x52) [0x7fd827d47ec2]
14  /lib/x86_64-linux-gnu/libopen-pal.so.80(opal_finalize+0x37) [0x7fd827d3a6b7]
15  /lib/x86_64-linux-gnu/libmpi.so.40(ompi_rte_finalize+0x14b) [0x7fd8282980eb]
16  /lib/x86_64-linux-gnu/libmpi.so.40(+0x9adfc) [0x7fd82829adfc]
17  /lib/x86_64-linux-gnu/libmpi.so.40(ompi_mpi_instance_finalize+0xbd) [0x7fd82829c3cd]
18  /lib/x86_64-linux-gnu/libmpi.so.40(ompi_mpi_finalize+0x21f) [0x7fd82829447f]
19  /usr/bin/python3.13() [0x45b262]
20  /usr/bin/python3.13(Py_Exit+0x2f) [0x69abef]
21  /usr/bin/python3.13() [0x68858b]
22  /usr/bin/python3.13() [0x6883fb]
23  /usr/bin/python3.13() [0x67ed22]
24  /usr/bin/python3.13() [0x67eb1e]
25  /usr/bin/python3.13(Py_RunMain+0x3c1) [0x67d961]
26  /usr/bin/python3.13(Py_BytesMain+0x2b) [0x63a72b]
27  /lib/x86_64-linux-gnu/libc.so.6(+0x29ca8) [0x7fd82aa7aca8]
28  /lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0x85) [0x7fd82aa7ad65]
29  /usr/bin/python3.13(_start+0x21) [0x639ae1]
=================================

python3.13:25412 terminated with signal 11 at PC=7fd827dd90a2 SP=7ffe028bd5d0.  Backtrace:
/lib/x86_64-linux-gnu/libopen-pal.so.80(opal_net_get_hostname+0x12) [0x7fd827dd90a2]
/lib/x86_64-linux-gnu/libopen-pal.so.80(+0xfc518) [0x7fd827e0e518]
/lib/x86_64-linux-gnu/libopen-pal.so.80(+0xfc8a5) [0x7fd827e0e8a5]
/lib/x86_64-linux-gnu/libopen-pal.so.80(mca_btl_tcp_proc_create+0x464) [0x7fd827df8924]
/lib/x86_64-linux-gnu/libopen-pal.so.80(mca_btl_tcp_add_procs+0x6f) [0x7fd827df09cf]
/lib/x86_64-linux-gnu/libmpi.so.40(+0xf3c0c) [0x7fd8282f3c0c]
/lib/x86_64-linux-gnu/libmpi.so.40(mca_pml_ob1_isend+0x84d) [0x7fd8284841ad]
/lib/x86_64-linux-gnu/libmpi.so.40(ompi_dpm_dyn_finalize+0x1b5) [0x7fd828280575]
/lib/x86_64-linux-gnu/libmpi.so.40(+0x64c27) [0x7fd828264c27]
/lib/x86_64-linux-gnu/libopen-pal.so.80(opal_finalize_cleanup_domain+0x52) [0x7fd827d47ec2]
/lib/x86_64-linux-gnu/libopen-pal.so.80(opal_finalize+0x37) [0x7fd827d3a6b7]
/lib/x86_64-linux-gnu/libmpi.so.40(ompi_rte_finalize+0x14b) [0x7fd8282980eb]
/lib/x86_64-linux-gnu/libmpi.so.40(+0x9adfc) [0x7fd82829adfc]
/lib/x86_64-linux-gnu/libmpi.so.40(ompi_mpi_instance_finalize+0xbd) [0x7fd82829c3cd]
/lib/x86_64-linux-gnu/libmpi.so.40(ompi_mpi_finalize+0x21f) [0x7fd82829447f]
/usr/bin/python3.13() [0x45b262]
/usr/bin/python3.13(Py_Exit+0x2f) [0x69abef]
/usr/bin/python3.13() [0x68858b]
/usr/bin/python3.13() [0x6883fb]
/usr/bin/python3.13() [0x67ed22]
/usr/bin/python3.13() [0x67eb1e]
/usr/bin/python3.13(Py_RunMain+0x3c1) [0x67d961]
/usr/bin/python3.13(Py_BytesMain+0x2b) [0x63a72b]
/lib/x86_64-linux-gnu/libc.so.6(+0x29ca8) [0x7fd82aa7aca8]
/lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0x85) [0x7fd82aa7ad65]
/usr/bin/python3.13(_start+0x21) [0x639ae1]
make[1]: *** [debian/rules:269: override_dh_auto_test] Error 1



More information about the debian-science-maintainers mailing list