Bug#1100120: libopenmpi-dev: mpi4py spawn tests get OPAL ERROR: Unreachable in file ../../../ompi/runtime/ompi_mpi_finalize.c at line 286
Drew Parsons
dparsons at debian.org
Tue Mar 11 19:32:09 GMT 2025
Package: libopenmpi-dev
Followup-For: Bug #1100120
We can see the spawn errors in the mpi4py 4.0.3-2 build logs,
e.g. https://buildd.debian.org/status/fetch.php?pkg=mpi4py&arch=amd64&ver=4.0.3-2&stamp=1741705221&raw=0
Skipping the test_spawn tests, remaing tests pass
but I get the following error locally (in pdebuild pbuilder chroot).
It looks like a different variation of the OPAL ERROR I reported in
this bug. But this backtrace refers to libucs.so.0. Could it a bug in
ucx, which has just been upgraded to 1.18.1?
The end of the build log is:
...
test_starmap (test_util_pool.TestThreadPool.test_starmap) ... ok
testConstructor (test_win.TestWinNull.testConstructor) ... ok
testGetName (test_win.TestWinNull.testGetName) ... ok
----------------------------------------------------------------------
Ran 1813 tests in 58.573s
OK (skipped=162)
[sandy:25412:0:25412] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x10)
==== backtrace (tid: 25412) ====
0 /lib/x86_64-linux-gnu/libucs.so.0(ucs_handle_error+0x2bc) [0x7fd827e8064c]
1 /lib/x86_64-linux-gnu/libucs.so.0(+0x3182f) [0x7fd827e8082f]
2 /lib/x86_64-linux-gnu/libucs.so.0(+0x319fa) [0x7fd827e809fa]
3 /lib/x86_64-linux-gnu/libc.so.6(+0x3fdb0) [0x7fd82aa90db0]
4 /lib/x86_64-linux-gnu/libopen-pal.so.80(opal_net_get_hostname+0x12) [0x7fd827dd90a2]
5 /lib/x86_64-linux-gnu/libopen-pal.so.80(+0xfc518) [0x7fd827e0e518]
6 /lib/x86_64-linux-gnu/libopen-pal.so.80(+0xfc8a5) [0x7fd827e0e8a5]
7 /lib/x86_64-linux-gnu/libopen-pal.so.80(mca_btl_tcp_proc_create+0x464) [0x7fd827df8924]
8 /lib/x86_64-linux-gnu/libopen-pal.so.80(mca_btl_tcp_add_procs+0x6f) [0x7fd827df09cf]
9 /lib/x86_64-linux-gnu/libmpi.so.40(+0xf3c0c) [0x7fd8282f3c0c]
10 /lib/x86_64-linux-gnu/libmpi.so.40(mca_pml_ob1_isend+0x84d) [0x7fd8284841ad]
11 /lib/x86_64-linux-gnu/libmpi.so.40(ompi_dpm_dyn_finalize+0x1b5) [0x7fd828280575]
12 /lib/x86_64-linux-gnu/libmpi.so.40(+0x64c27) [0x7fd828264c27]
13 /lib/x86_64-linux-gnu/libopen-pal.so.80(opal_finalize_cleanup_domain+0x52) [0x7fd827d47ec2]
14 /lib/x86_64-linux-gnu/libopen-pal.so.80(opal_finalize+0x37) [0x7fd827d3a6b7]
15 /lib/x86_64-linux-gnu/libmpi.so.40(ompi_rte_finalize+0x14b) [0x7fd8282980eb]
16 /lib/x86_64-linux-gnu/libmpi.so.40(+0x9adfc) [0x7fd82829adfc]
17 /lib/x86_64-linux-gnu/libmpi.so.40(ompi_mpi_instance_finalize+0xbd) [0x7fd82829c3cd]
18 /lib/x86_64-linux-gnu/libmpi.so.40(ompi_mpi_finalize+0x21f) [0x7fd82829447f]
19 /usr/bin/python3.13() [0x45b262]
20 /usr/bin/python3.13(Py_Exit+0x2f) [0x69abef]
21 /usr/bin/python3.13() [0x68858b]
22 /usr/bin/python3.13() [0x6883fb]
23 /usr/bin/python3.13() [0x67ed22]
24 /usr/bin/python3.13() [0x67eb1e]
25 /usr/bin/python3.13(Py_RunMain+0x3c1) [0x67d961]
26 /usr/bin/python3.13(Py_BytesMain+0x2b) [0x63a72b]
27 /lib/x86_64-linux-gnu/libc.so.6(+0x29ca8) [0x7fd82aa7aca8]
28 /lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0x85) [0x7fd82aa7ad65]
29 /usr/bin/python3.13(_start+0x21) [0x639ae1]
=================================
python3.13:25412 terminated with signal 11 at PC=7fd827dd90a2 SP=7ffe028bd5d0. Backtrace:
/lib/x86_64-linux-gnu/libopen-pal.so.80(opal_net_get_hostname+0x12) [0x7fd827dd90a2]
/lib/x86_64-linux-gnu/libopen-pal.so.80(+0xfc518) [0x7fd827e0e518]
/lib/x86_64-linux-gnu/libopen-pal.so.80(+0xfc8a5) [0x7fd827e0e8a5]
/lib/x86_64-linux-gnu/libopen-pal.so.80(mca_btl_tcp_proc_create+0x464) [0x7fd827df8924]
/lib/x86_64-linux-gnu/libopen-pal.so.80(mca_btl_tcp_add_procs+0x6f) [0x7fd827df09cf]
/lib/x86_64-linux-gnu/libmpi.so.40(+0xf3c0c) [0x7fd8282f3c0c]
/lib/x86_64-linux-gnu/libmpi.so.40(mca_pml_ob1_isend+0x84d) [0x7fd8284841ad]
/lib/x86_64-linux-gnu/libmpi.so.40(ompi_dpm_dyn_finalize+0x1b5) [0x7fd828280575]
/lib/x86_64-linux-gnu/libmpi.so.40(+0x64c27) [0x7fd828264c27]
/lib/x86_64-linux-gnu/libopen-pal.so.80(opal_finalize_cleanup_domain+0x52) [0x7fd827d47ec2]
/lib/x86_64-linux-gnu/libopen-pal.so.80(opal_finalize+0x37) [0x7fd827d3a6b7]
/lib/x86_64-linux-gnu/libmpi.so.40(ompi_rte_finalize+0x14b) [0x7fd8282980eb]
/lib/x86_64-linux-gnu/libmpi.so.40(+0x9adfc) [0x7fd82829adfc]
/lib/x86_64-linux-gnu/libmpi.so.40(ompi_mpi_instance_finalize+0xbd) [0x7fd82829c3cd]
/lib/x86_64-linux-gnu/libmpi.so.40(ompi_mpi_finalize+0x21f) [0x7fd82829447f]
/usr/bin/python3.13() [0x45b262]
/usr/bin/python3.13(Py_Exit+0x2f) [0x69abef]
/usr/bin/python3.13() [0x68858b]
/usr/bin/python3.13() [0x6883fb]
/usr/bin/python3.13() [0x67ed22]
/usr/bin/python3.13() [0x67eb1e]
/usr/bin/python3.13(Py_RunMain+0x3c1) [0x67d961]
/usr/bin/python3.13(Py_BytesMain+0x2b) [0x63a72b]
/lib/x86_64-linux-gnu/libc.so.6(+0x29ca8) [0x7fd82aa7aca8]
/lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0x85) [0x7fd82aa7ad65]
/usr/bin/python3.13(_start+0x21) [0x639ae1]
make[1]: *** [debian/rules:269: override_dh_auto_test] Error 1
More information about the debian-science-maintainers
mailing list