Bug#1004556: libmpich-dev: The simplest MPI program compiled with mpich crashes
Drew Parsons
dparsons at debian.org
Wed Feb 23 13:44:39 GMT 2022
On 2022-02-22 19:49, Alastair McKinstry wrote:
> I can disable pmix support in mpich.
Thanks Alastair. My tests are now passing with mpich 4.0-3
> Pmix is working fine in openmpi, so it’s an mpich/pmix issue of some
> sort (or maybe ch4)
"Working fine" might not be quite the word for it. There's some strange
stuff going on with openmpi, at last in multinode execution (RMA).
https://github.com/open-mpi/ompi/issues/10026
That's actually the reason why I was rebuilding with mpich. nwchem is
completely useless with openmpi in a multi-node job. (well, they hope
the situation will be better with openmpi 5).
Drew
>
> Regards
> Alastair
>
> On 22/02/2022, 14:57, "debian-science-maintainers on behalf of Drew
> Parsons"
> <debian-science-maintainers-bounces+mckinstry=debian.org at alioth-lists.debian.net
> on behalf of dparsons at debian.org> wrote:
>
> Package: mpich
> Followup-For: Bug #1004556
>
> My guess is that this bug is ongoing in 4.0-2 because of pmix
> support.
>
> 4.0-2 is still configured
> --with-pmix=/usr/lib/x86_64-linux-gnu/pmix2
> and still Depends: libpmix2 (>= 4.1.2)
>
> pmix support was added in 4.0~b1-2 along with ucx.
>
> I gather ucx was deactivated in 4.0-2 but pmix was not. Looks like
> pmix
> also needs to go (unless the problem is ch4, which was also added
> in
> 4.0~b1-2). But the error message references PMIX.
>
>
> Ironically, I find that an executable compiled against mpich 4.0-1
> fails with mpiexec.mpich (as raised in this bug) but actually
> passes
> when run with mpiexec.openmpi. Awkward.
>
> --
> debian-science-maintainers mailing list
> debian-science-maintainers at alioth-lists.debian.net
>
> https://alioth-lists.debian.net/cgi-bin/mailman/listinfo/debian-science-maintainers
More information about the debian-science-maintainers
mailing list