-
Notifications
You must be signed in to change notification settings - Fork 900
v4.x REGRESSION: Updating to PMIx v3.1.0 has an issue #6247
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
One thing that just caught my eye: this is filed wrt OMPI v3.1, which uses PMIx v2.x - not PMIx v3.x. I suspect this is at least part of the problem. Still, it's worth taking a look to see if there is an issue with OMPI v4. |
Do you expect that OMPI 3.1 will support PMIX 3.x in the near future, or just v2.x ? as I said, the combination worked with rc1. I haven't tested OMPI v4 and we're entering a freeze soon in Debian, so Debian 10 will ship with 3.1.x. |
The PMIX_MODEX and PMIX_INFO_ARRAY macros were removed from the PMIx 3.1 standard. Open MPI does not really need them (they are only used to be reported as not supported), so smply #ifdef protect them to support an external PMIx v3.1 Refs. open-mpi#6247 Signed-off-by: Gilles Gouaillardet <[email protected]>
let's start separating the (too many) variables here
Here is the patch to be applied to the
|
The PMIX_MODEX and PMIX_INFO_ARRAY macros were removed from the PMIx 3.1 standard. Open MPI does not really need them (they are only used to be reported as not supported), so smply #ifdef protect them to support an external PMIx v3.1 The change only need to be done in ext3x/ext3x.c. But since this file is automatically generated from pmix3x/pmix3x.c, we have to update the latter file. Refs. open-mpi#6247 Signed-off-by: Gilles Gouaillardet <[email protected]> (back-ported from commit open-mpi/ompi@950ba16)
The PMIX_MODEX and PMIX_INFO_ARRAY macros were removed from the PMIx 3.1 standard. Open MPI does not really need them (they are only used to be reported as not supported), so smply #ifdef protect them to support an external PMIx v3.1 external PMIx v3 is supported via the pmix/ext2x component, and it has worked so far until PMIx v3.1 removed some macros, the change to support external PMIx v3.1 is minimal, so we do not need to bother creating a new pmix/ext3x component. The change only need to be done in ext2x/ext2x.c. But since this file is automatically generated from pmix2x/pmix2x.c, we have to update the latter file. Refs. open-mpi#6247 Signed-off-by: Gilles Gouaillardet <[email protected]> (back-ported from commit open-mpi/ompi@950ba16)
@amckinstry I was able to fix this issue on a 32 bits distro, the fix is in openpmix/openpmix#1036 |
@ggouaillardet can this be closed? |
@gpaulsen I'm not sure what you are asking as there are two issues intermixed on this ticket. The external support issue was addressed by @ggouaillardet and committed back in Jan. The 32-bit issue was also addressed in Jan and included in an earlier PMIx release - the commit is here. Bottom line: this issue can be closed. |
Courtesy of @amckinstry (filed originally on PMIx repo as openpmix/openpmix#1032):
This is testing within Debian.
3.1.0rc1 works fine; 3.1.0rc2 fails on 32-bit archs.
See https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=918157
This is with openmpi 3.1.3. This will not compile as it stands with rc2 (rc1 was fine), so there was a patch needed:
https://salsa.debian.org/hpc-team/openmpi/blob/debian/master/debian/patches/pmix-modex.patch
Which would be instantly suspect, except the combination works with 64-bit archs (arm64, amd, etc).
The problem is easily reproduced with a simple MPI code on i386:
giving:
The text was updated successfully, but these errors were encountered: