-
Notifications
You must be signed in to change notification settings - Fork 900
PGI Fortran: runtime error when using using mpi_f08 module on OpenPOWER #2606
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
@jjhursey |
which version of IBM Compiler did you used ? |
@xiachsh Here is the full version info for xlc and xlf:
|
@PHHargrove. I tried it using the same IBM XLC and XLF compilers as you mentioned to build Openmpi 2.0.2, and it compiled and ran ring_openmpif08.f90 just fine for me. I built Openmpi 2.0.2 on a LE Power8 machine that runs Ubuntu 14.04.1. The only difference in comparing to your config file is that I also added --enable-mpi-fortran=usempif08, which is required to build mpi_f08.mod. Here is the run result: |
If I recall correctly, I had omitted --enable-mpi-fortan=usempif08 because configure had determined that F08 support was present even without passing that flag. Regardless, I have retested today with that flag passed explicitly. I am using:
on
The system is a KVM-based VM on Power8E h/w. Using Open MPI 2.0.2 I still see the error:
Meanwhile, ring_usempi and ring_mpifh ran just fine. |
@PHHargrove My guess is that your mpi_f08.mod might be compiled using a different version of the compiler that may have different module format as your compilation didn't seem fail complaining "use mpi_f08". Can you please try to
export PREFIX=/path_to_openmpi202 ./configure --prefix=$PREFIX
If you still run into the same error, can you please send me your configuration line and the command line you compile ring_usempif08.f90? |
My build scripts start with downloading the release tarball and unpacking. There are also no MPI installs (Open MPI or otherwise) in $PATH, $LD_LIBRARY_PATH, etc. The full configure command:
The full "make -V" is here. This should let you determine if the .mod was compiled with the proper compiler. There is only one xlc/xlf installation on this system. The ring_* tests are build by running "make" in the examples directory with the freshly built mpicc/mpifort as the only MPI in $PATH:
|
@PHHargrove |
@PHHargrove |
I am aware that Open MPI incorrectly passes Are you saying that your results and mine differ because you are using a newer xlf? My email is the same as my github user id plus AT lbl.gov. |
I have access to the following two non-community-edition XL compiler installations:
and
Both are running on
I do not see the error on that system with either pair of compilers. |
|
@PHHargrove |
@jsquyres |
@DanielCChen Excellent; thank you. |
@jsquyres I think this issue can be closed since we updated the README and this is an XLF compiler bug. |
@hppritcha There are actually PGI and XLF bugs in this issue. |
I have confirmed access to PGI 16.10 and 17.{1,3,4,5} for OpenPower. |
Problem re-confirmed with Open MPI 2.0.2 and PGI-17.3. |
@jjhursey @DanielCChen Can one of you guys follow up on the PGI issues on this issue? Thanks! |
I'll try to take a look this week. |
@gpaulsen says to bump to 2.1.2 |
@gvallee and here's the other PGI issue we discussed before you arrived today. |
This is an old thread and I'm trying to capture where we are so please let me know if I get this wrong.
|
The linker is changed in Ubuntu 16.04 in comparison to 14.1 (the one I tried). It no longer searches and links in the definition of the symbol, which caused the issue. I have fixed the XLF compiler to workaround it. |
How about this for
|
* Related to Issue open-mpi#2606 and Issue open-mpi#3075 * The core problem in those two issues is related to a regression in ld upstream. Add a note in the README about this issue. Signed-off-by: Joshua Hursey <[email protected]>
I created a PR for the |
Based on my testing of 3.0.0rc1, can can confirm:
FWIW: it looks like #3075 has the identical failure mode as #2606 - they are duplicate bug reports as far as I can tell. |
I wrote:
I restested XLF V15.1.5 on RHEL 7.3 today and reconfirmed no problem w/ MPI F08 bindings. |
* Related to Issue open-mpi#2606 and Issue open-mpi#3075 * The core problem in those two issues is related to a regression in ld upstream. Add a note in the README about this issue. Signed-off-by: Joshua Hursey <[email protected]> (cherry picked from commit 1c6a253) Signed-off-by: Joshua Hursey <[email protected]>
* Related to Issue open-mpi#2606 and Issue open-mpi#3075 * The core problem in those two issues is related to a regression in ld upstream. Add a note in the README about this issue. Signed-off-by: Joshua Hursey <[email protected]> (cherry picked from commit 1c6a253) Signed-off-by: Joshua Hursey <[email protected]>
* Related to Issue open-mpi#2606 and Issue open-mpi#3075 * The core problem in those two issues is related to a regression in ld upstream. Add a note in the README about this issue. Signed-off-by: Joshua Hursey <[email protected]> (cherry picked from commit 1c6a253) Signed-off-by: Joshua Hursey <[email protected]>
The |
@PHHargrove has reported a runtime problem when using the PGI 16.10 fortran compiler on OpenPOWER.
This posting on devel describes the problem in some more detail:
https://mail-archive.com/[email protected]/msg19836.html
The text was updated successfully, but these errors were encountered: