-
Notifications
You must be signed in to change notification settings - Fork 900
Predefined error handler such as MPI_ERRORS_ARE_FATAL does not work with main branch #11817
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
I did some debug and I found the error was caused by the following commit:
This commit removed the However, for predefined error handle, its |
@wzamazon thank you for the debugging, I can have a look, but it might be the end of the week until I get to it. |
Thank you! I think I have a solution. I looked into the change. IIUC, the removal of So I think we can add |
@wzamazon if that fixes the problem, that would be fantastic. This is what I would have looked into as well btw. |
#11818 did that. Please take a look. Thank you! |
@wzamazon Did we break ERRORS_ARE_FATAL on v5.0.x as well? If so, this is a blocker. |
Yes. As can be seen in mtt output:
|
backport PR has been merged |
Thank you for taking the time to submit an issue!
Background information
What version of Open MPI are you using? (e.g., v3.0.5, v4.0.2, git branch name and hash, etc.)
main branch
Describe how Open MPI was installed (e.g., from a source/distribution tarball, from a git clone, from an operating system distribution package, etc.)
Compiled from source
If you are building/installing from a git clone, please copy-n-paste the output from
git submodule status
.Please describe the system on which you are running
Details of the problem
As can be seen from mtt test result, the
MPI_Errhandler_fatal
test is not working as expected.The test set the error handler to MPI_ERRORS_ARE_FATAL, then called
MPI_Send
to a non-existing rank.It should have caused the application to abort immediately, but
MPI_Send
returned an error.The text was updated successfully, but these errors were encountered: