-
Notifications
You must be signed in to change notification settings - Fork 900
OPAL/MCA/COMMON/UCX: #11632 bugfix - mca string variables registration #11640
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
OPAL/MCA/COMMON/UCX: #11632 bugfix - mca string variables registration #11640
Conversation
Hello! The Git Commit Checker CI bot found a few problems with this PR: c319ccb: OPAL/MCA/COMMON/UCX: changed tls var registration,...
Please fix these problems and, if necessary, force-push new commits back up to the PR branch. Thanks! |
c319ccb
to
831c5b0
Compare
@roiedanino could you update this PR? |
And when merged, mind opening up a v5.0 PR please |
beb2894
to
9104fc1
Compare
9104fc1
to
dfdc992
Compare
@yosefe is this good to go? |
opal/mca/common/ucx/common_ucx.h
Outdated
@@ -5,6 +5,7 @@ | |||
* and Technology (RIST). All rights reserved. | |||
* Copyright (c) 2019-2020 High Performance Computing Center Stuttgart, | |||
* University of Stuttgart. All rights reserved. | |||
* Copyright (c) 2022 IBM Corporation. All rights reserved. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
i don't think we can add this..
e42bfc1
to
0ca009b
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In the future, please separate whitespace / style changes from code changes (i.e., have them in different commits). This helps understand the git history easier.
At a minimum, please squash the 2 commits in this PR down to a single commit. I.e., don't have a wrong commit (with the IBM copyright) followed by a "fixup" commit to fix that errant copyright, because then that wrong commit will forever be in Open MPI's git history.
Thanks.
0ca009b
to
ba0ee36
Compare
We discussed this PR on the Tuesday call today. Conclusions:
|
ba0ee36
to
b210f19
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sorry, this commit was originally cherry-picked from d79d5e8, then @janjust updated it; I'll update the commit message. |
b210f19
to
4c9776d
Compare
This nominally looks correct to me, but I have no good way of testing it. Someone in the UCX community should validate this. |
4c9776d
to
62f4933
Compare
62f4933
to
58fecda
Compare
bugfix open-mpi#11632 Signed-off-by: Roie Danino <[email protected]>
58fecda
to
32aba0b
Compare
@roiedanino please open up v5.0 PR |
Fixing #11632 segfault
The root cause for the segfault:
ompi_info
was hitting a segfault while trying to deregister the UCXtls
mca-variable after UCX components were already unloaded - resulting in illegal memory access.Suggested solution
Cherry-pick from a commit that already fixed this issue in v4.1.x: d79d5e8