Skip to content

btl/base_am_rdma: fix handling of btl_send returning 1 #9695

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged

Conversation

wzamazon
Copy link
Contributor

Active message RDMA uses btl_send to send the initial
request and RDMA response.

btl_send will return 0 when the descriptor has been
successfully queued for send, and will return 1 when
the desciptor has been successfully sent.

Currently, active message RDMA treats the return value
1 as an error, thus will either return the value
to caller, or retry the send.

This patch address the issue by correctly handling
the return value 1.

Signed-off-by: Wei Zhang [email protected]

@wzamazon wzamazon force-pushed the btl_am_rdma_handle_btl_send_return_1 branch from c4b7b1b to 92bba55 Compare November 24, 2021 00:07
@wzamazon wzamazon requested a review from bwbarrett November 24, 2021 00:07
Active message RDMA uses btl_send to send the initial
request and RDMA response.

btl_send will return 0 when the descriptor has been
successfully queued for send, and will return 1 when
the desciptor has been successfully sent.

Currently, active message RDMA treats the return value
1 as an error, thus will either return the value
to caller, or retry the send.

This patch address the issue by correctly handling
the return value 1.

Signed-off-by: Wei Zhang <[email protected]>
@wzamazon wzamazon force-pushed the btl_am_rdma_handle_btl_send_return_1 branch from 92bba55 to 7b177ce Compare November 24, 2021 04:08
@wzamazon wzamazon requested a review from bwbarrett November 24, 2021 04:09
@wzamazon
Copy link
Contributor Author

The PR build checker failed because a git issue:

> git checkout -f ee62a31826851bc8f0e509a4f5b97eebb4e50ac8 # timeout=10
FATAL: Could not checkout ee62a31826851bc8f0e509a4f5b97eebb4e50ac8
hudson.plugins.git.GitException: Command "git checkout -f ee62a31826851bc8f0e509a4f5b97eebb4e50ac8" returned status code 128:
stdout: 
stderr: fatal: reference is not a tree: ee62a31826851bc8f0e509a4f5b97eebb4e50ac8

should not be related to the PR.

@bwbarrett
Copy link
Member

bot:aws:retest

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants