Closed
Description
I'm not sure if this is a bug or a feature, but in testing the accelerator framework work on a system with Nvidia GPUS, I noticed that the osu_mbw_mr test (the 5.8 version of OSU benchmarks) fails using cuda allocated buffers. The problem is that the cuda component. doesn't get initialized until after calling MPI_Init, so the address checker code is not employed. The other mpi/pt2pt tests initialize CUDA before calling MPI_Init and don't see this problem.
I've not tested the rocm component, so don't know if it would have the same issue.