Closed
Description
After investigation of ROCm/pytorch#1232
we found the root cause of the issue to be https://github.com/pytorch/builder/blob/main/common/install_rocm_drm.sh#L100
- The code looks for "amdgpu.ids" at the root of the executable (removing /bin/exec) (i.e /path/to/bin/python => /path/to)
- The
check_for_location_of_amdgpuids
always returns 0 so the search is never cancelled once the first file is found. - because the search is never cancelled the entire python installation is walked through
- the file is not even installed close to the python installation so looking there is pointless anyway
- the amgpu.ids is just a mapping of product id to marketing names; the features is far from critical but takes a significant amount of time for no reasons.
- other part of the code will directly load
/opt/amdgpu/share/libdrm/amdgpu.ids
so why bother looking for it if we know where it is.