Skip to content

Increase BUFFER_SIZEs and add a safeguard; supply GEMM_R for POWER8/9 #2551

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 4 commits into from
Apr 12, 2020
Merged

Increase BUFFER_SIZEs and add a safeguard; supply GEMM_R for POWER8/9 #2551

merged 4 commits into from
Apr 12, 2020

Conversation

martin-frbg
Copy link
Collaborator

@martin-frbg martin-frbg commented Apr 12, 2020

As shown in #2538, the default value for BUFFER_SIZE was smaller than the actual requirement as calculated from the GEMM parameters on ARM64. This turns out to be the case on x86_64 Haswell and SkylakeX platform as well as ARM and ZARCH too.
Furthermore, the formula to derive GEMM_R values not supplied in param.h would never allow the BUFFER_SIZE requirement to be fulfilled, so matching values were calculated for POWER8 and 9.
Fixes #2538 following the suggestion of akobotov there and extending it to other affected architectures.

…_R for POWER8/9

As shown in #2538, default buffersizes on some platforms were smaller than required in memory.c
and the requirement could never be fulfilled for a calculated GEMM_R on PPC given the fomula used
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

ARM64v8-TSV110: dgemm sigfaults on large square matrices single thread
1 participant