-
Notifications
You must be signed in to change notification settings - Fork 1.6k
Regression with current pre-0.2.9 git and the elk code #329
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
@wernsaar , could you look at this issue? |
On 15.12.2013 23:36, martin-frbg wrote:
if you want to build for haswell, piledriver or bulldozer, you need Werner |
Build system in question is opensuse 12.3, so fairly recent (binutils-2.23, gcc472, using gcc482 does not solve the problem). Will do further analysis with valgrind 3.9 instead of the 3.8.1 used for the above. Please note that a sandybridge build of 0.2.8 works without errors, while 0.2.9 sandybridge is unusable on the same system. |
Just for the record, updating binutils to 2.24 did not change anything. (Neither did updating valgrind change anything |
Finally got around to taking another look - it turned out the problem with 0.2.9-rc1 is specific to openmp: When Elk is compiled without the "-fopenmp" from its default make.inc settings, all its tests pass on Haswell with 0.2.9-rc1. Conversely, a -fopenmp build linked against 0.2.9-rc1 fails even on nehalem architecture, where 0.2.8 works well (provided that it was built with USE_THREAD=0, USE_OPENMP=1) |
The problem apparently was introduced well before the Haswell branch was merged. Bisecting now. |
dfd1064 is the first bad commit
|
Have confirmed now that removing the distinction between F(P)FLAGS and LAPACK_F(P)FLAGS introduced by the above change to Makefile.system fixes my problem also in current git head. |
@martin-frbg , Thank you for the investigation. I added dfd1064 to fix the SEGFAULT with OpenMP on Windows. |
Yes, I saw that but it was not clear to me if that was a real fix, and not just papering over a different problem. |
Please try develop branch. |
Thank you. (Might it make sense to revisit #287 now that 0.2.9 contains a newer LAPACK ?) |
Will see if I can get openblas&elk built on a windows/mingw system in the near future for additional insight. |
Just a quick heads up - I will try to pinpoint the problem if possible later:
Using the ELK "computational chemistry" code from elk.sourceforge.net I see lots of failures in the test problems distributed with the code when I build it against the current git version of openBLAS optimized for either Haswell or Sandybridge.
Using 0.2.8 built for Sandybridge, all is well on this i7-4770 (though openblas gives no measurable speedup on these small problems).
To reproduce:
0.2.9, runtime is more than doubled (due to the test cases failing to converge)
and most tests report failure.
One hint: in test-002, valgrind emits lots of "use of uninitialized value" warnings in calls to dgemv_t, dgemm_kernel, zscal_k etc.
The text was updated successfully, but these errors were encountered: