Skip to content

Commit 7feedc6

Browse files
committed
[ExecuTorch][Llama] Use Eigen blas for custom sdpa
OpenBlas's implementation is not thread safe. Thus when used within parallel_for, it produces incorrect output. This has been documented in a few places like here OpenMathLib/OpenBLAS#1441 and OpenMathLib/OpenBLAS#2543. I tried few options to disable openblas's multithreading but none of them seemed to work. It is possible that upstream openblas has fixed this. Will validate this by pulling in latest, but in the meanwhile using eigen_blas to unblock. Differential Revision: [D64334733](https://our.internmc.facebook.com/intern/diff/D64334733/) [ghstack-poisoned]
1 parent cd2d2b4 commit 7feedc6

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

kernels/optimized/lib_defs.bzl

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -144,7 +144,7 @@ def define_libs():
144144
(
145145
"^android-arm64.*$",
146146
[
147-
"fbsource//third-party/openblas:openblas",
147+
"fbsource//arvr/third-party/eigen:eigen3_blas",
148148
],
149149
),
150150
],

0 commit comments

Comments
 (0)