Skip to content

Commit d743c25

Browse files
committed
Due to bug in OpenBLAS remove thread binding from defaults
The bug OpenMathLib/OpenBLAS#2238 is fixed in the development version
1 parent 22aadde commit d743c25

File tree

3 files changed

+7
-9
lines changed

3 files changed

+7
-9
lines changed

docs/computing/running/creating-job-scripts-mahti.md

Lines changed: 3 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -96,9 +96,9 @@ export OMP_NUM_THREADS=1
9696
srun myprog -i input -o output
9797
```
9898

99-
For hybrid applications, one should use `OMP_PLACES` and
100-
`OMP_PROC_BIND` OpenMP runtime environment variables for obtaining
101-
optimum placement of OpenMP threads. As an example, in order to run
99+
For hybrid applications, one should use
100+
`OMP_PROC_BIND` OpenMP runtime environment variable for
101+
laceming the OpenMP threads. As an example, in order to run
102102
one MPI tasks per NUMA domain and one OpenMP thread per L3cache one
103103
can set
104104

@@ -107,7 +107,6 @@ can set
107107
#SBATCH --cpus-per-task=16
108108

109109
export OMP_NUM_THREADS=4
110-
export OMP_PLACES=cores
111110
export OMP_PROC_BIND=spread
112111

113112
module load myprog/1.2.3

docs/computing/running/example-job-scripts-mahti.md

Lines changed: 0 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -52,8 +52,6 @@ srun myprog <options>
5252
5353
# Set the number of threads based on --cpus-per-task
5454
export OMP_NUM_THREADS=$SLURM_CPUS_PER_TASK
55-
# Bind OpenMP threads to cores
56-
export OMP_PLACES=cores
5755
5856
srun myprog <options>
5957
```
@@ -75,8 +73,6 @@ srun myprog <options>
7573
7674
# Set the number of threads based on --cpus-per-task
7775
export OMP_NUM_THREADS=$SLURM_CPUS_PER_TASK
78-
# Bind OpenMP threads to hardware threads
79-
export OMP_PLACES=threads
8076
8177
srun myprog <options>
8278
```

docs/computing/running/performance-checklist.md

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -54,7 +54,10 @@ higher and poor load balancing gets more likely.
5454

5555
Many HPC applications benefit from binding OpenMP threads to CPU cores
5656
which can be achieved by setting `export OMP_PLACES=cores` in the
57-
batch job script. When starting new production runs it is also good
57+
batch job script. Note! Due to bug in OpenBLAS thread binding should not be
58+
specified when using threaded OpenBLAS (-omp module).
59+
60+
When starting new production runs it is also good
5861
practice to ensure correct thread affinity by adding to batch job
5962
script
6063
```

0 commit comments

Comments
 (0)