Optimize busy loop for modern linux by default #2041

brada4 · 2019-03-03T17:50:01Z

Use just clock wait in place of sched_yield that is heavy in modern kernel side, also when called from light virtualisations like LXC or chroots.
I managed to make 8000 sched_yields per second without pti and 5000 with it, so this is already better for cases where zero threading threshold is in force, and with no kernel bombing also for turbo CPUs.
It dropped about 5% of time spent in gemm on huge set, I did not test much more.
Explanation from different point in file.
This does not solve problem of busy loop being employed, where some light IPC could work, it just eases life of current code.

brada4 · 2019-03-03T18:21:20Z

Bulldozer spinning right above gives questionable benefit with piledriver - no turbo, though theoretically immediate reaction to threads done

martin-frbg · 2019-03-04T14:31:35Z

We have been there several times without conclusive results (most recently #1861 I think), and your PR does look similar to my #1600 that I threw out again in #1613. So what is new ?

brada4 · 2019-03-04T14:39:46Z

Nothing is new, just I mention cause and consequence.
Linux 2.6 has compatibility switch to enable 2.4 behaviour, I am pretty sure we do target other systems.
Otherwise it is fine, same as patches before.

martin-frbg · 2019-03-04T14:56:53Z

So why sleep again like my earlier "failed" PR rather than nop or pause ?

brada4 · 2019-03-04T16:35:14Z

Nop does not un-schedule process, sleep does, thus easing lxc scenario that core that spins the busy loop acrually gives cycles to whatever else is on the system, it does not contribute to the compitation at that point anyway.

brada4 · 2019-03-04T16:41:52Z

E.g. run idle priority process/ thread per core outside openblas process, then count what time it accumulated, with sleep one gets few seconds not bombed into kernel.

brada4 · 2019-03-05T18:50:22Z

Nop chain should be good in case it permits simd part of cpu to sleep, while not idle completely, power is down and turbo is up, it is also possible that one in a hundred disables idle mwait hlt in the kernel, so that short naps are no naps. As long as this is in generic code, no single code would be best for every case, but at lest should not be very bad by default, i mean assuming nanosleep is hlt or mwait if no other process runs should be safe for common case

brada4 · 2019-06-08T18:20:36Z

Sleep is wildly better on a virtual machine, while on real CPU it is greatly indifferent. I dont know.

brada4 · 2019-08-01T07:11:54Z

I will make new one with toplevel option, leaving defaults intact, packagers can try to measure then.

brada4 added 2 commits March 3, 2019 19:34

init

c2c6286

use sleep timed after sched_yield for spinning busy loop

27d0379

brada4 closed this Aug 1, 2019

brada4 deleted the wait branch August 1, 2019 07:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimize busy loop for modern linux by default #2041

Optimize busy loop for modern linux by default #2041

Uh oh!

brada4 commented Mar 3, 2019

Uh oh!

brada4 commented Mar 3, 2019

Uh oh!

martin-frbg commented Mar 4, 2019

Uh oh!

brada4 commented Mar 4, 2019 •

edited

Loading

Uh oh!

martin-frbg commented Mar 4, 2019

Uh oh!

brada4 commented Mar 4, 2019

Uh oh!

brada4 commented Mar 4, 2019

Uh oh!

brada4 commented Mar 5, 2019

Uh oh!

brada4 commented Jun 8, 2019

Uh oh!

brada4 commented Aug 1, 2019

Uh oh!

Uh oh!

Optimize busy loop for modern linux by default #2041

Optimize busy loop for modern linux by default #2041

Uh oh!

Conversation

brada4 commented Mar 3, 2019

Uh oh!

brada4 commented Mar 3, 2019

Uh oh!

martin-frbg commented Mar 4, 2019

Uh oh!

brada4 commented Mar 4, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

martin-frbg commented Mar 4, 2019

Uh oh!

brada4 commented Mar 4, 2019

Uh oh!

brada4 commented Mar 4, 2019

Uh oh!

brada4 commented Mar 5, 2019

Uh oh!

brada4 commented Jun 8, 2019

Uh oh!

brada4 commented Aug 1, 2019

Uh oh!

Uh oh!

brada4 commented Mar 4, 2019 •

edited

Loading