[SPARK-49133][CORE] Make member `MemoryConsumer#used` atomic to avoid user code causing deadlock #51849

zhztheplayer · 2025-08-05T15:38:15Z

What changes were proposed in this pull request?

Turn the field MemoryConsumer#used from long to AtomicLong.

Why are the changes needed?

MemoryConsumer doesn't provide internal thread-safety so developer should add their own lock for concurrent memory allocation in the same task.

Thinking of multiple threads are allocating memory in the same task (although it's a special case regarding Spark's memory model), to protect the thread-safety of MemoryConsumer, user has to lock the API invocations of it. In this case, if one memory consumer spills another concurrently, there's a risk of ABBA deadlock. E.g.,

In thread 1, consumer A acquires memory from TMM
In thread 2, consumer B acquires memory from TMM and spills consumer A.

Deadlock happens at the moment thread 1 locks consumer A and acquires TMM's lock, while consumer B locks TMM then acquires A's lock.

To fix this problem, Spark could ensure MemoryConsumer's thread-safety with an atomic MemoryConsumer#used, so user doesn't have to add a lock in most cases.

Does this PR introduce any user-facing change?

A developer change:

protected long used;

will become

protected final AtomicLong used = new AtomicLong(0L);

To address this, developers could call getUsed() for all Spark versions instead (if they need to read the value of used), without having to maintain a shim layer for this change.

How was this patch tested?

No need to test from Spark code. But fine to add a case to emulate developer's calls if preferred.

Was this patch authored or co-authored using generative AI tooling?

No.

… deadlock

yaooqinn · 2025-08-06T03:03:55Z

cc @JoshRosen @rednaxelafx

[SPARK-49133][CORE] Make member <MemoryConsumer#used< atomic to avoid…

e6e8aaf

… deadlock

github-actions bot added the CORE label Aug 5, 2025

zhztheplayer changed the title ~~[SPARK-49133][CORE] Make member MemoryConsumer#used atomic to avoid deadlock~~ [SPARK-49133][CORE] Make member MemoryConsumer#used atomic to avoid user code causing deadlock Aug 5, 2025

empty

76cd33e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-49133][CORE] Make member `MemoryConsumer#used` atomic to avoid user code causing deadlock #51849

[SPARK-49133][CORE] Make member `MemoryConsumer#used` atomic to avoid user code causing deadlock #51849

zhztheplayer commented Aug 5, 2025 •

edited

Loading

Uh oh!

yaooqinn commented Aug 6, 2025

Uh oh!

Uh oh!

[SPARK-49133][CORE] Make member MemoryConsumer#used atomic to avoid user code causing deadlock #51849

Are you sure you want to change the base?

[SPARK-49133][CORE] Make member MemoryConsumer#used atomic to avoid user code causing deadlock #51849

Conversation

zhztheplayer commented Aug 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

yaooqinn commented Aug 6, 2025

Uh oh!

Uh oh!

[SPARK-49133][CORE] Make member `MemoryConsumer#used` atomic to avoid user code causing deadlock #51849

[SPARK-49133][CORE] Make member `MemoryConsumer#used` atomic to avoid user code causing deadlock #51849

zhztheplayer commented Aug 5, 2025 •

edited

Loading