Thread filter optim #238

r1viollet · 2025-07-07T07:51:19Z

What does this PR do?:

Reserve padded slots
Introduce a register / unregister to retrieve slots
manage a free list

Motivation:

Improve throughput of applications that run on many threads with many context updates.

Additional Notes:

How to test the change?:

For Datadog employees:

If this PR touches code that signs or publishes builds or packages, or handles
credentials of any kind, I've requested a review from @DataDog/security-design-and-guidance.
This PR doesn't touch any of that.
JIRA: [JIRA-XXXX]

Unsure? Have a question? Request a review!

github-actions · 2025-07-07T07:51:51Z

🔧 Report generated by pr-comment-cppcheck

CppCheck Report

Errors (2)

Warnings (8)

The class 'RecordingBuffer' defines member function with name 'flushIfNeeded' also defined in its parent class 'Buffer'.
Member variable 'CodeCache::_image_base' is not initialized in the copy constructor.
Member variable 'CodeCache::_image_base' is not assigned a value in 'CodeCache::operator='.
Member variable 'JitWriteProtection::_restore' is not initialized in the constructor.
Member variable 'JitWriteProtection::_restore' is not initialized in the constructor.
Either the condition 'uc!=nullptr' is redundant or there is possible null pointer dereference: uc.
Obsolete function 'alloca' called.
Member variable 'ElfParser::_vaddr_diff' is not initialized in the constructor.

Style Violations (306)

github-actions · 2025-07-07T07:52:38Z

🔧 Report generated by pr-comment-scanbuild

r1viollet · 2025-07-07T11:48:41Z

I have reasonable performance on most runs:

Benchmark                                                       (command)  (skipResults)  (workload)  Mode  Cnt    Score   Error  Units
ThreadFilterBenchmark.threadFilterStress01  cpu=100us,wall=100us,filter=1           true           0  avgt         0.039          us/op
ThreadFilterBenchmark.threadFilterStress01  cpu=100us,wall=100us,filter=1           true           7  avgt         0.041          us/op
ThreadFilterBenchmark.threadFilterStress01  cpu=100us,wall=100us,filter=1           true       70000  avgt       111.094          us/op
ThreadFilterBenchmark.threadFilterStress02  cpu=100us,wall=100us,filter=1           true           0  avgt         0.132          us/op
ThreadFilterBenchmark.threadFilterStress02  cpu=100us,wall=100us,filter=1           true           7  avgt         0.139          us/op
ThreadFilterBenchmark.threadFilterStress02  cpu=100us,wall=100us,filter=1           true       70000  avgt       108.666          us/op
ThreadFilterBenchmark.threadFilterStress04  cpu=100us,wall=100us,filter=1           true           0  avgt         0.258          us/op
ThreadFilterBenchmark.threadFilterStress04  cpu=100us,wall=100us,filter=1           true           7  avgt         0.278          us/op
ThreadFilterBenchmark.threadFilterStress04  cpu=100us,wall=100us,filter=1           true       70000  avgt       118.940          us/op
ThreadFilterBenchmark.threadFilterStress08  cpu=100us,wall=100us,filter=1           true           0  avgt         0.624          us/op
ThreadFilterBenchmark.threadFilterStress08  cpu=100us,wall=100us,filter=1           true           7  avgt         0.646          us/op
ThreadFilterBenchmark.threadFilterStress08  cpu=100us,wall=100us,filter=1           true       70000  avgt       160.170          us/op
ThreadFilterBenchmark.threadFilterStress16  cpu=100us,wall=100us,filter=1           true           0  avgt         1.780          us/op
ThreadFilterBenchmark.threadFilterStress16  cpu=100us,wall=100us,filter=1           true           7  avgt         2.288          us/op
ThreadFilterBenchmark.threadFilterStress16  cpu=100us,wall=100us,filter=1           true       70000  avgt       221.987          us/op

I'm not sure why some runs still blow up for higher numbers of threads.

ddprof-lib/src/main/cpp/profiler.cpp

ddprof-lib/src/main/cpp/threadFilter.cpp

r1viollet · 2025-07-10T15:54:56Z

CppCheck Report

Errors (2)

Warnings (8)

The class 'RecordingBuffer' defines member function with name 'flushIfNeeded' also defined in its parent class 'Buffer'.
Member variable 'CodeCache::_image_base' is not initialized in the copy constructor.
Member variable 'CodeCache::_image_base' is not assigned a value in 'CodeCache::operator='.
Member variable 'JitWriteProtection::_restore' is not initialized in the constructor.
Member variable 'JitWriteProtection::_restore' is not initialized in the constructor.
Either the condition 'uc!=nullptr' is redundant or there is possible null pointer dereference: uc.
Obsolete function 'alloca' called.
Member variable 'ElfParser::_vaddr_diff' is not initialized in the constructor.

Style Violations (306)

- Reserve padded slots - Introduce a register / unregister to retrieve slots - manage a free list

jbachorik · 2025-07-24T11:26:08Z

I did run some comparison of native memory usage with different thread filter implementations - data is in the notebook

TL;DR there is no observable increase in the native memory usage (the UNDEFINED category). Anyway, it would be useful to have an extra counter for the ThreadIDTable utilization.

…t/thread_filter_squash

If the TLS cleanup fires before the JVMTI hook, we want to ensure that we don't crash while retrieving the ProfiledThread - Add a check on validity of ProfiledThread

zhengyu123 · 2025-08-12T18:45:19Z

ddprof-lib/src/main/cpp/threadFilter.cpp

+
+    // Allocate a new slot
+    SlotID index = _next_index.fetch_add(1, std::memory_order_relaxed);
+    if (index >= kMaxThreads) {


Not sure if it is important, but it can race unregisterThread, you may want to check return value of fetch_sub to determinate if it is really full.

zhengyu123 · 2025-08-12T18:49:48Z

ddprof-lib/src/main/cpp/threadFilter.cpp


-  _enabled = true;
+    // Ensure the chunk is initialized (lock-free)
+    if (chunk_idx >= _num_chunks.load(std::memory_order_acquire)) {


I don't quite understand, are index and chunk_idx 1-to-1 matched?

zhengyu123 · 2025-08-12T19:03:02Z

ddprof-lib/src/main/cpp/threadFilter.cpp

+    ChunkStorage* expected = nullptr;
+    if (_chunks[chunk_idx].compare_exchange_strong(expected, new_chunk, std::memory_order_acq_rel)) {
+        // Successfully installed - initialize all slots
+        for (auto& slot : new_chunk->slots) {


I believe that initializing new_chunk can be done before compare_exchange_strong. Then you don't need initialized flag, which can result an awkward situation, e.g. you found chunk, but it is not initialized.

r1viollet force-pushed the r1viollet/thread_filter_squash branch 3 times, most recently from e5bce28 to 0918008 Compare July 7, 2025 11:45

r1viollet mentioned this pull request Jul 7, 2025

Refactor thread filter mechanisms #209

Closed

3 tasks

jbachorik force-pushed the r1viollet/thread_filter_squash branch 2 times, most recently from e0ac246 to 2421ba9 Compare July 10, 2025 12:48

r1viollet commented Jul 10, 2025

View reviewed changes

ddprof-lib/src/main/cpp/profiler.cpp Show resolved Hide resolved

r1viollet commented Jul 10, 2025

View reviewed changes

ddprof-lib/src/main/cpp/threadFilter.cpp Show resolved Hide resolved

r1viollet and others added 4 commits July 21, 2025 16:56

Thread filter optim

94b2559

- Reserve padded slots - Introduce a register / unregister to retrieve slots - manage a free list

Add an automatic register in case we failed to register the thread

51cb97f

Exterminate the last remnants of false sharing

cc02c1e

Minor tweaks

50a8d5f

jbachorik force-pushed the r1viollet/thread_filter_squash branch from 2421ba9 to 50a8d5f Compare July 21, 2025 14:57

Merge cleanup

30d32c0

jbachorik and others added 3 commits July 24, 2025 21:43

Merge branch 'main' into r1viollet/thread_filter_squash

bf23309

Merge branch 'main' of github.com:DataDog/java-profiler into r1violle…

90651f5

…t/thread_filter_squash

Adjust ThreadEnd hook

28e23ee

If the TLS cleanup fires before the JVMTI hook, we want to ensure that we don't crash while retrieving the ProfiledThread - Add a check on validity of ProfiledThread

zhengyu123 reviewed Aug 12, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Thread filter optim #238

Thread filter optim #238

Uh oh!

r1viollet commented Jul 7, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jul 7, 2025 •

edited

Loading

Errors (2)

Warnings (8)

Style Violations (306)

Uh oh!

github-actions bot commented Jul 7, 2025 •

edited

Loading

Uh oh!

r1viollet commented Jul 7, 2025

Uh oh!

Uh oh!

Uh oh!

r1viollet commented Jul 10, 2025 •

edited by dd-octo-sts bot

Loading

Errors (2)

Warnings (8)

Style Violations (306)

Uh oh!

jbachorik commented Jul 24, 2025

Uh oh!

zhengyu123 Aug 12, 2025

Uh oh!

zhengyu123 Aug 12, 2025

Uh oh!

zhengyu123 Aug 12, 2025 •

edited

Loading

Uh oh!

Uh oh!

Thread filter optim #238

Are you sure you want to change the base?

Thread filter optim #238

Uh oh!

Conversation

r1viollet commented Jul 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jul 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CppCheck Report

Errors (2)

Warnings (8)

Style Violations (306)

Uh oh!

github-actions bot commented Jul 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

r1viollet commented Jul 7, 2025

Uh oh!

Uh oh!

Uh oh!

r1viollet commented Jul 10, 2025 • edited by dd-octo-sts bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CppCheck Report

Errors (2)

Warnings (8)

Style Violations (306)

Uh oh!

jbachorik commented Jul 24, 2025

Uh oh!

zhengyu123 Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

zhengyu123 Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

zhengyu123 Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

r1viollet commented Jul 7, 2025 •

edited

Loading

github-actions bot commented Jul 7, 2025 •

edited

Loading

github-actions bot commented Jul 7, 2025 •

edited

Loading

r1viollet commented Jul 10, 2025 •

edited by dd-octo-sts bot

Loading

zhengyu123 Aug 12, 2025 •

edited

Loading