runtime/trace: Add traceEvCPUProfileRate Event

**Use Case 1:** Extract CPU Profile

If an execution trace contains CPU sample events, it would be useful to extract a `cpu/nanoseconds` CPU profile similar to the one produced by [runtime.StartCPUProfile](https://pkg.go.dev/runtime/pprof#StartCPUProfile).

This can be useful when building tools. E.g. a "trace to CPU profile" tool. Or perhaps a tool for explaining the _Grunning time of a goroutine using the CPU samples for that goroutine. A naive solution would give equal weight to each collected CPU sample and stretch it over the sum of the _Grunning of the goroutine, but that could be misleading in case of scheduler latency, see below.

**Use Case 2:** Understand OS Scheduler Latency

A Go application might experience two types of scheduler latency: OS Scheduler Latency and Go Scheduler Latency. The latter can easily be analyzed using the execution tracer.

Detecting OS scheduler latency is a bit more tricky, but possible. Over a long enough time period, the cumulative time goroutines spend in `running` state should converge to the cumulative number of `traceEvCPUSample` events multiplied by their duration (default `10ms`). If there are significantly less `traceEvCPUSample` events than expected, that's a strong indicator that the application is not getting enough scheduling time from the OS. That's a [common problem](https://github.com/uber-go/automaxprocs) for some setups, so it'd be nice to use tracing data to detect it.

(There are some dragons here when it comes to CPU samples received during syscalls/cgo ... but I think that deserves a separate discussion)

**Problem:**

The `traceEvCPUSample` event does not include a value indicating how much CPU time it represents:

https://github.com/golang/go/blob/39effbc105f5c54117a6011af3c48e3c8f14eca9/src/runtime/trace.go#L74

One could assume that it's always `10ms`, but that won't work if the user calls [runtime.SetCPUProfileRate](https://pkg.go.dev/runtime#SetCPUProfileRate). Unfortunately the execution trace does record this value, and it's not possible to get the currently active value from user land either. Unlike [SetMutexProfileFraction](https://pkg.go.dev/runtime#SetMutexProfileFraction), SetCPUProfileRate does not return a value, and there is no `GetCPUProfileRate` method either.

Additionally it's currently not possible calculate the expected number of `traceEvCPUSample` events if the CPU profiler is not enabled for the entire duration of the trace.

**Suggestion:**

Add a new `traceEvCPUProfileRate` event that is recorded in the following case:

1. The tracer starts while the CPU profiler is already running.
2. [StartCPUProfile](https://pkg.go.dev/runtime/pprof#StartCPUProfile) is being called.
3. [StopCPUProfile](https://pkg.go.dev/runtime/pprof#StopCPUProfile) is called (record `0`)

Alternatively we could also have a start/stop event for the CPU profiler.

cc @mknyszek @prattmic @nsrip-dd @rhysh 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

runtime/trace: Add traceEvCPUProfileRate Event #60701

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

runtime/trace: Add traceEvCPUProfileRate Event #60701

Description

Activity

rhysh commented on Jun 9, 2023

felixge commented on Jun 11, 2023

prattmic commented on Jun 12, 2023

mknyszek commented on Jun 12, 2023

ianlancetaylor commented on Jun 14, 2023

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

Issue actions