Parallelise smaps #2715

wacuuu · 2020-11-09T14:09:52Z

In relation to #2622 I introduced a change that should lower the cost of gathering referenced bytes information. As said in the issue, using this feature with big containers in current implementation will stop cAdvisor from refreshing metrics, as it waits for referenced bytes to be read, To mitigate the problem, I implemented reading referenced bytes independently from main execution, as go routines. Moreover, to avoid peaks in CPU usage by cAdvisor, reading is distributed over the operation interval.

Also it is important to mention, that for sake of precise estimation of wss it is good to reset independently from reading, hence there is a new parameter.

Signed-off-by: Jakub Walecki [email protected]

k8s-ci-robot · 2020-11-09T14:10:01Z

Hi @wacuuu. Thanks for your PR.

I'm waiting for a google member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

iwankgb · 2020-11-10T11:06:31Z

/ok-to-test

Creatone · 2020-11-26T13:24:05Z

docs/runtime_options.md

+--pid=host \
+--privileged \
+--name=cadvisor \
+cadvisor:$CADVISOR_TAG


According to docs/deploy.md it shouldn't be google/cadvisor:$CADVISOR_TAG?

No, as this is only and example

Creatone · 2020-11-26T13:33:01Z

docs/runtime_options.md

+--privileged \
+--name=cadvisor \
+cadvisor:$CADVISOR_TAG
+--disable_metrics="cpu_topology,resctrl,udp,sched,hugetlb,node_vmstat,memory_numa,tcp,advtcp,percpu,process" \


Got: invalid value "cpu_topology,resctrl,udp,sched,hugetlb,node_vmstat,memory_numa,tcp,advtcp,percpu,process" for flag -disable_metrics: unsupported metric "node_vmstat" specified in disable_metrics

Creatone · 2020-11-26T14:21:26Z

docs/runtime_options.md

+- `--referenced_read_interval` duration Read interval for referenced bytes (container_referenced_bytes metric), number of seconds after which referenced bytes are read, if set to 0 referenced bytes are never read (default: 0s)
+- `--referenced_reset_interval` duration Reset interval for referenced bytes (container_referenced_bytes metric), number of seconds after which referenced bytes are cleared, if set to 0 referenced bytes are never cleared (default: 0s)
+
+The referenced memory value is based on one of two files in `/sys/<PID>`: smaps or smaps_rollup. If the latter exists it reports the same value as the first one, but in aggregated form. This is only implementation flavoring and there is no difference whether smaps_rollup exists or not.


You mean /proc/<PID> instead of /sys/<PID> ?

Creatone · 2020-11-26T14:40:24Z

container/libcontainer/handler.go

 		if err != nil {
 			klog.V(4).Infof("Could not get PIDs for container %d: %v", h.pid, err)
 		} else {
-			stats.ReferencedMemory, err = referencedBytesStat(pids, h.cycles, *referencedResetInterval)
+			stats.ReferencedMemory = h.referencedMemory * 1024


Can you add a comment about converting ReferencedMemory to bytes?

Creatone · 2020-11-26T15:31:02Z

container/libcontainer/handler.go

-		smapsFilePath := fmt.Sprintf(smapsFilePathPattern, pid)
+		smapsFilePath = fmt.Sprintf(smapsRollupFilePattern, pid)
+		if _, err := os.Stat(smapsFilePath); err == nil {
+			klog.V(6).Infof("Using smaps_rollup for pid %d instead of smaps", pid)


Can you provide a reason why this verbosity level is set to 6?

Because this is an information useful in process of debugging and understanding the code, yet it would trash the logs if reported on lower level

Creatone · 2020-11-27T13:13:20Z

container/libcontainer/handler.go

+		return
+	}
+	castResetInterval := *referencedResetInterval
+	time.Sleep(time.Duration(rand.Intn(int(castResetInterval.Seconds()))))


Could you explain this sleep?

Creatone · 2020-11-27T13:13:27Z

container/libcontainer/handler.go

+		return
+	}
+	castReadInterval := *referencedReadInterval
+	time.Sleep(time.Duration(rand.Intn(int(castReadInterval.Seconds()))))


Could you explain this sleep?

Both of those sleeps are responsible for preventing cpu usage spikes. Imagine a situation that cadvisor is started on a machine populated by a lot of containers, or a situation in which the containers are created 'in a batch' that is a lot of containers in a short period of time. In any of those cases, with every container creation a new handler starts, and so does the thread responsible for reading smaps file(and if requestested, another one for reseting it).

As the interval is common for all the threads(it comes from parameter) now you are in situation, where all of those reading threads call their respective smaps files at almost the same moment. As smaps is not a regular file but an interface to a kernel function, this will trigger kernel execution and eat up cpu. Have a lot of not-so-multiprocess containers and you'll find yourself having cpu usage spikes every referenced_read_interval

Signed-off-by: Jakub Walecki <[email protected]>

Creatone · 2021-09-27T10:23:09Z

Feel free to reopen this PR if you think it's important.

google-cla bot added the cla: yes label Nov 9, 2020

k8s-ci-robot added the needs-ok-to-test label Nov 9, 2020

wacuuu force-pushed the jwalecki/parallel_smaps_2 branch from 32a1e4b to f51dd03 Compare November 10, 2020 10:50

k8s-ci-robot added ok-to-test and removed needs-ok-to-test labels Nov 10, 2020

Creatone reviewed Nov 27, 2020

View reviewed changes

JensErat mentioned this pull request Dec 17, 2020

Export entire smaps memory metrics instead of only referenced_memory #2767

Closed

5 tasks

Jakub Walecki added 3 commits December 27, 2020 16:51

Parallelise smaps

fa41abd

Signed-off-by: Jakub Walecki <[email protected]>

Fix linting issues

ae8490d

Signed-off-by: Jakub Walecki <[email protected]>

Review requests and rebase

1625fe6

Signed-off-by: Jakub Walecki <[email protected]>

wacuuu force-pushed the jwalecki/parallel_smaps_2 branch from f51dd03 to 1625fe6 Compare December 27, 2020 16:13

Creatone closed this Sep 27, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Parallelise smaps #2715

Parallelise smaps #2715

Uh oh!

wacuuu commented Nov 9, 2020

Uh oh!

k8s-ci-robot commented Nov 9, 2020

Uh oh!

iwankgb commented Nov 10, 2020

Uh oh!

Creatone Nov 26, 2020

Uh oh!

wacuuu Dec 27, 2020

Uh oh!

Creatone Nov 26, 2020

Uh oh!

Creatone Nov 26, 2020

Uh oh!

wacuuu Dec 27, 2020

Uh oh!

Creatone Nov 26, 2020

Uh oh!

Creatone Nov 26, 2020

Uh oh!

wacuuu Dec 27, 2020

Uh oh!

Creatone Nov 27, 2020

Uh oh!

Creatone Nov 27, 2020

Uh oh!

wacuuu Dec 27, 2020

Uh oh!

Creatone commented Sep 27, 2021

Uh oh!

Uh oh!

Parallelise smaps #2715

Parallelise smaps #2715

Uh oh!

Conversation

wacuuu commented Nov 9, 2020

Uh oh!

k8s-ci-robot commented Nov 9, 2020

Uh oh!

iwankgb commented Nov 10, 2020

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Creatone commented Sep 27, 2021

Uh oh!

Uh oh!