K3s etcd snapshot reconcile consumes excessive memory when a large number of snapshots are present #10450

purha · 2024-07-04T17:10:54Z

Environmental Info:
K3s Version: v1.28.10+k3s1 (a4c5612)
go version go1.21.9

but also affects some earlier versions.

Node(s) CPU architecture, OS, and Version: Linux kubernetes-worker-f-1 6.1.0-22-arm64 #1 SMP Debian 6.1.94-1 (2024-06-21) aarch64 GNU/Linux

Issue exists also atleast on ubuntu 22.04 arm64.

Cluster Configuration: 3 servers with all roles in all of them

Describe the bug:

In case when there's lots of snapshots in s3, at some point k3s will consume all memory available, and then oom killer will kick in.
Normally snapshots get cleaned, but because of bug #10292 they dont.
This will only affect single node in cluster.

Steps To Reproduce:

Installed K3s: enabled snapshots to S3 (AWS) as snapshots gets piled up the memory will run out faster and faster.

Expected behavior:

Memory should not run out.

Actual behavior:

Memory runs out on node, causing oom killer to start killing processes, restarting k3s service will fix the issue for some time until memory runs out again.

Additional context / logs:

Memory consumption example graph attached.

The text was updated successfully, but these errors were encountered:

brandond · 2024-07-05T23:24:06Z

In case when there's lots of snapshots in s3, at some point k3s will consume all memory available, and then oom killer will kick in.

Is there actually a cumulative memory leak, or is the memory required to manage the snapshots directly proportional to the number of snapshots found on disk and in S3?

If there is a cumulative memory leak, this should show up as increasing memory usage over time despite a static number of etcd snapshots.

purha · 2024-07-06T09:21:39Z

In case when there's lots of snapshots in s3, at some point k3s will consume all memory available, and then oom killer will kick in.

Is there actually a cumulative memory leak, or is the memory required to manage the snapshots directly proportional to the number of snapshots found on disk and in S3?

If there is a cumulative memory leak, this should show up as increasing memory usage over time despite a static number of etcd snapshots.

Seems like cumulative, amount of snapshots contribute to the time that it takes the memory to run out. By disabling S3 snapshots the issue is gone and the memory usage is stable.

brandond · 2024-07-08T21:28:41Z

What are the units on your graph? Can you show the actual memory utilization of the k3s process in bytes? How many snapshots did you have in the cluster when you saw the memory utilization growing?

I'm trying to reproduce this by profiling k3s with s3 enabled, retention set to 120, and snapshots taken 1 per minute, but I'm not quite sure that I'm seeing the exact same thing as you.

brandond · 2024-07-08T21:38:46Z

I am also curious if you've tried adding a memory limit to the k3s systemd unit. By default the k3s systemd unit does not have a memory limit on it, and without any external memory pressure, golang will not free memory back to the operating system. So you could just be seeing secondary effects of k3s requiring more memory to reconcile a large number of snapshots, and golang not freeing memory until it absolutely needs to.

brandond · 2024-07-08T22:28:18Z

Just to share what I'm seeing: I do see k3s allocating a lot of memory while reconciling snapshots, but this memory is freed at the end of each snapshot save cycle. Note that the memory is allocated but no longer in use, which means that it is available to be freed or reused. This is NOT a leak, but I can try to see if there is some potential for enhancement here to avoid the momentary spike in memory during reconcile.

alloc_space

inuse_space

brandond · 2024-07-08T23:47:46Z

Just from glancing at this, I suspect that just adding some pagination to the various list operations would take the memory utilization down a lot. The current code pulls a full list into memory on every pass, which will be expensive with hundreds of snapshots.

The profiling also makes it clear that this is NOT a leak, and is not related to minio. So I am going to edit the issue title to better reflect the root of the problem.

purha · 2024-07-09T07:12:35Z

I dont have the data anymore, but I think there was around 300 snapshots or more, from 60 days or so, and few on-demand snapshots, I deleted all but the last 14 days and you can see the from the graph that it slightly helped. And you can also see when I disabled the snapshots all together. Attached also the heap profile that I took, however at that point the k3s was already consuming gigabytes of memory. I didn't try to set memory limits for the service.

purha · 2024-07-09T07:13:44Z

The usage is in percents and that's a node with 8Gb of memory

aganesh-suse · 2024-08-13T21:51:49Z

Closing based on release-1.30 results here: #10559

brandond self-assigned this Jul 5, 2024

brandond added this to the v1.30.3+k3s1 milestone Jul 5, 2024

brandond changed the title ~~Memory leak in s3 snapshots / minio client~~ K3s etcd snapshot reconcile consumes excessive memory when a large number of snapshots are present Jul 8, 2024

brandond removed this from the v1.30.3+k3s1 milestone Jul 15, 2024

brandond added the area/etcd label Jul 15, 2024

brandond mentioned this issue Jul 16, 2024

Use pagination when listing large numbers of resources #10527

Merged

brandond added this to the August 2024 Release Cycle milestone Jul 24, 2024

endawkins assigned fmoral2 and aganesh-suse and unassigned fmoral2 Aug 13, 2024

aganesh-suse closed this as completed Aug 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

K3s etcd snapshot reconcile consumes excessive memory when a large number of snapshots are present #10450

K3s etcd snapshot reconcile consumes excessive memory when a large number of snapshots are present #10450

purha commented Jul 4, 2024

brandond commented Jul 5, 2024

purha commented Jul 6, 2024

brandond commented Jul 8, 2024

brandond commented Jul 8, 2024

brandond commented Jul 8, 2024 •

edited

Loading

brandond commented Jul 8, 2024

purha commented Jul 9, 2024

purha commented Jul 9, 2024

aganesh-suse commented Aug 13, 2024

K3s etcd snapshot reconcile consumes excessive memory when a large number of snapshots are present #10450

K3s etcd snapshot reconcile consumes excessive memory when a large number of snapshots are present #10450

Comments

purha commented Jul 4, 2024

brandond commented Jul 5, 2024

purha commented Jul 6, 2024

brandond commented Jul 8, 2024

brandond commented Jul 8, 2024

brandond commented Jul 8, 2024 • edited Loading

brandond commented Jul 8, 2024

purha commented Jul 9, 2024

purha commented Jul 9, 2024

aganesh-suse commented Aug 13, 2024

brandond commented Jul 8, 2024 •

edited

Loading