Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

High disk usage during docker:dind scenarios with Testcontainers #875

Open
mawl opened this issue Nov 25, 2024 · 1 comment
Open

High disk usage during docker:dind scenarios with Testcontainers #875

mawl opened this issue Nov 25, 2024 · 1 comment
Assignees
Labels
bug Something isn't working question Further information is requested

Comments

@mawl
Copy link

mawl commented Nov 25, 2024

Environment:
Rancher: v2.9.3
sysbox: registry.nestybox.com/nestybox/sysbox-deploy-k8s:v0.6.5

Container Runtime Version: cri-o://1.30.4
Image: Ubuntu 22.04.5 LTS
Kernel Version: 5.15.0-124-generic
Kube Proxy Version: v1.30.5+rke2r1
Kubelet Version: v1.30.5+rke2r1
vCPUs: 16
RAM: 36GB

Hey,

We're seeing high disk usage and defective CI jobs on rancher worker nodes once or twice a week, mainly in docker-dind scenarios using Testcontainers. An update to Sysbox v0.6.5 doesn't fix this.

image

sudo journalctl -eu crio

Nov 25 12:47:28 ranchernode crio[1642137]: time="2024-11-25 12:47:28.516519173Z" level=warning msg="Stopping container ad738e20887bad39204cd6106c7d16308a1f8a53a5f21d54cf26ca3714938edb with stop signal timed out. Killing..."
Nov 25 12:47:28 ranchernode crio[1642137]: time="2024-11-25 12:47:28.517333181Z" level=warning msg="Stopping container 68095211d7cb09101f5ad415e7f160f8298c9d5c232068553ce10ebbd64a8ce1 with stop signal timed out. Killing..."

Thanks for any help in advance.

@ctalledo
Copy link
Member

ctalledo commented Dec 9, 2024

Hi @mawl, can you provide more info on the setup and how to reproduce the problem?

I see you are installing Sysbox on a Rancher K8s cluster, but what Testcontainers workload are you running on it? And when does the problem manifest itself? Is it somehow related to the number (or size) or images created by the Docker-in-Docker engine? In other words, does the problem occur when the Docker engine running inside the Sysbox pod has too many images? And does it occur when the pod gets started or stopped, or while the pod is running?

The more info you can provide, the better. Thanks.

@ctalledo ctalledo self-assigned this Dec 9, 2024
@ctalledo ctalledo added question Further information is requested bug Something isn't working labels Dec 9, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants