Skip to content

Self-hosted runner (AMD mi250 CI caller) #424

Self-hosted runner (AMD mi250 CI caller)

Self-hosted runner (AMD mi250 CI caller) #424

Triggered via workflow run December 15, 2023 14:46
@younesbelkadayounesbelkada
completed 74cae67
Status Failure
Total duration 27m 13s
Artifacts
AMD mi250  /  Check Runner Status
6s
AMD mi250 / Check Runner Status
Matrix: AMD mi250 / Check Runners
Matrix: AMD mi250 / Setup
Matrix: AMD mi250 / Model tests
AMD mi250  /  Send results to webhook
2m 14s
AMD mi250 / Send results to webhook
Fit to window
Zoom out
Zoom in

Annotations

3 errors and 1 warning
AMD mi250 / Check Runners (multi-gpu)
The self-hosted runner: hf-amd-mi250-ci-2gpu-1 lost communication with the server. Verify the machine is running and has a healthy network connection. Anything in your workflow that terminates the runner process, starves it for CPU/Memory, or blocks its network access can cause this error.
AMD mi250 / Check Runners (single-gpu)
The job was canceled because "multi-gpu" failed.
AMD mi250 / Check Runners (single-gpu)
The self-hosted runner: hf-amd-mi250-ci-1gpu-2 lost communication with the server. Verify the machine is running and has a healthy network connection. Anything in your workflow that terminates the runner process, starves it for CPU/Memory, or blocks its network access can cause this error.
AMD mi250 / Check Runners (multi-gpu)
Delete stale container networks failed, docker network prune fail with exit code 1