Self-hosted runner (AMD mi250 CI caller) #424
Triggered via workflow run
December 15, 2023 14:46
younesbelkada
completed
74cae67
Status
Failure
Total duration
27m 13s
Artifacts
–
self-push-amd-mi250-caller.yml
on: workflow_run
AMD mi250
/
Check Runner Status
6s
Matrix: AMD mi250 / Check Runners
Matrix: AMD mi250 / Setup
Matrix: AMD mi250 / Model tests
AMD mi250
/
Send results to webhook
2m 14s
Annotations
3 errors and 1 warning
AMD mi250 / Check Runners (multi-gpu)
The self-hosted runner: hf-amd-mi250-ci-2gpu-1 lost communication with the server. Verify the machine is running and has a healthy network connection. Anything in your workflow that terminates the runner process, starves it for CPU/Memory, or blocks its network access can cause this error.
|
AMD mi250 / Check Runners (single-gpu)
The job was canceled because "multi-gpu" failed.
|
AMD mi250 / Check Runners (single-gpu)
The self-hosted runner: hf-amd-mi250-ci-1gpu-2 lost communication with the server. Verify the machine is running and has a healthy network connection. Anything in your workflow that terminates the runner process, starves it for CPU/Memory, or blocks its network access can cause this error.
|
AMD mi250 / Check Runners (multi-gpu)
Delete stale container networks failed, docker network prune fail with exit code 1
|