Self-hosted runner (AMD mi250 CI caller) #411
Triggered via workflow run
December 14, 2023 14:53
amyeroberts
completed
388fd31
Status
Failure
Total duration
18m 16s
Artifacts
–
self-push-amd-mi250-caller.yml
on: workflow_run
AMD mi250
/
Check Runner Status
7s
Matrix: AMD mi250 / Check Runners
Matrix: AMD mi250 / Setup
Matrix: AMD mi250 / Model tests
AMD mi250
/
Send results to webhook
29s
Annotations
2 errors and 1 warning
AMD mi250 / Check Runners (multi-gpu)
The self-hosted runner: hf-amd-mi250-ci-2gpu-2 lost communication with the server. Verify the machine is running and has a healthy network connection. Anything in your workflow that terminates the runner process, starves it for CPU/Memory, or blocks its network access can cause this error.
|
AMD mi250 / Check Runners (single-gpu)
The job was canceled because "multi-gpu" failed.
|
AMD mi250 / Check Runners (single-gpu)
Runner hf-amd-mi250-ci-1gpu-3 did not respond to a cancelation request with 00:05:00.
|