Skip to content

Self-hosted runner (nightly-ci) #36

Self-hosted runner (nightly-ci)

Self-hosted runner (nightly-ci) #36

Triggered via schedule June 25, 2024 02:28
Status Failure
Total duration 6h 58m 35s
Artifacts 3
Build Nightly CI Docker Images  /  Nightly PyTorch + Stable TensorFlow
15m 23s
Build Nightly CI Docker Images / Nightly PyTorch + Stable TensorFlow
Build Nightly CI Docker Images  /  Nightly PyTorch + DeepSpeed
22m 37s
Build Nightly CI Docker Images / Nightly PyTorch + DeepSpeed
Matrix: DeepSpeed CI / Setup
Matrix: Model CI / Setup
Matrix: DeepSpeed CI / Examples directory
Matrix: DeepSpeed CI / TensorFlow pipelines
Matrix: DeepSpeed CI / PyTorch pipelines
Matrix: DeepSpeed CI / Torch CUDA extension tests
Matrix: Model CI / Examples directory
Matrix: Model CI / TensorFlow pipelines
Matrix: Model CI / PyTorch pipelines
Matrix: Model CI / Torch CUDA extension tests
Matrix: DeepSpeed CI /
Waiting for pending jobs
Matrix: DeepSpeed CI /
Matrix: Model CI /
Waiting for pending jobs
Matrix: Model CI /
DeepSpeed CI  /  Extract warnings in CI artifacts
0s
DeepSpeed CI / Extract warnings in CI artifacts
Model CI  /  Extract warnings in CI artifacts
19s
Model CI / Extract warnings in CI artifacts
DeepSpeed CI  /  ...  /  Send results to webhook
16s
DeepSpeed CI / Slack Report / Send results to webhook
Model CI  /  ...  /  Send results to webhook
21s
Model CI / Slack Report / Send results to webhook
Fit to window
Zoom out
Zoom in

Annotations

5 errors and 7 warnings
DeepSpeed CI / Torch CUDA extension tests (single-gpu)
Process completed with exit code 1.
DeepSpeed CI / Torch CUDA extension tests (multi-gpu)
Value cannot be null. (Parameter 'ContainerId')
DeepSpeed CI / Torch CUDA extension tests (multi-gpu)
Value cannot be null. (Parameter 'ContainerId')
DeepSpeed CI / Torch CUDA extension tests (multi-gpu)
Docker pull failed with exit code 1
Model CI / Setup (multi-gpu)
Docker pull failed with exit code 1
Build Nightly CI Docker Images / Nightly PyTorch + Stable TensorFlow
Node.js 16 actions are deprecated. Please update the following actions to use Node.js 20: docker/setup-buildx-action@v2, docker/login-action@v2, docker/build-push-action@v3. For more information see: https://github.blog/changelog/2023-09-22-github-actions-transitioning-from-node-16-to-node-20/.
Build Nightly CI Docker Images / Nightly PyTorch + DeepSpeed
Node.js 16 actions are deprecated. Please update the following actions to use Node.js 20: docker/setup-buildx-action@v2, docker/login-action@v2, docker/build-push-action@v3. For more information see: https://github.blog/changelog/2023-09-22-github-actions-transitioning-from-node-16-to-node-20/.
DeepSpeed CI / Torch CUDA extension tests (multi-gpu)
Docker pull failed with exit code 1, back off 5.223 seconds before retry.
DeepSpeed CI / Torch CUDA extension tests (multi-gpu)
Docker pull failed with exit code 1, back off 7.879 seconds before retry.
Model CI / Setup (multi-gpu)
Docker pull failed with exit code 1, back off 8.221 seconds before retry.
Model CI / Setup (multi-gpu)
Docker pull failed with exit code 1, back off 6.731 seconds before retry.
Model CI / Slack Report / Send results to webhook
No files were found with the provided path: ci_results_run_models_gpu. No artifacts will be uploaded.

Artifacts

Produced during runtime
Name Size
ci_results_run_torch_cuda_extensions_gpu Expired
1.46 KB
single-gpu_run_torch_cuda_extensions_gpu_test_reports Expired
62.2 KB
warnings_in_ci Expired
765 Bytes