Skip to content

[do not land] H100 for float8 #2386

[do not land] H100 for float8

[do not land] H100 for float8 #2386

Triggered via pull request November 6, 2024 16:48
Status Failure
Total duration 11m 48s
Artifacts

float8_test.yml

on: pull_request
Matrix: test
Fit to window
Zoom out
Zoom in

Annotations

3 errors and 1 warning
test (H100, linux.aws.h100, --pre torch --index-url https://download.pytorch.org/whl/nightly/cu12... / linux-job
The self-hosted runner: i-0e4172a00b41fa6c6-1004 lost communication with the server. Verify the machine is running and has a healthy network connection. Anything in your workflow that terminates the runner process, starves it for CPU/Memory, or blocks its network access can cause this error.
test (H100, linux.aws.h100, --pre torch --index-url https://download.pytorch.org/whl/nightly/cu12... / linux-job
An error occurred trying to start process '/usr/bin/bash' with working directory '/home/david/_work/ao/ao/pytorch/ao'. No such file or directory
test (SM-89, linux.g6.4xlarge.experimental.nvidia.gpu, --pre torch --index-url https://download.p... / linux-job
Node.js 16 actions are deprecated. Please update the following actions to use Node.js 20: actions/checkout@v3, nick-fields/retry@3e91a01664abd3c5cd539100d10d33b9c5b68482, ./test-infra/.github/actions/setup-ssh, pmeier/[email protected]. For more information see: https://github.blog/changelog/2023-09-22-github-actions-transitioning-from-node-16-to-node-20/.