Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[do not land] H100 for float8 #1233

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Tryign out H100 for float8

ef39a09
Select commit
Loading
Failed to load commit list.
Open

[do not land] H100 for float8 #1233

Tryign out H100 for float8
ef39a09
Select commit
Loading
Failed to load commit list.
PyTorch Bot / Dr.CI completed Nov 6, 2024 in 0s

Dr.CI classification results

{"FAILED":[{"workflowId":11708125812,"workflowUniqueId":111172765,"id":32609111605,"runnerName":"i-0a8cc2098906ea9d8","authorEmail":"[email protected]","name":"Run Float8 Tests / test (SM-89, linux.g6.4xlarge.experimental.nvidia.gpu, --pre torch --index-url https://download.p... / linux-job","jobName":"test (SM-89, linux.g6.4xlarge.experimental.nvidia.gpu, --pre torch --index-url https://download.p... / linux-job","conclusion":"failure","completed_at":"2024-11-06T16:53:54.000000000Z","html_url":"https://github.com/pytorch/ao/actions/runs/11708125812/job/32609111605","head_branch":"msaroufim-patch-25","pr_number":1233,"head_sha":"ef39a09940810b09777833b6731ee7eb40d0cead","head_sha_timestamp":"2024-11-06T16:47:53.000000000Z","failure_captures":["RuntimeError: Command docker exec -t 1a351f0a6fde2395c907d2437d667b02d7cfa0ddf1208f3310e1bc4300aa3358 /exec failed with exit code 1"],"failure_lines":["RuntimeError: Command docker exec -t 1a351f0a6fde2395c907d2437d667b02d7cfa0ddf1208f3310e1bc4300aa3358 /exec failed with exit code 1"],"failure_context":["+ pip install -r dev-requirements.txt","+ pip install --pre torch --index-url https://download.pytorch.org/whl/nightly/cu121","+ python -m pip install --upgrade pip","+ PATH=/opt/rh/devtoolset-10/root/usr/bin/:/opt/conda/envs/venv/bin:/opt/conda/condabin:/opt/conda/bin:/usr/local/cuda-12.1/bin:/opt/rh/devtoolset-9/root/usr/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin","+ export PATH=/opt/rh/devtoolset-10/root/usr/bin/:/opt/conda/envs/venv/bin:/opt/conda/condabin:/opt/conda/bin:/usr/local/cuda-12.1/bin:/opt/rh/devtoolset-9/root/usr/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin","+ yum install -y devtoolset-10-binutils","+ echo '::group::Install newer objcopy that supports --set-section-alignment'","+ hash -r","+ '[' -n '' ']'","+ '[' -n '' ']'","+ __conda_hashr","++ CONDA_PYTHON_EXE=/opt/conda/bin/python"],"time":"2024-11-06T16:48:05.000000000Z"}],"FLAKY":[{"workflowId":11708125812,"workflowUniqueId":111172765,"id":32609112037,"runnerName":"i-0e4172a00b41fa6c6-1004","authorEmail":"[email protected]","name":"Run Float8 Tests / test (H100, linux.aws.h100, --pre torch --index-url https://download.pytorch.org/whl/nightly/cu12... / linux-job","jobName":"test (H100, linux.aws.h100, --pre torch --index-url https://download.pytorch.org/whl/nightly/cu12... / linux-job","conclusion":"failure","completed_at":"2024-11-06T16:59:49.000000000Z","html_url":"https://github.com/pytorch/ao/actions/runs/11708125812/job/32609112037","head_branch":"msaroufim-patch-25","pr_number":1233,"head_sha":"ef39a09940810b09777833b6731ee7eb40d0cead","head_sha_timestamp":"2024-11-06T16:47:53.000000000Z","failure_captures":[],"failure_lines":[],"failure_context":[],"time":"2024-11-06T16:48:05.000000000Z"}],"BROKEN_TRUNK":[{"workflowId":11708125930,"workflowUniqueId":89543087,"id":32609114408,"runnerName":"i-0069938c239a24bae","authorEmail":"[email protected]","name":"Run Regression Tests / test (CPU Nightly, linux.4xlarge, --pre torch --index-url https://download.pytorch.org/whl/nightl... / linux-job","jobName":"test (CPU Nightly, linux.4xlarge, --pre torch --index-url https://download.pytorch.org/whl/nightl... / linux-job","conclusion":"failure","completed_at":"2024-11-06T16:51:05.000000000Z","html_url":"https://github.com/pytorch/ao/actions/runs/11708125930/job/32609114408","head_branch":"msaroufim-patch-25","pr_number":1233,"head_sha":"ef39a09940810b09777833b6731ee7eb40d0cead","head_sha_timestamp":"2024-11-06T16:47:53.000000000Z","failure_captures":[],"failure_lines":[],"failure_context":[],"time":"2024-11-06T16:48:07.000000000Z"},{"workflowId":11708125930,"workflowUniqueId":89543087,"id":32609116399,"runnerName":"i-0f8e208c2d9a34885","authorEmail":"[email protected]","name":"Run Regression Tests / test (CUDA Nightly, linux.g5.12xlarge.nvidia.gpu, --pre torch --index-url https://download.pytorc... / linux-job","jobName":"test (CUDA Nightly, linux.g5.12xlarge.nvidia.gpu, --pre torch --index-url https://download.pytorc... / linux-job","conclusion":"failure","completed_at":"2024-11-06T16:55:03.000000000Z","html_url":"https://github.com/pytorch/ao/actions/runs/11708125930/job/32609116399","head_branch":"msaroufim-patch-25","pr_number":1233,"head_sha":"ef39a09940810b09777833b6731ee7eb40d0cead","head_sha_timestamp":"2024-11-06T16:47:53.000000000Z","failure_captures":[],"failure_lines":[],"failure_context":[],"time":"2024-11-06T16:48:10.000000000Z"}],"UNSTABLE":[]}