Skip to content

Actions: microsoft/DeepSpeed

cpu-torch-latest

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
3,209 workflow runs
3,209 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

cpu-torch-latest
cpu-torch-latest #3832: Scheduled
December 28, 2024 00:11 14m 28s master
December 28, 2024 00:11 14m 28s
[BUG FIX]:fix get torch.version.cuda error when cuda is None in rocm
cpu-torch-latest #3831: Pull request #6909 synchronize by hj-wei
December 27, 2024 03:06 Action required hj-wei:dev_hjwei
December 27, 2024 03:06 Action required
cpu-torch-latest
cpu-torch-latest #3828: Scheduled
December 27, 2024 00:11 15m 47s master
December 27, 2024 00:11 15m 47s
Stage3: Use new torch grad accumulation hooks API
cpu-torch-latest #3827: Pull request #6773 synchronize by loadams
December 26, 2024 20:09 15m 2s deepcharm:stage3-use-new-grad-acc-api
December 26, 2024 20:09 15m 2s
Stage3: Use new torch grad accumulation hooks API
cpu-torch-latest #3825: Pull request #6773 synchronize by loadams
December 26, 2024 17:40 14m 41s deepcharm:stage3-use-new-grad-acc-api
December 26, 2024 17:40 14m 41s
[BUG FIX]:fix get torch.version.cuda error when cuda is None in rocm
cpu-torch-latest #3824: Pull request #6909 synchronize by loadams
December 26, 2024 17:15 Action required hj-wei:dev_hjwei
December 26, 2024 17:15 Action required
Use ds-specific module id to avoid conflicts
cpu-torch-latest #3823: Pull request #6847 synchronize by loadams
December 26, 2024 17:13 14m 53s olruwase/pr_6772
December 26, 2024 17:13 14m 53s
Fix checkpointable_layers Logic
cpu-torch-latest #3822: Pull request #6881 synchronize by loadams
December 26, 2024 17:12 14m 45s Quentin-Anthony:qanthony/fix-act-recomp
December 26, 2024 17:12 14m 45s
Update Gaudi2 jobs to latest 1.19 build
cpu-torch-latest #3821: Pull request #6905 synchronize by loadams
December 26, 2024 17:12 15m 30s raza-sikander:master
December 26, 2024 17:12 15m 30s
Add fp8_gemm fallback for non-triton systems
cpu-torch-latest #3819: Pull request #6916 opened by oelayan7
December 26, 2024 08:52 Action required oelayan7:fp8_gemm_no_triton
December 26, 2024 08:52 Action required
cpu-torch-latest
cpu-torch-latest #3818: Scheduled
December 26, 2024 00:11 15m 16s master
December 26, 2024 00:11 15m 16s
[BUG FIX]:fix get torch.version.cuda error when cuda is None in rocm
cpu-torch-latest #3814: Pull request #6909 synchronize by hj-wei
December 25, 2024 02:18 Action required hj-wei:dev_hjwei
December 25, 2024 02:18 Action required
Add the missing view operations from sequence parallel(async).
cpu-torch-latest #3813: Pull request #6750 synchronize by inkcherry
December 25, 2024 01:50 Action required inkcherry:ds_overlap_fix
December 25, 2024 01:50 Action required
cpu-torch-latest
cpu-torch-latest #3812: Scheduled
December 25, 2024 00:11 14m 46s master
December 25, 2024 00:11 14m 46s
[BUG FIX]:fix get torch.version.cuda error when cuda is None in rocm
cpu-torch-latest #3811: Pull request #6909 opened by hj-wei
December 24, 2024 07:38 Action required hj-wei:dev_hjwei
December 24, 2024 07:38 Action required
[inf] Add config var to enable keeping module on host
cpu-torch-latest #3810: Pull request #6846 synchronize by oelayan7
December 24, 2024 06:49 6h 0m 25s oelayan7:keep_module_on_host
December 24, 2024 06:49 6h 0m 25s
cpu-torch-latest
cpu-torch-latest #3809: Scheduled
December 24, 2024 00:11 15m 30s master
December 24, 2024 00:11 15m 30s
Tecorigin sdaa accelerator
cpu-torch-latest #3808: Pull request #6903 synchronize by tjruwase
December 23, 2024 23:13 Action required siqi654321:Tecorigin-SDAA-accelerator
December 23, 2024 23:13 Action required