Skip to content

Actions: microsoft/DeepSpeed

nv-accelerate-v100

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
5,035 workflow runs
5,035 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

nv-accelerate-v100
nv-accelerate-v100 #12613: Scheduled
December 28, 2024 00:07 3m 54s master
December 28, 2024 00:07 3m 54s
[BUG FIX]:fix get torch.version.cuda error when cuda is None in rocm
nv-accelerate-v100 #12612: Pull request #6909 synchronize by hj-wei
December 27, 2024 03:06 Action required hj-wei:dev_hjwei
December 27, 2024 03:06 Action required
nv-accelerate-v100
nv-accelerate-v100 #12609: Scheduled
December 27, 2024 00:07 3m 56s master
December 27, 2024 00:07 3m 56s
Stage3: Use new torch grad accumulation hooks API
nv-accelerate-v100 #12608: Pull request #6773 synchronize by loadams
December 26, 2024 20:09 18m 24s deepcharm:stage3-use-new-grad-acc-api
December 26, 2024 20:09 18m 24s
Stage3: Use new torch grad accumulation hooks API
nv-accelerate-v100 #12606: Pull request #6773 synchronize by loadams
December 26, 2024 17:40 11m 23s deepcharm:stage3-use-new-grad-acc-api
December 26, 2024 17:40 11m 23s
[BUG FIX]:fix get torch.version.cuda error when cuda is None in rocm
nv-accelerate-v100 #12605: Pull request #6909 synchronize by loadams
December 26, 2024 17:15 Action required hj-wei:dev_hjwei
December 26, 2024 17:15 Action required
Use ds-specific module id to avoid conflicts
nv-accelerate-v100 #12604: Pull request #6847 synchronize by loadams
December 26, 2024 17:13 27m 49s olruwase/pr_6772
December 26, 2024 17:13 27m 49s
Fix checkpointable_layers Logic
nv-accelerate-v100 #12603: Pull request #6881 synchronize by loadams
December 26, 2024 17:12 17m 14s Quentin-Anthony:qanthony/fix-act-recomp
December 26, 2024 17:12 17m 14s
Update Gaudi2 jobs to latest 1.19 build
nv-accelerate-v100 #12602: Pull request #6905 synchronize by loadams
December 26, 2024 17:12 12m 59s raza-sikander:master
December 26, 2024 17:12 12m 59s
Add fp8_gemm fallback for non-triton systems
nv-accelerate-v100 #12600: Pull request #6916 opened by oelayan7
December 26, 2024 08:52 Action required oelayan7:fp8_gemm_no_triton
December 26, 2024 08:52 Action required
nv-accelerate-v100
nv-accelerate-v100 #12599: Scheduled
December 26, 2024 00:07 3m 53s master
December 26, 2024 00:07 3m 53s
[BUG FIX]:fix get torch.version.cuda error when cuda is None in rocm
nv-accelerate-v100 #12595: Pull request #6909 synchronize by hj-wei
December 25, 2024 02:18 Action required hj-wei:dev_hjwei
December 25, 2024 02:18 Action required
Add the missing view operations from sequence parallel(async).
nv-accelerate-v100 #12594: Pull request #6750 synchronize by inkcherry
December 25, 2024 01:50 Action required inkcherry:ds_overlap_fix
December 25, 2024 01:50 Action required
nv-accelerate-v100
nv-accelerate-v100 #12593: Scheduled
December 25, 2024 00:07 3m 54s master
December 25, 2024 00:07 3m 54s
[BUG FIX]:fix get torch.version.cuda error when cuda is None in rocm
nv-accelerate-v100 #12592: Pull request #6909 opened by hj-wei
December 24, 2024 07:38 Action required hj-wei:dev_hjwei
December 24, 2024 07:38 Action required
[inf] Add config var to enable keeping module on host
nv-accelerate-v100 #12591: Pull request #6846 synchronize by oelayan7
December 24, 2024 06:49 3m 52s oelayan7:keep_module_on_host
December 24, 2024 06:49 3m 52s
nv-accelerate-v100
nv-accelerate-v100 #12590: Scheduled
December 24, 2024 00:07 3m 50s master
December 24, 2024 00:07 3m 50s
Tecorigin sdaa accelerator
nv-accelerate-v100 #12589: Pull request #6903 synchronize by tjruwase
December 23, 2024 23:13 Action required siqi654321:Tecorigin-SDAA-accelerator
December 23, 2024 23:13 Action required