Skip to content

Actions: microsoft/DeepSpeed

nv-accelerate-v100

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
5,028 workflow runs
5,028 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

hpu_accelerator: use torch.use_deterministic_algorithms
nv-accelerate-v100 #12562: Pull request #6897 opened by nelyahu
December 19, 2024 07:23 12m 18s nelyahu:patch-2
December 19, 2024 07:23 12m 18s
nv-accelerate-v100
nv-accelerate-v100 #12561: Scheduled
December 19, 2024 00:07 56m 45s master
December 19, 2024 00:07 56m 45s
Allow to compile collective for PT > 2.3
nv-accelerate-v100 #12560: Pull request #6674 reopened by loadams
December 18, 2024 21:53 2h 44m 47s nelyahu:compile_collectives
December 18, 2024 21:53 2h 44m 47s
Allow to compile collective for PT > 2.3
nv-accelerate-v100 #12559: Pull request #6674 synchronize by loadams
December 18, 2024 21:07 39m 26s nelyahu:compile_collectives
December 18, 2024 21:07 39m 26s
Copy #6674: Allow to compile collective for PT > 2.3
nv-accelerate-v100 #12558: Pull request #6894 opened by loadams
December 18, 2024 21:01 53m 58s loadams/test-compile-collectives
December 18, 2024 21:01 53m 58s
Fix checkpointable_layers Logic
nv-accelerate-v100 #12557: Pull request #6881 synchronize by Quentin-Anthony
December 18, 2024 20:25 2h 14m 25s Quentin-Anthony:qanthony/fix-act-recomp
December 18, 2024 20:25 2h 14m 25s
Fix checkpointable_layers Logic
nv-accelerate-v100 #12556: Pull request #6881 synchronize by Quentin-Anthony
December 18, 2024 20:24 Action required Quentin-Anthony:qanthony/fix-act-recomp
December 18, 2024 20:24 Action required
Support latest transformers with DSChat
nv-accelerate-v100 #12555: Pull request #6711 synchronize by loadams
December 18, 2024 20:24 1h 52m 38s loadams/fix-ds-chat-transformers
December 18, 2024 20:24 1h 52m 38s
Training ops kernels: Speeding up the Llama-based MoE architectures
nv-accelerate-v100 #12554: Pull request #6734 synchronize by loadams
December 18, 2024 19:27 Action required RezaYazdaniAminabadi:tops-kernels
December 18, 2024 19:27 Action required
Add the missing view operations from sequence parallel(async).
nv-accelerate-v100 #12553: Pull request #6750 synchronize by loadams
December 18, 2024 18:59 Action required inkcherry:ds_overlap_fix
December 18, 2024 18:59 Action required
Fix error caused by all_reduce call in domino
nv-accelerate-v100 #12552: Pull request #6880 synchronize by hwchen2017
December 18, 2024 18:02 1h 32m 20s hongwei/fix_domino_allreduce
December 18, 2024 18:02 1h 32m 20s
Stage3: Use new torch grad accumulation hooks API
nv-accelerate-v100 #12551: Pull request #6773 synchronize by loadams
December 18, 2024 17:55 17m 44s deepcharm:stage3-use-new-grad-acc-api
December 18, 2024 17:55 17m 44s
Zero2: avoid graph breaks in torch.compile by using param_idx
nv-accelerate-v100 #12550: Pull request #6803 synchronize by loadams
December 18, 2024 17:55 25m 53s nelyahu:zero2_param_idx
December 18, 2024 17:55 25m 53s
Update version.txt after 0.16.2 release
nv-accelerate-v100 #12549: Pull request #6893 opened by loadams
December 18, 2024 17:52 16m 35s AutoPR/0.16.2
December 18, 2024 17:52 16m 35s
Inference ops unit test failures/fixes
nv-accelerate-v100 #12546: Pull request #6879 synchronize by loadams
December 18, 2024 16:53 17m 55s loadams/inference-ops-test-repro
December 18, 2024 16:53 17m 55s
Stage3: Use new torch grad accumulation hooks API
nv-accelerate-v100 #12545: Pull request #6773 synchronize by loadams
December 18, 2024 16:51 15m 9s deepcharm:stage3-use-new-grad-acc-api
December 18, 2024 16:51 15m 9s
Zero2: avoid graph breaks in torch.compile by using param_idx
nv-accelerate-v100 #12544: Pull request #6803 synchronize by loadams
December 18, 2024 16:51 12m 36s nelyahu:zero2_param_idx
December 18, 2024 16:51 12m 36s
Update code owners
nv-accelerate-v100 #12543: Pull request #6890 synchronize by loadams
December 18, 2024 16:30 11m 16s olruwase/code_owners
December 18, 2024 16:30 11m 16s
Use ds-specific module id to avoid conflicts
nv-accelerate-v100 #12541: Pull request #6847 synchronize by tjruwase
December 18, 2024 13:59 11m 37s olruwase/pr_6772
December 18, 2024 13:59 11m 37s
Update code owners
nv-accelerate-v100 #12540: Pull request #6890 opened by tjruwase
December 18, 2024 12:04 11m 5s olruwase/code_owners
December 18, 2024 12:04 11m 5s
Fix error caused by all_reduce call in domino
nv-accelerate-v100 #12539: Pull request #6880 synchronize by tjruwase
December 18, 2024 11:51 11m 49s hongwei/fix_domino_allreduce
December 18, 2024 11:51 11m 49s
Stage3: Use new torch grad accumulation hooks API
nv-accelerate-v100 #12538: Pull request #6773 synchronize by deepcharm
December 18, 2024 09:44 11m 24s deepcharm:stage3-use-new-grad-acc-api
December 18, 2024 09:44 11m 24s
Change compile for pipeline module torch.compile
nv-accelerate-v100 #12537: Pull request #6478 synchronize by NirSonnenschein
December 18, 2024 07:44 Action required NirSonnenschein:torch_compile_micro_offset_fix
December 18, 2024 07:44 Action required