Skip to content

Actions: ModelTC/lightllm

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
676 workflow runs
676 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

optimze decode mla att
Pre-commit checks #334: Pull request #616 opened by shihaobai
November 25, 2024 10:49 29s mla_att
November 25, 2024 10:49 29s
add vllm pynccl for cuda graph compatibility
Pre-commit checks #333: Pull request #615 synchronize by WANDY666
November 25, 2024 10:16 38s pynccl2
November 25, 2024 10:16 38s
add vllm pynccl for cuda graph compatibility
Pre-commit checks #332: Pull request #615 opened by WANDY666
November 25, 2024 10:08 28s pynccl2
November 25, 2024 10:08 28s
Deepseek2 Support tp DP (#614)
Docker #299: Commit 692555e pushed by shihaobai
November 25, 2024 06:54 1m 53s main
November 25, 2024 06:54 1m 53s
Deepseek2 Support PD mode
Pre-commit checks #331: Pull request #614 opened by hiworldwzj
November 25, 2024 05:30 37s wzj_pd
November 25, 2024 05:30 37s
add vllm pynccl for cuda graph compatibility
Pre-commit checks #330: Pull request #613 synchronize by WANDY666
November 22, 2024 10:26 29s pynccl
November 22, 2024 10:26 29s
add vllm pynccl for cuda graph compatibility
Pre-commit checks #329: Pull request #613 synchronize by shihaobai
November 22, 2024 10:20 30s pynccl
November 22, 2024 10:20 30s
add vllm pynccl for cuda graph compatibility
Pre-commit checks #328: Pull request #613 synchronize by WANDY666
November 22, 2024 09:26 30s pynccl
November 22, 2024 09:26 30s
add vllm pynccl for cuda graph compatibility
Pre-commit checks #327: Pull request #613 synchronize by WANDY666
November 22, 2024 09:23 30s pynccl
November 22, 2024 09:23 30s
add vllm pynccl for cuda graph compatibility
Pre-commit checks #326: Pull request #613 opened by WANDY666
November 22, 2024 09:11 33s pynccl
November 22, 2024 09:11 33s
refact quantization, support torchao quant and vllm w8a8(int/fp), su…
Docker #298: Commit c6a654c pushed by hiworldwzj
November 22, 2024 06:46 2m 23s main
November 22, 2024 06:46 2m 23s
refact quantization, support torchao quant and vllm w8a8(int/fp), support mix quantization.
Pre-commit checks #325: Pull request #596 synchronize by hiworldwzj
November 22, 2024 06:45 28s quantization
November 22, 2024 06:45 28s
refact quantization, support torchao quant and vllm w8a8(int/fp), support mix quantization.
Pre-commit checks #324: Pull request #596 synchronize by hiworldwzj
November 22, 2024 06:43 32s quantization
November 22, 2024 06:43 32s
refact quantization, support torchao quant and vllm w8a8(int/fp), support mix quantization.
Pre-commit checks #323: Pull request #596 synchronize by shihaobai
November 22, 2024 06:03 33s quantization
November 22, 2024 06:03 33s
refact quantization, support torchao quant and vllm w8a8(int/fp), support mix quantization.
Pre-commit checks #322: Pull request #596 synchronize by shihaobai
November 22, 2024 04:36 26s quantization
November 22, 2024 04:36 26s
refact quantization, support torchao quant and vllm w8a8(int/fp), support mix quantization.
Pre-commit checks #321: Pull request #596 synchronize by shihaobai
November 22, 2024 04:33 32s quantization
November 22, 2024 04:33 32s
refact quantization, support torchao quant and vllm w8a8(int/fp), support mix quantization.
Pre-commit checks #320: Pull request #596 synchronize by shihaobai
November 21, 2024 15:12 29s quantization
November 21, 2024 15:12 29s
refact quantization, support torchao quant and vllm w8a8(int/fp), support mix quantization.
Pre-commit checks #319: Pull request #596 synchronize by shihaobai
November 21, 2024 15:09 32s quantization
November 21, 2024 15:09 32s
refact quantization, support torchao quant and vllm w8a8(int/fp), support mix quantization.
Pre-commit checks #318: Pull request #596 synchronize by shihaobai
November 21, 2024 10:45 47s quantization
November 21, 2024 10:45 47s
refact quantization, support torchao quant and vllm w8a8(int/fp), support mix quantization.
Pre-commit checks #317: Pull request #596 synchronize by shihaobai
November 21, 2024 10:15 28s quantization
November 21, 2024 10:15 28s
refact quantization, support torchao quant and vllm w8a8(int/fp), support mix quantization.
Pre-commit checks #316: Pull request #596 synchronize by shihaobai
November 21, 2024 08:11 30s quantization
November 21, 2024 08:11 30s
refact quantization, support torchao quant and vllm w8a8(int/fp), support mix quantization.
Pre-commit checks #315: Pull request #596 synchronize by shihaobai
November 21, 2024 07:50 29s quantization
November 21, 2024 07:50 29s
refact quantization, support torchao quant and vllm w8a8(int/fp), support mix quantization.
Pre-commit checks #314: Pull request #596 synchronize by shihaobai
November 21, 2024 07:03 32s quantization
November 21, 2024 07:03 32s
refact quantization, support torchao quant and vllm w8a8(int/fp), support mix quantization.
Pre-commit checks #313: Pull request #596 synchronize by shihaobai
November 21, 2024 06:43 29s quantization
November 21, 2024 06:43 29s
feat(misc): Profiler support
Pre-commit checks #312: Pull request #611 opened by WuSiYu
November 20, 2024 15:05 44s WuSiYu:dev/profiler_support
November 20, 2024 15:05 44s