Skip to content

Enable FP8 Per-Tensor Scales and Integrate Marlin/MoE Kernels Repo for ROCm #2063

Enable FP8 Per-Tensor Scales and Integrate Marlin/MoE Kernels Repo for ROCm

Enable FP8 Per-Tensor Scales and Integrate Marlin/MoE Kernels Repo for ROCm #2063

Annotations

1 warning

build (cuda-trtllm)  /  build-and-push

succeeded Dec 18, 2024 in 1h 34m 33s