Skip to content

Tutel v0.1.1

Compare
Choose a tag to compare
@ghostplant ghostplant released this 10 Oct 14:00
· 92 commits to v0.1.x since this release

What's New in v0.1.1:

  1. Enable fp16 support for AMDGPU.
  2. Using NVRTC for JIT compilation if available.
  3. Add new system_init interface for initializing NUMA settings in distributed GPUs.
  4. Extend more gating types: Top3Gate & Top4Gate.
  5. Allow high level to change capacity value in Tutel fast dispatcher.
  6. Add custom AllToAll extension for old Pytorch version without builtin AllToAll operator support.
How to Setup:
python3 -m pip install --user https://github.com/microsoft/tutel/archive/refs/tags/v0.1.1.tar.gz

Contributors: @jspark1105 , @ngoyal2707 , @guoshzhao, @ghostplant .