Skip to content

GPTQModel v1.3.1

Latest
Compare
Choose a tag to compare
@Qubitium Qubitium released this 29 Nov 04:10
· 19 commits to main since this release
e7f1437

What's Changed

⚡ Olmo2 model support.
⚡ Intel XPU acceleration via IPEX.
Sharding compat fix due to api deprecation in HF Transformers.
Removed triton dependency. Triton kernel now optionally dependent on triton pkg.
Fixed Hymba Test (Hymba requires desc_act=False)

Full Changelog: v1.3.0...v1.3.1