Benchmarks for PaddingFree and Granite. Fix for LowCPUMemMode for Quant.
Release
aadp
no version bump: Updates on PaddingFree bench only.peft
v0.3.1: patch version. fixes forlow_cpu_mem_mode
for issues introduced since transformers0.45
. Also provide fallback iftarget_modules=None
.foak
v0.3.1: patch version: Support forbias
, needed for Granite models.
What's Changed
- Update Benches: Orca by @fabianlim in #85
- Update Benchmarks and Documentation for GraniteCausalLM by @fabianlim in #86
- Fixes to Accelerated Peft by @fabianlim in #89
Full Changelog: v0.4.0...v0.4.1