This repository has been archived by the owner on Aug 30, 2024. It is now read-only.
[LLM Runtime] Support 3bits & 4bits GPTQ for gpt-j #285
Job | Run time |
---|---|
5m 59s | |
6m 28s | |
12s | |
12m 39s |
Job | Run time |
---|---|
5m 59s | |
6m 28s | |
12s | |
12m 39s |