[QST] Does CUTLASS 3.5.1 support int4 x float16 GEMMs natively? #1928

SimpleTheoryOfTypes · 2024-11-07T23:50:41Z

What is your question?
By "natively," I mean without relying on third-party implementations. If I understand correctly, FasterTransformer and TVM have already developed their own CUTLASS extensions for constructing INT4/INT8 x FLOAT16 GEMMs. Just wondering if the latest CUTLASS release can already do this now? Thanks!

thakkarV · 2024-11-08T00:04:17Z

see example 55

SimpleTheoryOfTypes · 2024-11-08T00:07:53Z

see example 55

Thank you so much! sorry, I forgot to mention that my question is about int4 x fp16 GEMMs on Ampere, not Hopper. :).

thakkarV · 2024-11-08T00:15:52Z

#1084

SimpleTheoryOfTypes added ? - Needs Triage question Question labels Nov 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QST] Does CUTLASS 3.5.1 support int4 x float16 GEMMs natively? #1928

[QST] Does CUTLASS 3.5.1 support int4 x float16 GEMMs natively? #1928

SimpleTheoryOfTypes commented Nov 7, 2024 •

edited

Loading

thakkarV commented Nov 8, 2024

SimpleTheoryOfTypes commented Nov 8, 2024 •

edited

Loading

thakkarV commented Nov 8, 2024

[QST] Does CUTLASS 3.5.1 support int4 x float16 GEMMs natively? #1928

[QST] Does CUTLASS 3.5.1 support int4 x float16 GEMMs natively? #1928

Comments

SimpleTheoryOfTypes commented Nov 7, 2024 • edited Loading

thakkarV commented Nov 8, 2024

SimpleTheoryOfTypes commented Nov 8, 2024 • edited Loading

thakkarV commented Nov 8, 2024

SimpleTheoryOfTypes commented Nov 7, 2024 •

edited

Loading

SimpleTheoryOfTypes commented Nov 8, 2024 •

edited

Loading