Use subgroup operations when possible #553

beaufortfrancois · 2024-08-20T11:30:40Z

Subgroups can substantially enhance performance and adaptability for machine learning tasks on GPUs. Since they're now available on origin trial, it means https://webllm.mlc.ai/ could take advantage of them.

I'm not sure what is needed yet to make it work... I assume some work in Apache TVM as well.

I highly recommend you check out the quick-start guide at https://developer.chrome.com/blog/new-in-webgpu-128#experimenting_with_subgroups. For info, only subgroupBallot and subgroupBroadcast are there for now but more built-in functions such as subgroupAdd, subgroupAll, subgroupElect, subgroupShuffle will be added in a near future.

beaufortfrancois · 2024-09-03T09:54:08Z

@CharlieFRuan @tqchen What are your thoughts on this?

tqchen · 2024-09-03T14:49:46Z

This is great, subgroup shuffle can be useful for reduction operations. We did have warp shuffle support for metal backend, so maybe we can try add codegen backend for webgpu

beaufortfrancois · 2024-09-03T15:36:26Z

The following subgroup shuffle functions are actually in Chrome 129 (currently beta):

subgroupShuffle(value, id): Returns value from the active invocation whose subgroup_invocation_id matches id.
subgroupShuffleXor(value, mask): Returns value from the active invocation whose subgroup_invocation_id matches subgroup_invocation_id ^ mask. mask must be dynamically uniform.
subgroupShuffleUp(value, delta): Returns value from the active invocation whose subgroup_invocation_id matches subgroup_invocation_id - delta.
subgroupShuffleDown(value, delta): Returns value from the active invocation whose subgroup_invocation_id matches subgroup_invocation_id + delta.

beaufortfrancois · 2024-09-09T06:39:48Z

@tqchen @CharlieFRuan Is this being implemented in Apache TVM?

CharlieFRuan · 2024-09-10T17:35:14Z

Hi @beaufortfrancois Really appreciate the info and suggestions! We think it is a good idea to have it implemented in the TVM flow. Unfortunately, we are a bit out of bandwidth as of now. We'll revisit in the future!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use subgroup operations when possible #553

Use subgroup operations when possible #553

beaufortfrancois commented Aug 20, 2024 •

edited

Loading

beaufortfrancois commented Sep 3, 2024

tqchen commented Sep 3, 2024

beaufortfrancois commented Sep 3, 2024

beaufortfrancois commented Sep 9, 2024

CharlieFRuan commented Sep 10, 2024

Use subgroup operations when possible #553

Use subgroup operations when possible #553

Comments

beaufortfrancois commented Aug 20, 2024 • edited Loading

beaufortfrancois commented Sep 3, 2024

tqchen commented Sep 3, 2024

beaufortfrancois commented Sep 3, 2024

beaufortfrancois commented Sep 9, 2024

CharlieFRuan commented Sep 10, 2024

beaufortfrancois commented Aug 20, 2024 •

edited

Loading