bring back torch.autograd.Function for float8 matmul #336

vkuzo · 2024-07-25T16:40:32Z

Stack from ghstack (oldest at bottom):

Summary:

This is a redo of
#316

With upcoming support of scaling granularities other than tensorwise,
we need a good way to control which gemm kernel to call and how to scale
the input tensors in fwd and bwd. A torch.autograd.Function override
is the cleanest way to do that, and in 2024 this now works with
torch.compile.

Test Plan:

./test/test_everything.sh

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: D60252068

Summary: This is a redo of #316 With upcoming support of scaling granularities other than tensorwise, we need a good way to control which gemm kernel to call and how to scale the input tensors in fwd and bwd. A `torch.autograd.Function` override is the cleanest way to do that, and in 2024 this now works with `torch.compile`. Test Plan: ``` ./test/test_everything.sh ``` Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Summary: This is a redo of #316 With upcoming support of scaling granularities other than tensorwise, we need a good way to control which gemm kernel to call and how to scale the input tensors in fwd and bwd. A `torch.autograd.Function` override is the cleanest way to do that, and in 2024 this now works with `torch.compile`. Test Plan: ``` ./test/test_everything.sh ``` Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 6cb1588bf59be73b5782f6af94e7a360eba7f40e Pull Request resolved: #336

Summary: This is a redo of #316 With upcoming support of scaling granularities other than tensorwise, we need a good way to control which gemm kernel to call and how to scale the input tensors in fwd and bwd. A `torch.autograd.Function` override is the cleanest way to do that, and in 2024 this now works with `torch.compile`. Test Plan: ``` ./test/test_everything.sh ``` Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Summary: This is a redo of #316 With upcoming support of scaling granularities other than tensorwise, we need a good way to control which gemm kernel to call and how to scale the input tensors in fwd and bwd. A `torch.autograd.Function` override is the cleanest way to do that, and in 2024 this now works with `torch.compile`. Test Plan: ``` ./test/test_everything.sh ``` Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 42dd59511e4ec2a55846c2593955c4ff5f12b254 Pull Request resolved: #336

Summary: This is a redo of #316 With upcoming support of scaling granularities other than tensorwise, we need a good way to control which gemm kernel to call and how to scale the input tensors in fwd and bwd. A `torch.autograd.Function` override is the cleanest way to do that, and in 2024 this now works with `torch.compile`. Test Plan: ``` ./test/test_everything.sh ``` Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

vkuzo · 2024-07-25T19:56:17Z

@vkuzo has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Summary: This is a redo of #316 With upcoming support of scaling granularities other than tensorwise, we need a good way to control which gemm kernel to call and how to scale the input tensors in fwd and bwd. A `torch.autograd.Function` override is the cleanest way to do that, and in 2024 this now works with `torch.compile`. Test Plan: ``` ./test/test_everything.sh ``` Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D60252068](https://our.internmc.facebook.com/intern/diff/D60252068) [ghstack-poisoned]

vkuzo · 2024-07-26T15:50:29Z

started a new PR due to ghstack error

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 25, 2024

This was referenced Jul 25, 2024

[bc-breaking] rename TensorScalingType->ScalingType, Float8TensorCastConfig->CastConfig #337

Closed

[wip] add scaling granularity #338

Open

This was referenced Jul 26, 2024

[1/x] clean up casting functions #339

Closed

[2/x] clean up casting functions: delayed scaling #340

Closed

vkuzo closed this Jul 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bring back torch.autograd.Function for float8 matmul #336

bring back torch.autograd.Function for float8 matmul #336

vkuzo commented Jul 25, 2024 •

edited

Loading

vkuzo commented Jul 25, 2024

vkuzo commented Jul 26, 2024

bring back torch.autograd.Function for float8 matmul #336

bring back torch.autograd.Function for float8 matmul #336

Conversation

vkuzo commented Jul 25, 2024 • edited Loading

vkuzo commented Jul 25, 2024

vkuzo commented Jul 26, 2024

vkuzo commented Jul 25, 2024 •

edited

Loading