-
Notifications
You must be signed in to change notification settings - Fork 197
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[cleanup][2/x] split float8 mm by delayed vs dynamic
Summary: Before this PR, the float8 mm logic was split by axiswise vs tensorwise. After this PR, the float8 mm logic is split by dynamic vs non-dynamic scaling. Motivation: there is more and more evidence that dynamic scaling will be common to the most important lowp recipes. This PR is a step on the way to making the dynamic scaling logic be simpler and easier to understand in `torchao.float8`. There are a lot of other simplifications to do, but stopping here to keep the PR small. This is a pure refactor without any logic changes. Test Plan: ``` ./test/float8/test_everything.sh ``` Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 8a9792272e3aeea12d705eb5a466b9830dac6420 ghstack-comment-id: 2564049757 Pull Request resolved: #1461
- Loading branch information
Showing
2 changed files
with
240 additions
and
141 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.