Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bumping flash attention version to 2.6.3 and adding option for softcap in attention and lm_head logits. #1374
Bumping flash attention version to 2.6.3 and adding option for softcap in attention and lm_head logits. #1374
Changes from all commits
aeb650f
8ed7e1c
4cf075c
d1e738e
87d4114
debd411
1e4a3aa
9260f19
65c5fa8
059617d
13119e2
02b4f04
b06adf6
c72458a
258e048
b875fa3
ee72ff6
63d8676
65c17b0
756e127
a3b568d
9c31f17
bf5e94e
9a68b8f
1a4123a
21dd8bd
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing