Bumping flash attention version to 2.6.3 and adding option for softcap in attention and lm_head logits. #7828
Annotations
1 warning
The following actions use a deprecated Node.js version and will be forced to run on node20: actions/checkout@v3, actions/setup-python@v4. For more info: https://github.blog/changelog/2024-03-07-github-actions-all-actions-will-run-on-node20-instead-of-node16-by-default/
|
This job succeeded
Loading