Skip to content

Issues: explosion/curated-transformers

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Add suggested PyTorch LLM optimizations feat/generation Feature: Generation feat/model Feature: models
#356 opened Dec 1, 2023 by danieldk
Add support for attention sinks feat/layers Feature: Layers feat/model Feature: models type/feature Type: Feature
#350 opened Oct 4, 2023 by danieldk Undecided
Support DeBERTa v2/3 feat/model Feature: models type/feature Type: Feature
#348 opened Oct 3, 2023 by danieldk Undecided
Add a an extras/contrib package type/maintenance Type: Maintenance
#347 opened Oct 3, 2023 by danieldk Undecided
Make QkvMode ADT-like feat/layers Feature: Layers type/maintenance Type: Maintenance
#344 opened Oct 3, 2023 by danieldk v2.0.0
Add support for Mistral feat/model Feature: models type/feature Type: Feature
#341 opened Oct 3, 2023 by danieldk v2.1.0
Optimal Qlora settings feat/training Feature: Training/Fine-tuning type/feature Type: Feature
#316 opened Sep 2, 2023 by KnutJaegersberg Undecided
Add Low-Rank Adapters injection into base models feat/training Feature: Training/Fine-tuning type/feature Type: Feature
#312 opened Aug 28, 2023 by bilelomrani1 Undecided
Output logits for generation feat/generation Feature: Generation type/feature Type: Feature
#311 opened Aug 24, 2023 by mayankjobanputra v2.0.0
Thoughts on jaxtyping feat/misc Feature: Miscellaneous type/feature Type: Feature
#246 opened Jul 14, 2023 by Ryu1845 Undecided
ProTip! Updated in the last three days: updated:>2024-11-24.