Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature request] Support Mistral #356

Closed
brunnolou opened this issue Oct 12, 2023 · 2 comments · Fixed by #379
Closed

[Feature request] Support Mistral #356

brunnolou opened this issue Oct 12, 2023 · 2 comments · Fixed by #379
Labels
enhancement New feature or request

Comments

@brunnolou
Copy link

Support the Mistral model

Reason for request

  1. It is the most trending model on Hugging Face.
  2. Outperforms Llama 2 13B on all benchmarks tested.
  3. It's the best 7B, Apache 2.0
  4. They are committed to open models

Context

- error Error: Unsupported model type: mistral
    at AutoModelForQuestionAnswering.from_pretrained (webpack-internal:///(rsc)/./node_modules/@xenova/transformers/src/models.js:3239:19)

Is there any plans to add support for it soon?

@brunnolou brunnolou added the enhancement New feature or request label Oct 12, 2023
@xenova
Copy link
Collaborator

xenova commented Oct 18, 2023

Hi there! 👋 Good news: @echarlaix recently added support for mistral models to be exported with Optimum, so this is now possible (PR: huggingface/optimum#1425)! On that note, do you maybe have examples of these models which are slightly smaller in size? At the moment, we don't yet support >=7B models, so I won't be able to test without a smaller model.

@brunnolou
Copy link
Author

Cool! Thanks for the update @xenova!
While I couldn't find an example, I found we can prune Mistral-7B to be smaller (1.3B - 3B) for local usage instead of pre-training from scratch using: https://xiamengzhou.github.io/sheared-llama/
I don't know if @xiamengzhou or @gaotianyu1350 (the creators) provided the code or if they have plans to do it for Mistral already.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants