-
Notifications
You must be signed in to change notification settings - Fork 448
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
OpenCoder, Request for adding a model #2058
Comments
Hi @insop thanks for creating the issue. Given the model is still relatively new we would like to wait and see a bit before onboarding it as part of our core offering. Fortunately that shouldn't stop you from being able to finetune it with torchtune. We encourage folks to plug in custom components, and for this model it should be relatively easy to do so. Since the architecture is the same as Llama you should be able to do the following:
The tokenizer I will need to look at a bit more closely, but given that it appears to use SentencePiece with some additional preprocessing I suspect it should be a small modification off of our Llama2Tokenizer. But happy to provide more detailed pointers to help you getting started here. |
Hi @ebsmothers Thank you so much for how I could approach, it makes sense on when to bring the model in. I am new to torchtune, so anypointer will be helpful and appreciated. Thank you, |
The OpenCoder team has released OpenCoder 1.5B and 8B models. They seem very promising.
Requesting Team to add this model in torchtune.
Thank you!
The text was updated successfully, but these errors were encountered: