Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

how to adapt my new model? #2077

Open
belle9217 opened this issue Nov 27, 2024 · 2 comments
Open

how to adapt my new model? #2077

belle9217 opened this issue Nov 27, 2024 · 2 comments

Comments

@belle9217
Copy link

No description provided.

@felipemello1
Copy link
Contributor

Hi @belle9217 , can you give more details of what you are trying to do?

@belle9217
Copy link
Author

I have a model with a similar architecture to the LLaMA model, which has been modified based on LLaMA by altering the multi-head attention mechanism for QKV. now I want to use FSDP(deepspeed zero3) and torch.compile,and I found torchtune has the two functionalities.so I want to add my model to the torchtune framework. since I see torchtune has support Qwen、llama、phi, how can I adjust my modeling.py to use torchtune? is there any API or Interface?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants