how to adapt my new model? #2077

belle9217 · 2024-11-27T03:54:26Z

No description provided.

felipemello1 · 2024-11-27T04:06:13Z

Hi @belle9217 , can you give more details of what you are trying to do?

belle9217 · 2024-11-27T06:00:50Z

I have a model with a similar architecture to the LLaMA model, which has been modified based on LLaMA by altering the multi-head attention mechanism for QKV. now I want to use FSDP(deepspeed zero3) and torch.compile,and I found torchtune has the two functionalities.so I want to add my model to the torchtune framework. since I see torchtune has support Qwen、llama、phi, how can I adjust my modeling.py to use torchtune? is there any API or Interface？

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to adapt my new model? #2077

how to adapt my new model? #2077

belle9217 commented Nov 27, 2024

felipemello1 commented Nov 27, 2024

belle9217 commented Nov 27, 2024

how to adapt my new model? #2077

how to adapt my new model? #2077

Comments

belle9217 commented Nov 27, 2024

felipemello1 commented Nov 27, 2024

belle9217 commented Nov 27, 2024