Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Best practice recommendation update for dpo_trainer.mdx (#1325)
In the document as it is now the best practice recommendations don't seem neither consistent nor correct. For example, the documentation links a tweet with a recommendation to merge adaptors into a quantized model, and a script that supposedly illustrates how to apply that recommendation. But the script actually does the opposite of what the tweet recommends, first dequantizing the model. There are similar inconsistencies/ambiguities further in that paragraph. For example, saying that using an unquantized model would lead to lower performance (I changed it to "higher memory demand"). Overall, I updated the paragraph to improve consistency and provided links to slightly more evidence-based merging recommendations.
- Loading branch information