Skip to content

Commit

Permalink
Update 2024-03-21-LLM.md
Browse files Browse the repository at this point in the history
  • Loading branch information
rkuo2000 authored Apr 20, 2024
1 parent 3021eb1 commit 59ac4b9
Showing 1 changed file with 7 additions and 0 deletions.
7 changes: 7 additions & 0 deletions _posts/2024-03-21-LLM.md
Original file line number Diff line number Diff line change
Expand Up @@ -366,6 +366,13 @@ representation fine-tuning (ReFT) library, a Powerful, Parameter-Efficient, and
**Paper:** [ReFT: Representation Finetuning for Language Models](https://arxiv.org/abs/2404.03592)<br>
**Code:** [https://github.com/stanfordnlp/pyreft](https://github.com/stanfordnlp/pyreft)<br>

---
### ORPO
**model:** [kaist-ai/mistral-orpo-beta](https://huggingface.co/kaist-ai/mistral-orpo-beta)<br>
**Paper:** [ORPO: Monolithic Preference Optimization without Reference Model](https://arxiv.org/abs/2403.07691)<br>
**Code:** [https://github.com/xfactlab/orpo](https://github.com/xfactlab/orpo)<br>
**Blog:** [Fine-tune Llama 3 with ORPO](https://huggingface.co/blog/mlabonne/orpo-llama-3)<br>

---
## Deep LLM

Expand Down

0 comments on commit 59ac4b9

Please sign in to comment.