Update 2024-03-21-LLM.md

rkuo2000 · Apr 20, 2024 · 59ac4b9 · 59ac4b9
1 parent 3021eb1
commit 59ac4b9
Showing 1 changed file with 7 additions and 0 deletions.
diff --git a/_posts/2024-03-21-LLM.md b/_posts/2024-03-21-LLM.md
@@ -366,6 +366,13 @@ representation fine-tuning (ReFT) library, a Powerful, Parameter-Efficient, and
 **Paper:** [ReFT: Representation Finetuning for Language Models](https://arxiv.org/abs/2404.03592)<br>
 **Code:** [https://github.com/stanfordnlp/pyreft](https://github.com/stanfordnlp/pyreft)<br>
 
+---
+### ORPO
+**model:** [kaist-ai/mistral-orpo-beta](https://huggingface.co/kaist-ai/mistral-orpo-beta)<br>
+**Paper:** [ORPO: Monolithic Preference Optimization without Reference Model](https://arxiv.org/abs/2403.07691)<br>
+**Code:** [https://github.com/xfactlab/orpo](https://github.com/xfactlab/orpo)<br>
+**Blog:** [Fine-tune Llama 3 with ORPO](https://huggingface.co/blog/mlabonne/orpo-llama-3)<br>
+
 ---
 ## Deep LLM