From 59ac4b9aad253f498b734a66535f206e408bf106 Mon Sep 17 00:00:00 2001 From: Richard Kuo Date: Sat, 20 Apr 2024 18:14:58 +0800 Subject: [PATCH] Update 2024-03-21-LLM.md --- _posts/2024-03-21-LLM.md | 7 +++++++ 1 file changed, 7 insertions(+) diff --git a/_posts/2024-03-21-LLM.md b/_posts/2024-03-21-LLM.md index 8efb1324..dfff3322 100644 --- a/_posts/2024-03-21-LLM.md +++ b/_posts/2024-03-21-LLM.md @@ -366,6 +366,13 @@ representation fine-tuning (ReFT) library, a Powerful, Parameter-Efficient, and **Paper:** [ReFT: Representation Finetuning for Language Models](https://arxiv.org/abs/2404.03592)
**Code:** [https://github.com/stanfordnlp/pyreft](https://github.com/stanfordnlp/pyreft)
+--- +### ORPO +**model:** [kaist-ai/mistral-orpo-beta](https://huggingface.co/kaist-ai/mistral-orpo-beta)
+**Paper:** [ORPO: Monolithic Preference Optimization without Reference Model](https://arxiv.org/abs/2403.07691)
+**Code:** [https://github.com/xfactlab/orpo](https://github.com/xfactlab/orpo)
+**Blog:** [Fine-tune Llama 3 with ORPO](https://huggingface.co/blog/mlabonne/orpo-llama-3)
+ --- ## Deep LLM