Skip to content

Commit

Permalink
Update README.md to clarify model requirement (#1315)
Browse files Browse the repository at this point in the history
Clarify that language models must be transformers models for text.  This is a bit redundant with intro description, but attempts to better address a question that that comes up (issue 1257).

Closes: #1257
  • Loading branch information
markstur authored Feb 15, 2024
1 parent 9bc478e commit 31b02d0
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -168,7 +168,7 @@ train_stats = ppo_trainer.step([query_tensor[0]], [response_tensor[0]], reward)
The PPO implementation largely follows the structure introduced in the paper **"Fine-Tuning Language Models from Human Preferences"** by D. Ziegler et al. \[[paper](https://arxiv.org/pdf/1909.08593.pdf), [code](https://github.com/openai/lm-human-preferences)].

### Language models
The language models utilize the `transformers` library by 🤗 Hugging Face.
The language models utilize the `transformers` library by 🤗 Hugging Face. Currently, `trl` only supports `transformers` models **for text**.

## Citation

Expand Down

0 comments on commit 31b02d0

Please sign in to comment.