Add "Language Modeling to Unpaired Preference" in Utilities for converting dataset types #2436

AMindToThink · 2024-12-04T03:48:43Z

This link adds a section to "Dataset Formats" for converting Language Modeling data to Unpaired Preference Data.

My motivation is that I wanted to apply reinforcement learning techniques like BCO to the WMDP dataset: a dataset of for "unlearning" harmful information.

I used the snippet I added to the docs to make an unpaired preference version of the WMDP cyber dataset. Others may want to make similar datasets.

This PR fixes a typo or improves the docs.

qgallouedec · 2024-12-04T14:52:31Z

Thanks @AMindToThink for the addition.

Strictly speaking, you can't convert a language-modelling dataset (only text column) to a prompt-completion dataset, because you'd have to be able to extract the prompt from it. I'm afraid that adding this part to the documentation will create confusion.

The workaround you use is to have an empty prompt column. Which is a bit strange.

How about instead making the algorithms natively support this new type in which you only have a text column and a label colone?

AMindToThink added 2 commits December 3, 2024 21:23

Add completion to unpaired preference

584b560

Add link to dataset conversion table

5b2934b

qgallouedec added the 😴 stale No update from the author, will be closed soon label Dec 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add "Language Modeling to Unpaired Preference" in Utilities for converting dataset types #2436

Add "Language Modeling to Unpaired Preference" in Utilities for converting dataset types #2436

AMindToThink commented Dec 4, 2024

qgallouedec commented Dec 4, 2024 •

edited

Loading

Add "Language Modeling to Unpaired Preference" in Utilities for converting dataset types #2436

Are you sure you want to change the base?

Add "Language Modeling to Unpaired Preference" in Utilities for converting dataset types #2436

Conversation

AMindToThink commented Dec 4, 2024

qgallouedec commented Dec 4, 2024 • edited Loading

qgallouedec commented Dec 4, 2024 •

edited

Loading