Clean dependencies #2298

qgallouedec · 2024-10-30T16:42:14Z

What does this PR do?

This PR cleans up TRL dependencies by better dividing and following best practice.
It updates the test workflow accordingly.

For a test extra, it's usually best practice to include only the dependencies required for testing and not all optional dependencies. This PR makes the appropriate change.
Optional dependencies deepspeed and liger-kernel are only supported for linux. This PR ensure that they are not part of optional dependencies with windows.
Add test for python 3.12 (stable for more than a year now)
Test with both latest and dev versions to better track breaking changes in our dependencies.

	OS	Python	Required dependencies	Optional dependencies
Tests	Ubuntu	3.9	Latest	Latest
Tests	Ubuntu	3.10	Latest	Latest
Tests	Ubuntu	3.11	Latest	Latest
Tests	Ubuntu	3.12	Latest	Latest
Tests	Windows	3.9	Latest	Latest
Tests	Windows	3.10	Latest	Latest
Tests	Windows	3.11	Latest	Latest
Tests	Windows	3.12	Latest	Latest
Tests without optional dependencies	Ubuntu	3.12	Latest	No
Tests with dev dependencies	Ubuntu	3.12	Dev	Latest

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a GitHub issue? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

HuggingFaceDocBuilderDev · 2024-10-30T16:46:23Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

qgallouedec · 2024-10-31T08:03:12Z

trl/trainer/judges.py

@@ -21,7 +21,6 @@
 import numpy as np
 from accelerate import Accelerator
 from huggingface_hub import InferenceClient
-from scipy.special import softmax


remove this dep

qgallouedec · 2024-10-31T08:03:46Z

tests/test_peft_models.py

@@ -95,7 +98,7 @@ def test_create_peft_model_from_config(self):
        nb_trainable_params = sum(p.numel() for p in trl_model.parameters() if p.requires_grad)
        assert nb_trainable_params == 10273

-    @require_bitsandbytes
+    @require_torch_gpu_if_bnb_not_multi_backend_enabled


it requires cuda

qgallouedec · 2024-10-31T08:27:16Z

Failing test are expected and will be solved later (in #2288 for example)

trl/trainer/judges.py

lewtun

LGTM with a small tweak to the softmax

lewtun · 2024-10-31T09:30:07Z

.github/workflows/tests.yml

+          cache: "pip"
+          cache-dependency-path: |
+              setup.py
+              requirements.txt


If we find the dependencies are a hassle to maintain, we might want to switch to the transformers approach, which uses a look-up table to give fine-grained control: https://github.com/huggingface/transformers/blob/main/setup.py

trl/trainer/judges.py

qgallouedec added 3 commits October 30, 2024 16:23

clean deps

9fb772e

new tests

3691aad

tests

8d9dee3

qgallouedec added 17 commits October 30, 2024 16:49

Add tests without optional dependencies workflow

9fe63f0

Update dependencies in tests.yml

ad37bdc

cpu version of torch

c5c2393

Update dependencies and installation commands

9e6a3df

Disable fail-fast in test workflow

5636198

Update test matrix in workflows file

21dcac5

try fix windows

ab6619b

Remove "rich" from required packages in setup.py

784d5a8

Update dependency installation in tests.yml

04aa363

Add torch and deepspeed installation for windows-latest

ab7cb68

Fix conditional statement in workflow file

c7dda4e

Add torch and deepspeed installation for Windows

16765b5

Fix if statement

5cb82c0

Update torch and deepspeed dependencies

fe65acd

Update liger package requirement for non-Windows platforms

64a5a38

remove scipy dep

8a9611e

Add torch GPU requirement for testing_utils

2bdcb88

qgallouedec commented Oct 31, 2024

View reviewed changes

qgallouedec requested review from lewtun, kashif and edbeeching October 31, 2024 08:26

kashif reviewed Oct 31, 2024

View reviewed changes

trl/trainer/judges.py Outdated Show resolved Hide resolved

qgallouedec mentioned this pull request Oct 31, 2024

Fix _save_checkpoint for online methods #2288

Merged

5 tasks

lewtun approved these changes Oct 31, 2024

View reviewed changes

Update trl/trainer/judges.py

9a84242

qgallouedec merged commit 06be6f4 into main Oct 31, 2024
10 of 12 checks passed

qgallouedec deleted the clean-deps branch October 31, 2024 10:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clean dependencies #2298

Clean dependencies #2298

qgallouedec commented Oct 30, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Oct 30, 2024

qgallouedec Oct 31, 2024

qgallouedec Oct 31, 2024

qgallouedec commented Oct 31, 2024

lewtun left a comment

lewtun Oct 31, 2024

Clean dependencies #2298

Clean dependencies #2298

Conversation

qgallouedec commented Oct 30, 2024 • edited Loading

What does this PR do?

Before submitting

Who can review?

HuggingFaceDocBuilderDev commented Oct 30, 2024

qgallouedec Oct 31, 2024

Choose a reason for hiding this comment

qgallouedec Oct 31, 2024

Choose a reason for hiding this comment

qgallouedec commented Oct 31, 2024

lewtun left a comment

Choose a reason for hiding this comment

lewtun Oct 31, 2024

Choose a reason for hiding this comment

qgallouedec commented Oct 30, 2024 •

edited

Loading