Fixed nll with label_smoothing to just nll #28708

nileshkokane01 · 2024-01-25T16:13:24Z

What does this PR do?

This PR fixes #28167 by making label_smoothing= 0 .

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

younesbelkada

Thanks ! Could you apply the same changes on the TF modeling file?

nileshkokane01 · 2024-02-05T09:33:24Z

@younesbelkada ,
Can you please review the changes?

younesbelkada

Thanks ! there is a small merge conflict with main - could you resolve that ? After that well be able to merge

src/transformers/models/blip/modeling_tf_blip_text.py

nileshkokane01 · 2024-02-06T09:52:27Z

@younesbelkada ,
I rebased and resolved the conflict. I hope its a right way, or else let me know.

younesbelkada

Thank you @nileshkokane01 for your contribution !

HuggingFaceDocBuilderDev · 2024-02-06T21:39:45Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

ArthurZucker

IMO that's a breaking change. Not sure if the best is to update the documentation or update this!
Should we add a config arg for this?

ArthurZucker · 2024-02-07T12:34:45Z

cc @younesbelkada

younesbelkada · 2024-02-09T16:48:05Z

@ArthurZucker IMO this is not really breaking, it is even the opposite as it fixes some subtle bugs with respect to training with BLIP- see @NielsRogge ' comment here: #28167 (comment)

younesbelkada · 2024-02-09T16:49:44Z

thinking a bit about it, indeed maybe we should make that configurable through a variable in the config so that potentially users could revert to original behaviour if needed.
@nileshkokane01 would be happy to adjust the PR accordingly? You just need to add a new variable label_smoothing in the blip config class and set it to 0

nileshkokane01 · 2024-02-09T17:00:04Z

@younesbelkada sure! I'll do that.

younesbelkada · 2024-02-09T17:02:06Z

thank you @nileshkokane01 !

nileshkokane01 · 2024-02-11T15:59:10Z

@younesbelkada ,

Do I have to change the nll loss to the following as well:

loss_fct = CrossEntropyLoss(reduction=reduction, label_smoothing=self.config.lable_smoothing)

younesbelkada · 2024-02-12T23:13:40Z

hi @nileshkokane01
Yes please, this sounds great

nileshkokane01 · 2024-02-17T07:59:54Z

@younesbelkada ,
can you have a look ?

younesbelkada

Looking good thanks ! left few nits on docstrings!

src/transformers/models/blip/configuration_blip.py

amyeroberts

Thanks for fixing and making this controllable!

* Fixed nll with label_smoothing to nll * Resolved conflict by rebase * Fixed nll with label_smoothing to nll * Resolved conflict by rebase * Added label_smoothing to config file * Fixed nits

younesbelkada reviewed Jan 25, 2024

View reviewed changes

nileshkokane01 force-pushed the fix_nll branch from b22eaf9 to 5eb9e3c Compare January 26, 2024 04:47

nileshkokane01 closed this Feb 5, 2024

nileshkokane01 reopened this Feb 5, 2024

younesbelkada reviewed Feb 5, 2024

View reviewed changes

src/transformers/models/blip/modeling_tf_blip_text.py Outdated Show resolved Hide resolved

nileshkokane01 added 2 commits February 6, 2024 14:46

Fixed nll with label_smoothing to nll

bdc243d

Resolved conflict by rebase

90d2459

nileshkokane01 force-pushed the fix_nll branch from 5eb9e3c to 90d2459 Compare February 6, 2024 09:49

younesbelkada approved these changes Feb 6, 2024

View reviewed changes

younesbelkada requested review from amyeroberts and ArthurZucker February 6, 2024 21:20

ArthurZucker reviewed Feb 7, 2024

View reviewed changes

nileshkokane01 added 3 commits February 17, 2024 10:03

Fixed nll with label_smoothing to nll

ad2b756

Resolved conflict by rebase

faa05b9

Added label_smoothing to config file

03f72e6

nileshkokane01 force-pushed the fix_nll branch from 90d2459 to 03f72e6 Compare February 17, 2024 06:29

fix repo-consistency

c8af1cf

younesbelkada approved these changes Feb 18, 2024

View reviewed changes

src/transformers/models/blip/configuration_blip.py Outdated Show resolved Hide resolved

src/transformers/models/blip/configuration_blip.py Outdated Show resolved Hide resolved

Fixed nits

527dc94

amyeroberts approved these changes Feb 19, 2024

View reviewed changes

younesbelkada merged commit 49c0b29 into huggingface:main Feb 20, 2024
18 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixed nll with label_smoothing to just nll #28708

Fixed nll with label_smoothing to just nll #28708

nileshkokane01 commented Jan 25, 2024 •

edited

Loading

younesbelkada left a comment

nileshkokane01 commented Feb 5, 2024

younesbelkada left a comment

nileshkokane01 commented Feb 6, 2024

younesbelkada left a comment

HuggingFaceDocBuilderDev commented Feb 6, 2024

ArthurZucker left a comment

ArthurZucker commented Feb 7, 2024

younesbelkada commented Feb 9, 2024 •

edited

Loading

younesbelkada commented Feb 9, 2024 •

edited

Loading

nileshkokane01 commented Feb 9, 2024

younesbelkada commented Feb 9, 2024

nileshkokane01 commented Feb 11, 2024

younesbelkada commented Feb 12, 2024

nileshkokane01 commented Feb 17, 2024

younesbelkada left a comment

amyeroberts left a comment

Fixed nll with label_smoothing to just nll #28708

Fixed nll with label_smoothing to just nll #28708

Conversation

nileshkokane01 commented Jan 25, 2024 • edited Loading

What does this PR do?

Before submitting

Who can review?

younesbelkada left a comment

Choose a reason for hiding this comment

nileshkokane01 commented Feb 5, 2024

younesbelkada left a comment

Choose a reason for hiding this comment

nileshkokane01 commented Feb 6, 2024

younesbelkada left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Feb 6, 2024

ArthurZucker left a comment

Choose a reason for hiding this comment

ArthurZucker commented Feb 7, 2024

younesbelkada commented Feb 9, 2024 • edited Loading

younesbelkada commented Feb 9, 2024 • edited Loading

nileshkokane01 commented Feb 9, 2024

younesbelkada commented Feb 9, 2024

nileshkokane01 commented Feb 11, 2024

younesbelkada commented Feb 12, 2024

nileshkokane01 commented Feb 17, 2024

younesbelkada left a comment

Choose a reason for hiding this comment

amyeroberts left a comment

Choose a reason for hiding this comment

nileshkokane01 commented Jan 25, 2024 •

edited

Loading

younesbelkada commented Feb 9, 2024 •

edited

Loading

younesbelkada commented Feb 9, 2024 •

edited

Loading