Sentence Transformers test (soon) no longer expected to fail #1918

tomaarsen · 2023-12-18T09:46:02Z

Hello!

Pull Request overview

Remove xfail from Sentence Transformer "save_to_hub" test.
Add an assert to verify that it should work.

Details

The upcoming Sentence Transformers 2.3.0 has removed the hardcoded endpoint in save_to_hub, so we can soon expect for this behaviour to work correctly.

Tom Aarsen

HuggingFaceDocBuilderDev · 2023-12-18T09:50:47Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Wauplin

Yay! Thanks for opening this PR @tomaarsen!
Just let me know what sentence-transformers is released so we can merge this PR.

tomaarsen · 2023-12-18T09:59:45Z

Will do!
In the meantime, do the contrib tests download the bleeding edge version from the respective repos?

Wauplin · 2023-12-18T10:01:53Z

In the meantime, do the contrib tests download the bleeding edge version from the respective repos?

Yes it does! I forgot about that. Then we should be good to merge once the CI is green.

Wauplin · 2023-12-18T10:04:28Z

contrib/sentence_transformers/test_sentence_transformers.py

+
+    # Check model has been pushed properly
+    model_id = f"{user}/{repo_name}"
+    assert model_info(model_id).library_name == "sentence-transformers"


It looks like for some reason the library_name attribute is not set when requesting the model info 😕

Certainly odd. When I install with git+https://github.com/UKPLab/sentence-transformers.git#egg=sentence-transformers, it fails the first 2 times that I run pytest, and passes the 2 subsequent times :/

When it fails, the only siblings in ModelInfo is [RepoSibling(rfilename='.gitattributes', size=None, blob_id=None, lfs=None)], so the repo only has the gitattributes. If I breakpoint and inspect the times that it does pass, then all files are seen by the model_info.

and between each run, is model_id = f"{user}/{repo_name}" changing? The value should be unique for each run (generated here). I'm asking because having dependency between tests is odd

Yes, it does differ between runs.

Haha, now the spaCy tests are failing & sentence_transformers is passing. Perhaps it's indeed a flaky test setup where model_info gets slightly outdated data if it is called "too soon".

cc @Kakulukian @coyotte508 do you know if something has changed recently on the /api/models/<repo_id> endpoint that would make the model cache longer to update? In the test and thread above, we are doing:

create new repo

push model with modelcard

get model info (GET /api/models/repo_id)

check model_info.library_name.

It looks like adding a 1 second delay between steps 2. and 3. makes the test more robust. But I don't remember this was the case before. It is not a problem to add this delay in our tests but prefer to let you know in case it's a bigger problem server-side.

Now we rely on cache instead of building from scratch. So if the cache update (following the push) is going on in the background, it can display outdated info.

You can bypass this by passing the commit ID instead of HEAD, eg /api/models/<repo_id>/revision/<commit_id>

You can pass main as the commit id (for now it should work, maybe later we'll optimize)

We can add support for Cache-Control header to skip cache if needed later on

Note that this seeems to happen only for commit endpoint, not push (as it's been awaited since https://github.com/huggingface/moon-landing/pull/5501)

We can fix it but potentially commit endpoint will take longer

Thanks for the explanation! No need to optimize anything on the endpoint as it's mostly a problem in internal tests. Good to know about the revision workaround 👍

…maarsen/huggingface_hub into contrib/sentence_transformers

codecov · 2023-12-18T10:47:04Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (a071655) 49.01% compared to head (a74d62a) 81.99%.

❗ Current head a74d62a differs from pull request most recent head ae7389c. Consider uploading reports for the commit ae7389c to get more accurate results

Additional details and impacted files

@@             Coverage Diff             @@
##             main    #1918       +/-   ##
===========================================
+ Coverage   49.01%   81.99%   +32.98%     
===========================================
  Files          65       65               
  Lines        8092     8092               
===========================================
+ Hits         3966     6635     +2669     
+ Misses       4126     1457     -2669

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Wauplin · 2023-12-18T11:08:01Z

Thanks for debugging @tomaarsen. Let's merge this now :)

Sentence Transformers test (soon) no longer expected to fail

0461bfe

tomaarsen mentioned this pull request Dec 18, 2023

Remove hardcoded HF endpoint UKPLab/sentence-transformers#1767

Closed

Wauplin approved these changes Dec 18, 2023

View reviewed changes

Wauplin reviewed Dec 18, 2023

View reviewed changes

Wauplin and others added 3 commits December 18, 2023 11:13

Merge branch 'main' into contrib/sentence_transformers

07a94e7

Sleep before calling model_info

26e6d61

Merge branch 'contrib/sentence_transformers' of https://github.com/to…

a74d62a

…maarsen/huggingface_hub into contrib/sentence_transformers

Add sleep to spaCy tests, too

ae7389c

Wauplin merged commit c4ddfc7 into huggingface:main Dec 18, 2023
17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sentence Transformers test (soon) no longer expected to fail #1918

Sentence Transformers test (soon) no longer expected to fail #1918

tomaarsen commented Dec 18, 2023

HuggingFaceDocBuilderDev commented Dec 18, 2023

Wauplin left a comment

tomaarsen commented Dec 18, 2023

Wauplin commented Dec 18, 2023

Wauplin Dec 18, 2023

tomaarsen Dec 18, 2023

tomaarsen Dec 18, 2023

Wauplin Dec 18, 2023

tomaarsen Dec 18, 2023

tomaarsen Dec 18, 2023

Wauplin Dec 18, 2023

coyotte508 Dec 18, 2023 •

edited

Loading

coyotte508 Dec 18, 2023 •

edited

Loading

Wauplin Dec 18, 2023

codecov bot commented Dec 18, 2023 •

edited

Loading

Wauplin commented Dec 18, 2023

Sentence Transformers test (soon) no longer expected to fail #1918

Sentence Transformers test (soon) no longer expected to fail #1918

Conversation

tomaarsen commented Dec 18, 2023

Pull Request overview

Details

HuggingFaceDocBuilderDev commented Dec 18, 2023

Wauplin left a comment

Choose a reason for hiding this comment

tomaarsen commented Dec 18, 2023

Wauplin commented Dec 18, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

coyotte508 Dec 18, 2023 • edited Loading

Choose a reason for hiding this comment

coyotte508 Dec 18, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented Dec 18, 2023 • edited Loading

Codecov Report

Wauplin commented Dec 18, 2023

coyotte508 Dec 18, 2023 •

edited

Loading

coyotte508 Dec 18, 2023 •

edited

Loading

codecov bot commented Dec 18, 2023 •

edited

Loading