Text generation inference, fix offline #1341

oOraph · 2023-12-13T13:30:37Z

What does this PR do?

Allows for Text Generation Inference to succeed in loading prefetched and cached private models if no token is provided at the time the text-generation-inference service is launched

oOraph · 2023-12-13T16:03:19Z

server/text_generation_server/utils/hub.py

@@ -138,33 +179,33 @@ def download_weights(
 ) -> List[Path]:
    """Download the safetensors files from the hub"""

-    def download_file(filename, tries=5, backoff: int = 5):
-        local_file = try_to_load_from_cache(model_id, revision, filename)
+    def download_file(fname, tries=5, backoff: int = 5):


Note: these var renaming (filename -> fname, start_time -> stime and i -> idx) are not mandatory, just renamed them to avoid pep warnings like "shadows var_name from outer scope"

oOraph · 2023-12-13T16:40:47Z

@OlivierDehaene @Narsil, I think the unit tests that fail only do because I miss some secrets in my forked repository. Could you please push a new branch with the proposed change in this repo (or give me the right to do it) so as to confirm this ? Thanks :)

Signed-off-by: Raphael Glon <[email protected]>

@oOraph

@oOraph --------- Signed-off-by: Raphael Glon <[email protected]> Co-authored-by: Raphael Glon <[email protected]>

@oOraph

@oOraph --------- Signed-off-by: Raphael Glon <[email protected]> Co-authored-by: Raphael Glon <[email protected]>

oOraph force-pushed the dev/fix_offline branch 4 times, most recently from ae77ab1 to 4959025 Compare December 13, 2023 16:01

oOraph commented Dec 13, 2023

View reviewed changes

oOraph marked this pull request as ready for review December 13, 2023 16:39

oOraph requested review from OlivierDehaene and Narsil December 13, 2023 16:39

Text generation inference, fix offline

b0b76ce

Signed-off-by: Raphael Glon <[email protected]>

oOraph force-pushed the dev/fix_offline branch from 4959025 to b0b76ce Compare December 13, 2023 16:43

OlivierDehaene changed the base branch from main to fix/offline December 14, 2023 14:57

OlivierDehaene merged commit 47cd67e into huggingface:fix/offline Dec 14, 2023
3 of 7 checks passed

OlivierDehaene added a commit that referenced this pull request Dec 18, 2023

fix: fix offline (#1341) (#1347)

8428ed1

@oOraph --------- Signed-off-by: Raphael Glon <[email protected]> Co-authored-by: Raphael Glon <[email protected]>

kdamaszk pushed a commit to kdamaszk/tgi-gaudi that referenced this pull request Apr 29, 2024

fix: fix offline (huggingface#1341) (huggingface#1347)

5ff9e81

@oOraph --------- Signed-off-by: Raphael Glon <[email protected]> Co-authored-by: Raphael Glon <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Text generation inference, fix offline #1341

Text generation inference, fix offline #1341

oOraph commented Dec 13, 2023 •

edited

Loading

oOraph Dec 13, 2023

oOraph commented Dec 13, 2023 •

edited

Loading

Text generation inference, fix offline #1341

Text generation inference, fix offline #1341

Conversation

oOraph commented Dec 13, 2023 • edited Loading

What does this PR do?

oOraph Dec 13, 2023

Choose a reason for hiding this comment

oOraph commented Dec 13, 2023 • edited Loading

oOraph commented Dec 13, 2023 •

edited

Loading

oOraph commented Dec 13, 2023 •

edited

Loading