fix custom cache dir #2226

ErikKaum · 2024-07-12T12:45:36Z

What does this PR do?

Fixes loading fast tokenizer implementation for cached model with HF_HUB_OFFLINE=1 fails #2147

How to reproduce the bug

env vars HUGGINGFACE_HUB_CACHE=/some/other/dir and HF_HUB_OFFLINE=1 have to be set
launch with e.g. this tokenizer text-generation-router --tokenizer-name meta-llama/Meta-Llama-3-8B-Instruct
result, the tokenizer_file_name is None and the config as well

2024-07-12T12:40:29.724257Z  WARN text_generation_router: router/src/main.rs:328: Could not find tokenizer config locally and no API specified
2024-07-12T12:40:29.724292Z  INFO text_generation_router: router/src/main.rs:353: Using config None
2024-07-12T12:40:29.724307Z  WARN text_generation_router: router/src/main.rs:355: Could not find a fast tokenizer implementation for meta-llama/Meta-Llama-3-8B-Instruct

Fix

check if HUGGINGFACE_HUB_CACHE is set and use that
if not --> then use the Cache::default()

This was most likely to only come up when running TGI in docker since there the ENV HUGGINGFACE_HUB_CACHE=/data \ is set to something besides the default.

OlivierDehaene

Thanks

* fix to not ignore HUGGINGFACE_HUB_CACHE in cache * delete printlns * delete newlines * maybe fix trailing whitespace

ErikKaum added 3 commits July 12, 2024 14:35

fix to not ignore HUGGINGFACE_HUB_CACHE in cache

57c313d

delete printlns

3a14827

delete newlines

30ce9e0

ErikKaum mentioned this pull request Jul 12, 2024

loading fast tokenizer implementation for cached model with HF_HUB_OFFLINE=1 fails #2147

Closed

4 tasks

ErikKaum requested review from Narsil and OlivierDehaene July 12, 2024 12:58

maybe fix trailing whitespace

dc64f8a

OlivierDehaene approved these changes Jul 15, 2024

View reviewed changes

ErikKaum merged commit 457fb0a into main Jul 15, 2024
9 checks passed

ErikKaum deleted the fix/hf-cache-dir branch July 15, 2024 13:17

ErikKaum added a commit that referenced this pull request Jul 25, 2024

fix custom cache dir (#2226)

006c0c2

* fix to not ignore HUGGINGFACE_HUB_CACHE in cache * delete printlns * delete newlines * maybe fix trailing whitespace

ErikKaum added a commit that referenced this pull request Jul 26, 2024

fix custom cache dir (#2226)

26a7bae

* fix to not ignore HUGGINGFACE_HUB_CACHE in cache * delete printlns * delete newlines * maybe fix trailing whitespace

yuanwu2017 pushed a commit to yuanwu2017/tgi-gaudi that referenced this pull request Sep 26, 2024

fix custom cache dir (huggingface#2226)

271ebb7

* fix to not ignore HUGGINGFACE_HUB_CACHE in cache * delete printlns * delete newlines * maybe fix trailing whitespace

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix custom cache dir #2226

fix custom cache dir #2226

ErikKaum commented Jul 12, 2024 •

edited

Loading

OlivierDehaene left a comment

fix custom cache dir #2226

fix custom cache dir #2226

Conversation

ErikKaum commented Jul 12, 2024 • edited Loading

What does this PR do?

How to reproduce the bug

Fix

OlivierDehaene left a comment

Choose a reason for hiding this comment

ErikKaum commented Jul 12, 2024 •

edited

Loading