Model not automatically loaded from cache #1461

DavidHuebner · 2022-03-11T13:09:33Z

First, let me thank you for maintaining this great package!

I think that the model-caching logic in SentenceTransformer.py could be improved. When I specify a model and a cache_dir like so SentenceTransformer("distiluse-base-multilingual-cased-v1", cache_folder="~/.models/") , my hope (and expectation) was the following behaviour:

When executed for the first time, it should download the model into the cache_folder (this is ok!).
When executed for a second time, it should re-load the existing model from that cache folder (does not work!). Even worse, it will download the model every time.

To actually make the model reload from cache, one would need to specify the model_name_or_path like so: SentenceTransformer(model_name_or_path="~/.models/sentence-transformers_distiluse-base-multilingual-cased-v1"). This leaves me with two different calls (first call, later re-loads) to SentenceTransformer with some checking in between.

The fix to this is rather simple. The loading code should check if the model was already downloaded to the cache_dir. The call to snapshot_download in https://github.com/UKPLab/sentence-transformers/blob/master/sentence_transformers/SentenceTransformer.py#L86 should only be executed if there is no existing model at model_path yet.

The text was updated successfully, but these errors were encountered:

…KPLab#1461 and pull UKPLab#1565

sadnoodles · 2023-05-31T11:26:51Z

Sorry for incomplete PR.

#1923 is also not a fix. If download is not complete, this will go wrong.

I’m think check it's integrality.

tomaarsen · 2024-01-29T10:15:57Z

Hello!

The caching functionality has been overhauled in the new v2.3.0. It shouldn't re-download the model every time! I'll close this for now. Feel free to let me know if other model loading issues pop up with the new release.

Tom Aarsen

sadnoodles mentioned this issue May 24, 2022

Download from online only if not cached #1565

Merged

CharlesJu1 added a commit to CharlesJu1/sentence-transformers that referenced this issue May 18, 2023

use cache even if module.json is not present in the cache. Fix issus U…

889537c

…KPLab#1461 and pull UKPLab#1565

CharlesJu1 mentioned this issue May 18, 2023

use cache even if module.json is not present in the cache. #1923

Closed

tomaarsen closed this as completed Jan 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model not automatically loaded from cache #1461

Model not automatically loaded from cache #1461

DavidHuebner commented Mar 11, 2022

sadnoodles commented May 31, 2023

tomaarsen commented Jan 29, 2024

Model not automatically loaded from cache #1461

Model not automatically loaded from cache #1461

Comments

DavidHuebner commented Mar 11, 2022

sadnoodles commented May 31, 2023

tomaarsen commented Jan 29, 2024