fix discard_names bug in safetensors convertion #1052

zhangsibo1129 · 2023-09-25T03:03:30Z

What does this PR do?

Model Class attributes _tied_weights_keys, _keys_to_ignore_on_load_missing can only be None or a List. getattr(class_, "_keys_to_ignore_on_load_missing", []) will return None if _keys_to_ignore_on_load_missing is None, and discard_names.extend(None) will trigger an exception, even though _tied_weights_keys exists.

Who can review?

@OlivierDehaene @Narsil

@Narsil

# What does this PR do?   Support local config file to avoid unexpected `discard_names`, which causes #1057. In the case of launching local mode without `model.safetensors` file, the original code will result `discard_names = []` when `hf_hub_download` throws an connection error. ```python # server/text_generation_server/cli.py try: import transformers import json config_filename = hf_hub_download(model_id, revision=revision, filename="config.json") with open(config_filename, "r") as f: config = json.load(f) architecture = config["architectures"][0] class_ = getattr(transformers, architecture) # Name for this varible depends on transformers version. discard_names = getattr(class_, "_tied_weights_keys", []) discard_names.extend(getattr(class_, "_keys_to_ignore_on_load_missing", [])) except Exception as e: discard_names = [] ``` The expected `_tied_weights_keys` of OPT-1.3b is `["lm_head.weight"]`, and its tied weight `"model.decoder.embed_tokens.weight"` will be kept in the safetensors conversion. But the above empty `discard_names` will lead to `"lm_head.weight"` being kept and `"model.decoder.embed_tokens.weight"` being discard in the subsequent method `_remove_duplicate_names`, which causes error #1057. So add a local mode branch to get the expected `discard_names` like follows. This modification also applies to other models ```python # server/text_generation_server/cli.py if is_local_model: config_filename = os.path.join(model_id, "config.json") else: config_filename = hf_hub_download(model_id, revision=revision, filename="config.json") ``` In addition, when `_tied_weights_keys` or `_keys_to_ignore_on_load_missing` is `None`, the above code will also throw an error unexpectedly. This is fixed in PR #1052 ## Before submitting - [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case). - [x] Did you read the [contributor guideline](https://github.com/huggingface/transformers/blob/main/CONTRIBUTING.md#start-contributing-pull-requests), Pull Request section? - [x] Was this discussed/approved via a Github issue or the [forum](https://discuss.huggingface.co/)? Please add a link to it if that's the case. - [ ] Did you make sure to update the documentation with your changes? Here are the [documentation guidelines](https://github.com/huggingface/transformers/tree/main/docs), and [here are tips on formatting docstrings](https://github.com/huggingface/transformers/tree/main/docs#writing-source-documentation). N/A - [ ] Did you write any new necessary tests? N/A ## Who can review? Anyone in the community is free to review the PR once the tests have passed. Feel free to tag members/contributors who may be interested in your PR. @Narsil

Narsil · 2023-09-26T13:02:41Z

server/text_generation_server/cli.py

+            if getattr(class_, "_tied_weights_keys", []):
+                discard_names.extend(getattr(class_, "_tied_weights_keys", []))
+            if getattr(class_, "_keys_to_ignore_on_load_missing", []):
+                discard_names.extend(getattr(class_, "_keys_to_ignore_on_load_missing", []))


Actually I think we should just discard _keys_to_ignore_on_load_missing.

This should always be lists if defined, but indeed transformers recently switched everything to _tied_weights_keys.

Narsil · 2023-09-26T13:05:34Z

Since it's a small fix I took the liberty of modifying the PR directly with the proposed fix. I tested on facebook/opt-13.b and it works.

fix discard_names in safetensors convertion

649d975

zhangsibo1129 mentioned this pull request Sep 26, 2023

support local model config file #1058

Merged

5 tasks

Narsil reviewed Sep 26, 2023

View reviewed changes

Simpler fix.

9c0f679

Narsil merged commit eba6ab1 into huggingface:main Sep 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix discard_names bug in safetensors convertion #1052

fix discard_names bug in safetensors convertion #1052

zhangsibo1129 commented Sep 25, 2023

Narsil Sep 26, 2023

Narsil commented Sep 26, 2023

fix discard_names bug in safetensors convertion #1052

fix discard_names bug in safetensors convertion #1052

Conversation

zhangsibo1129 commented Sep 25, 2023

What does this PR do?

Who can review?

Narsil Sep 26, 2023

Choose a reason for hiding this comment

Narsil commented Sep 26, 2023