fix(convert-hf-to-gguf): requires einops for InternLM2ForCausalLM models #5792

Nold360 · 2024-02-29T09:18:47Z

small fix for container images, which will currently fail with:

Traceback (most recent call last):
  File "/app/convert-hf-to-gguf.py", line 1934, in <module>
gguf: This GGUF file is for Little Endian only
Set model parameters
Set model tokenizer
InternLM2 convert token 'b'\x00'' to '🐉'!
gguf: Setting special token type bos to 1
gguf: Setting special token type eos to 2
gguf: Setting special token type pad to 2
gguf: Setting chat_template to {{ bos_token }}{% for message in messages %}{{'<|im_start|>' + message['role'] + '
' + message['content'] + '<|im_end|>' + '
'}}{% endfor %}{% if add_generation_prompt %}{{ '<|im_start|>assistant
' }}{% endif %}
Exporting model to '/workspace/input/ggml-model-f16.gguf'
    main()
  File "/app/convert-hf-to-gguf.py", line 1928, in main
    model_instance.write()
  File "/app/convert-hf-to-gguf.py", line 152, in write
    self.write_tensors()
  File "/app/convert-hf-to-gguf.py", line 1612, in write_tensors
    from einops import rearrange
ModuleNotFoundError: No module named 'einops'
Traceback (most recent call last):
  File "/app/convert.py", line 1483, in <module>

used 0.7.0 cause it's the current release.

cebtenzzre · 2024-02-29T16:39:02Z

@crasm Were these files meant to contain required dependencies, or also include optional dependencies? In the latter case, this PR is a step in the right direction, but we also need to add tiktoken for Qwen models.

Nold360 · 2024-02-29T17:42:28Z

@crasm Were these files meant to contain required dependencies, or also include optional dependencies? In the latter case, this PR is a step in the right direction, but we also need to add tiktoken for Qwen models.

good point. imho they are so tiny compared to the rest, that we could just add them before building workarounds for containers

crasm · 2024-02-29T18:10:16Z

@crasm Were these files meant to contain required dependencies, or also include optional dependencies? In the latter case, this PR is a step in the right direction, but we also need to add tiktoken for Qwen models.

It's supposed to be for all dependencies.

The check-requirements.sh script and workflow were intended to prevent these kinds of PRs from being necessary, by forcing all dependencies to be declared in the requirements.txt files.

~~Has the workflow check been failing consistently from running out of space, like in the other PR you tagged me?~~ (edit: yes) That could explain why these are slipping though.

crasm · 2024-02-29T18:16:12Z

Maybe it'd be better to forgo the complexity I introduced in the requirements.txt. We could just do #5745 and declare each script's optional dependencies in the pyproject.toml.

cebtenzzre

This PR is an improvement, and is fine by me until we have a better solution.

crasm

Looks good. I'll investigate the disk space issue, which I suspect is a configuration issue rather than an actual resource limitation.

crasm · 2024-03-01T22:21:38Z

I'm trying to figure out how to catch any more of these implicit dependencies to prevent more gotchas.

@cebtenzzre Is it normal to have import statements embedded in various places in python scripts? That seems crazy to me coming from other languages.

cebtenzzre · 2024-03-01T22:27:38Z

Is it normal to have import statements embedded in various places in python scripts? That seems crazy to me coming from other languages.

Yep, this is how optional dependencies are typically implemented in python - you import them lazily so they only matter if the code that needs them is run. With a venv and a type checker like mypy, it's not hard to detect if these dependencies are missing when they shouldn't be - it will complain with import-not-found.

…ov#5792)

fix(convert-hf-to-gguf): requires einops for InternLM2ForCausalLM models

0046e58

cebtenzzre requested a review from crasm February 29, 2024 16:38

cebtenzzre approved these changes Mar 1, 2024

View reviewed changes

crasm approved these changes Mar 1, 2024

View reviewed changes

cebtenzzre merged commit da3b9ba into ggerganov:master Mar 1, 2024
22 of 23 checks passed

hazelnutcloud pushed a commit to hazelnutcloud/llama.cpp that referenced this pull request Mar 10, 2024

convert-hf-to-gguf : require einops for InternLM2ForCausalLM (ggergan…

2d16323

…ov#5792)

jordankanter pushed a commit to jordankanter/llama.cpp that referenced this pull request Mar 13, 2024

convert-hf-to-gguf : require einops for InternLM2ForCausalLM (ggergan…

da236e1

…ov#5792)

hodlen pushed a commit to hodlen/llama.cpp that referenced this pull request Apr 1, 2024

convert-hf-to-gguf : require einops for InternLM2ForCausalLM (ggergan…

368e421

…ov#5792)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(convert-hf-to-gguf): requires einops for InternLM2ForCausalLM models #5792

fix(convert-hf-to-gguf): requires einops for InternLM2ForCausalLM models #5792

Nold360 commented Feb 29, 2024

cebtenzzre commented Feb 29, 2024

Nold360 commented Feb 29, 2024 •

edited

Loading

crasm commented Feb 29, 2024 •

edited

Loading

crasm commented Feb 29, 2024

cebtenzzre left a comment

crasm left a comment

crasm commented Mar 1, 2024

cebtenzzre commented Mar 1, 2024 •

edited

Loading

fix(convert-hf-to-gguf): requires einops for InternLM2ForCausalLM models #5792

fix(convert-hf-to-gguf): requires einops for InternLM2ForCausalLM models #5792

Conversation

Nold360 commented Feb 29, 2024

cebtenzzre commented Feb 29, 2024

Nold360 commented Feb 29, 2024 • edited Loading

crasm commented Feb 29, 2024 • edited Loading

crasm commented Feb 29, 2024

cebtenzzre left a comment

Choose a reason for hiding this comment

crasm left a comment

Choose a reason for hiding this comment

crasm commented Mar 1, 2024

cebtenzzre commented Mar 1, 2024 • edited Loading

Nold360 commented Feb 29, 2024 •

edited

Loading

crasm commented Feb 29, 2024 •

edited

Loading

cebtenzzre commented Mar 1, 2024 •

edited

Loading