Wirthual/fix vision #12

wirthual · 2024-10-02T01:47:34Z

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

… wirthual/fix-vision

wirthual · 2024-10-10T03:20:13Z

I saw you started working on the support for mllama. In this branch I added the new tensor names from Llama 3.2. I came accross 2 problems at this point:

add_tensor_info in gguf writer raises a Duplicated tensor name error for blk.{bid}.ffn_up. Is this layer doubled for vision and language parts and I need to prefix the name somewhere?

Another problem is the tokenizer loaded seems to have 1 token more than the specified in the vocab_size parameter so it errors out here:

        vocab_size = self.hparams["text_config"].get("vocab_size", len(tokenizer.vocab))
        assert max(tokenizer.vocab.values()) < vocab_size

Is the additonal token related to the image so this check needs to be changed?

Skipping those two check, I am able to produce a GGUF file.

Best,
wirthual

ngxson · 2024-10-10T09:19:42Z

Thanks for your suggestion. However, my PR ggerganov#9687 targets llava (please read the descriptions for more info). This is to reduce the complexity and focus on developing a framework that can support multiple archs in the future.

llama-3.2 vision will be added at some point, so I'll keep this PR open.

wirthual added 2 commits October 2, 2024 03:42

fix missing imports

3c1242a

revert vision.cpp

3ca3898

github-actions bot added the examples label Oct 2, 2024

wirthual added 2 commits October 9, 2024 17:07

Merge branch 'xsn/vision' of https://github.com/ngxson/llama.cpp into…

c430c21

… wirthual/fix-vision

added layer names for mllama

308da5f

github-actions bot added the python label Oct 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wirthual/fix vision #12

Wirthual/fix vision #12

wirthual commented Oct 2, 2024

wirthual commented Oct 10, 2024 •

edited

Loading

ngxson commented Oct 10, 2024

Wirthual/fix vision #12

Are you sure you want to change the base?

Wirthual/fix vision #12

Conversation

wirthual commented Oct 2, 2024

wirthual commented Oct 10, 2024 • edited Loading

ngxson commented Oct 10, 2024

wirthual commented Oct 10, 2024 •

edited

Loading