common: ensure token addition to batch does not exceed llama_batch size #9668

matiaslin · 2024-09-27T17:17:42Z

A crash was observed when the number of tokens added to a batch exceeds the context size. Assertions have been added to ensure the number of tokens added to batch is within bounds of context size.

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

ggerganov

A more elegant solution is to use the fact that llama_batch.seq_id is null-terminated:

llama.cpp/src/llama.cpp

Line 20778 in 877a04c

batch.seq_id[n_tokens_alloc] = nullptr;

We can add an assert directly in llama_batch_add:

GGML_ASSERT(batch.seq_id[batch.n_tokens] && "llama_batch size exceeded");

matiaslin · 2024-09-28T16:11:13Z

That's great @ggerganov. I'm uploading the new patch soon.

A crash was observed when the number of tokens added to a batch exceeds llama_batch size. An assertion in llama_batch_add was added to protect against llama_batch size overflow.

matiaslin · 2024-09-28T16:31:49Z

Done. Now the issued mentioned in #9667 displays the following message if we attempt to add more tokens than batch size:

common.cpp:1435: GGML_ASSERT(batch.seq_id[batch.n_tokens] && "llama_batch size exceeded") failed

WilliamTambellini · 2024-09-30T18:06:36Z

Tks @matiaslin and @ggerganov

…9668) A crash was observed when the number of tokens added to a batch exceeds llama_batch size. An assertion in llama_batch_add was added to protect against llama_batch size overflow.

github-actions bot added the examples label Sep 27, 2024

ggerganov reviewed Sep 28, 2024

View reviewed changes

common: ensure token addition to batch does not exceed llama_batch size

c197f0f

A crash was observed when the number of tokens added to a batch exceeds llama_batch size. An assertion in llama_batch_add was added to protect against llama_batch size overflow.

matiaslin force-pushed the protect_parallel branch from d063cf1 to c197f0f Compare September 28, 2024 16:29

matiaslin changed the title ~~parallel: fix adding tokens to batch~~ common: ensure token addition to batch does not exceed llama_batch size Sep 28, 2024

ggerganov approved these changes Sep 29, 2024

View reviewed changes

ggerganov added the merge ready indicates that this may be ready to merge soon and is just holding out in case of objections label Sep 29, 2024

ggerganov merged commit faac0ba into ggerganov:master Sep 29, 2024
52 checks passed

ggerganov mentioned this pull request Sep 30, 2024

server: Bring back multimodal support #8010

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

common: ensure token addition to batch does not exceed llama_batch size #9668

common: ensure token addition to batch does not exceed llama_batch size #9668

matiaslin commented Sep 27, 2024

ggerganov left a comment

matiaslin commented Sep 28, 2024

matiaslin commented Sep 28, 2024

WilliamTambellini commented Sep 30, 2024

common: ensure token addition to batch does not exceed llama_batch size #9668

common: ensure token addition to batch does not exceed llama_batch size #9668

Conversation

matiaslin commented Sep 27, 2024

ggerganov left a comment

Choose a reason for hiding this comment

matiaslin commented Sep 28, 2024

matiaslin commented Sep 28, 2024

WilliamTambellini commented Sep 30, 2024