Faster eval #298

jmercat · 2024-07-31T02:53:13Z

We might not want to merge this because it is hacky and there might be a usage that I don't foresee that could be impacted.

Problem:
Somewhere in llm-foundry or composer, the batches are appended with many stop tokens which makes the eval very slow.

Solution:
In OpenLMforCausalLM tests for 0s at the end of the input_ids batch and remove them. Add back fake logits at the end....

Consideration:
This hack could be implemented in SimpleComposerOpenLMCausalLM instead and would have less risk to have unforeseen impacts.
There shouldn't really be a need for this but I could not understand what was happening in Composer or llm-foundry or find where/why the inputs are appended with 0s.

jmercat added 3 commits July 24, 2024 16:11

loading checkpoint with _orig_mod. name prefix

c4ca039

added open_lm_1b_swiglutorch json

def4675

hacky removal of appended 0s in OpenLMforCausalLM

0180973

jmercat mentioned this pull request Jul 31, 2024

Faster eval2 #299

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Faster eval #298

Faster eval #298

jmercat commented Jul 31, 2024

Faster eval #298

Are you sure you want to change the base?

Faster eval #298

Conversation

jmercat commented Jul 31, 2024