Skip to content

Commit

Permalink
Update Llama architecture to handle 3b/1b (#429)
Browse files Browse the repository at this point in the history
  • Loading branch information
cg123 authored Oct 4, 2024
1 parent c891a09 commit 8522917
Showing 1 changed file with 2 additions and 4 deletions.
6 changes: 2 additions & 4 deletions mergekit/_data/architectures/llama.json
Original file line number Diff line number Diff line change
Expand Up @@ -73,10 +73,8 @@
{
"name": "lm_head.weight",
"input_space": "running_residual",
"is_embed":true,
"aliases": [
"model.lm_head.weight"
]
"is_embed": true,
"optional": true
}
]
}

0 comments on commit 8522917

Please sign in to comment.