update modular_modernbert -- add inputs_embeds param to ModernBertModel #35373

jxmorris12 · 2024-12-20T18:57:00Z

What does this PR do?

Hi! Congrats on the release of ModernBERT; it looks amazing. I'm interested in using ModernBERT eventually to train a new Contextual Document Embeddings model.

One desired feature is to pass the contextual and word embeddings together in the second stage, which requires setting the inputs_embeds kwarg so that we can pass hidden states directly. This is a feature of typical BERT and other transformer implementations but isn't yet allowed by ModernBERT, so I added it. It's only a few additional lines of code.

cc: @warner-benjamin @tomaarsen @orionw @staghado @bclavie @NohTow @ArthurZucker

jxmorris12 added 2 commits December 20, 2024 10:54

update modular_modernbert -- add inputs_embeds param to ModernBertModel

78d4ea7

Merge branch 'main' into patch-1

754d959

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update modular_modernbert -- add inputs_embeds param to ModernBertModel #35373

update modular_modernbert -- add inputs_embeds param to ModernBertModel #35373

jxmorris12 commented Dec 20, 2024

update modular_modernbert -- add inputs_embeds param to ModernBertModel #35373

Are you sure you want to change the base?

update modular_modernbert -- add inputs_embeds param to ModernBertModel #35373

Conversation

jxmorris12 commented Dec 20, 2024

What does this PR do?