DRAFT: Add `transformers` backend support #11330

ArthurZucker · 2024-12-19T09:55:40Z

Initial PR to add support for transformers as a backend! Following huggingface/transformers#35235, a bunch of models should already be supported, we are wramping up support for more models !!
Thanks @Isotr0py for the TP support!

Still have a few things to test / check

Co-authored-by: Isotr0py <[email protected]>

github-actions · 2024-12-19T09:55:53Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

ywang96 · 2024-12-19T11:02:06Z

Hello @ArthurZucker! This is very exciting!

I know this PR is still a draft, but could you provide some context on the scope of this effort? Is it to support any model on transformers?

ArthurZucker · 2024-12-19T16:05:12Z

Yep, overall this should support any model that is supported in transformers, were the cache is "simple" so for now, most of the decoder models and the encoder models for a single modularity!
For MultiModal models, we might need a little bit of extra work, but I thing LLAVA models should work out of the box!

We are refactor our models to make sure it's propagated to as many models as possible!

ArthurZucker · 2024-12-20T10:02:02Z

Might not have time to finish this week, will make it ready for next week 🎄
This should be minimal (no support fort Lora or at least I am not testing it ! This might. need to either call transformers's from pretrain, or replace modules similarly to TP)

Signed-off-by: Harry Mellor <[email protected]>

ArthurZucker and others added 2 commits December 19, 2024 10:33

Merge

0bb5519

Co-authored-by: Isotr0py <[email protected]>

Merge branch 'main' into transformers-backend

8e238f7

mergify bot added the ci/build label Dec 19, 2024

ArthurZucker added 3 commits December 19, 2024 11:01

revert some changes

6d8f1fd

changes are now merged with main of transformers

fb37617

revert more changes

2d0c128

DarkLight1337 requested review from mgoin and ywang96 December 19, 2024 10:05

Isotr0py mentioned this pull request Dec 19, 2024

[Roadmap] vLLM Roadmap Q4 2024 #9006

Open

40 tasks

simon-mo mentioned this pull request Jan 9, 2025

[Roadmap] vLLM Roadmap Q1 2025 #11862

Open

36 tasks

hmellor added 2 commits January 9, 2025 11:35

Merge remote-tracking branch 'upstream/main' into fix-history

31c16a1

Signed-off-by: Harry Mellor <[email protected]>

Undo whitespace changes

a49aa81

Signed-off-by: Harry Mellor <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DRAFT: Add `transformers` backend support #11330

DRAFT: Add `transformers` backend support #11330

ArthurZucker commented Dec 19, 2024 •

edited by github-actions bot

Loading

github-actions bot commented Dec 19, 2024

ywang96 commented Dec 19, 2024

ArthurZucker commented Dec 19, 2024

ArthurZucker commented Dec 20, 2024

DRAFT: Add transformers backend support #11330

Are you sure you want to change the base?

DRAFT: Add transformers backend support #11330

Conversation

ArthurZucker commented Dec 19, 2024 • edited by github-actions bot Loading

github-actions bot commented Dec 19, 2024

ywang96 commented Dec 19, 2024

ArthurZucker commented Dec 19, 2024

ArthurZucker commented Dec 20, 2024

DRAFT: Add `transformers` backend support #11330

DRAFT: Add `transformers` backend support #11330

ArthurZucker commented Dec 19, 2024 •

edited by github-actions bot

Loading