Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

vllm.config: add _set_missing_architectures #119

Conversation

dtrifiro
Copy link

@dtrifiro dtrifiro commented Aug 2, 2024

Some models, such as granite-7b-starter, do not include the architectures list in their config.json, causing a crash when trying to serve the model with vLLM.

This patch attempts to automatically guess the correct architecture when the field is not provided.

https://issues.redhat.com/browse/RHOAIENG-10722

@openshift-ci openshift-ci bot requested review from njhill and Xaenalt August 2, 2024 15:15
@openshift-ci openshift-ci bot added the approved label Aug 2, 2024
@dtrifiro dtrifiro changed the base branch from main to release August 2, 2024 15:15
@dtrifiro dtrifiro force-pushed the fix-serving-for-granite-starter branch from bdcb612 to 8654ff7 Compare August 2, 2024 15:17
Copy link

openshift-ci bot commented Aug 2, 2024

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: dtrifiro

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@dtrifiro dtrifiro changed the title vllm.config: add _set_missing_architectures() vllm.config: add _set_missing_architectures Aug 2, 2024
@dtrifiro
Copy link
Author

dtrifiro commented Aug 5, 2024

Closing as the the architecture field was added in the model's config.json

@dtrifiro dtrifiro closed this Aug 5, 2024
prarit pushed a commit to prarit/vllm that referenced this pull request Oct 18, 2024
* add weight padding for moe

* enable padding by default

* fix linter

* fix linter

* fix linter

* using envs.py

* fix linter
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant