forked from vllm-project/vllm
-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[DO NOT MERGE] Ibm dev rebased #255
Open
prashantgupta24
wants to merge
43
commits into
main
Choose a base branch
from
ibm-dev-rebased
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Changes from 41 commits
Commits
Show all changes
43 commits
Select commit
Hold shift + click to select a range
42d3a81
[Docs] Add Nebius as sponsors (#10371)
simon-mo a18c4fb
[Frontend] Add --version flag to CLI (#10369)
russellb 067dd2b
[Doc] Move PR template content to docs (#10159)
russellb 82be00f
[Docs] Misc updates to TPU installation instructions (#10165)
mikegre-google 2ca5912
[Frontend] Automatic detection of chat content format from AST (#9919)
DarkLight1337 ea480a1
[doc] add doc for the plugin system (#10372)
youkaichao 2e453bc
[misc][plugin] improve log messages (#10386)
youkaichao 51813a4
[BugFix] [Kernel] Fix GPU SEGV occuring in fused_moe kernel (#10385)
rasmith 6c45f56
[Misc] Update benchmark to support image_url file or http (#10287)
kakao-steve-ai d93bde9
[Misc] Medusa supports custom bias (#10361)
skylee-01 068451f
[Bugfix] Fix M-RoPE position calculation when chunked prefill is enab…
imkero d49dacb
[V1] Add code owners for V1 (#10397)
WoosukKwon 11d2bbc
[2/N][torch.compile] make compilation cfg part of vllm cfg (#10383)
youkaichao 2ea854d
[V1] Refactor model executable interface for all text-only language m…
ywang96 2652ea1
[CI/Build] Fix IDC hpu [Device not found] issue (#10384)
xuechendi 47325d5
[Bugfix][CPU] Fix CPU embedding runner with tensor parallel (#10394)
Isotr0py e288d72
[platforms] refactor cpu code (#10402)
youkaichao f6dc8be
[Hardware] [HPU]add `mark_step` for hpu (#10239)
jikunshang dcccc62
[Bugfix] Fix mrope_position_delta in non-last prefill chunk (#10403)
imkero d97d269
[Misc] Enhance offline_inference to support user-configurable paramet…
wchen61 d3fe99b
[Misc] Add uninitialized params tracking for `AutoWeightsLoader` (#10…
Isotr0py 1f165b6
[Bugfix] Ignore ray reinit error when current platform is ROCm or XPU…
HollowMan6 56b9c49
[4/N][torch.compile] clean up set_torch_compile_backend (#10401)
youkaichao 663fb57
[VLM] Report multi_modal_placeholders in output (#10407)
lk-chen 3c64fbb
[Model] Remove redundant softmax when using PoolingType.STEP (#10415)
Maybewuss 20ea0c3
[Model][LoRA]LoRA support added for glm-4v (#10418)
B-201 b1d6a6a
[Model] Remove transformers attention porting in VITs (#10414)
Isotr0py 5d6df78
[Doc] Update doc for LoRA support in GLM-4V (#10425)
B-201 b4c641f
[5/N][torch.compile] torch.jit.script --> torch.compile (#10406)
youkaichao c973945
[Doc] Add documentation for Structured Outputs (#9943)
ismael-dm e19e15a
Fix open_collective value in FUNDING.yml (#10426)
andrew a00d8c1
[Model][Bugfix] Support TP for PixtralHF ViT (#10405)
mgoin 472b9cf
[Hardware][XPU] AWQ/GPTQ support for xpu backend (#10107)
yma11 9147b76
[Kernel] Explicitly specify other value in tl.load calls (#9014)
angusYuhao ce83e18
[Kernel] Initial Machete W4A8 support + Refactors (#9855)
LucasWilkinson bf5bf24
Squash 9522
fialhocoelho 8b27172
Squash 6357
fialhocoelho e8fcdee
Squash 10235
fialhocoelho 1e7f586
Squash 10400
fialhocoelho 09f5b4a
Squash 10430
fialhocoelho 33c1553
pin mistral and install adapter from branch :rocket:
fialhocoelho f931d71
Dockerfile.ubi: remove extra line continuation
dtrifiro 87ebca7
Merge branch 'main' into ibm-dev-rebased
prashantgupta24 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Validating CODEOWNERS rules …
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,2 +1,2 @@ | ||
github: [vllm-project] | ||
open_collective: [vllm] | ||
open_collective: vllm |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,5 @@ | ||
FILL IN THE PR DESCRIPTION HERE | ||
|
||
FIX #xxxx (*link existing issues this PR will resolve*) | ||
|
||
**BEFORE SUBMITTING, PLEASE READ https://docs.vllm.ai/en/latest/contributing/overview.html ** |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There's an issue in this Dockerfile at line 188 (not part of this PR) which prevents the dockerfile from building. You can either rebase on main or cherry pick 039e209 to fix the issue.