Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[do-not-merge] Ibm dev rebuild #254

Open
wants to merge 42 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
42 commits
Select commit Hold shift + click to select a range
c76ac49
[Docs] Add Nebius as sponsors (#10371)
simon-mo Nov 15, 2024
a067f85
[Frontend] Add --version flag to CLI (#10369)
russellb Nov 15, 2024
3e8d14d
[Doc] Move PR template content to docs (#10159)
russellb Nov 15, 2024
4f168f6
[Docs] Misc updates to TPU installation instructions (#10165)
mikegre-google Nov 15, 2024
32e46e0
[Frontend] Automatic detection of chat content format from AST (#9919)
DarkLight1337 Nov 16, 2024
755b853
[doc] add doc for the plugin system (#10372)
youkaichao Nov 16, 2024
2f427c2
[misc][plugin] improve log messages (#10386)
youkaichao Nov 16, 2024
1d75472
[BugFix] [Kernel] Fix GPU SEGV occuring in fused_moe kernel (#10385)
rasmith Nov 16, 2024
8b6725b
[Misc] Update benchmark to support image_url file or http (#10287)
kakao-steve-ai Nov 16, 2024
b98d89e
[Misc] Medusa supports custom bias (#10361)
skylee-01 Nov 16, 2024
361c29e
[Bugfix] Fix M-RoPE position calculation when chunked prefill is enab…
imkero Nov 16, 2024
661a34f
[V1] Add code owners for V1 (#10397)
WoosukKwon Nov 16, 2024
4fd9375
[2/N][torch.compile] make compilation cfg part of vllm cfg (#10383)
youkaichao Nov 17, 2024
643ecf7
[V1] Refactor model executable interface for all text-only language m…
ywang96 Nov 17, 2024
905d0f0
[CI/Build] Fix IDC hpu [Device not found] issue (#10384)
xuechendi Nov 17, 2024
cf349c4
[Bugfix][CPU] Fix CPU embedding runner with tensor parallel (#10394)
Isotr0py Nov 17, 2024
8d74b5a
[platforms] refactor cpu code (#10402)
youkaichao Nov 17, 2024
76aab90
[Hardware] [HPU]add `mark_step` for hpu (#10239)
jikunshang Nov 17, 2024
80d85c5
[Bugfix] Fix mrope_position_delta in non-last prefill chunk (#10403)
imkero Nov 17, 2024
d1557e6
[Misc] Enhance offline_inference to support user-configurable paramet…
wchen61 Nov 17, 2024
c4e4643
[Misc] Add uninitialized params tracking for `AutoWeightsLoader` (#10…
Isotr0py Nov 18, 2024
47826ca
[Bugfix] Ignore ray reinit error when current platform is ROCm or XPU…
HollowMan6 Nov 18, 2024
51bb12d
[4/N][torch.compile] clean up set_torch_compile_backend (#10401)
youkaichao Nov 18, 2024
c7dec92
[VLM] Report multi_modal_placeholders in output (#10407)
lk-chen Nov 18, 2024
01aae1c
[Model] Remove redundant softmax when using PoolingType.STEP (#10415)
Maybewuss Nov 18, 2024
5be4e52
[Model][LoRA]LoRA support added for glm-4v (#10418)
B-201 Nov 18, 2024
e7ebb66
[Model] Remove transformers attention porting in VITs (#10414)
Isotr0py Nov 18, 2024
4186be8
[Doc] Update doc for LoRA support in GLM-4V (#10425)
B-201 Nov 18, 2024
7851b45
[5/N][torch.compile] torch.jit.script --> torch.compile (#10406)
youkaichao Nov 18, 2024
31894a2
[Doc] Add documentation for Structured Outputs (#9943)
ismael-dm Nov 18, 2024
4f686d1
Fix open_collective value in FUNDING.yml (#10426)
andrew Nov 18, 2024
281cc4b
[Model][Bugfix] Support TP for PixtralHF ViT (#10405)
mgoin Nov 18, 2024
6b2d25e
[Hardware][XPU] AWQ/GPTQ support for xpu backend (#10107)
yma11 Nov 18, 2024
c2170a5
[Kernel] Explicitly specify other value in tl.load calls (#9014)
angusYuhao Nov 18, 2024
96d999f
[Kernel] Initial Machete W4A8 support + Refactors (#9855)
LucasWilkinson Nov 18, 2024
3444895
Merge branch 'main' of https://github.com/vllm-project/vllm into ibm-…
fialhocoelho Nov 18, 2024
e58ac91
Squash 9522
fialhocoelho Nov 18, 2024
9223ac3
Squash 6357
fialhocoelho Nov 18, 2024
b715908
Squash 10235
fialhocoelho Nov 18, 2024
6aa7422
Squash 10400
fialhocoelho Nov 18, 2024
565cfcc
Squash 10430
fialhocoelho Nov 18, 2024
049cf55
pin mistral and install adapter from branch :rocket:
fialhocoelho Nov 18, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion .buildkite/run-hpu-test.sh
Original file line number Diff line number Diff line change
Expand Up @@ -13,4 +13,4 @@ trap remove_docker_container EXIT
remove_docker_container

# Run the image and launch offline inference
docker run --runtime=habana --name=hpu-test --network=host -e VLLM_SKIP_WARMUP=true --entrypoint="" hpu-test-env python3 examples/offline_inference.py
docker run --runtime=habana --name=hpu-test --network=host -e HABANA_VISIBLE_DEVICES=all -e VLLM_SKIP_WARMUP=true --entrypoint="" hpu-test-env python3 examples/offline_inference.py
17 changes: 10 additions & 7 deletions .github/CODEOWNERS
Validating CODEOWNERS rules …
Original file line number Diff line number Diff line change
Expand Up @@ -3,13 +3,16 @@

# This lists cover the "core" components of vLLM that require careful review
/vllm/attention/backends/abstract.py @WoosukKwon @zhuohan123 @youkaichao @alexm-neuralmagic @comaniac @njhill
/vllm/core @WoosukKwon @zhuohan123 @youkaichao @alexm-neuralmagic @comaniac @njhill
/vllm/engine/llm_engine.py @WoosukKwon @zhuohan123 @youkaichao @alexm-neuralmagic @comaniac @njhill
/vllm/executor/executor_base.py @WoosukKwon @zhuohan123 @youkaichao @alexm-neuralmagic @comaniac @njhill
/vllm/worker/worker_base.py @WoosukKwon @zhuohan123 @youkaichao @alexm-neuralmagic @comaniac @njhill
/vllm/worker/worker.py @WoosukKwon @zhuohan123 @youkaichao @alexm-neuralmagic @comaniac @njhill
/vllm/model_executor/layers/sampler.py @WoosukKwon @zhuohan123 @youkaichao @alexm-neuralmagic @comaniac @njhill
CMakeLists.txt @tlrmchlsmth @WoosukKwon
/vllm/core @zhuohan123 @youkaichao @alexm-neuralmagic @comaniac @njhill
/vllm/engine/llm_engine.py @zhuohan123 @youkaichao @alexm-neuralmagic @comaniac @njhill
/vllm/executor/executor_base.py @zhuohan123 @youkaichao @alexm-neuralmagic @comaniac @njhill
/vllm/worker/worker_base.py @zhuohan123 @youkaichao @alexm-neuralmagic @comaniac @njhill
/vllm/worker/worker.py @zhuohan123 @youkaichao @alexm-neuralmagic @comaniac @njhill
/vllm/model_executor/layers/sampler.py @zhuohan123 @youkaichao @alexm-neuralmagic @comaniac @njhill
CMakeLists.txt @tlrmchlsmth

# vLLM V1
/vllm/v1 @WoosukKwon @robertgshaw2-neuralmagic @njhill @ywang96 @comaniac @alexm-neuralmagic

# Test ownership
/tests/async_engine @njhill @robertgshaw2-neuralmagic @simon-mo
Expand Down
2 changes: 1 addition & 1 deletion .github/FUNDING.yml
Original file line number Diff line number Diff line change
@@ -1,2 +1,2 @@
github: [vllm-project]
open_collective: [vllm]
open_collective: vllm
29 changes: 0 additions & 29 deletions .github/ISSUE_TEMPLATE/100-documentation.yml

This file was deleted.

47 changes: 0 additions & 47 deletions .github/ISSUE_TEMPLATE/200-installation.yml

This file was deleted.

45 changes: 0 additions & 45 deletions .github/ISSUE_TEMPLATE/300-usage.yml

This file was deleted.

107 changes: 0 additions & 107 deletions .github/ISSUE_TEMPLATE/400-bug report.yml

This file was deleted.

38 changes: 0 additions & 38 deletions .github/ISSUE_TEMPLATE/500-feature request.yml

This file was deleted.

40 changes: 0 additions & 40 deletions .github/ISSUE_TEMPLATE/600-new model.yml

This file was deleted.

Loading
Loading