Skip to content

Actions: vllm-project/vllm

PR Reminder Comment Bot

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
2,786 workflow runs
2,786 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Bugfix] Validate lora adapters to avoid crashing server
PR Reminder Comment Bot #2786: Pull request #11727 opened by joerunde
January 3, 2025 22:24 14s
January 3, 2025 22:24 14s
[V1] Chore: cruft removal
PR Reminder Comment Bot #2785: Pull request #11724 opened by robertgshaw2-neuralmagic
January 3, 2025 20:00 13s
January 3, 2025 20:00 13s
[Bugfix] Remove block size constraint
PR Reminder Comment Bot #2784: Pull request #11723 opened by comaniac
January 3, 2025 19:38 10s
January 3, 2025 19:38 10s
[Ignore] Test multi-modal models extended
PR Reminder Comment Bot #2783: Pull request #11722 opened by mgoin
January 3, 2025 19:32 10s
January 3, 2025 19:32 10s
[V1] Improve TP>1 Error Handling + Stack Trace
PR Reminder Comment Bot #2782: Pull request #11721 opened by robertgshaw2-neuralmagic
January 3, 2025 18:15 14s
January 3, 2025 18:15 14s
[Misc]Add BNB quantization for Qwen2VL
PR Reminder Comment Bot #2781: Pull request #11719 opened by jeejeelee
January 3, 2025 16:36 11s
January 3, 2025 16:36 11s
Update bnb.md with example for OpenAI
PR Reminder Comment Bot #2780: Pull request #11718 opened by bet0x
January 3, 2025 16:28 14s
January 3, 2025 16:28 14s
[VLM] Merged multi-modal processors for LLaVA-NeXT-Video and LLaVA-OneVision
PR Reminder Comment Bot #2779: Pull request #11717 opened by DarkLight1337
January 3, 2025 10:47 11s
January 3, 2025 10:47 11s
lbx modify
PR Reminder Comment Bot #2778: Pull request #11716 opened by PZS-ModelCloud
January 3, 2025 10:41 12s
January 3, 2025 10:41 12s
[Model] LoRA with lm_head and embed_tokens fully trained - 4
PR Reminder Comment Bot #2777: Pull request #11714 opened by sergeykochetkov
January 3, 2025 09:11 10s
January 3, 2025 09:11 10s
[Frontend] Add segments to OpenAI Requests
PR Reminder Comment Bot #2776: Pull request #11713 opened by ruediste
January 3, 2025 08:25 14s
January 3, 2025 08:25 14s
[V1] Add RayExecutor support for AsyncLLM (api server)
PR Reminder Comment Bot #2775: Pull request #11712 opened by jikunshang
January 3, 2025 06:54 12s
January 3, 2025 06:54 12s
[perf-benchmark] Fix dependency for steps in benchmark pipeline
PR Reminder Comment Bot #2774: Pull request #11710 opened by khluu
January 3, 2025 06:19 15s
January 3, 2025 06:19 15s
[Bugfix] Fix ColumnParallelLinearWithLoRA slice
PR Reminder Comment Bot #2773: Pull request #11708 opened by zinccat
January 3, 2025 05:44 12s
January 3, 2025 05:44 12s
Update tool_calling.md
PR Reminder Comment Bot #2772: Pull request #11701 opened by Bryce1010
January 3, 2025 02:41 12s
January 3, 2025 02:41 12s
PD Disagg Performance enhance & benchmark tool update
PR Reminder Comment Bot #2771: Pull request #11699 opened by chenqianfzh
January 3, 2025 00:38 10s
January 3, 2025 00:38 10s
[Kernel][Triton][AMD] Change default block size for triton_scaled_mm to 128 for 3-5x speedup
PR Reminder Comment Bot #2770: Pull request #11698 opened by rasmith
January 3, 2025 00:31 13s
January 3, 2025 00:31 13s
Make detokenization optional in benchmark scripts
PR Reminder Comment Bot #2769: Pull request #11697 opened by JArnoldAMD
January 3, 2025 00:05 13s
January 3, 2025 00:05 13s
[Hardware][Apple] Native support for macOS Apple Silicon
PR Reminder Comment Bot #2768: Pull request #11696 opened by wallashss
January 2, 2025 21:27 12s
January 2, 2025 21:27 12s
Update requirements-tpu.txt to support python 3.9 and 3.11
PR Reminder Comment Bot #2767: Pull request #11695 opened by mgoin
January 2, 2025 20:51 10s
January 2, 2025 20:51 10s
Update default max_num_batch_tokens for chunked prefill
PR Reminder Comment Bot #2766: Pull request #11694 opened by SachinVarghese
January 2, 2025 20:31 11s
January 2, 2025 20:31 11s
[V1] Add BlockTable class
PR Reminder Comment Bot #2765: Pull request #11693 opened by WoosukKwon
January 2, 2025 17:05 12s
January 2, 2025 17:05 12s
[V1][Minor] Optimize token_ids_cpu copy
PR Reminder Comment Bot #2764: Pull request #11692 opened by WoosukKwon
January 2, 2025 16:43 17s
January 2, 2025 16:43 17s
[Frontend] Add split_special_tokens to the Tokenize Endpoint
PR Reminder Comment Bot #2763: Pull request #11691 opened by ruediste
January 2, 2025 16:20 11s
January 2, 2025 16:20 11s
[Kernel] Move attn_type to Attention.__init__()
PR Reminder Comment Bot #2762: Pull request #11690 opened by heheda12345
January 2, 2025 16:04 15s
January 2, 2025 16:04 15s