-
-
Notifications
You must be signed in to change notification settings - Fork 3
Pull requests: Snowflake-Labs/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
cherrypick two PRs from Snowflakedb main to swiftkv-refactor
#22
opened Nov 25, 2024 by
sfc-gh-yewang
Loading…
Add span metrics for model_forward, scheduler and sampler time
#21
opened Aug 1, 2024 by
sfc-gh-mkeralapura
Loading…
add the instrumentation needed to measure model forward time
#19
opened Jul 3, 2024 by
sfc-gh-reyazda
Loading…
Add the fp8-quantized GeMM for dense linear layers
#18
opened Jun 11, 2024 by
sfc-gh-reyazda
Loading…
ProTip!
Exclude everything labeled
bug
with -label:bug.