[WIP] Add support for vllm container image #351

rhatdan · 2024-10-21T21:48:33Z

No description provided.

Signed-off-by: Daniel J Walsh <[email protected]>

rhatdan · 2024-10-21T21:49:21Z

@ericcurtin First attempt to get vllm based container for running ramalama. Sadly blows up after pulling Model. Not sure what is going wrong.

ericcurtin · 2024-10-21T21:57:14Z

Maybe we should reach out to IBM and ask them to rebase their fork against upstream:

https://github.com/IBM/vllm

.gguf support only made vllm very recently. Could try using upstream also to see if it makes a difference.

It would also be interesting to query some IBM folk as to why they fork.

dtrifiro · 2024-11-13T15:59:20Z

IBM is currently using the version here: https://github.com/opendatahub-io/vllm, the main Dockerfile is Dockerfile.ubi.

This is built and pushed to quay.io/repository/opendatahub/vllm (fast tag)

danielezonca · 2024-11-13T17:33:22Z

@dtrifiro
Can you help with this PR?
The goal is to add vLLM (mainly CPU+cuda I would say, we can iterate and other accelerator later) to ramalama.
I don't think it should require special customization but maybe there are options that can help

@ericcurtin
That IBM fork is old and not used anymore

Add support for vllm container image

05fbb5b

Signed-off-by: Daniel J Walsh <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Add support for vllm container image #351

[WIP] Add support for vllm container image #351

rhatdan commented Oct 21, 2024

rhatdan commented Oct 21, 2024

ericcurtin commented Oct 21, 2024

dtrifiro commented Nov 13, 2024 •

edited

Loading

danielezonca commented Nov 13, 2024

[WIP] Add support for vllm container image #351

Are you sure you want to change the base?

[WIP] Add support for vllm container image #351

Conversation

rhatdan commented Oct 21, 2024

rhatdan commented Oct 21, 2024

ericcurtin commented Oct 21, 2024

dtrifiro commented Nov 13, 2024 • edited Loading

danielezonca commented Nov 13, 2024

dtrifiro commented Nov 13, 2024 •

edited

Loading