-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Frontend] add build_server and make run_server async #107
Conversation
Signed-off-by: Travis Johnson <[email protected]>
…ed. (vllm-project#6645) Signed-off-by: Thomas Parnell <[email protected]>
Co-authored-by: Michael Goin <[email protected]>
Skipping CI for Draft Pull Request. |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: dtrifiro The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
f2d9ecb
to
08f4f19
Compare
Update Dockerfile.rocm
This aims to split
run_server
into two: one part responsible for building theuvicorn.Server
and another responsible for running it.With this:
run_server()
is now a coroutine, which can be run usingasyncio.run()
or just awaited when running in an async contextbuild_server()
can now take optional kwargs, which are passed to uvicorn and allow for further customization of the server