Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Load Balancing #59

Open
HRashidi opened this issue Feb 26, 2024 · 0 comments
Open

Load Balancing #59

HRashidi opened this issue Feb 26, 2024 · 0 comments
Labels
enhancement New feature or request

Comments

@HRashidi
Copy link
Contributor

Enhancement Description

Currently, the ray does not consider different server capabilities and distributes the requests evenly between each server. This shortage can cause the slowest server to be overwhelmed and create a bottleneck for the whole system.
This will happen if we use commercial gpus

Advantages

  • Fixing this issue allows us to run sdk on ray cluster with different hardware specs.

Possible Implementation

@HRashidi HRashidi added the enhancement New feature or request label Feb 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant