batching multi-client server #42

Gldkslfmsd · 2023-12-08T11:41:31Z

          > > How to use this to allow multiple clients to connect when you host a server or create an API for live transcription?

I don't know, it's a topic that requires a separate issue. But first, there must be a Whisper backend that enables batching -- more inputs processing at once. If there's not, then use one GPU with one server for one client.

Thank you. Using one GPU for each client is a tall ask for me as there could be up to a dozen clients active at a particular time for my use case. I think there are a few backends which do support batched processing. e.g. https://github.com/Blair-Johnson/batch-whisper
If you have any references or you can point me to the parts where changes are needed to implement this.
Or is it alright if I create a new issue for this?

Originally posted by @umaryasin33 in #10 (comment)

The text was updated successfully, but these errors were encountered:

Gldkslfmsd · 2023-12-08T11:43:42Z

I also found this fast batching whisper backend: https://github.com/Vaibhavs10/insanely-fast-whisper

Gldkslfmsd · 2023-12-08T11:57:27Z

you can point me to the parts where changes are needed to implement this.

First, you need a multi-client server. It handles each client the same way as single client, but it needs a new subclass of ASRBase that would connect through API to a batching backend. Maybe the API could be shared with #34 ?

And then you need the Whisper batching backend and API -- I don't know which way is optimal, a subprocess, network API, etc.

From code policy point of view, make a new entry point for the multi-client server. I suggest a separate project which would use Whisper-Streaming as a module. I could not be available to maintain it in this repo.

Gldkslfmsd · 2023-12-08T12:01:01Z

but more projects could use this feature, like https://github.com/ufal/correctable-lecture-translator . Open-sourcing and collaboration is welcome!

joaogabrieljunq mentioned this issue Feb 20, 2024

batching inference and forced decoding for speedup and multi-target #55

Open

Gldkslfmsd mentioned this issue Sep 23, 2024

insanely-fast-whisper backend #122

Closed

Gldkslfmsd mentioned this issue Nov 13, 2024

Cannot handle multiple streams concurrently #138

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

batching multi-client server #42

batching multi-client server #42

Gldkslfmsd commented Dec 8, 2023

Gldkslfmsd commented Dec 8, 2023

Gldkslfmsd commented Dec 8, 2023 •

edited

Loading

Gldkslfmsd commented Dec 8, 2023

batching multi-client server #42

batching multi-client server #42

Comments

Gldkslfmsd commented Dec 8, 2023

Gldkslfmsd commented Dec 8, 2023

Gldkslfmsd commented Dec 8, 2023 • edited Loading

Gldkslfmsd commented Dec 8, 2023

Gldkslfmsd commented Dec 8, 2023 •

edited

Loading