Replies: 1 comment
-
I just thought that maybe I could split the [7B] model file in half, where the first half would contain layers 0-15 and the second half would contain layers 16-31. I could then distribute those partial models across the nodes and forward the contents of the |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
So I was wondering, would it be possible to have the model split across multiple nodes communicating over a slow link?
My use case would be to split the model into <4 GB chunks ([32-bit] WebAssembly memory limit) distributed across peers connected via WebRTC.
BLOOM already caters to a use case similar to mine (i.e. Petals), except for that its requirements exceed the resources available in this case…
Beta Was this translation helpful? Give feedback.
All reactions