Proper way use MULTI GPUs with DEEPSPEED when each GPU does not have enough VRAM? #1369

atipasvanund · 2024-03-07T11:13:21Z

atipasvanund
Mar 7, 2024

What's the proper way to fine-tune on MULTI GPUs with DEEPSPEED when each GPU does not have enough VRAM? For example If I wish to use 4 x RTX 4090 to fine-tune a 7B model without LORA or Quantization. Is this possible? I tried zero1.json all the way to zero3_bf16.json and get various error, including out of CUDA memory. Can someone please help me?

NanoCode012 · 2024-04-03T15:17:58Z

NanoCode012
Apr 3, 2024
Collaborator

You should be able to using zero3 with offload. Alternatively , make sure to adjust optim etc.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proper way use MULTI GPUs with DEEPSPEED when each GPU does not have enough VRAM? #1369

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Proper way use MULTI GPUs with DEEPSPEED when each GPU does not have enough VRAM? #1369

atipasvanund Mar 7, 2024

Replies: 1 comment

NanoCode012 Apr 3, 2024 Collaborator

atipasvanund
Mar 7, 2024

NanoCode012
Apr 3, 2024
Collaborator