Skip to content

Fixes that may help with saving & resuming training with LongVU's codebase under deepspeed

Notifications You must be signed in to change notification settings

geomlyd/longvu_ckpt_fixes

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

longvu_ckpt_fixes

Fixes that may help with saving & resuming training with LongVU's codebase under deepspeed

To use deepspeed instead of FSDP (see this discussion on LongVU's repo), remove all FSDP-related arguments that are passed to train.py and instead use --deepspeed $path_to_deepspeed_config. A suggested deepspeed config .json can be found in this repo. Under the assumption that LongVU "inherits" some of the issues related to resuming training from VideoLlaVA (see their github issues for related discussions), I've changed some .py files which can also be found here (diff-ing with the original can show what changes were made).

Note: these fixes worked for me but I take no responsbility whatsoever whether they'll work for your environment as well.

About

Fixes that may help with saving & resuming training with LongVU's codebase under deepspeed

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages