-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hyper SDXL Lora support #127
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Review in progress
Still need to look at src/maxdiffusion/models/modeling_flax_pytorch_utils.py and src/maxdiffusion/loaders/lora_conversion_utils.py
|
||
@classmethod | ||
@validate_hf_hub_args | ||
def lora_state_dict(cls, pretrained_model_name_or_path: str, **kwargs): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit: I think calling this something like get_lora_state_dict
might be clearer about what the method is doing?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Great work! Approving, but I did have a question on the infinite loop avoidance logic.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm mostly just skimming this file since it seems to just be a lot of wrangling of PyTorch state_dict to Flax. Someone who works more on MaxDiffusion might have more thoughts
@@ -26,4 +26,6 @@ git+https://github.com/mlperf/logging.git | |||
opencv-python-headless==4.10.0.84 | |||
orbax-checkpoint>=0.5.20 | |||
tokenizers==0.20.0 | |||
huggingface_hub==0.24.7 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Repeated line?
This PR establishes LoRA support and includes Hyper-SD XL LoRA loading for inference.