I just combined one of my Stable Diffusion safetensors with the video safetensors from text-to-video-ms-1.7b #46

ourladyofagi · 2023-03-22T22:43:51Z

ourladyofagi
Mar 22, 2023

It didn't break it, and I'm not entirely sure what I was expecting, but the output is... good?

(I know nothing about how safetensors or made, what they consist of, or what combining them does, and I just wanted to see what would happen. Added a 70/30 mix with the txt2vid and my... octopus?... model, and... yeah.)

Just thought I'd throw it out there. Not quite sure where else to post something like this haha

(The auto1111 extension is working well. Thanks!)

hithereai · 2023-03-22T22:46:11Z

hithereai
Mar 22, 2023

Give us a link to the model?

0 replies

kabachuha · 2023-03-22T22:47:57Z

kabachuha
Mar 22, 2023
Maintainer

Absolutely, post it on r/StableDiffusion. They love such experiments

0 replies

ourladyofagi · 2023-03-22T23:42:24Z

ourladyofagi
Mar 22, 2023
Author

goth_girl_miniskirt.mp4

0 replies

jparmstr · 2023-03-24T00:07:46Z

jparmstr
Mar 24, 2023

I did some experimenting with this and wasn't able to find anything interesting.

I created 3 different merges between text2video_pytorch_model.pth and Deliberate v2. The .pth file is in CKPT format, not Safetensors. I used Weighted Sum, Don't Copy Config, Bake In VAE: None. For sampling, the prompt is "a raccoon walking in the forest, DSLR, detailed", negative prompt: "blurry, drawing", 30 steps, CFG 12.5, 30 frames, seed 1, 256 x 256, video is 5 FPS, and CRF is 12. The resulting models have different checksums from each other, and are all 5.25 GB like the original model.

https://user-images.githubusercontent.com/33569918/227389067-3eafdb52-312b-4b61-ad34-4802ca2004fd.mp4
Original text2video_pytorch_model

https://user-images.githubusercontent.com/33569918/227389129-6b0e7370-1ff2-457e-acb3-bfac6dd99e6a.mp4
30% Deliberate v2, 70% original model

https://user-images.githubusercontent.com/33569918/227389153-870924c5-af6a-469b-ac23-f212d1ac931a.mp4
70% Deliberate v2, 30% original model

https://user-images.githubusercontent.com/33569918/227389180-74013056-f139-40d8-9ceb-ef2971e38c46.mp4
100% Deliberate v2, 0% original model

The videos are not technically identical (they have different checksums), but they are nearly visually identical. I used Photoshop to compare the first frame from the videos created by the merged models to the first frame from the video created by the original model. I used the "Difference" color blending mode:

Difference between 30-70 model and original model's first frame

Difference between 70-30 model and original model's first frame

Difference between 0-100 model and original model's first frame

As you can see there is no visible difference between the frames. These images aren't pure black; there are random pixels with values like "010000" instead of "000000". But they are basically identical.

This is an example of what you would see if the images differed from each other. I boosted the Brightness of one of the frames, and this is the difference between that frame and the original one. This is to demonstrate that my method for comparing images works.

2 replies

kabachuha Mar 24, 2023
Maintainer

I guess, webui's merger tries to combine weights with names from the sd config, which are likely not present in this model's dictionary

ourladyofagi Mar 25, 2023
Author

Excellent work. Thanks for getting to the bottom of this.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

I just combined one of my Stable Diffusion safetensors with the video safetensors from text-to-video-ms-1.7b #46

{{title}}

Replies: 4 comments 2 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

I just combined one of my Stable Diffusion safetensors with the video safetensors from text-to-video-ms-1.7b #46

ourladyofagi Mar 22, 2023

Replies: 4 comments · 2 replies

hithereai Mar 22, 2023

kabachuha Mar 22, 2023 Maintainer

ourladyofagi Mar 22, 2023 Author

jparmstr Mar 24, 2023

kabachuha Mar 24, 2023 Maintainer

ourladyofagi Mar 25, 2023 Author

ourladyofagi
Mar 22, 2023

Replies: 4 comments 2 replies

hithereai
Mar 22, 2023

kabachuha
Mar 22, 2023
Maintainer

ourladyofagi
Mar 22, 2023
Author

jparmstr
Mar 24, 2023

kabachuha Mar 24, 2023
Maintainer

ourladyofagi Mar 25, 2023
Author