This is a tutorial to get teach you how to teach a custom face to Stable Diffusion using Google Cloud Platform (GCP).

Prerequisites

GCP account with a spot A100 GPU instance limit >= 1

Setup the VM

(These steps you'll only need to do once)

Create a Compute Engine (GCP) instance:

Choose GPU -> NVIDIA A100 40GB machine family
Choose an OS with pre-installed PyTorch (via "Switch Image" button),
- e.g. Debian 10 based Deep Learning VM for PyTorch CPU/GPU with CUDA 11.3 M97
Choose a storage size of at least 100gb
Choose Advanced options -> Management -> Availability policies -> Spot (if you want to save money)

SSH into the instance

The instance asks "Would you like to install the Nvidia driver?" on the first login. Choose "y".

Clone Dreambooth repo

git clone https://github.com/XavierXiao/Dreambooth-Stable-Diffusion.git
cd Dreambooth-Stable-Diffusion/

Create the conda environment

conda env create -f environment.yaml

Download the Stable Diffusion model

Go to https://huggingface.co/CompVis/stable-diffusion-v-1-4-original/tree/main and download the model (sd-v1-4.ckpt). Copy the model over to the VM.

scp sd-v1-4.ckpt ec2-user@<VM_IP>:~/Dreambooth-Stable-Diffusion/

Note that you need to be logged in to HuggingFace to download the model (create a free account if you don't have one).

Create an image for training images and regularization images

mkdir training-images
mkdir regularization-images

You're done! Log out and now you can start training the model with your own images

Train the model

Crop your training images (5-8 pictures) to squares with 512x512 pixels dimensions, with the face in the center
Move your training images into the VM

scp training-images/* $HOST:~/Dreambooth-Stable-Diffusion/training-images/

Make sure you don't show teeth in the training images — Stable Diffusion is not that good in drawing teeth and having teeth in the training images will make Stable Diffusion to draw teeth in the final image.

SSH into the instance
Ectivate Conda environment

cd Dreambooth-Stable-Diffusion/
conda activate ldm

Generate 10 regularization images

python scripts/stable_txt2img.py \
  --prompt "photo of <subject>" \
  --n_samples 10 \
  --skip_grid \
  --seed $RANDOM \
  --outdir regularization-images \
  --ckpt sd-v1-4.ckpt

Train the model

python main.py \
  --base configs/stable-diffusion/v1-finetune_unfrozen.yaml \
  -t \
  --actual_resume sd-v1-4.ckpt \
  -n a_m \
  --gpus 0, \
  --seed $RANDOM \
  --data_root training-images \
  --reg_data_root regularization-images/samples \
  --class_word '<subject>'

Run inference

python scripts/stable_txt2img.py \
  --n_samples 1 \
  --n_iter 1 \
  --ckpt logs/training-images<TIMESTAMP>/checkpoints/last.ckpt \
  --skip_grid \
  --seed $RANDOM \
  --prompt 'sks <subject>'

Load images back to your own machine

scp $HOST:~/Dreambooth-Stable-Diffusion/outputs/txt2img-samples/samples/* ./

Download the .ckpt to local use

scp $HOST:~/Dreambooth-Stable-Diffusion/logs/training-images<TIMESTAMP>/checkpoints/last.ckpt ./

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GCP.md

GCP.md

Prerequisites

Setup the VM

Train the model

Download the .ckpt to local use

Files

GCP.md

Latest commit

History

GCP.md

File metadata and controls

Prerequisites

Setup the VM

Train the model

Download the .ckpt to local use