Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md

README.md

SDXL model

SDXL consists of an ensemble of experts pipeline for latent diffusion: In a first step, the base model is used to generate (noisy) latents, which are then further processed with a refinement model (available here: https://huggingface.co/stabilityai/stable-diffusion-xl-refiner-1.0/) specialized for the final denoising steps. Note that the base model can be used as a standalone module.

Alternatively, we can use a two-stage pipeline as follows: First, the base model is used to generate latents of the desired output size. In the second step, we use a specialized high-resolution model and apply a technique called SDEdit (https://arxiv.org/abs/2108.01073, also known as "img2img") to the latents generated in the first step, using the same prompt. This technique is slightly slower than the first one, as it requires more function evaluations.

Source code is available at https://github.com/Stability-AI/generative-models .

Model Description

Developed by: Stability AI
Model type: Diffusion-based text-to-image generative model
License: CreativeML Open RAIL++-M License
Model Description: This is a model that can be used to generate and modify images based on text prompts. It is a Latent Diffusion Model that uses two fixed, pretrained text encoders (OpenCLIP-ViT/G and CLIP-ViT/L).
Resources for more information: Check out our GitHub Repository and the SDXL report on arXiv.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

stable_diffusion

stable_diffusion

README.md

SDXL model

Model Description

Files

stable_diffusion

Directory actions

More options

Directory actions

More options

Latest commit

History

stable_diffusion

Folders and files

parent directory

README.md

SDXL model

Model Description