diff --git a/index.html b/index.html index c379c40..8ef3210 100644 --- a/index.html +++ b/index.html @@ -246,7 +246,7 @@

Image Fidelity and Text-to-Image Alignment


Compositional Robustness

Image Composition is a known limitation of Diffusion Models. Through evaluation of our new benchmark MCC-250 we show that multimodal prompting leads to more compositional robustness as judged by humans.

- method
+ method

Multilinguality

method

Evaluating the alignment of prompt embeddings as well as generated images across multiple languagese we show that well aligned embeddings enable the transfer of multiligualism to downstream tasks even for task-specific monolingual training data.