From e647e082a75d4b4e273b4e55bf79657c55c4ef5e Mon Sep 17 00:00:00 2001 From: HannahBenita <77296142+HannahBenita@users.noreply.github.com> Date: Fri, 1 Dec 2023 12:54:25 +0100 Subject: [PATCH] Update index.html --- index.html | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/index.html b/index.html index c379c40..8ef3210 100644 --- a/index.html +++ b/index.html @@ -246,7 +246,7 @@

Image Fidelity and Text-to-Image Alignment


Compositional Robustness

Image Composition is a known limitation of Diffusion Models. Through evaluation of our new benchmark MCC-250 we show that multimodal prompting leads to more compositional robustness as judged by humans.

- method
+ method

Multilinguality

method

Evaluating the alignment of prompt embeddings as well as generated images across multiple languagese we show that well aligned embeddings enable the transfer of multiligualism to downstream tasks even for task-specific monolingual training data.