Issue of too much recompilation after modification of cross attention processor #131

xziayro · 2024-03-06T06:53:15Z

Hello,
i did bring some large change to the cross attention processor in order to support regional prompt. However after this change i do observe a lot of recompilation (1 over 10 renders). I investigated, this is link to a constant change of the number of inputs due to a change of the number of "regional prompt".

If i do not JIT compilation of the unet then i don't have this issue but i don't have good speed ups anymore.

This is very annoying. Do you have any idea on how to prevent this?

I was wondering if it is possible to do apply the lazy_trace only on the self attention layers and do not trace cross attention?

chengzeyi · 2024-05-09T14:57:23Z

@xziayro You can use a tensor to contain your regional prompt. That should not trigger recompilation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue of too much recompilation after modification of cross attention processor #131

Issue of too much recompilation after modification of cross attention processor #131

xziayro commented Mar 6, 2024

chengzeyi commented May 9, 2024

Issue of too much recompilation after modification of cross attention processor #131

Issue of too much recompilation after modification of cross attention processor #131

Comments

xziayro commented Mar 6, 2024

chengzeyi commented May 9, 2024