You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In the file flops_computation.py, I find that the FLOPs calculation of attn_output_layer_norm (in Line 77) does not include self.h multiplication factor. This is different from the FLOPs calculation of output_layer_norm (in Line 85) which includes self.h multiplication factor. It seems that the code logic of these two lines is exactly the same, so that self.h multiplication factor is missed in the FLOPs calculation of attn_output_layer_norm.
The text was updated successfully, but these errors were encountered:
In the file flops_computation.py, I find that the FLOPs calculation of attn_output_layer_norm (in Line 77) does not include self.h multiplication factor. This is different from the FLOPs calculation of output_layer_norm (in Line 85) which includes self.h multiplication factor. It seems that the code logic of these two lines is exactly the same, so that self.h multiplication factor is missed in the FLOPs calculation of attn_output_layer_norm.
The text was updated successfully, but these errors were encountered: