Skip to content

checking if attention mask present for ignoring pad tokens in ffn (#1… #683

checking if attention mask present for ignoring pad tokens in ffn (#1…

checking if attention mask present for ignoring pad tokens in ffn (#1… #683

Annotations

1 warning

docker-build (2.3.0_cu121_flash2, mosaicml/pytorch:2.3.0_cu121-python3.11-ubuntu20.04, [gpu-flash2])

succeeded May 9, 2024 in 3m 13s