Skip to content

gradient accumulation tests, embeddings w pad_token fix, smaller models #93

gradient accumulation tests, embeddings w pad_token fix, smaller models

gradient accumulation tests, embeddings w pad_token fix, smaller models #93

test-axolotl-multigpu (121, 12.1.1, 3.11, 2.3.1, 2)

succeeded Nov 14, 2024 in 27m 47s