diff --git a/README.md b/README.md index 3effcfc6..9979d4fa 100644 --- a/README.md +++ b/README.md @@ -37,7 +37,7 @@ To see the exact usage for each script, run the script without any arguments. Throughput numbers from these scripts with various different configuration settings are reported below, measured on a cluster with NVIDIA H100 GPUs. -| Model size | Model arch | Context length | Precision | Throughput[^1] | Training script | Commandline overrides                                    | +| Model size | Model arch.   | Context length | Precision | Throughput[^1] | Training script | Commandline overrides                                    | | :--------: | :--------: | :------------: | :-------: | -----------: | :----------- | :-------- | | **1B** | OLMo-1124 | 4096 | BF16 | 55,000 TPS | `OLMo-1B.py` | | | | | 4096 | BF16/FP8[^2] | 65,000 TPS | `OLMo-1B.py` | `--model.float8_config.enabled=true` |