Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
epwalsh authored Nov 17, 2024
1 parent 57b38ad commit 4f2c8ef
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -37,7 +37,7 @@ To see the exact usage for each script, run the script without any arguments.

Throughput numbers from these scripts with various different configuration settings are reported below, measured on a cluster with NVIDIA H100 GPUs.

| Model size | Model arch | Context length | Precision | Throughput[^1] | Training script | Commandline overrides                                    |
| Model size | Model arch.   | Context length | Precision | Throughput[^1] | Training script | Commandline overrides                                    |
| :--------: | :--------: | :------------: | :-------: | -----------: | :----------- | :-------- |
| **1B** | OLMo-1124 | 4096 | BF16 | 55,000 TPS | `OLMo-1B.py` | |
| | | 4096 | BF16/FP8[^2] | 65,000 TPS | `OLMo-1B.py` | `--model.float8_config.enabled=true` |
Expand Down

0 comments on commit 4f2c8ef

Please sign in to comment.