-
Notifications
You must be signed in to change notification settings - Fork 60
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
**Description** Cherry-pick bug fixes from v0.10.0 to main. **Major Revisions** * Benchmarks: Microbenchmark - Support different hipblasLt data types in dist_inference #590 * Benchmarks: Microbenchmark - Support in-place for NCCL/RCCL benchmark #591 * Bug Fix - Fix NUMA Domains Swap Issue in NDv4 Topology File #592 * Benchmarks: Microbenchmark - Add data type option for NCCL and RCCL tests #595 * Benchmarks: Bug Fix - Make metrics of dist-inference-cpp aligned with PyTorch version #596 * CI/CD - Add ndv5 topo file #597 * Benchmarks: Microbenchmark - Improve AMD GPU P2P performance with fine-grained GPU memory #593 * Benchmarks: Build Pipeline - fix nccl and nccl test version to 2.18.3 to resolve hang issue in cuda12.2 docker #599 * Dockerfile - Bug fix for rocm docker build and deploy #598 * Benchmarks: Microbenchmark - Adapt to hipblasLt data type changes #603 * Benchmarks: Micro benchmarks - Update hipblaslt metric unit to tflops #604 * Monitor - Upgrade pyrsmi to amdsmi python library. #601 * Benchmarks: Micro benchmarks - add fp8 and initialization for hipblaslt benchmark #605 * Dockerfile - Add rocm6.0 dockerfile #602 * Bug Fix - Bug fix for latest megatron-lm benchmark #600 * Docs - Upgrade version and release note #606 Co-authored-by: Ziyue Yang <[email protected]> Co-authored-by: Yang Wang <[email protected]> Co-authored-by: Yuting Jiang <[email protected]> Co-authored-by: guoshzhao <[email protected]>
- Loading branch information
1 parent
2c2096e
commit 2c88db9
Showing
56 changed files
with
919 additions
and
240 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,34 +1,34 @@ | ||
<system version="1"> | ||
<cpu numaid="0" affinity="0000ffff,0000ffff" arch="x86_64" vendor="AuthenticAMD" familyid="23" modelid="49"> | ||
<pci busid="ffff:ff:01.0" class="0x060400" link_speed="16 GT/s" link_width="16"> | ||
<pci busid="0001:00:00.0" class="0x030200" link_speed="16 GT/s" link_width="16"/> | ||
<pci busid="0101:00:00.0" class="0x020700" link_speed="16 GT/s" link_width="16"/> | ||
<pci busid="0002:00:00.0" class="0x030200" link_speed="16 GT/s" link_width="16"/> | ||
<pci busid="0102:00:00.0" class="0x020700" link_speed="16 GT/s" link_width="16"/> | ||
</pci> | ||
</cpu> | ||
<cpu numaid="1" affinity="0000ffff,0000ffff" arch="x86_64" vendor="AuthenticAMD" familyid="23" modelid="49"> | ||
<pci busid="ffff:ff:02.0" class="0x060400" link_speed="16 GT/s" link_width="16"> | ||
<pci busid="0003:00:00.0" class="0x030200" link_speed="16 GT/s" link_width="16"/> | ||
<pci busid="0103:00:00.0" class="0x020700" link_speed="16 GT/s" link_width="16"/> | ||
<pci busid="0004:00:00.0" class="0x030200" link_speed="16 GT/s" link_width="16"/> | ||
<pci busid="0104:00:00.0" class="0x020700" link_speed="16 GT/s" link_width="16"/> | ||
</pci> | ||
</cpu> | ||
<cpu numaid="2" affinity="0000ffff,0000ffff" arch="x86_64" vendor="AuthenticAMD" familyid="23" modelid="49"> | ||
<pci busid="ffff:ff:03.0" class="0x060400" link_speed="16 GT/s" link_width="16"> | ||
<pci busid="000b:00:00.0" class="0x030200" link_speed="16 GT/s" link_width="16"/> | ||
<pci busid="0105:00:00.0" class="0x020700" link_speed="16 GT/s" link_width="16"/> | ||
<pci busid="000c:00:00.0" class="0x030200" link_speed="16 GT/s" link_width="16"/> | ||
<pci busid="0106:00:00.0" class="0x020700" link_speed="16 GT/s" link_width="16"/> | ||
<cpu numaid="1" affinity="0000ffff,0000ffff" arch="x86_64" vendor="AuthenticAMD" familyid="23" modelid="49"> | ||
<pci busid="ffff:ff:02.0" class="0x060400" link_speed="16 GT/s" link_width="16"> | ||
<pci busid="0001:00:00.0" class="0x030200" link_speed="16 GT/s" link_width="16"/> | ||
<pci busid="0101:00:00.0" class="0x020700" link_speed="16 GT/s" link_width="16"/> | ||
<pci busid="0002:00:00.0" class="0x030200" link_speed="16 GT/s" link_width="16"/> | ||
<pci busid="0102:00:00.0" class="0x020700" link_speed="16 GT/s" link_width="16"/> | ||
</pci> | ||
</cpu> | ||
<cpu numaid="3" affinity="0000ffff,0000ffff" arch="x86_64" vendor="AuthenticAMD" familyid="23" modelid="49"> | ||
<pci busid="ffff:ff:04.0" class="0x060400" link_speed="16 GT/s" link_width="16"> | ||
<cpu numaid="2" affinity="0000ffff,0000ffff" arch="x86_64" vendor="AuthenticAMD" familyid="23" modelid="49"> | ||
<pci busid="ffff:ff:03.0" class="0x060400" link_speed="16 GT/s" link_width="16"> | ||
<pci busid="000d:00:00.0" class="0x030200" link_speed="16 GT/s" link_width="16"/> | ||
<pci busid="0107:00:00.0" class="0x020700" link_speed="16 GT/s" link_width="16"/> | ||
<pci busid="000e:00:00.0" class="0x030200" link_speed="16 GT/s" link_width="16"/> | ||
<pci busid="0108:00:00.0" class="0x020700" link_speed="16 GT/s" link_width="16"/> | ||
</pci> | ||
</cpu> | ||
<cpu numaid="3" affinity="0000ffff,0000ffff" arch="x86_64" vendor="AuthenticAMD" familyid="23" modelid="49"> | ||
<pci busid="ffff:ff:04.0" class="0x060400" link_speed="16 GT/s" link_width="16"> | ||
<pci busid="000b:00:00.0" class="0x030200" link_speed="16 GT/s" link_width="16"/> | ||
<pci busid="0105:00:00.0" class="0x020700" link_speed="16 GT/s" link_width="16"/> | ||
<pci busid="000c:00:00.0" class="0x030200" link_speed="16 GT/s" link_width="16"/> | ||
<pci busid="0106:00:00.0" class="0x020700" link_speed="16 GT/s" link_width="16"/> | ||
</pci> | ||
</cpu> | ||
</system> |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,38 @@ | ||
<system version="1"> | ||
<cpu numaid="0" affinity="ffffffff,ffff0000,00000000" arch="x86_64" vendor="GenuineIntel" familyid="6" modelid="143"> | ||
<pci busid="ffff:ff:01.0" class="0x060400" link_speed="32.0 GT/s PCIe" link_width="16" vendor="0x0000" device="0x0000" subsystem_vendor="0x0000" subsystem_device="0x0000"> | ||
<pci busid="0001:00:00.0" class="0x030200" link_speed="32.0 GT/s PCIe" link_width="16"/> | ||
<pci busid="0101:00:00.0" class="0x020700" link_speed="32.0 GT/s PCIe" link_width="16"/> | ||
</pci> | ||
<pci busid="ffff:ff:02.0" class="0x060400" link_speed="32.0 GT/s PCIe" link_width="16" vendor="0x0000" device="0x0000" subsystem_vendor="0x0000" subsystem_device="0x0000"> | ||
<pci busid="0002:00:00.0" class="0x030200" link_speed="32.0 GT/s PCIe" link_width="16"/> | ||
<pci busid="0102:00:00.0" class="0x020700" link_speed="32.0 GT/s PCIe" link_width="16"/> | ||
</pci> | ||
<pci busid="ffff:ff:03.0" class="0x060400" link_speed="32.0 GT/s PCIe" link_width="16" vendor="0x0000" device="0x0000" subsystem_vendor="0x0000" subsystem_device="0x0000"> | ||
<pci busid="0003:00:00.0" class="0x030200" link_speed="32.0 GT/s PCIe" link_width="16"/> | ||
<pci busid="0103:00:00.0" class="0x020700" link_speed="32.0 GT/s PCIe" link_width="16"/> | ||
</pci> | ||
<pci busid="ffff:ff:04.0" class="0x060400" link_speed="32.0 GT/s PCIe" link_width="16" vendor="0x0000" device="0x0000" subsystem_vendor="0x0000" subsystem_device="0x0000"> | ||
<pci busid="0008:00:00.0" class="0x030200" link_speed="32.0 GT/s PCIe" link_width="16"/> | ||
<pci busid="0104:00:00.0" class="0x020700" link_speed="32.0 GT/s PCIe" link_width="16"/> | ||
</pci> | ||
</cpu> | ||
<cpu numaid="1" affinity="00000000,0000ffff,ffffffff" arch="x86_64" vendor="GenuineIntel" familyid="6" modelid="143"> | ||
<pci busid="ffff:ff:05.0" class="0x060400" link_speed="32.0 GT/s PCIe" link_width="16" vendor="0x0000" device="0x0000" subsystem_vendor="0x0000" subsystem_device="0x0000"> | ||
<pci busid="0009:00:00.0" class="0x030200" link_speed="32.0 GT/s PCIe" link_width="16"/> | ||
<pci busid="0105:00:00.0" class="0x020700" link_speed="32.0 GT/s PCIe" link_width="16"/> | ||
</pci> | ||
<pci busid="ffff:ff:06.0" class="0x060400" link_speed="32.0 GT/s PCIe" link_width="16" vendor="0x0000" device="0x0000" subsystem_vendor="0x0000" subsystem_device="0x0000"> | ||
<pci busid="000a:00:00.0" class="0x030200" link_speed="32.0 GT/s PCIe" link_width="16"/> | ||
<pci busid="0106:00:00.0" class="0x020700" link_speed="32.0 GT/s PCIe" link_width="16"/> | ||
</pci> | ||
<pci busid="ffff:ff:07.0" class="0x060400" link_speed="32.0 GT/s PCIe" link_width="16" vendor="0x0000" device="0x0000" subsystem_vendor="0x0000" subsystem_device="0x0000"> | ||
<pci busid="000b:00:00.0" class="0x030200" link_speed="32.0 GT/s PCIe" link_width="16"/> | ||
<pci busid="0107:00:00.0" class="0x020700" link_speed="32.0 GT/s PCIe" link_width="16"/> | ||
</pci> | ||
<pci busid="ffff:ff:08.0" class="0x060400" link_speed="32.0 GT/s PCIe" link_width="16" vendor="0x0000" device="0x0000" subsystem_vendor="0x0000" subsystem_device="0x0000"> | ||
<pci busid="000c:00:00.0" class="0x030200" link_speed="32.0 GT/s PCIe" link_width="16"/> | ||
<pci busid="0108:00:00.0" class="0x020700" link_speed="32.0 GT/s PCIe" link_width="16"/> | ||
</pci> | ||
</cpu> | ||
</system> |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.