AWS OFI NCCL v1.7.2
This release is intended only for use on AWS P* instances. A general release that supports other Libfabric networks will be made in the near future. This release includes the following changes:
- Fix compilation against CUDA versions prior to 11.3.
- Fix allocation of free lists to avoid accidently registering user data, which can cause corruption on fork() with older Linux kernels.
- Fix memory leak with registered bounce buffers.
- Fix improper usage of optlen in call to fi_getopt().
- Numerous memory cleanup fixes.
This release has been tested on P3dn, P4d/P4de, and P5 using the EFA provider in Libfabric.