Skip to content

Commit 90efbbe

Browse files
[integ-tests] Upgrade NCCL versions
The new NCCL version has some performance improvement on Blackwell. See NCCL release note: https://docs.nvidia.com/deeplearning/nccl/release-notes/rel_2-28-3.html#rel_2-28-3 This upgrade makes NCCL performance on two p6-b200 15% better
1 parent 25ff751 commit 90efbbe

File tree

2 files changed

+4
-4
lines changed

2 files changed

+4
-4
lines changed

tests/integration-tests/tests/common/data/nccl/init_nccl_benchmarks.sh

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -5,8 +5,8 @@ set -xe
55
rm -rf /shared/${1}
66

77
module load ${1}
8-
NCCL_BENCHMARKS_VERSION='2.16.7'
9-
NCCL_VERSION='2.27.7-1'
8+
NCCL_BENCHMARKS_VERSION='2.17.1'
9+
NCCL_VERSION='2.28.3-1'
1010
MPI_HOME=$(which mpirun | awk -F '/bin' '{print $1}')
1111
NVCC_GENCODE="-gencode=arch=compute_80,code=sm_80 -gencode=arch=compute_90,code=sm_90 -gencode=arch=compute_90,code=compute_90" # Arch for NVIDIA A100 and H100, ref https://docs.nvidia.com/cuda/ada-compatibility-guide/index.html
1212

tests/integration-tests/tests/common/data/nccl/nccl_tests_submit_openmpi.sh

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -3,8 +3,8 @@
33
#SBATCH --exclusive
44

55
module load openmpi
6-
NCCL_VERSION='2.27.7-1'
7-
NCCL_BENCHMARKS_VERSION='2.16.7'
6+
NCCL_VERSION='2.28.3-1'
7+
NCCL_BENCHMARKS_VERSION='2.17.1'
88

99
. /etc/os-release
1010
if [[ $ID==rhel || $ID==rocky ]]; then

0 commit comments

Comments
 (0)