We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hi,
We observed an unexpected low host-to-device bandwidth on GH200 Superchip.
Specs
nvidia-smi topology:
GPU0 NIC0 NIC1 CPU Affinity NUMA Affinity GPU NUMA ID GPU0 X SYS SYS 0-71 0 1 NIC0 SYS X PIX NIC1 SYS PIX X
output: nvbandwidth-gh200.log
ref:
Regards.
The text was updated successfully, but these errors were encountered:
Are the GPU clocks locked? To what value? Can you attach the output of nvidia-smi -q?
nvidia-smi -q
Sorry, something went wrong.
@deepakcu
I have attached here the output of lscpu and nvidia-smi -q per your requested. I believe everything is running at stock.
lscpu
Below is addition information
$ uname -a Linux gpu51 5.14.0-362.8.1.el9_3.aarch64+64k #1 SMP PREEMPT_DYNAMIC Thu Nov 9 05:07:41 UTC 2023 aarch64 aarch64 aarch64 GNU/Linux
550.54.14
nvidia_peermem
gdrdrv
I also check dmesg immediately after running nvbandwidth and didn't observe and warning or error.
dmesg
nvbandwidth
Thanks.
Can you repeat your test locking clocks at max (1980MHz)
sudo nvidia-smi --lock-gpu-clocks=1980,1980
No branches or pull requests
Hi,
We observed an unexpected low host-to-device bandwidth on GH200 Superchip.
Specs
nvidia-smi topology:
output: nvbandwidth-gh200.log

ref:
Regards.
The text was updated successfully, but these errors were encountered: