-
Notifications
You must be signed in to change notification settings - Fork 31
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
failed to run sm read cases on l40s platform #19
Comments
Can you provide the toolkit and driver versions installed on this system? By any chance, does the system have more than one toolkit installed? |
thanks for your response. The following is the info about nvidia-related versions:
In addition, I installed the toolkit without GPU driver by:
|
Hi @deepakcu , Is there any solution to this issue? I met a similar problem. I have a compute node with 8 L20 GPUs. Testcases Nvidia driver version: 535.161.08 |
Do the devices have nvlink connectivity ? What does nvidia-smi nvlink -s report? |
No they do not have nvlink. |
What's the IOMMU configuration? |
I think it is not enabled. There is nothing under |
When I compiled on A100 and run it on H100, I get the same error |
Have the same error on system running 4xL40S. Are there any updates on this? |
I have a compute node with 8 L40s gpus. when I run nvbandwidth, the following cases failed/aborted:
All these cases aborted with message like:
The following is one of them:
please help to check what is the problem, thanks.
The text was updated successfully, but these errors were encountered: