Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ZLUDA v3.8.8 #71

Merged
merged 39 commits into from
Feb 12, 2025
Merged

ZLUDA v3.8.8 #71

merged 39 commits into from
Feb 12, 2025

Conversation

lshqqytiger
Copy link
Owner

Change logs

  • Update README.
  • Add missing cudaDeviceSynchronize and cudaGetErrorString for cuda runtime api.
  • Fix bug of cuda runtime api.
  • [Nightly] Add nvcuda.zluda_get_nightly_flag.
    It returns 1 if ZLUDA was built with --nightly flag. Otherwise, it returns 0.
  • [Nightly] Unlock cuBLASLt matmul compute type.
  • [Nightly] Experimental cuDNN support on Windows.

About cuDNN

cuDNN is required by various high-performance softwares.

Luckily, there are a few of them that allows us to disable it.

However, we cannot run others that does not. (e.g. ONNX Runtime)

You'll need nightly ZLUDA and a very specific build of MIOpen.

Actually, I would like not to share it because it is virtually unusable and useless at this moment.

As I tested, because MIOpen itself is very unstable and incomplete, there are some limitations as described below.

  • Only the custom build of MIOpen without rocMLIR and composable kernel is tested.
    Other unofficial builds will not work.
  • Only FP32 is supported for Conv2d in gfx1100.
  • There is small memory leak issue due to technical difficulties.
  • The performance is poor due to the reasons I described above.

Despite of the issues, I'm merging this into master because the branches are diverging so much.

@lshqqytiger lshqqytiger added implementation Unimplemented feature(s) release MIOpen labels Feb 12, 2025
@lshqqytiger lshqqytiger self-assigned this Feb 12, 2025
@lshqqytiger lshqqytiger marked this pull request as ready for review February 12, 2025 16:30
@lshqqytiger lshqqytiger merged commit 8352d8e into master Feb 12, 2025
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
implementation Unimplemented feature(s) MIOpen release
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant