Skip to content

Conversation

@mberenjk
Copy link
Contributor

@mberenjk mberenjk commented Apr 2, 2025

Details

Do not mention proprietary info or link to internal work items in this PR.

Work item: "Internal", or link to GitHub issue (if applicable).
[LWPCOMMLIBS-577]
What were the changes?
Switched to hip_fp8.h for proper FP8 support.
Ensured alignment with NCCL synchronization.
Resolved fnuz incompatibility to enhance stability.

Why were the changes made?
RCCL-Tests needs to switch over to HIP FP8 header file which has become available in recent ROCm releases.

How was the outcome achieved?
rccl-tests have been tested and passed with datatype=fp8_e4m3/fp8_e5m2.

Additional Documentation:
What else should the reviewer know?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why were these added to the header?

Comment on lines +567 to +568
case ncclFloat8e4m3: fp8_e4m3 = ncclVerifiablePremulScalar<rccl_float8>(rank); break;
case ncclFloat8e5m2: fp8_e5m2 = ncclVerifiablePremulScalar<rccl_bfloat8>(rank); break;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Missing if/else. Likely will need to add fp8_e4m3_fnuz and fp8_e5m2_fnuz fields to the union.

@nileshnegi nileshnegi requested a review from rahulvaidya20 as a code owner May 9, 2025 21:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants