Skip to content

Conversation

@gnifAMD
Copy link

@gnifAMD gnifAMD commented Jul 29, 2025

This can occur on platforms that operates in WGP (work group processor) mode in which the CUs are grouped in pairs so there are effectively half as many multiprocessing units.

Associated JIRA ticket number/Github issue number

Fixes SWDEV-537865

What type of PR is this? (check all applicable)

  • Refactor
  • Feature
  • Bug Fix
  • Optimization
  • Documentation Update
  • Continuous Integration

What were the changes?

Clamp the results from GenerateThreadDimensions, GenerateBlockDimensions, GenerateThreadDimensionsForShuffle and GenerateBlockDimensionsForShuffle to prevent invalid zero values when multiProcessorCount returns 1.

Why are these changes needed?

Fixes divide by zero errors and invalid parameter errors.

Updated CHANGELOG?

  • Yes
  • No, Does not apply to this PR.

Added/Updated documentation?

  • Yes
  • No, Does not apply to this PR.

Additional Checks

  • I have added tests relevant to the introduced functionality, and the unit tests are passing locally.
  • Any dependent changes have been merged.

This can occur on platforms that operates in WGP (work group processor) mode in
which the CUs are grouped in pairs so there are effectively half as many
multiprocessing units.
@jayhawk-commits
Copy link
Contributor

Imported to ROCm/rocm-systems#625

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants