Skip to content

'Fixed performance issue with random ops'#193

Open
fr30 wants to merge 1 commit intomindspore-ai:masterfrom
fr30:random-ops-performance-fix
Open

'Fixed performance issue with random ops'#193
fr30 wants to merge 1 commit intomindspore-ai:masterfrom
fr30:random-ops-performance-fix

Conversation

@fr30
Copy link
Copy Markdown

@fr30 fr30 commented Aug 5, 2022

What type of PR is this?

/kind bug

What does this PR do / why do we need it:
Improve performance of random ops for CUDA.

Which issue(s) this PR fixes:
Fixes #192

Special notes for your reviewers:
The kernels were refactored as descibed in perfomance guideline in https://docs.nvidia.com/cuda/curand/device-api-overview.html#performance-notes

@CLAassistant
Copy link
Copy Markdown

CLAassistant commented Aug 5, 2022

CLA assistant check
All committers have signed the CLA.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Performance issue with mindspore.ops.normal

2 participants